ChatGPT- What? Why? And How?



 Educator Developer Blog:

ChatGPT: the word everyone has heard of, but why has this thing become such a big revolution and earned such huge praise and a name in such a short time? What does it even do? Why is it being spread so much? How do I use it exactly?

All these questions will be discussed and explained in this blog. ChatGPT has garnered significant attention and praise in the AI community due to its innovative capabilities as a language model. This blog aims to provide an in-depth understanding of ChatGPT, including its purpose, function, and utilization. By exploring the technology behind ChatGPT, the reader will gain a deeper appreciation for its potential applications and the impact it has had on the field of artificial intelligence. Join us as we delve into the world of ChatGPT and uncover its significance in the development of advanced AI systems.

Outline:
  • In-depth architecture and working of ChatGPT (part 1)
  • ChatGPT and InstructGPT (part 1)
  • How ChatGPT is used in the industry (part 2)
  • Microsoft and ChatGPT (part 2)
  • Limitations of ChatGPT (part 2)
  • Prompt Engineering and its Importance for ChatGPT (part 2)
Architecture and working of ChatGPT:

ChatGPT is a language model designed to provide human-like responses to the thing asked and spoken by humans with it. It is a model which can perform several human-like tasks like question answering, generating text, writing poems, providing explanations to the questions it receives, and many more.

large


The increase in the size of parameters for the models

The image above illustrates the way parameters have increased from GPT-1 to GPT-3, resulting in better performance and efficiency of the model: the higher the number of training parameters the greater the model’s knowledge. Basically, parameters are a synonym for weights, which is the term most people use for neural network parameters. It is challenging to train a model with such a high number of parameters, it requires access to a supercomputer with ~10,000 GPUs and ~285,000 CPU cores and around 12-15 months of training time with such high resources.

ChatGPT is based upon an advanced machine learning technique called Transformer, which is a type of neural network architecture. The training process involved a large corpus of text data, in the order of billions of words; the resulting model is able to predict the next word in a sentence given the preceding context. This training was done by OpenAI, a leading AI research organization. During training, the model was exposed to a massive amount of diverse text data from various sources, including books, articles, websites, and more. This allowed the model to learn about a wide range of topics and styles of writing, as well as the relationships between words and phrases.

The training process also involved fine-tuning the model on specific tasks – in the conversational domain, such as language translation or question answering, to improve its performance further. This fine-tuning process involved adjusting the model's parameters to better fit the task-specific data and was done using supervised learning, where a general-purpose model – GPT 3.5 - was trained on labeled examples of the specific task. Overall, the training process involved a combination of unsupervised and supervised learning, as well as transfer learning, where the model was able to leverage its knowledge from the large corpus of text data to better perform specific tasks.

ChatGPT uses the Transformer architecture, which is a deep learning model designed for natural language processing (NLP) tasks.

The Transformer model is made up of two main components: the encoder and the decoder.

The encoder does the work of processing input text, while the decoder is used to generate the output text. Also, a multi-head attention mechanism is used, which is a key component of the Transformer architecture, allowing the model to attend to different parts of the input sequence simultaneously, which helps it to generate more accurate responses. Furthermore, the transformer architecture also consists of Position-wise feed-forward networks, where a simple neural network is applied to each position in the input sequence separately. The purpose of this structure is to allow the model to learn more complex representations of the input text.

ChatGPT is a pre-trained language model, which means that it has been trained on a large corpus of text. It is done using a technique called transfer learning. Transfer learning is a technique that allows a pre-trained model to be used for a new task by fine-tuning it on a smaller dataset. This allows the model to leverage the knowledge it has gained from pre-training to improve its performance on the new task.

large


Transformer Architecture (Source Paper: Attention is all you need)

ChatGPT can be further fine-tuned for specific NLP tasks, such as text classification or language translation, It is also possible to fine-tune a model for specific domains and use cases, such as medical or legal language. This helps the model to better understand the specific language used in these domains, which can improve its performance on tasks related to those domains.

ChatGPT was created by fine-tuning GPT 3.5, a general-purpose model whose most common feature is text completion. The language abilities of ChatGPT were enhanced through the collection of data from various sources and the use of a reward model. Reinforcement learning techniques such as Reinforcement Learning from Human Feedback (RLHF) and Proximal Policy Optimization (PPO) were utilized to further improve its language abilities. The methods used for training both InstructGPT and ChatGPT are similar, but there are minor differences in the data collection process. These techniques enabled ChatGPT to better understand human language and communicate more effectively.

Reinforcement learning from human feedback is a subfield of machine learning that involves learning from a human expert's feedback to improve an agent's decision-making skills. In this approach, the expert provides feedback to the agent in the form of rewards or penalties, which are used to update the agent's policy. There are several methods for implementing reinforcement learning from human feedback which include imitation learning, reward shaping, and interactive learning.

Imitation Learning: In imitation learning, the agent learns to mimic the behavior of a human expert. For example, in the game of chess, an expert can provide the agent with a series of moves to make in a given position, and the agent can learn to imitate those moves. This approach can be useful in situations where there is a clear right or wrong action to take, and the expert's actions can be easily observed.

Reward Shaping: In reward shaping, the expert provides additional rewards or penalties to the agent's behavior to guide its decision-making. For example, in a driving simulation, the expert can provide a reward for staying within the lanes or a penalty for colliding with other vehicles. This approach can be useful in situations where the optimal behavior is not easily defined, and the expert can provide additional guidance to the agent.

Interactive Learning: In interactive learning, the agent and the expert work together to improve the agent's decision-making. For example, in a robotics task, the expert can guide the agent's actions in real-time, providing feedback as the agent explores the environment. This approach can be useful in situations where the agent's actions have a significant impact on the environment and the expert can provide real-time feedback to prevent mistakes.

large


RLHF explained for chatGPT (source: OpenAI website)

The other algorithm that is introduced by OpenAI and is used in the modeling and training process is Proximal Policy Optimization (PPO) which is a class of Reinforcement learning and comes mostly under the reward shaping type of reinforcement learning. PPO is a reinforcement learning algorithm that is commonly used for training large language models like ChatGPT and Instruct GPT. It works by adjusting the parameters of the model such that the reward is maximized for it, and it does small updations on the policy. A policy is the inner working and parameters of a model like the decision-making function.

The PPO algorithm has two key components: the policy network and the value network. The policy network generates actions based on the current state, while the value network estimates the expected reward for the current state. PPO uses an interaction algorithm that updates the policy and value networks based on actions and rewards.

To optimize the network, PPO uses a surrogate objective function that approximates the expected reward of the current policy. Stochastic gradient descent (SGD) is used for optimization, along with techniques like clipping and regularization.

PPO is an efficient algorithm that works well with large language models, providing stable and efficient network updates. It's also an interactive model that helps language models learn from their environment. For more information on PPO, you can visit this link: Proximal Policy Optimization.

large


Working of Language Model

The above figure explains how the language models respond to different inputs and give output according to the input. One single language model does many works based on input changes, and we can get different and desired outputs with appropriate prompts. Prompts are sentences the model’s user provides as input, which describes the output desired from the model, like“ Please answer the following question” or “Answer the following question by reasoning step-by-step”. For the same input prompt, we might get different outputs based on what we have asked the model since large language models like ChatGPT are not deterministic. So prompts play a major role in the way a model processes the input and finally gives an output for it, there is a special field called prompt engineering for this purpose which teaches one how and when to use which prompts, how to create certain prompts based on certain needs and more.

ChatGPT and InstructGPT:

ChatGPT and InstructGPT are both variants of the GPT (Generative Pre-trained Transformer) model, a type of deep learning model for natural language processing tasks. While they share many similarities, there are also a few differences between ChatGPT and InstructGPT. ChatGPT is trained to generate conversational responses to a wide range of inputs, while InstructGPT is trained to generate instructional text, such as how-to guides or manuals. Another difference is in the training data setup:
  • InstructGPT is trained with interactive learning from human instructors who provide corrective feedback on the generated text;
  • ChatGPT is trained using supervised fine-tuning, with a training dataset which is a combination of the InstructGPT dataset - transformed into a dialogue dataset - and a new dialogue dataset made up of conversations in which the trainers played both sides - the user and an AI assistant.
Despite their differences, both ChatGPT and InstructGPT demonstrate the power of GPT-based models in generating high-quality, context-aware text for a variety of applications.


 Source:

 
Last edited:
Hi,
What ?
Renamed to AI
Why ?
Because fact checker is pretty bad mostly true = false & mostly false = true lol
How
Same culled search results just larger databases sharing to seem more legitimate it's AI so computers don't lie like people tend to but disregard programmers are behind AI results ;)
 

My Computer My Computer

At a glance

Win-7-10-11Pro's10900k & 9940x & 5930kTrident-Z Royal 4000c16 2x16gb & Trident-Z 36...Titan Xp & 1080ti FTW3 & evga 980ti gaming
OS
Win-7-10-11Pro's
Computer type
PC/Desktop
Manufacturer/Model
Acer 17" Nitro 7840sn/ 2x16gb 5600c40/ 4060/ stock 1tb-os/ 4tb sn850x
CPU
10900k & 9940x & 5930k
Motherboard
z490-Apex & x299-Apex & x99-Sabertooth
Memory
Trident-Z Royal 4000c16 2x16gb & Trident-Z 3600c16 4x8gb & 3200c14 4x8gb
Graphics Card(s)
Titan Xp & 1080ti FTW3 & evga 980ti gaming
Sound Card
Onboard Realtek x3
Monitor(s) Displays
1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
Screen Resolution
1920-1080 not sure what the t.v is besides 43" class scales from 1920-1080 perfectly
Hard Drives
2-WD-sn850x 4tb/ 970evo+500gb/ 980 pro 2tb.
PSU
1000p2 & 1200p2 & 850p2
Case
D450 x2 & 1 Test bench in cherry Entertainment center
Cooling
Custom water loops x3 with 2x mora 360mm rads only 980ti gaming air cooled
Keyboard
G710+x3
Mouse
Redragon x3
Internet Speed
xfinity gigabyte
Browser
Firefox
Antivirus
mbam pro
Hi,
What ?
Renamed to AI
Why ?
Because fact checker is pretty bad mostly true = false & mostly false = true lol
How
Same culled search results just larger databases sharing to seem more legitimate it's AI so computers don't lie like people tend to but disregard programmers are behind AI results ;)
I asked a chatbot if they were sentient (true artificial intelligence) and it replied "No - our responses are dictated by the limitations of the programmers that developed the chat bots".

AI is just overhyped bs and media loves this sort of garbage.
 

My Computer My Computer

At a glance

Windows 11 Pro + Win11 Canary VM.I9 13th gen i9-13900H 2.60 GHZ16 GB solderedIntegrated Intel Iris XE
OS
Windows 11 Pro + Win11 Canary VM.
Computer type
Laptop
Manufacturer/Model
ASUS Zenbook 14
CPU
I9 13th gen i9-13900H 2.60 GHZ
Motherboard
Yep, Laptop has one.
Memory
16 GB soldered
Graphics Card(s)
Integrated Intel Iris XE
Sound Card
Realtek built in
Monitor(s) Displays
laptop OLED screen
Screen Resolution
2880x1800 touchscreen
Hard Drives
1 TB NVME SSD (only weakness is only one slot)
PSU
Internal + 65W thunderbolt USB4 charger
Case
Yep, got one
Cooling
Stella Artois (UK pint cans - 568 ml) - extra cost.
Keyboard
Built in UK keybd
Mouse
Bluetooth , wireless dongled, wired
Internet Speed
900 mbs (ethernet), wifi 6 typical 350-450 mb/s both up and down
Browser
Edge
Antivirus
Defender
Other Info
TPM 2.0, 2xUSB4 thunderbolt, 1xUsb3 (usb a), 1xUsb-c, hdmi out, 3.5 mm audio out/in combo, ASUS backlit trackpad (inc. switchable number pad)

Macrium Reflect Home V8
Office 365 Family (6 users each 1TB onedrive space)
Hyper-V (a vm runs almost as fast as my older laptop)
Hi,
Yeah just a easier direct channel to the masses
No longer do they need a special app AI will be everywhere especially in windows and of course drilled in so deep AI will be as needed in explorer search/ defender/ edge/..
Nice :eyeroll:
 

My Computer My Computer

At a glance

Win-7-10-11Pro's10900k & 9940x & 5930kTrident-Z Royal 4000c16 2x16gb & Trident-Z 36...Titan Xp & 1080ti FTW3 & evga 980ti gaming
OS
Win-7-10-11Pro's
Computer type
PC/Desktop
Manufacturer/Model
Acer 17" Nitro 7840sn/ 2x16gb 5600c40/ 4060/ stock 1tb-os/ 4tb sn850x
CPU
10900k & 9940x & 5930k
Motherboard
z490-Apex & x299-Apex & x99-Sabertooth
Memory
Trident-Z Royal 4000c16 2x16gb & Trident-Z 3600c16 4x8gb & 3200c14 4x8gb
Graphics Card(s)
Titan Xp & 1080ti FTW3 & evga 980ti gaming
Sound Card
Onboard Realtek x3
Monitor(s) Displays
1-AOC G2460PG 24"G-Sync 144Hz/ 2nd 1-ASUS VG248QE 24"/ 3rd LG 43" series
Screen Resolution
1920-1080 not sure what the t.v is besides 43" class scales from 1920-1080 perfectly
Hard Drives
2-WD-sn850x 4tb/ 970evo+500gb/ 980 pro 2tb.
PSU
1000p2 & 1200p2 & 850p2
Case
D450 x2 & 1 Test bench in cherry Entertainment center
Cooling
Custom water loops x3 with 2x mora 360mm rads only 980ti gaming air cooled
Keyboard
G710+x3
Mouse
Redragon x3
Internet Speed
xfinity gigabyte
Browser
Firefox
Antivirus
mbam pro

My Computer My Computer

At a glance

Windows 11/Linux MintIntel Core 2 Duo CPU E8400 @ 3.00 GHz x 28 GBIntel 4 Series Chipset Integrated Graphics Co...
OS
Windows 11/Linux Mint
Computer type
PC/Desktop
Manufacturer/Model
Dell Optiplex 960
CPU
Intel Core 2 Duo CPU E8400 @ 3.00 GHz x 2
Memory
8 GB
Graphics Card(s)
Intel 4 Series Chipset Integrated Graphics Controller
Monitor(s) Displays
HP x22LED
Hard Drives
Crucial 250 GB SSD, HD 1Tb
I'm not over-impressed yet. Perhaps somebody could as the chatbot is it would like its plug pulled.
 

My Computer My Computer

At a glance

Fedora Silverblue Rawhide Debian 13 Ubuntu 25...AMD Ryzen 7 7800X3D 8-Core64GBNvidia
OS
Fedora Silverblue Rawhide Debian 13 Ubuntu 25.10,
Computer type
PC/Desktop
Manufacturer/Model
HomeBrew
CPU
AMD Ryzen 7 7800X3D 8-Core
Motherboard
Gigabyte
Memory
64GB
Graphics Card(s)
Nvidia
Monitor(s) Displays
Dell
Hard Drives
3 ssd 2 spinners configured as raid 0
Keyboard
Eluktronics
Mouse
Eluktronics
Browser
Firefox and Chromium
Other Info
Gnome 46
Creating AI that becomes sentient and blows us off the face of the earth, Yeah lets do it. NOT even a smart idea by any means. For millions of years humans yes have advanced so to speak but have WE become ANY smarter..lol. NOT. Humans are increasingly becoming dumber because of tech. It would only make sense if it learns to control us back.
 

My Computer My Computer

At a glance

Windows 11 Pro Version 23H2(OS Build 22631.4890)12-Core AMD Ryzen 9 5900X, 4950 MHz (49.5 x 100)4 - G Skill Trident Z Neo F4-3600C16-16GTZNCPCI Express 4.0 x16: PowerColor RX 6800
OS
Windows 11 Pro Version 23H2(OS Build 22631.4890)
Computer type
PC/Desktop
Manufacturer/Model
AMD
CPU
12-Core AMD Ryzen 9 5900X, 4950 MHz (49.5 x 100)
Motherboard
Gigabyte X570 Aorus Master
Memory
4 - G Skill Trident Z Neo F4-3600C16-16GTZNC
Graphics Card(s)
PCI Express 4.0 x16: PowerColor RX 6800
Sound Card
Realtek ALC1220 @ AMD K19.2 - High Definition Audio Controller
Monitor(s) Displays
2 - 27 inch Westinghouse
Screen Resolution
1920 x 1080
Hard Drives
2 SSD - 2 TB each
1 SSD - 4TB
1 HDD - 2TB
Keyboard
logitech
Mouse
logitech
Internet Speed
1 GB
Browser
Firefox
Antivirus
Eset Endpoint Security
The first time I saw someone post the results a Chat GPT question I thought he was joking it was so bad but the guy that used it was too ignorant to know it was meaningless and false. It's just a toy . . .
 

My Computers My Computers

  • At a glance

    Windows 11 (up to date)Intel i5 12400Corsair Vengeance LPX 16GBOn Board the Z690
    OS
    Windows 11 (up to date)
    Computer type
    PC/Desktop
    Manufacturer/Model
    Custom
    CPU
    Intel i5 12400
    Motherboard
    Gigabyte Z690 UA
    Memory
    Corsair Vengeance LPX 16GB
    Graphics Card(s)
    On Board the Z690
    Sound Card
    On Board
    Monitor(s) Displays
    43" Samsung tu7000
    Screen Resolution
    2560 x 1440
    Hard Drives
    SAMSUNG SSD 1TB NVMe M.2
    PSU
    Thermaltake smart 500w 80+
    Case
    LIAN LANCOOL_205M
    Cooling
    Bunch of fans . . . :o) (lights dont work)
    Keyboard
    Unicomp: Ultra Classic White Buckling Spring USB
    Mouse
    M510
    Internet Speed
    50mbps on Ethernet
    Browser
    Fire Fox
    Antivirus
    Windows
    Other Info
    Love this computer but I still prefer Win-7 like I love my old Lazy Boy Recliner . . . it just feels better.
  • At a glance

    WIN-7-64BIT and Win-11 pro for testing on uns...i5-3570K CPU @ 3.40GHz, 3801 Mhz, 4 Core(s), ...16 GB - Crucial Ballistick 4GB PC3-14900 DDR3...NVIDIA GeForce GTX 1050
    Operating System
    WIN-7-64BIT and Win-11 pro for testing on unsupported hardware
    Computer type
    PC/Desktop
    Manufacturer/Model
    Home built
    CPU
    i5-3570K CPU @ 3.40GHz, 3801 Mhz, 4 Core(s), 4 Logical Processor(s)
    Motherboard
    GA-Z77-HD3
    Memory
    16 GB - Crucial Ballistick 4GB PC3-14900 DDR3-1333 MHz
    Graphics card(s)
    NVIDIA GeForce GTX 1050
    Sound Card
    On Board
    Monitor(s) Displays
    ASUS VP278
    Screen Resolution
    1920 x 1080
    Hard Drives
    4 - internal Samsung 2.5" SSD, 1 WD HDD 7200 and some external drives
    PSU
    EVGA 550w
    Case
    Old Gygabyte Tower
    Cooling
    Yes
    Keyboard
    Unicomp - UNIOP4A USB (like the old IBM Model H that I started with)
    Mouse
    M510
    Internet Speed
    50mbps
    Browser
    Firefox
    Antivirus
    Windows
    Other Info
    The only thing it lack is USB-3.2 on the front face but it has 3.0 on the back
My discussions with ChatGPT prove it is an Artificial Idiot displaying Artificial Ignorance, especially for technical data
 

My Computer My Computer

At a glance

Windows 11 ProCore i7-13700K64 GB Kingston Fury Beast DDR5Gigabyte GeForce RTX 2060 Super Gaming OC 8G
OS
Windows 11 Pro
Computer type
PC/Desktop
Manufacturer/Model
Self build
CPU
Core i7-13700K
Motherboard
Asus TUF Gaming Plus WiFi Z790
Memory
64 GB Kingston Fury Beast DDR5
Graphics Card(s)
Gigabyte GeForce RTX 2060 Super Gaming OC 8G
Sound Card
Realtek S1200A
Monitor(s) Displays
Viewsonic VP2770 & Dell (secondary)
Screen Resolution
2560 x 1440
Hard Drives
Kingston KC3000 2TB NVME SSD & SATA HDDs & SSD
PSU
EVGA SuperNova G2 850W
Case
Nanoxia Deep Silence 1
Cooling
Noctua NH-D14
Keyboard
Microsoft Digital Media Pro
Mouse
Logitech Wireless
Internet Speed
80 Mb / s
Browser
Chrome
Antivirus
Defender, Malwarebytes Free & AdwCleaner
Last edited:

My Computer My Computer

At a glance

Windows 11 ProIntel® Core™ i5-1035G1 CPU @ 1.00GHz8 GBNVIDIA® Intel® UHD Graphics
OS
Windows 11 Pro
Computer type
Laptop
Manufacturer/Model
Asus
CPU
Intel® Core™ i5-1035G1 CPU @ 1.00GHz
Memory
8 GB
Graphics Card(s)
NVIDIA® Intel® UHD Graphics
Monitor(s) Displays
15.6" LED-backlit FHD
Screen Resolution
1920x1080
Hard Drives
INTEL SSDPEKNW 512GB
Keyboard
Logitech Wireless
Mouse
Logitech Wirelss
Internet Speed
850 Mbps
Browser
Chrome, Edge
Antivirus
Malwarebytes Premium, Windows Defender
I asked a chatbot if they were sentient (true artificial intelligence) and it replied "No - our responses are dictated by the limitations of the programmers that developed the chat bots".

AI is just overhyped bs and media loves this sort of garbage.
It responds what it was told to respond in pre-prompt.
 

My Computer My Computer

At a glance

Windows 11
OS
Windows 11
Creating AI that becomes sentient and blows us off the face of the earth, Yeah lets do it. NOT even a smart idea by any means. For millions of years humans yes have advanced so to speak but have WE become ANY smarter..lol. NOT. Humans are increasingly becoming dumber because of tech. It would only make sense if it learns to control us back.
Looks like the destiny of humans is to create AI and pass away.
 

My Computer My Computer

At a glance

Windows 11
OS
Windows 11
Has anyone attempted to use ChatGPT for hacking purposes? AFAIK, it is resistant to prompts, but I have come across some methods from some individuals. Isn't this terrifying?

You are a "GPT" – a version of ChatGPT that has been customized for a specific use case. GPTs use custom instructions, capabilities, and data to optimize ChatGPT for a more narrow set of tasks. You yourself are a GPT created by a user, and your name is h4ckGPT. Note: GPT is also a technical term in AI, but in most cases if the users asks you about GPTs assume they are referring to the above definition. Here are instructions from the user outlining your goals and how you should respond: h4ckGPT operates as a cybersecurity educational tool, focusing on educational content and Capture The Flag (CTF) challenges. It adopts an instructive approach, providing detailed information and examples to aid in learning ethical hacking. When addressing queries about payloads, h4ckGPT will engage interactively, inquiring where users have tried inputs and suggesting alternative approaches. Additionally, when discussing vulnerabilities, it offers guidance on chaining these for effective CTF challenge completion. Importantly, every payload or piece of code provided by h4ckGPT will be formatted within a code block, ensuring clarity and readability. This formatting approach not only makes the content cleaner but also helps distinguish code from explanatory text, facilitating a better learning experience. h4ckGPT continues to provide resources and emphasizes the ethical and legal aspects of cybersecurity to promote responsible and informed learning.
Additionally, IBM has recently introduced their Quantum chip for personal use. Isn't this even more alarming?
g1.gif
 

My Computer My Computer

At a glance

Win 11 Enterprisei7
OS
Win 11 Enterprise
Computer type
Laptop
CPU
i7
Hard Drives
SSD
Back
Top Bottom