Behind the Scenes of ChatGPT.pptx

•Download as PPTX, PDF•

1 like•133 views

fsxflyer789Productio

Slide deck

Engineering

Behind the Scenes of
ChatGPT
with Jay Mody

The Goal
High-level introduction based on my blog post GPT in 60 Lines of NumPy
Basically, you’ll be able to understand the last 15 lines of the “60 lines”

Detour
Understanding GPTs
AI, Machine
Learning, Deep
Learning, Neural
Networks, etc …

Examples
● Object recognition, segmentation,
detection, etc ....
● Face recognition
● Time series forecasting
● Natural language processing (e.g.
sentiment analysis)
● Youtube recommendation
algorithm
● etc …

Tesler’s Theorem
“AI is whatever hasn't been done yet.”
- Larry Tesler

4, 1, 3, 9, 5
Regular Algorithms
Problem Programmer Algorithm
Input Algorithm Output
sort a list of
numbers
merge sort
1, 3, 4, 5, 9

ML Algorithms
labelled
pictures of
cats/dogs
Problem Programmer
Training
Algorithm
Data
(Input/Output
examples)
Training
Algorithm
Inference
Algorithm
classify
pictures of
cats/dogs
trained neural
network
Input
Inference
Algorithm
Output dog
gradient
descent on
neural network

Weights or Parameters
(also holds a number)

Deep Learning Boom
Internet → More and more data
Moore’s Law → Faster and faster computers

Different Architectures
Credit: Sumit Saha

What is a GPT?
GPT stands for Generative Pre-trained Transformer. It's a type of neural
network architecture based on the Transformer.
● Generative: A GPT generates text.
● Pre-trained: A GPT is trained on lots of text from books, the internet, etc …
● Transformer: A GPT is a decoder-only transformer neural network.

Language Modeling (Next Word Prediction)
“Not” → “all”
“Not all” → “heroes”
“Not all heroes” → “wear”
“Not all heroes wear” → “capes”

Self-Supervised Learning
Given a piece the text “not all heroes wear”:
Input = [“not”, “not all”, “not all heroes”, “not all heroes wear”]
Label = [“all”, “heroes”, “wear”, “capes”]
The label can be derived from the input text itself, no need for human labellers.

Lots of Parameters!
Credit: Huggingface Blog

OpenAI's GPT-3, Google's LaMDA, and other similar models are just GPTs under
the hood. What makes them special is they happen to be:
1. Very big (billions of parameters in the neural network)
2. Trained on lots of data (hundreds of gigabytes of text)
As such, we call them Large Language Models (LLMs).

If GPTs are just next word predictors, how do they
produce full sentences?

Autoregressive
Credit: Jay Alammar’s The Illustrated GPT-2

What's hot

Introduction to ChatGPTannusharma26

Let's talk about GPT: A crash course in Generative AI for researchersSteven Van Vaerenbergh

The Future of AI is Generative not Discriminative 5/26/2021Steve Omohundro

Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1

And then there were ... Large Language ModelsLeon Dohmen

OpenAI’s GPT 3 Language Model - guest Steve OmohundroNumenta

ChatGPT-the-revolution-is-coming.pdfLiang Yan

Chat GPT Intoduction.pdfThiyagu K

ChatGPT 101 - Vancouver ChatGPT ExpertsAli Tavanayan

What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?Bernard Marr

chatGPT.txtKamleshlodhi1

A brief primer on OpenAI's GPT-3Ishan Jain

ChatGPT, Foundation Models and Web3.pptxJesus Rodriguez

An Introduction to Generative AICori Faklaris

LangChain Intro by KeyMate.AIOzgurOscarOzkan

ChatGPT 101.pptxMohamadAimanArifMoha

ChatGPT Use- Cases Bluechip Technologies

ChatGPT ChatBotLinconMondal

Prompting is an art / Sztuka promptowaniaMichal Jaskolski

ChatGPT vs. GPT-3.pdfAddepto

What's hot (20)

Introduction to ChatGPT

Let's talk about GPT: A crash course in Generative AI for researchers

The Future of AI is Generative not Discriminative 5/26/2021

Unlocking the Power of Generative AI An Executive's Guide.pdf

And then there were ... Large Language Models

OpenAI’s GPT 3 Language Model - guest Steve Omohundro

ChatGPT-the-revolution-is-coming.pdf

Chat GPT Intoduction.pdf

ChatGPT 101 - Vancouver ChatGPT Experts

What Is GPT-3 And Why Is It Revolutionizing Artificial Intelligence?

chatGPT.txt

A brief primer on OpenAI's GPT-3

ChatGPT, Foundation Models and Web3.pptx

An Introduction to Generative AI

LangChain Intro by KeyMate.AI

ChatGPT 101.pptx

ChatGPT Use- Cases

ChatGPT ChatBot

Prompting is an art / Sztuka promptowania

ChatGPT vs. GPT-3.pdf

Similar to Behind the Scenes of ChatGPT.pptx

Code quality; patch qualitydn

Code quality. Patch qualitymalcolmt

Intelligent Ruby + Machine LearningIlya Grigorik

00_pytorch_and_deep_learning_fundamentals.pdfeanyang7

Hacking Predictive Modeling - RoadSec 2018HJ van Veen

Think Machine Learning with Scikit-Learn (Python)Chetan Khatri

Hala GPT - Samer Desouky.pdfSamer Desouky

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays

GPT and other Text Transformers: Black Swans and Stochastic ParrotsKonstantin Savenkov

Remoticon - TinyML Workshop.pptxnaeembisma

An Introduction to Machine LearningAngelo Simone Scotto

Copywriting with AI - How creatives can machines really be?Wijnand Meijer

Think machine-learning-with-scikit-learn-chetanChetan Khatri

Generative AI and ChatGPT - Scope of AI and advance Generative AIKumaresan K

SPOTLIGHT IGNITE (10 MINUTES): THE FUTURE OF DEVELOPER TOOLS: FROM STACKOVERF...DevOpsDays Tel Aviv

unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptxUsama Wahab Khan Cloud, Data and AI

Python Machine Learning - Getting StartedRafey Iqbal Rahman

A Friendly Introduction to Machine LearningHaptik

Generative AI by Salesforce Admin Group DehradunkailashChandra95

Get connected with pythonJan Kroon

Similar to Behind the Scenes of ChatGPT.pptx (20)

Code quality; patch quality

Code quality. Patch quality

Intelligent Ruby + Machine Learning

00_pytorch_and_deep_learning_fundamentals.pdf

Hacking Predictive Modeling - RoadSec 2018

Think Machine Learning with Scikit-Learn (Python)

Hala GPT - Samer Desouky.pdf

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...

GPT and other Text Transformers: Black Swans and Stochastic Parrots

Remoticon - TinyML Workshop.pptx

An Introduction to Machine Learning

Copywriting with AI - How creatives can machines really be?

Think machine-learning-with-scikit-learn-chetan

Generative AI and ChatGPT - Scope of AI and advance Generative AI

SPOTLIGHT IGNITE (10 MINUTES): THE FUTURE OF DEVELOPER TOOLS: FROM STACKOVERF...

unleshing the the Power Azure Open AI - MCT Summit middle east 2024 Riyhad.pptx

Python Machine Learning - Getting Started

A Friendly Introduction to Machine Learning

Generative AI by Salesforce Admin Group Dehradun

Get connected with python

Recently uploaded

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...ranjana rawat

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Dr.Costas Sachpazis

HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

Introduction to IEEE STANDARDS and its different types.pptxupamatechverse

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINESIVASHANKAR N

Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130Suhani Kapoor

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINEslot gacor bisa pakai pulsa

Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...ranjana rawat

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escortsranjana rawat

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptxhumanexperienceaaa

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130Suhani Kapoor

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis

UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...Call Girls in Nagpur High Profile

Recently uploaded (20)

(SHREYA) Chakan Call Girls Just Call 7001035870 [ Cash on Delivery ] Pune Esc...

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...

HARMONY IN THE NATURE AND EXISTENCE - Unit-IV

Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx

(RIA) Call Girls Bhosari ( 7001035870 ) HI-Fi Pune Escorts Service

Introduction to IEEE STANDARDS and its different types.pptx

MANUFACTURING PROCESS-II UNIT-2 LATHE MACHINE

Processing & Properties of Floor and Wall Tiles.pptx

VIP Call Girls Service Kondapur Hyderabad Call +91-8250192130

DJARUM4D - SLOT GACOR ONLINE | SLOT DEMO ONLINE

Microscopic Analysis of Ceramic Materials.pptx

The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...

High Profile Call Girls Nagpur Isha Call 7001035870 Meet With Nagpur Escorts

Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts

Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik

the ladakh protest in leh ladakh 2024 sonam wangchuk.pptx

VIP Call Girls Service Hitech City Hyderabad Call +91-8250192130

Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...

UNIT-V FMM.HYDRAULIC TURBINE - Construction and working

Top Rated Pune Call Girls Budhwar Peth ⟟ 6297143586 ⟟ Call Me For Genuine Se...

Behind the Scenes of ChatGPT.pptx

1. Behind the Scenes of ChatGPT with Jay Mody

2. Who am I? ● Software Engineering at Mac ● Prev Intern @ Tesla, Amazon, Cohere ● Write about and work on neural nets

3. ChatGPT

4. The Goal High-level introduction based on my blog post GPT in 60 Lines of NumPy Basically, you’ll be able to understand the last 15 lines of the “60 lines”

5. Detour Understanding GPTs AI, Machine Learning, Deep Learning, Neural Networks, etc …

7. Examples ● Object recognition, segmentation, detection, etc .... ● Face recognition ● Time series forecasting ● Natural language processing (e.g. sentiment analysis) ● Youtube recommendation algorithm ● etc …

8. Tesler’s Theorem “AI is whatever hasn't been done yet.” - Larry Tesler

9. Artificial General Intelligence (AGI)

10.

11. 4, 1, 3, 9, 5 Regular Algorithms Problem Programmer Algorithm Input Algorithm Output sort a list of numbers merge sort 1, 3, 4, 5, 9

12. ML Algorithms labelled pictures of cats/dogs Problem Programmer Training Algorithm Data (Input/Output examples) Training Algorithm Inference Algorithm classify pictures of cats/dogs trained neural network Input Inference Algorithm Output dog gradient descent on neural network

13.

14. Deep Learning (Neural Networks)

15. Age Weight Shoe Size Height

16. Neuron (holds a number) 1.23

17. Weights or Parameters (also holds a number)

18.

19.

20.

21. Credit: 3Blue1Brown

22. https://playground.tensorflow.org/

23.

24. Deep Learning Boom Internet → More and more data Moore’s Law → Faster and faster computers

25. Different Architectures Credit: Sumit Saha

26. End of Detour

27. What is a GPT? GPT stands for Generative Pre-trained Transformer. It's a type of neural network architecture based on the Transformer. ● Generative: A GPT generates text. ● Pre-trained: A GPT is trained on lots of text from books, the internet, etc … ● Transformer: A GPT is a decoder-only transformer neural network.

28. Language Modeling (Next Word Prediction) “Not” → “all” “Not all” → “heroes” “Not all heroes” → “wear” “Not all heroes wear” → “capes”

29.

30. Self-Supervised Learning Given a piece the text “not all heroes wear”: Input = [“not”, “not all”, “not all heroes”, “not all heroes wear”] Label = [“all”, “heroes”, “wear”, “capes”] The label can be derived from the input text itself, no need for human labellers.

31. Lots of Data! Credit: GPT-3 Paper

32. Lots of Parameters! Credit: Huggingface Blog

33. Lots of Parameters!

34. OpenAI's GPT-3, Google's LaMDA, and other similar models are just GPTs under the hood. What makes them special is they happen to be: 1. Very big (billions of parameters in the neural network) 2. Trained on lots of data (hundreds of gigabytes of text) As such, we call them Large Language Models (LLMs).

35. If GPTs are just next word predictors, how do they produce full sentences?

36. Autoregressive Credit: Jay Alammar’s The Illustrated GPT-2

37. Autoregressive

38. Time to code! tinyurl.com/yd8xbn3e