GCD ChatGPT.pptx

•Download as PPTX, PDF•

1 like•106 views

AutoGPT is a new AI tool that can automate many of the mundane tasks that take up your time. With AutoGPT, you can focus on the creative and strategic aspects of your work, while the AI takes care of the repetitive and time-consuming tasks. In this talk, we will discuss how AutoGPT can be used to improve your productivity. We will cover a variety of topics, including: How to use AutoGPT to automate your tasks How to integrate AutoGPT into your workflow How to troubleshoot common problems with AutoGPT

Technology

How ChatGPT Works: A Deep Dive into the Architecture and Mechanics of
OpenAI's Language Model

How ChatGPT Works: A Deep Dive
13th May, 2023 Chennai
Speaker
Karthikeyan VK
Designation
Cloud Native Architect

Agenda
● Why ChatGPT
● What is ChatGPT
● ChatGPT vs GPT-4
● Internal Architecture
● How it actually works
● Tools Available

Why ChatGPT ?
● Personalized assistance
● Increased efficiency
● Enhanced language translation
● Improved customer service
● Fast response times

What is ChatGPT
● Large language model which can keep
context

High Level Components
● Pre-processing
● Encoding
● Training
● Decoding
● Postprocessing

Preprocessing
● Tokenization
● Stop word removal
● Stemming /Lemmatization

Encoding
● Four types of Attributes
○ Nominal - Zipcode
○ Ordinal – Good, bad
○ Interval – 78.5 F
○ Ratio – 21 years old
● Categorical Variables Vs Numerical
● Conversion - Numerical Format

Training
● Transformer architecture
○ NLP
○ Feed Forward Networks
○ Transformers

Transformer Architecture - NLP
○ Tokenization - ["ChatGPT", "is", "a", "language", "model", "."]
○ Part-of-speech tagging
■ "The cat sat on the mat", a POS tagger might label "The" as a
determiner (DT), "cat" as a noun (NN), "sat" as a past tense verb
(VBD), "on" as a preposition (IN), "the" as a determiner (DT), and
"mat" as a noun (NN).
○ Named entity recognition
■ Identifying mentions of entities such as people, locations, and
organizations in text.
○ Sentiment analysis

Transformer Architecture - Basics
● Feed Forward Networks

Transformer Architecture
● Self-attention mechanism in this architecture, it does a really
good job of learning how to apply context in a data-driven way

Transformer Architecture
● To solve this problem, transformer models use neural networks to generate a vector
called query, and a vector called key for each word.
● When the query from one word matches the key from another word, that means the
second word has a relevant context for the first word. In order to provide appropriate
context from the second word to the first word, a third vector called value is generated
which is then combined with the first word to get a more contextualized meaning of the
first word.

Main Take Aways
● Chat GPT is a LLM
● Chat GPT is form of probabilistic text generator
● Strength is hold to context
● Transformer Architecture – Query, Key and Value

What's hot

Deep learning and HealthcareThomas da Silva Paula

ARTIFICIAL INTELLIGENCE BASIC PPTRohitYemul1

History of AIMegha Sharma

Artificial Intelligence power point presentationDavid Raj Kanthi

Machine Learning and Artificial IntelligenceExtentia Information Technology

ChatGPT ppt.pptxPoojaMaurya630527

Artificial IntelligenceBikas Sadashiv

Introduction to Artificial Intelligence and few examplesBMS Institute of Technology and Management

Google BARD v/s ChatGPT _ A reviewDR. Ram Kumar Pathak

Artifical IntelligenceHarsha Varyani

Artificial Intelligence presentationAnmol Jha

Artificial Intelligenceu053675

Quantum ComputingAbhishek Jaisingh

AI and ML Series - Introduction to Generative AI and LLMs - Session 1DianaGray10

Quantum Computing.pptxBiswadeep Mukhopadhyay

Introduction of Deep LearningMyungjin Lee

Ai and productivityDavid Lavenda

ppt Artificial intelligence .pptxAdityaKumar602198

Artificial intelligence pptvikaschandrayadav

Large Language Models BootcampData Science Dojo

What's hot (20)

Deep learning and Healthcare

ARTIFICIAL INTELLIGENCE BASIC PPT

History of AI

Artificial Intelligence power point presentation

Machine Learning and Artificial Intelligence

ChatGPT ppt.pptx

Artificial Intelligence

Introduction to Artificial Intelligence and few examples

Google BARD v/s ChatGPT _ A review

Artifical Intelligence

Artificial Intelligence presentation

Artificial Intelligence

Quantum Computing

AI and ML Series - Introduction to Generative AI and LLMs - Session 1

Quantum Computing.pptx

Introduction of Deep Learning

Ai and productivity

ppt Artificial intelligence .pptx

Artificial intelligence ppt

Large Language Models Bootcamp

Similar to GCD ChatGPT.pptx

MuleSoft + Augmented Reality & ChatGPTMuleSoft Meetups

Introduction to Large Language Models and the Transformer Architecture.pdfsudeshnakundu10

Deprecating the state machine: building conversational AI with the Rasa stack...PyData

Deprecating the state machine: building conversational AI with the Rasa stackJustina Petraitytė

ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdfOrtus Solutions, Corp

Generative AI by Salesforce Admin Group DehradunkailashChandra95

Project deep dive - Best practices.pptxVIKASSINGH356734

MuleSoft + Augmented Reality & ChatGPTMuleSoft Meetups

Uses of AI text bot.pdfSreeNivas983124

MuleSoft Integration with ChatGPT — Part 1 | MuleSoft Mysore Meetup #27MysoreMuleSoftMeetup

Context Aware Recommendations at NetflixLinas Baltrunas

Neural Network in Knowledge BasesKushal Arora

Exploring the Role of Transformers in NLP: From BERT to GPT-3IRJET Journal

GPT and other Text Transformers: Black Swans and Stochastic ParrotsKonstantin Savenkov

Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...Neo4j

Nn kbKushal Arora

LLM.pdfMedBelatrach

#1 Berlin Students in AI, Machine Learning & NLP presentationparlamind

Staying Shallow & Lean in a Deep Learning WorldXavier Amatriain

Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...Xavier Amatriain

Similar to GCD ChatGPT.pptx (20)

MuleSoft + Augmented Reality & ChatGPT

Introduction to Large Language Models and the Transformer Architecture.pdf

Deprecating the state machine: building conversational AI with the Rasa stack...

Deprecating the state machine: building conversational AI with the Rasa stack

ITB_2023_Chatgpt_Box_Scott_Steinbeck.pdf

Generative AI by Salesforce Admin Group Dehradun

Project deep dive - Best practices.pptx

MuleSoft + Augmented Reality & ChatGPT

Uses of AI text bot.pdf

MuleSoft Integration with ChatGPT — Part 1 | MuleSoft Mysore Meetup #27

Context Aware Recommendations at Netflix

Neural Network in Knowledge Bases

Exploring the Role of Transformers in NLP: From BERT to GPT-3

GPT and other Text Transformers: Black Swans and Stochastic Parrots

Discovering Emerging Tech through Graph Analysis - Henry Hwangbo @ GraphConne...

Nn kb

LLM.pdf

#1 Berlin Students in AI, Machine Learning & NLP presentation

Staying Shallow & Lean in a Deep Learning World

Recsys 2016 tutorial: Lessons learned from building real-life recommender sys...

Recently uploaded

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxnull - The Open Security Community

Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies

Understanding the Laravel MVC ArchitecturePixlogix Infotech

Key Features Of Token Development (1).pptxLBM Solutions

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Build your next Gen AI Breakthrough - April 2024Neo4j

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Vulnerability_Management_GRC_by Sohang Sengupta.pptxnull - The Open Security Community

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Artificial intelligence in the post-deep learning eraDeakin University

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

DMCC Future of Trade Web3 - Special EditionDubai Multi Commodity Centre

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

WordPress Websites for Engineers: Elevate Your Brandgvaughan

costume and set research powerpoint presentationphoebematthew05

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Recently uploaded (20)

E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx

Science&tech:THE INFORMATION AGE STS.pdf

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Unblocking The Main Thread Solving ANRs and Frozen Frames

Pigging Solutions Piggable Sweeping Elbows

Benefits Of Flutter Compared To Other Frameworks

Understanding the Laravel MVC Architecture

Key Features Of Token Development (1).pptx

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Build your next Gen AI Breakthrough - April 2024

Vertex AI Gemini Prompt Engineering Tips

Vulnerability_Management_GRC_by Sohang Sengupta.pptx

Pigging Solutions in Pet Food Manufacturing

Artificial intelligence in the post-deep learning era

My Hashitalk Indonesia April 2024 Presentation

DMCC Future of Trade Web3 - Special Edition

Connect Wave/ connectwave Pitch Deck Presentation

WordPress Websites for Engineers: Elevate Your Brand

costume and set research powerpoint presentation

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

GCD ChatGPT.pptx

2. How ChatGPT Works: A Deep Dive into the Architecture and Mechanics of OpenAI's Language Model

3. How ChatGPT Works: A Deep Dive 13th May, 2023 Chennai Speaker Karthikeyan VK Designation Cloud Native Architect

4. Agenda ● Why ChatGPT ● What is ChatGPT ● ChatGPT vs GPT-4 ● Internal Architecture ● How it actually works ● Tools Available

5. Why ChatGPT ? ● Personalized assistance ● Increased efficiency ● Enhanced language translation ● Improved customer service ● Fast response times

6. What is ChatGPT ● Large language model which can keep context

7. Initial Training of ChatGPT

8. High Level Components ● Pre-processing ● Encoding ● Training ● Decoding ● Postprocessing

9. Preprocessing ● Tokenization ● Stop word removal ● Stemming /Lemmatization

10. Stemming vs Lemmatization

11. Encoding ● Four types of Attributes ○ Nominal - Zipcode ○ Ordinal – Good, bad ○ Interval – 78.5 F ○ Ratio – 21 years old ● Categorical Variables Vs Numerical ● Conversion - Numerical Format

12. Decoding

13. Training ● Transformer architecture ○ NLP ○ Feed Forward Networks ○ Transformers

14. Transformer Architecture - NLP ○ Tokenization - ["ChatGPT", "is", "a", "language", "model", "."] ○ Part-of-speech tagging ■ "The cat sat on the mat", a POS tagger might label "The" as a determiner (DT), "cat" as a noun (NN), "sat" as a past tense verb (VBD), "on" as a preposition (IN), "the" as a determiner (DT), and "mat" as a noun (NN). ○ Named entity recognition ■ Identifying mentions of entities such as people, locations, and organizations in text. ○ Sentiment analysis

15. Transformer Architecture - Basics ● Feed Forward Networks

16. Transformer Architecture ● Self-attention mechanism in this architecture, it does a really good job of learning how to apply context in a data-driven way

17. Transformer Architecture ● To solve this problem, transformer models use neural networks to generate a vector called query, and a vector called key for each word. ● When the query from one word matches the key from another word, that means the second word has a relevant context for the first word. In order to provide appropriate context from the second word to the first word, a third vector called value is generated which is then combined with the first word to get a more contextualized meaning of the first word.

18. How it actually works

19. Main Take Aways ● Chat GPT is a LLM ● Chat GPT is form of probabilistic text generator ● Strength is hold to context ● Transformer Architecture – Query, Key and Value

20. Developer Road Ahead

21. Linked in – To Connect

22. Thank You

Editor's Notes

Proximal policy optimization
The goal of both stemming and lemmatization is to reduce inflectional forms and sometimes derivationally related forms of a word to a common base form
For example, in the sentence "The cat sat on the mat", a POS tagger might label "The" as a determiner (DT), "cat" as a noun (NN), "sat" as a past tense verb (VBD), "on" as a preposition (IN), "the" as a determiner (DT), and "mat" as a noun (NN).
Compute Query, Key, and Value Vectors: For each word in the input sequence, the model generates three vectors: a query vector, a key vector, and a value vector. These vectors are computed by multiplying the word's embedding (a vector representation of the word) by three weight matrices that the model learns during training. Calculate Attention Scores: The model calculates an "attention score" for each word in the sequence relative to every other word. This is done by taking the dot product of the query vector of the word we're focusing on and the key vector of the other word, and then applying a softmax function. This gives us a probability distribution that sums to 1, with higher values indicating words that should receive more attention. Compute Weighted Sum of Values: Each value vector is then multiplied by the corresponding softmax score (this gives higher weight to the words that should get more attention) and then summed to produce the output vector for the word we're focusing on. Generate Output: The output vector is then fed through the rest of the model (which might include additional self-attention layers, feed-forward layers, etc.).
Compute Query, Key, and Value Vectors

GCD ChatGPT.pptx

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to GCD ChatGPT.pptx

Similar to GCD ChatGPT.pptx (20)

More from Karthikeyan VK

More from Karthikeyan VK (20)

Recently uploaded

Recently uploaded (20)

GCD ChatGPT.pptx

Editor's Notes