SlideShare a Scribd company logo
1 of 48
Download to read offline
GENERATIVE AI CON
AMAZON BEDROCK
Guido Nebiolo,AWS Ambassador @ Reply
23 November2023
(WITH REAL EXAMPLES INSIDE 😉)
2
• 20 years developer
• 10 years paid developer
• 8 years paid developer on AWS (mainly)
• 3 years paid to teach developing on AWS
(and other topics)
WHOAMI
aws sts get-caller-identity
3
THE AI REVOLUTION
4
WHAT IS GENERATIVE AI?
AI
ML
DL
Gen AI
Generative AI generates new
content for a variety of tasks
leveraging pretrained foundation
models that can be customized
with small fractions of data.
5
HOW LLMS WORKS?
PROBABILITY
I am truly excited
6
HOW LLMS WORKS?
PROBABILITY
I am truly excited to (61.91%)
about (21.72%)
for (4.87%)
and (2.93%)
that (1.98%)
7
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be (25.43%)
announce (9.46%)
share (8.18%)
have (7.49%)
see (5.45%)
8
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a (16.57%)
joining (10.49%)
part (9.70%)
able (7.72%)
working (5.76%)
9
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a part (82.15%)
member (4.32%)
new (1.03%)
guest (0.88%)
partner (0.34%)
10
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a part of (99.66%)
nn (0.05%)
the (0.04%)
this (0.02%)
o (0.02%)
11
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a part of the (46.85%)
this (20.92%)
such (4.73%)
a (4.05%)
an (1.52%)
12
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a part of the team (5.32%)
(1.81%)
amazing (0.83%)
new (0.69%)
community (0.61%)
13
HOW LLMS WORKS?
PROBABILITY
I am truly excited to be a part of the team
14
FMS ON AWS
Out-of-the-Box
Managed
Model-as-a-Service
Managed ML Dev
Tooling
Proprietary Models
Provides a ready-to-use
solution with predefined
configurations, requiring
minimal setup and
customization
Build GenAI applications
on fully managed models
with choice of FMs
Tune or use publicly
available or open-source
models as is on managed
model
Build custom models from
scratch
15
KEY FEATURES OF BEDROCK
Accelerate development of
generative AI applications
using FMs through an API,
without managing
infrastructure
Choose FMs from Amazon,
AI21 Labs, Anthropic, Cohere,
and Stability AI to find the right
FM for your use case
Privately customize FMs using
your organization’s data
Multilingual LLMs for text
generation in Spanish,
French, German,
Portuguese, Italian, and
Dutch
LLM f or thoughtf ul
dialogue, content
creation, complex
reasoning, creativ ity ,
and coding, based on
Constitutional AI and
harmlessness training
Powerf ul and v ersatile
language models that
can be used f or a wide
range of natural
language processing
tasks. Optimized f or
dialogue use case
Generation of unique,
realistic, high-quality
images, art, logos, and
designs
Text summarization,
generation,
classif ication, open-
ended Q&A, inf ormation
extraction, embeddings
and search
JURASSIC CLAUDE LLAMA SDXL TITAN
Text generation model
f or business applications
and embeddings model
f or search, clustering, or
classif ication in 100+
languages
COMMAND
19
EMERGING GENERATIVE AI MODEL PATTERNS
Coherence
|
context
learning
Complexity | Time to market
In-context learning
using foundational
models
Model fine-tuning
using foundational
models
Training your own
model
#1: Contextual prompt engineering
#2: Retrieval augmented generation (RAG)
#3: Model fine-tuning
#4: Training models
PROMPT ENGINEERING
21
UNDERSTANDING PROMPT ENGINEERING
Summarize the following technical sentence:
Tags: generative ai, security, blogpost
Sentence: Security has been a hot topic since the
birth of Generative AI🔥. From the beginning, AWS
states that security is a shared responsibility
between us and them...
Summary:
22
UNDERSTANDING PROMPT ENGINEERING
INSTRUCTION
INPUT DATA
OUTPUT INDICATOR
CONTEXT
Summarize the following
technical sentence
23
UNDERSTANDING PROMPT ENGINEERING
INSTRUCTION
INPUT DATA
OUTPUT INDICATOR
CONTEXT
Summarize the following
technical sentence
Summary:
24
UNDERSTANDING PROMPT ENGINEERING
INSTRUCTION
INPUT DATA
OUTPUT INDICATOR
CONTEXT
Summarize the following
technical sentence
Sentence: Security has been a
hot topic since the birth of
Generative AI🔥. From the
beginning, AWS states that…
Summary:
25
UNDERSTANDING PROMPT ENGINEERING
INSTRUCTION
INPUT DATA
OUTPUT INDICATOR
CONTEXT
Summarize the following
technical sentence
Sentence: Security has been a
hot topic since the birth of
Generative AI🔥. From the
beginning, AWS states that…
Summary:
Tags: generative ai, security,
blogpost
26
UNDERSTANDING PROMPT ENGINEERING
Summarize the following technical sentence:
Tags: generative ai, security, blogpost
Sentence: Security has been a hot topic since the
birth of Generative AI🔥. From the beginning, AWS
states that security is a shared responsibility
between us and them...
Summary:
INSTRUCTION
INPUT DATA
OUTPUT INDICATOR
CONTEXT
Summarize the following
technical sentence
Sentence: Security has been a
hot topic since the birth of
Generative AI🔥. From the
beginning, AWS states that…
Summary:
Tags: generative ai, security,
blogpost
27
UNDERSTANDING PROMPT ENGINEERING
PLEASE
28
UNDERSTANDING PROMPT ENGINEERING
Please summarize the following technical sentence:
Tags: generative ai, security, blogpost
Sentence: Security has been a hot topic since the
birth of Generative AI🔥. From the beginning, AWS
states that security is a shared responsibility
between us and them...
Summary:
29
INFERENCE
PARAMETERS
30 Prompt: Captures the beauty of a tropical beach on a hot, sunny day.Include palm trees, crystal-clear waters.
INFERENCE
PARAMETERS
Higher the value means more
randomness.
TEMPERATURE
31 Prompt: Serene winter wonderland,showcasing a snow-covered forest with glistening trees, a frozen lake, and the peaceful,
cold atmosphere
INFERENCE
PARAMETERS
Higher the value means it will
only looks at a subset of tokens
whose probability adds up to a
certain threshold (Top P).
TOP P
32 Prompt: Cozy mountain cabin surrounded by a snowy, alpine landscape, with smoke rising from the chimney and a sky full of
stars on a freezing night.
INFERENCE
PARAMETERS
Similar to Top P, but, instead of
working in percentage, it
specifies an absolute number of
tokens.
TOP K
33
PROMPT ENGINEERING
TECHNIQUES
ONE-SHOT LEARNING FEW-SHOT LEARNING
34
ZERO-SHOT LEARNING
PROMPT ENGINEERING TECHNIQUES
35
ZERO SHOT DEMO
PromptEngineering
we didn't provide the
model with any
examples of text
alongside their
classifications, the
LLM already
understands
"sentiment"
36
FEW SHOT DEMO
PromptEngineering
Few-shot prompting
can be used as a
technique to enable
in-context learning
where we provide
demonstrations in
the prompt to steer
the model to better
performance.
37
FEW SHOT DEMO
PromptEngineering
few-shot prompting
is not enough to get
reliable responses
for this type of
reasoning problem.
enables complexreasoning capabilities through
intermediate reasoning steps.
CHAIN-OF-THOUGHT
generate knowledge to be used as part of the
prompt.
GENERATED KNOWLEDGE
38
MORE PROMPT ENGINEERING
TECHNIQUES
… AND MANY OTHERS
RETRIEVAL AUGMENTED
GENERATION
FMs knowledge is freezed at the time of of model
training.
POINT IN TIME
Generation of text that is not grounded in
accurate or real-world information.
HALLUCINATION
40
WHY RAG?
Retrieval Augmented Generation
41
UNDERSTANDING RAG
RAG’s internal knowledge
can be easily altered or
even supplemented on the
fly, controlling what RAG
knows and doesn’t know.
Retrieval Augmented
Generation (RAG) is a machine
learning approach that combines
elements of both retrieval-based
models and generative models to
improve the performance of
natural language understanding
and generation tasks.
Retrieval Augmented Generation
42
HIGH LEVEL DESIGN
RAG Architecture
High level:
1. Document
ingestion
2. Document
retrieval
3. Prompt
augmentation
Data
Sources
Embedder
Vector
Storage
Prompt Embedder
Retrieval
Engine
Prompt +
Docs
LLM
Output
43
UNDERSTANDING RAG
Retrieval Augmented Generation
44
RAG DEMO
Retrieval Augmented Generation
Prompt
Output
45
RAG DEMO
Retrieval Augmented Generation
Prompt + Context
Output
FINE TUNING
47
INTRODUCTION TO FINE TUNING
FOUNDATION
MODEL
TRAINING
DATA
CUSTOM
MODEL
48
FINE TUNING WITH BEDROCK
FOUNDATION
MODEL
TRAINING
DATA
CUSTOM
MODEL
49
TAKE-AWAYS
• To get better results, give as many details as possible to LLMs.
• Use RAG to cut training costs and decrease TTM when delivering POC
or MVP.
• Consider fine-tuning LLMs instead of giving them too many examples to
learn from.
(How many shots can an LLM handle?)
• Go on and build something, best learning path is hands-one experience.
Be part of the revolution!
Q&A
THANK YOU!

More Related Content

What's hot

Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdfGlobal Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdfAroh Shukla
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scaleMaxim Salnikov
 
Generative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGenerative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGene Leybzon
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceKumton Suttiraksiri
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdfcaa28steve
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬VINCI Digital - Industrial IoT (IIoT) Strategic Advisory
 
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023Vadym Kazulkin
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of usMassimo Ferre'
 
LLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team StructureAggregage
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AIMark DeLoura
 
AI Transformation
AI TransformationAI Transformation
AI TransformationLiming Zhu
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsAdventureWorld5
 
Generative AI - The New Reality: How Key Players Are Progressing
Generative AI - The New Reality: How Key Players Are Progressing Generative AI - The New Reality: How Key Players Are Progressing
Generative AI - The New Reality: How Key Players Are Progressing Vishal Sharma
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersSteven Van Vaerenbergh
 
[Machine Learning 15minutes! #61] Azure OpenAI Service
[Machine Learning 15minutes! #61] Azure OpenAI Service[Machine Learning 15minutes! #61] Azure OpenAI Service
[Machine Learning 15minutes! #61] Azure OpenAI ServiceNaoki (Neo) SATO
 
A Framework for Navigating Generative Artificial Intelligence for Enterprise
A Framework for Navigating Generative Artificial Intelligence for EnterpriseA Framework for Navigating Generative Artificial Intelligence for Enterprise
A Framework for Navigating Generative Artificial Intelligence for EnterpriseRocketSource
 

What's hot (20)

Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdfGlobal Azure Bootcamp Pune 2023 -  Lead the AI era with Microsoft Azure.pdf
Global Azure Bootcamp Pune 2023 - Lead the AI era with Microsoft Azure.pdf
 
Using the power of Generative AI at scale
Using the power of Generative AI at scaleUsing the power of Generative AI at scale
Using the power of Generative AI at scale
 
Generative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First SessionGenerative AI Use-cases for Enterprise - First Session
Generative AI Use-cases for Enterprise - First Session
 
OpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptxOpenAI-Copilot-ChatGPT.pptx
OpenAI-Copilot-ChatGPT.pptx
 
Best Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI ServiceBest Practice on using Azure OpenAI Service
Best Practice on using Azure OpenAI Service
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
𝐆𝐞𝐧𝐞𝐫𝐚𝐭𝐢𝐯𝐞 𝐀𝐈: 𝐂𝐡𝐚𝐧𝐠𝐢𝐧𝐠 𝐇𝐨𝐰 𝐁𝐮𝐬𝐢𝐧𝐞𝐬𝐬 𝐈𝐧𝐧𝐨𝐯𝐚𝐭𝐞𝐬 𝐚𝐧𝐝 𝐎𝐩𝐞𝐫𝐚𝐭𝐞𝐬
 
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023
Github Copilot vs Amazon CodeWhisperer for Java developers at JCON 2023
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
 
The-CxO-Guide-to.pdf
The-CxO-Guide-to.pdfThe-CxO-Guide-to.pdf
The-CxO-Guide-to.pdf
 
LLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team StructureLLMs in Production: Tooling, Process, and Team Structure
LLMs in Production: Tooling, Process, and Team Structure
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AI
 
ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...
ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...
ChatGPT, Generative AI and Microsoft Copilot: Step Into the Future - Geoff Ab...
 
AI Transformation
AI TransformationAI Transformation
AI Transformation
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
 
Generative AI - The New Reality: How Key Players Are Progressing
Generative AI - The New Reality: How Key Players Are Progressing Generative AI - The New Reality: How Key Players Are Progressing
Generative AI - The New Reality: How Key Players Are Progressing
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchers
 
[Machine Learning 15minutes! #61] Azure OpenAI Service
[Machine Learning 15minutes! #61] Azure OpenAI Service[Machine Learning 15minutes! #61] Azure OpenAI Service
[Machine Learning 15minutes! #61] Azure OpenAI Service
 
A Framework for Navigating Generative Artificial Intelligence for Enterprise
A Framework for Navigating Generative Artificial Intelligence for EnterpriseA Framework for Navigating Generative Artificial Intelligence for Enterprise
A Framework for Navigating Generative Artificial Intelligence for Enterprise
 

Similar to Generative AI con Amazon Bedrock.pdf

re:cap Generative AI journey with Bedrock
re:cap Generative AI journey  with Bedrockre:cap Generative AI journey  with Bedrock
re:cap Generative AI journey with BedrockPhilipBasford
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon Web Services
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Julien SIMON
 
Bagels & Bytes: Data Scientist Event
Bagels & Bytes: Data Scientist EventBagels & Bytes: Data Scientist Event
Bagels & Bytes: Data Scientist EventAmazon Web Services
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfAmazon Web Services
 
Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Animesh Singh
 
雲端推動的人工智能革命
雲端推動的人工智能革命雲端推動的人工智能革命
雲端推動的人工智能革命Amazon Web Services
 
Build, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfBuild, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfAmazon Web Services
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfAmazon Web Services
 
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaSviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaAmazon Web Services
 
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...e-dialog GmbH
 
Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Andrés Leonardo Martinez Ortiz
 
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...Matt Stubbs
 
Machine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMakerMachine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMakerAmazon Web Services
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Julien SIMON
 
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台Amazon Web Services
 
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28Simplify Machine Learning with the Deep Learning AMI | AWS Floor28
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28Amazon Web Services
 
From Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerFrom Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerAmazon Web Services
 

Similar to Generative AI con Amazon Bedrock.pdf (20)

re:cap Generative AI journey with Bedrock
re:cap Generative AI journey  with Bedrockre:cap Generative AI journey  with Bedrock
re:cap Generative AI journey with Bedrock
 
AWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdfAWS_Meetup_BLR_July_22_Social.pdf
AWS_Meetup_BLR_July_22_Social.pdf
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
 
Bagels & Bytes: Data Scientist Event
Bagels & Bytes: Data Scientist EventBagels & Bytes: Data Scientist Event
Bagels & Bytes: Data Scientist Event
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
 
Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox
 
TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
 
雲端推動的人工智能革命
雲端推動的人工智能革命雲端推動的人工智能革命
雲端推動的人工智能革命
 
Build, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdfBuild, train and deploy ML models at scale.pdf
Build, train and deploy ML models at scale.pdf
 
Build, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdfBuild, train, and deploy ML models at scale.pdf
Build, train, and deploy ML models at scale.pdf
 
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaSviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
 
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
Google Analytics Konferenz 2018_Machine Learning / AI mit Google_Lukman Ramse...
 
Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies Google Cloud: Data Analysis and Machine Learningn Technologies
Google Cloud: Data Analysis and Machine Learningn Technologies
 
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...
Big Data LDN 2018: SHAPING AN AI-DRIVEN FUTURE WITH AUGMENTED INTELLIGENCE FO...
 
Machine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMakerMachine Learning in azione con Amazon SageMaker
Machine Learning in azione con Amazon SageMaker
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
 
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台
Build in 2019 建立分佈式、開放式、數據中心的人工智慧數據驅動平台
 
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28Simplify Machine Learning with the Deep Learning AMI | AWS Floor28
Simplify Machine Learning with the Deep Learning AMI | AWS Floor28
 
From Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMakerFrom Notebook to production with Amazon SageMaker
From Notebook to production with Amazon SageMaker
 

Recently uploaded

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfjimielynbastida
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024BookNet Canada
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 

Recently uploaded (20)

"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Science&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdfScience&tech:THE INFORMATION AGE STS.pdf
Science&tech:THE INFORMATION AGE STS.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Unblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen FramesUnblocking The Main Thread Solving ANRs and Frozen Frames
Unblocking The Main Thread Solving ANRs and Frozen Frames
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
New from BookNet Canada for 2024: BNC BiblioShare - Tech Forum 2024
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 

Generative AI con Amazon Bedrock.pdf

  • 1. GENERATIVE AI CON AMAZON BEDROCK Guido Nebiolo,AWS Ambassador @ Reply 23 November2023 (WITH REAL EXAMPLES INSIDE 😉)
  • 2. 2 • 20 years developer • 10 years paid developer • 8 years paid developer on AWS (mainly) • 3 years paid to teach developing on AWS (and other topics) WHOAMI aws sts get-caller-identity
  • 4. 4 WHAT IS GENERATIVE AI? AI ML DL Gen AI Generative AI generates new content for a variety of tasks leveraging pretrained foundation models that can be customized with small fractions of data.
  • 6. 6 HOW LLMS WORKS? PROBABILITY I am truly excited to (61.91%) about (21.72%) for (4.87%) and (2.93%) that (1.98%)
  • 7. 7 HOW LLMS WORKS? PROBABILITY I am truly excited to be (25.43%) announce (9.46%) share (8.18%) have (7.49%) see (5.45%)
  • 8. 8 HOW LLMS WORKS? PROBABILITY I am truly excited to be a (16.57%) joining (10.49%) part (9.70%) able (7.72%) working (5.76%)
  • 9. 9 HOW LLMS WORKS? PROBABILITY I am truly excited to be a part (82.15%) member (4.32%) new (1.03%) guest (0.88%) partner (0.34%)
  • 10. 10 HOW LLMS WORKS? PROBABILITY I am truly excited to be a part of (99.66%) nn (0.05%) the (0.04%) this (0.02%) o (0.02%)
  • 11. 11 HOW LLMS WORKS? PROBABILITY I am truly excited to be a part of the (46.85%) this (20.92%) such (4.73%) a (4.05%) an (1.52%)
  • 12. 12 HOW LLMS WORKS? PROBABILITY I am truly excited to be a part of the team (5.32%) (1.81%) amazing (0.83%) new (0.69%) community (0.61%)
  • 13. 13 HOW LLMS WORKS? PROBABILITY I am truly excited to be a part of the team
  • 14. 14 FMS ON AWS Out-of-the-Box Managed Model-as-a-Service Managed ML Dev Tooling Proprietary Models Provides a ready-to-use solution with predefined configurations, requiring minimal setup and customization Build GenAI applications on fully managed models with choice of FMs Tune or use publicly available or open-source models as is on managed model Build custom models from scratch
  • 15. 15 KEY FEATURES OF BEDROCK Accelerate development of generative AI applications using FMs through an API, without managing infrastructure Choose FMs from Amazon, AI21 Labs, Anthropic, Cohere, and Stability AI to find the right FM for your use case Privately customize FMs using your organization’s data Multilingual LLMs for text generation in Spanish, French, German, Portuguese, Italian, and Dutch LLM f or thoughtf ul dialogue, content creation, complex reasoning, creativ ity , and coding, based on Constitutional AI and harmlessness training Powerf ul and v ersatile language models that can be used f or a wide range of natural language processing tasks. Optimized f or dialogue use case Generation of unique, realistic, high-quality images, art, logos, and designs Text summarization, generation, classif ication, open- ended Q&A, inf ormation extraction, embeddings and search JURASSIC CLAUDE LLAMA SDXL TITAN Text generation model f or business applications and embeddings model f or search, clustering, or classif ication in 100+ languages COMMAND
  • 16. 19 EMERGING GENERATIVE AI MODEL PATTERNS Coherence | context learning Complexity | Time to market In-context learning using foundational models Model fine-tuning using foundational models Training your own model #1: Contextual prompt engineering #2: Retrieval augmented generation (RAG) #3: Model fine-tuning #4: Training models
  • 18. 21 UNDERSTANDING PROMPT ENGINEERING Summarize the following technical sentence: Tags: generative ai, security, blogpost Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that security is a shared responsibility between us and them... Summary:
  • 19. 22 UNDERSTANDING PROMPT ENGINEERING INSTRUCTION INPUT DATA OUTPUT INDICATOR CONTEXT Summarize the following technical sentence
  • 20. 23 UNDERSTANDING PROMPT ENGINEERING INSTRUCTION INPUT DATA OUTPUT INDICATOR CONTEXT Summarize the following technical sentence Summary:
  • 21. 24 UNDERSTANDING PROMPT ENGINEERING INSTRUCTION INPUT DATA OUTPUT INDICATOR CONTEXT Summarize the following technical sentence Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that… Summary:
  • 22. 25 UNDERSTANDING PROMPT ENGINEERING INSTRUCTION INPUT DATA OUTPUT INDICATOR CONTEXT Summarize the following technical sentence Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that… Summary: Tags: generative ai, security, blogpost
  • 23. 26 UNDERSTANDING PROMPT ENGINEERING Summarize the following technical sentence: Tags: generative ai, security, blogpost Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that security is a shared responsibility between us and them... Summary:
  • 24. INSTRUCTION INPUT DATA OUTPUT INDICATOR CONTEXT Summarize the following technical sentence Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that… Summary: Tags: generative ai, security, blogpost 27 UNDERSTANDING PROMPT ENGINEERING PLEASE
  • 25. 28 UNDERSTANDING PROMPT ENGINEERING Please summarize the following technical sentence: Tags: generative ai, security, blogpost Sentence: Security has been a hot topic since the birth of Generative AI🔥. From the beginning, AWS states that security is a shared responsibility between us and them... Summary:
  • 27. 30 Prompt: Captures the beauty of a tropical beach on a hot, sunny day.Include palm trees, crystal-clear waters. INFERENCE PARAMETERS Higher the value means more randomness. TEMPERATURE
  • 28. 31 Prompt: Serene winter wonderland,showcasing a snow-covered forest with glistening trees, a frozen lake, and the peaceful, cold atmosphere INFERENCE PARAMETERS Higher the value means it will only looks at a subset of tokens whose probability adds up to a certain threshold (Top P). TOP P
  • 29. 32 Prompt: Cozy mountain cabin surrounded by a snowy, alpine landscape, with smoke rising from the chimney and a sky full of stars on a freezing night. INFERENCE PARAMETERS Similar to Top P, but, instead of working in percentage, it specifies an absolute number of tokens. TOP K
  • 31. ONE-SHOT LEARNING FEW-SHOT LEARNING 34 ZERO-SHOT LEARNING PROMPT ENGINEERING TECHNIQUES
  • 32. 35 ZERO SHOT DEMO PromptEngineering we didn't provide the model with any examples of text alongside their classifications, the LLM already understands "sentiment"
  • 33. 36 FEW SHOT DEMO PromptEngineering Few-shot prompting can be used as a technique to enable in-context learning where we provide demonstrations in the prompt to steer the model to better performance.
  • 34. 37 FEW SHOT DEMO PromptEngineering few-shot prompting is not enough to get reliable responses for this type of reasoning problem.
  • 35. enables complexreasoning capabilities through intermediate reasoning steps. CHAIN-OF-THOUGHT generate knowledge to be used as part of the prompt. GENERATED KNOWLEDGE 38 MORE PROMPT ENGINEERING TECHNIQUES … AND MANY OTHERS
  • 37. FMs knowledge is freezed at the time of of model training. POINT IN TIME Generation of text that is not grounded in accurate or real-world information. HALLUCINATION 40 WHY RAG? Retrieval Augmented Generation
  • 38. 41 UNDERSTANDING RAG RAG’s internal knowledge can be easily altered or even supplemented on the fly, controlling what RAG knows and doesn’t know. Retrieval Augmented Generation (RAG) is a machine learning approach that combines elements of both retrieval-based models and generative models to improve the performance of natural language understanding and generation tasks. Retrieval Augmented Generation
  • 39. 42 HIGH LEVEL DESIGN RAG Architecture High level: 1. Document ingestion 2. Document retrieval 3. Prompt augmentation Data Sources Embedder Vector Storage Prompt Embedder Retrieval Engine Prompt + Docs LLM Output
  • 41. 44 RAG DEMO Retrieval Augmented Generation Prompt Output
  • 42. 45 RAG DEMO Retrieval Augmented Generation Prompt + Context Output
  • 44. 47 INTRODUCTION TO FINE TUNING FOUNDATION MODEL TRAINING DATA CUSTOM MODEL
  • 45. 48 FINE TUNING WITH BEDROCK FOUNDATION MODEL TRAINING DATA CUSTOM MODEL
  • 46. 49 TAKE-AWAYS • To get better results, give as many details as possible to LLMs. • Use RAG to cut training costs and decrease TTM when delivering POC or MVP. • Consider fine-tuning LLMs instead of giving them too many examples to learn from. (How many shots can an LLM handle?) • Go on and build something, best learning path is hands-one experience. Be part of the revolution!
  • 47. Q&A