SlideShare a Scribd company logo
1 of 77
LLMs in Production:
Tooling, Process, and Team Structure
December 6th, 2023
10:30am PST
Have a question or comment for our
panelists?
Use this QR code to engage with our
speakers, or visit the link in the chat!
Having an audio issue?
Try dialing in by phone!
Dial: +1 312 626 6799
Webinar ID: 819 1469 6007
Passcode: 385318
Closed Captioning is available
for this webinar!
Our Panelists
Tony Karrer
Founder & CEO TechEmpower,
Founder & CTO Aggregage
Greg Loughnane
Founder & CEO of
AI Makerspace
Chris Alexiuk
Co-Founder & CTO at
AI Makerspace
🎯ALIGNING OUR AIM
BY THE END OF TODAY...
Understand processes for building and
improving production LLM applications
Overview of industry-standard tooling
How to leverage LangSmith
OVERVIEW
LLM Ops, LLM OS, and “The New Stack”
Leading Tooling
Meet LangSmith
Conclusions, Q&A
LLM OPS
What is LLM Ops?
BUILDING LLM APPLICATIONS
ENTERPRISE BUILDS
Synthetic data, closed-source models
1.
Baseline performance
a.
Open-source models
2.
Add your private data
a.
Iterate
3.
Optimize models, data, metrics,
inference, efficiency
a.
NEXT-LEVEL
❓Some additional questions:
On premise hardware?
What scale and speed?
Training proprietary LLM?
Small Language Models (SLMs)
Efficiency
Transparency
Accuracy
Security
PROTOTYPING
🧩PROTOTYPING LLM APPS
Prompt Engineering
1.
Question Answering Systems
2.
Fine-Tuning Models
3.
“THE NEW STACK AND OPS FOR AI”
#LLM OPS
USER EXPERIENCE
Control for uncertainty (PE)
Build guardrails for steerability
and safety (Harm/Help)
MODEL CONSISTENCY
Constrain model behavior (FT)
Ground the model (RAG)
EVALUATING PERF
Create evaluation suites (RAGAS)
Use model-graded evals (GPT-4)
LATENCY AND COST
Use semantic caching (Prompts)
Route to cheaper models (FT)
RAG, FROM LANGCHAIN
LEADING TOOLING
OUR LLM OPS CURRICULUM
🧑‍💻Building LLM Applications in Pure Python
1.
🔗LangChain Powered RAG and Advanced Retrieval
2.
🦙Open-Source Production RAG with LlamaIndex
3.
🕴️Agents, 🧑‍💻Hackathon, and 🧑‍🏫Demo Day!
4.
OUR LLM OPS CURRICULUM
LLMs
Evaluation
Visibility
Infrastructure
Vector Database
Embedding Models
User Interface
Deployment
LLMs: OpenAI GPT-4-Turbo, Mistral 7B
Evaluation: RAGAS, Built-in metrics
Visibility: Weights and Biases
Infrastructure: LangChain LlamaIndex
Vector Database: Pinecone FAISS
Embedding Models: OpenAI Ada MTEB Leaderboard
User Interface: Chainlit
Deployment: Hugging Face Amazon Bedrock
OPEN LEADERBOARDS
BASIC RAG +
ADVANCED RAG W/
LANGSMITH
HALLUCINATIONS
Confident responses that are false.
FACT CHECKING
RETREIVAL AUGMENTED GENERATION
Retrieval
Finds references in your documents
Augmented
Adds references to your prompts
Generation
Improves answers to questions!
SPECIALIZED DOMAINS
Jargon, e.g.;
Legal, healthcare, financial, insurance,
government, research
Alignment
With common language that humans use
RETRIEVAL AUGMENTED
GENERATION
OVERVIEW
RAG
=
DENSE VECTOR RETRIEVAL
(R)
+
IN-CONTEXT LEARNING
(AG)
🧩3 EASY PIECES TO RETRIEVAL
Ask a question
1.
Search database for stuff similar to question
2.
Return the stuff
3.
📇INDEX (THE DATABASE)
Split docs into chunks
1.
Create embeddings for each chunk
2.
Store embeddings in vector store index
3.
Embeddings Vector Store Index
Documents
Raw Source
Documents Chunked Documents
[0.1,0.4,-0.6,...]
[0.2,0.3,-0.4,...]
[0.8,0.3,-0.1,...]
🐕RETRIEVERS
Embeddings Vector Store Index
Documents
Raw Source
Documents Chunked Documents
[0.1,0.4,-0.6,...]
Query
INPUT
[0.1,0.4,-0.6,...]
Find Nearest Neighbors
Context: From source 1
Context: From source 2
Context: From source
🐕
[0.2,0.3,-0.4,...]
[0.8,0.3,-0.1,...]
[0.1, 0.4, -0.6, ...]
Ryan was ...
Query
Find Nearest
Neighbours
(cosine similarity)
Vector Database
App Logic
INPUT
“Query...”
Embedding Model
[0.1, 0.4, -0.6, ...]
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Query
Embedding Model Chat Model
Prompt Templates
INPUT
“Query...”
Find Nearest
Neighbours
(cosine similarity)
Vector Database
App Logic
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Ryan was ...
Query
INPUT
“Query”
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Dense Vector Retrieval
In-Context Learning
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Dense Vector Retrieval
In-Context Learning
TODAY’S BUILD
DATA
Rich Content
Hard to search
OUR MODELS
EMBEDDING & CHAT (LLM) MODEL
Chat Model (e.g.
, LLM)
e.g,. OpenAI GPT-4
Embeddi
ng Model
e.g., Cohere V3
Embedding Model Chat Model
OUR TOOLING
Vector Database
Qdrant
Infrasturucture & Evaluation
LangChain
LangSmith
DATABASE AND INFRASTRUCTURE
Vector Database
🔗LANGCHAIN
“The real power
comes when you can
combine them with
other sources of
computation or
knowledge.”
~ Harrison Chase
Creator of LangChain
WHAT IS IT?
WHAT IS IT?
WHAT IS IT?
Search OpenAI blog for top k resources, rerank
1.
Ask specific questions related to content
2.
Return answers to questions with sources
3.
OpenAI RAG Flow
OPENAI RAG 👨‍💻
Presented By
Chris Alexiuk, LLM Wizard 🪄
HOW DO I IMPROVE?
PROMPT ENGINEERING
Check system-level prompting
and one-shot/few-shot examples
for alignment with your task
e.g., varies
RAG
“Are you pulling the right
references?
e.g., context recall
FINE-TUNING EMBEDDINGS
Is your model understanding
domain-specific language?
e.g., hit rate
AGENTS
Is your model reasoning the way
a human would?
e.g., ???
THE RISE OF THE AI
ENGINEER
THE AGE OF THE AI ENGINEER
“A wide range of AI tasks that used to
take 5 years and a research team to
accomplish in 2013, now just require API
docs and a spare afternoon in 2023.”
It is now possible to build what used
to take months in a single day!
DATA SCIENTISTS!
Enhance Retrieval (and thus Generation!)
Fine-Tuning
Embeddings
Chat Models
Evaluation
CONCLUSIONS
Best-practice tools are out there!
LangSmith-like tooling is the most comprehensive
Building
Prompt Engineering, RAG, Fine-Tuning
Improvement
Depends on Building!
Eval varies
Lots of work for data scientist and AI Engineers!
Q&A
Tony Karrer
Founder & CEO TechEmpower,
Founder & CTO Aggregage
Dr. Greg Loughnane
Founder & CEO of
AI Makerspace
Chris Alexiuk
Co-Founder & CTO at
AI Makerspace
Tara Dwyer
Webinar Manager
/in/tonykarrer/
aggregage.com
/in/gregloughnane/
aimakerspace.io
/in/csalexiuk/
aimakerspace.io
/in/taradwyer/
artificialintelligencezone.com
JOIN THE GENERATIVE AI FOR TECHNOLOGY LEADERS LINKEDIN GROUP
FOR THOUGHTFUL DISCUSSION AND Q&A! VISIT THE LINK OR SCAN THE QR CODE!
bit.ly/genaitechleaders

More Related Content

What's hot

Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxSri Ambati
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfPremNaraindas1
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Sri Ambati
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfDung Hoang
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfLiming Zhu
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Naoki (Neo) SATO
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAnant Corporation
 
Use Case Patterns for LLM Applications (1).pdf
Use Case Patterns for LLM Applications (1).pdfUse Case Patterns for LLM Applications (1).pdf
Use Case Patterns for LLM Applications (1).pdfM Waleed Kadous
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsAdventureWorld5
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Mihai Criveti
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023CoriFaklaris1
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of usMassimo Ferre'
 
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for DevelopersHow do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for DevelopersIvo Andreev
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...David Talby
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AICMassociates
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AIBenjaminlapid1
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AIMark DeLoura
 
Generative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGenerative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGene Leybzon
 

What's hot (20)

Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdfExploring Opportunities in the Generative AI Value Chain.pdf
Exploring Opportunities in the Generative AI Value Chain.pdf
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
 
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
Microsoft + OpenAI: Recent Updates (Machine Learning 15minutes! Broadcast #74)
 
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPTAutomate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
Automate your Job and Business with ChatGPT #3 - Fundamentals of LLM/GPT
 
LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
 
Use Case Patterns for LLM Applications (1).pdf
Use Case Patterns for LLM Applications (1).pdfUse Case Patterns for LLM Applications (1).pdf
Use Case Patterns for LLM Applications (1).pdf
 
Intro to LLMs
Intro to LLMsIntro to LLMs
Intro to LLMs
 
generative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language modelsgenerative-ai-fundamentals and Large language models
generative-ai-fundamentals and Large language models
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
 
An Introduction to Generative AI - May 18, 2023
An Introduction  to Generative AI - May 18, 2023An Introduction  to Generative AI - May 18, 2023
An Introduction to Generative AI - May 18, 2023
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
 
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for DevelopersHow do OpenAI GPT Models Work - Misconceptions and Tips for Developers
How do OpenAI GPT Models Work - Misconceptions and Tips for Developers
 
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
Large Language Models, No-Code, and Responsible AI - Trends in Applied NLP in...
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
 
The current state of generative AI
The current state of generative AIThe current state of generative AI
The current state of generative AI
 
Using Generative AI
Using Generative AIUsing Generative AI
Using Generative AI
 
Generative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second SessionGenerative AI Use cases for Enterprise - Second Session
Generative AI Use cases for Enterprise - Second Session
 

Similar to LLMs in Production: Tooling, Process, and Team Structure

The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge GraphTrey Grainger
 
How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...Maxim Salnikov
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowKaxil Naik
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesMax Irwin
 
Building Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to startBuilding Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to startMaxim Salnikov
 
Tensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS SagemakerTensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS SagemakerAnima Anandkumar
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeArangoDB Database
 
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...Amazon Web Services
 
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017Amazon Web Services
 
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...Lviv Startup Club
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...Daniel Zivkovic
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data ScienceDataWorks Summit
 
AI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Pythonamyiris
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsBen DeMott
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital.AI
 
Ssas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesiSsas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesiKoray Kocabas
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsGabriel Moreira
 

Similar to LLMs in Production: Tooling, Process, and Team Structure (20)

TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
 
How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
Building Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to startBuilding Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to start
 
Tensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS SagemakerTensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS Sagemaker
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
 
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
 
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
 
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
AI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Python
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementations
 
introduction Azure OpenAI by Usama wahab khan
introduction  Azure OpenAI by Usama wahab khanintroduction  Azure OpenAI by Usama wahab khan
introduction Azure OpenAI by Usama wahab khan
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Ssas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesiSsas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesi
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender Systems
 

More from Aggregage

Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingAggregage
 
Planning your Restaurant's Path to Profitability
Planning your Restaurant's Path to ProfitabilityPlanning your Restaurant's Path to Profitability
Planning your Restaurant's Path to ProfitabilityAggregage
 
The Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance CultureThe Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance CultureAggregage
 
Driving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon HarmerDriving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon HarmerAggregage
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Aggregage
 
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor DynamicsThe Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor DynamicsAggregage
 
Breaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for ExcellenceBreaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for ExcellenceAggregage
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityAggregage
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...Aggregage
 
How to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product DevelopmentHow to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product DevelopmentAggregage
 
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee BenefitsBridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee BenefitsAggregage
 
Mapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic ShiftMapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic ShiftAggregage
 
AI & DEI: With Great Opportunities Comes Great HR Responsibility
AI & DEI: With Great Opportunities Comes Great HR ResponsibilityAI & DEI: With Great Opportunities Comes Great HR Responsibility
AI & DEI: With Great Opportunities Comes Great HR ResponsibilityAggregage
 
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?Can Brain Science Actually Help Make Your Training & Teaching "Stick"?
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?Aggregage
 
How Personalized Customer Experiences Drive Retail Growth and Revenue
How Personalized Customer Experiences Drive Retail Growth and RevenueHow Personalized Customer Experiences Drive Retail Growth and Revenue
How Personalized Customer Experiences Drive Retail Growth and RevenueAggregage
 
Your Expert Guide to CX Orchestration & Enhancing Customer Journeys
Your Expert Guide to CX Orchestration & Enhancing Customer JourneysYour Expert Guide to CX Orchestration & Enhancing Customer Journeys
Your Expert Guide to CX Orchestration & Enhancing Customer JourneysAggregage
 
The Resurgence of Direct Mail as a Growth Marketing Strategy
The Resurgence of Direct Mail as a Growth Marketing StrategyThe Resurgence of Direct Mail as a Growth Marketing Strategy
The Resurgence of Direct Mail as a Growth Marketing StrategyAggregage
 
Top 5 Ways to Optimize Your Loyalty Strategy
Top 5 Ways to Optimize Your Loyalty StrategyTop 5 Ways to Optimize Your Loyalty Strategy
Top 5 Ways to Optimize Your Loyalty StrategyAggregage
 
HR Pro Playbook: How to Navigate Global Workforce Expansion with Confidence
HR Pro Playbook: How to Navigate Global Workforce Expansion with ConfidenceHR Pro Playbook: How to Navigate Global Workforce Expansion with Confidence
HR Pro Playbook: How to Navigate Global Workforce Expansion with ConfidenceAggregage
 

More from Aggregage (20)

Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of Reporting
 
Planning your Restaurant's Path to Profitability
Planning your Restaurant's Path to ProfitabilityPlanning your Restaurant's Path to Profitability
Planning your Restaurant's Path to Profitability
 
The Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance CultureThe Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance Culture
 
Driving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon HarmerDriving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon Harmer
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
 
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor DynamicsThe Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
 
Breaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for ExcellenceBreaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for Excellence
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
 
How to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product DevelopmentHow to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product Development
 
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee BenefitsBridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
 
Mapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic ShiftMapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic Shift
 
AI & DEI: With Great Opportunities Comes Great HR Responsibility
AI & DEI: With Great Opportunities Comes Great HR ResponsibilityAI & DEI: With Great Opportunities Comes Great HR Responsibility
AI & DEI: With Great Opportunities Comes Great HR Responsibility
 
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?Can Brain Science Actually Help Make Your Training & Teaching "Stick"?
Can Brain Science Actually Help Make Your Training & Teaching "Stick"?
 
How Personalized Customer Experiences Drive Retail Growth and Revenue
How Personalized Customer Experiences Drive Retail Growth and RevenueHow Personalized Customer Experiences Drive Retail Growth and Revenue
How Personalized Customer Experiences Drive Retail Growth and Revenue
 
Your Expert Guide to CX Orchestration & Enhancing Customer Journeys
Your Expert Guide to CX Orchestration & Enhancing Customer JourneysYour Expert Guide to CX Orchestration & Enhancing Customer Journeys
Your Expert Guide to CX Orchestration & Enhancing Customer Journeys
 
The Resurgence of Direct Mail as a Growth Marketing Strategy
The Resurgence of Direct Mail as a Growth Marketing StrategyThe Resurgence of Direct Mail as a Growth Marketing Strategy
The Resurgence of Direct Mail as a Growth Marketing Strategy
 
Top 5 Ways to Optimize Your Loyalty Strategy
Top 5 Ways to Optimize Your Loyalty StrategyTop 5 Ways to Optimize Your Loyalty Strategy
Top 5 Ways to Optimize Your Loyalty Strategy
 
HR Pro Playbook: How to Navigate Global Workforce Expansion with Confidence
HR Pro Playbook: How to Navigate Global Workforce Expansion with ConfidenceHR Pro Playbook: How to Navigate Global Workforce Expansion with Confidence
HR Pro Playbook: How to Navigate Global Workforce Expansion with Confidence
 

Recently uploaded

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):comworks
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostZilliz
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 

Recently uploaded (20)

Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):CloudStudio User manual (basic edition):
CloudStudio User manual (basic edition):
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 

LLMs in Production: Tooling, Process, and Team Structure

  • 1. LLMs in Production: Tooling, Process, and Team Structure December 6th, 2023 10:30am PST
  • 2. Have a question or comment for our panelists? Use this QR code to engage with our speakers, or visit the link in the chat! Having an audio issue? Try dialing in by phone! Dial: +1 312 626 6799 Webinar ID: 819 1469 6007 Passcode: 385318 Closed Captioning is available for this webinar!
  • 3. Our Panelists Tony Karrer Founder & CEO TechEmpower, Founder & CTO Aggregage Greg Loughnane Founder & CEO of AI Makerspace Chris Alexiuk Co-Founder & CTO at AI Makerspace
  • 5. BY THE END OF TODAY... Understand processes for building and improving production LLM applications Overview of industry-standard tooling How to leverage LangSmith
  • 6. OVERVIEW LLM Ops, LLM OS, and “The New Stack” Leading Tooling Meet LangSmith Conclusions, Q&A
  • 8. What is LLM Ops?
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 15. ENTERPRISE BUILDS Synthetic data, closed-source models 1. Baseline performance a. Open-source models 2. Add your private data a. Iterate 3. Optimize models, data, metrics, inference, efficiency a.
  • 16. NEXT-LEVEL ❓Some additional questions: On premise hardware? What scale and speed? Training proprietary LLM?
  • 17. Small Language Models (SLMs) Efficiency Transparency Accuracy Security
  • 19. 🧩PROTOTYPING LLM APPS Prompt Engineering 1. Question Answering Systems 2. Fine-Tuning Models 3.
  • 20.
  • 21.
  • 22.
  • 23. “THE NEW STACK AND OPS FOR AI” #LLM OPS
  • 24. USER EXPERIENCE Control for uncertainty (PE) Build guardrails for steerability and safety (Harm/Help)
  • 25. MODEL CONSISTENCY Constrain model behavior (FT) Ground the model (RAG)
  • 26. EVALUATING PERF Create evaluation suites (RAGAS) Use model-graded evals (GPT-4)
  • 27. LATENCY AND COST Use semantic caching (Prompts) Route to cheaper models (FT)
  • 28.
  • 29.
  • 32. OUR LLM OPS CURRICULUM 🧑‍💻Building LLM Applications in Pure Python 1. 🔗LangChain Powered RAG and Advanced Retrieval 2. 🦙Open-Source Production RAG with LlamaIndex 3. 🕴️Agents, 🧑‍💻Hackathon, and 🧑‍🏫Demo Day! 4.
  • 33. OUR LLM OPS CURRICULUM
  • 35. LLMs: OpenAI GPT-4-Turbo, Mistral 7B Evaluation: RAGAS, Built-in metrics Visibility: Weights and Biases Infrastructure: LangChain LlamaIndex Vector Database: Pinecone FAISS Embedding Models: OpenAI Ada MTEB Leaderboard User Interface: Chainlit Deployment: Hugging Face Amazon Bedrock
  • 37. BASIC RAG + ADVANCED RAG W/ LANGSMITH
  • 40. RETREIVAL AUGMENTED GENERATION Retrieval Finds references in your documents Augmented Adds references to your prompts Generation Improves answers to questions!
  • 41. SPECIALIZED DOMAINS Jargon, e.g.; Legal, healthcare, financial, insurance, government, research Alignment With common language that humans use
  • 44. 🧩3 EASY PIECES TO RETRIEVAL Ask a question 1. Search database for stuff similar to question 2. Return the stuff 3.
  • 45. 📇INDEX (THE DATABASE) Split docs into chunks 1. Create embeddings for each chunk 2. Store embeddings in vector store index 3. Embeddings Vector Store Index Documents Raw Source Documents Chunked Documents [0.1,0.4,-0.6,...] [0.2,0.3,-0.4,...] [0.8,0.3,-0.1,...]
  • 46. 🐕RETRIEVERS Embeddings Vector Store Index Documents Raw Source Documents Chunked Documents [0.1,0.4,-0.6,...] Query INPUT [0.1,0.4,-0.6,...] Find Nearest Neighbors Context: From source 1 Context: From source 2 Context: From source 🐕 [0.2,0.3,-0.4,...] [0.8,0.3,-0.1,...]
  • 47. [0.1, 0.4, -0.6, ...] Ryan was ... Query Find Nearest Neighbours (cosine similarity) Vector Database App Logic INPUT “Query...” Embedding Model
  • 48. [0.1, 0.4, -0.6, ...] Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Query Embedding Model Chat Model Prompt Templates INPUT “Query...” Find Nearest Neighbours (cosine similarity) Vector Database App Logic
  • 49. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Ryan was ... Query INPUT “Query”
  • 50. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query”
  • 51. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query” Dense Vector Retrieval In-Context Learning
  • 52. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query” Dense Vector Retrieval In-Context Learning
  • 56. EMBEDDING & CHAT (LLM) MODEL Chat Model (e.g. , LLM) e.g,. OpenAI GPT-4 Embeddi ng Model e.g., Cohere V3 Embedding Model Chat Model
  • 58. Vector Database Qdrant Infrasturucture & Evaluation LangChain LangSmith DATABASE AND INFRASTRUCTURE Vector Database
  • 60. “The real power comes when you can combine them with other sources of computation or knowledge.” ~ Harrison Chase Creator of LangChain
  • 64.
  • 65. Search OpenAI blog for top k resources, rerank 1. Ask specific questions related to content 2. Return answers to questions with sources 3. OpenAI RAG Flow
  • 66. OPENAI RAG 👨‍💻 Presented By Chris Alexiuk, LLM Wizard 🪄
  • 67. HOW DO I IMPROVE?
  • 68.
  • 69. PROMPT ENGINEERING Check system-level prompting and one-shot/few-shot examples for alignment with your task e.g., varies
  • 70. RAG “Are you pulling the right references? e.g., context recall
  • 71. FINE-TUNING EMBEDDINGS Is your model understanding domain-specific language? e.g., hit rate
  • 72. AGENTS Is your model reasoning the way a human would? e.g., ???
  • 73. THE RISE OF THE AI ENGINEER
  • 74. THE AGE OF THE AI ENGINEER “A wide range of AI tasks that used to take 5 years and a research team to accomplish in 2013, now just require API docs and a spare afternoon in 2023.” It is now possible to build what used to take months in a single day!
  • 75. DATA SCIENTISTS! Enhance Retrieval (and thus Generation!) Fine-Tuning Embeddings Chat Models Evaluation
  • 76. CONCLUSIONS Best-practice tools are out there! LangSmith-like tooling is the most comprehensive Building Prompt Engineering, RAG, Fine-Tuning Improvement Depends on Building! Eval varies Lots of work for data scientist and AI Engineers!
  • 77. Q&A Tony Karrer Founder & CEO TechEmpower, Founder & CTO Aggregage Dr. Greg Loughnane Founder & CEO of AI Makerspace Chris Alexiuk Co-Founder & CTO at AI Makerspace Tara Dwyer Webinar Manager /in/tonykarrer/ aggregage.com /in/gregloughnane/ aimakerspace.io /in/csalexiuk/ aimakerspace.io /in/taradwyer/ artificialintelligencezone.com JOIN THE GENERATIVE AI FOR TECHNOLOGY LEADERS LINKEDIN GROUP FOR THOUGHTFUL DISCUSSION AND Q&A! VISIT THE LINK OR SCAN THE QR CODE! bit.ly/genaitechleaders