SlideShare a Scribd company logo
LLMs in Production:
Tooling, Process, and Team Structure
December 6th, 2023
10:30am PST
Have a question or comment for our
panelists?
Use this QR code to engage with our
speakers, or visit the link in the chat!
Having an audio issue?
Try dialing in by phone!
Dial: +1 312 626 6799
Webinar ID: 819 1469 6007
Passcode: 385318
Closed Captioning is available
for this webinar!
Our Panelists
Tony Karrer
Founder & CEO TechEmpower,
Founder & CTO Aggregage
Greg Loughnane
Founder & CEO of
AI Makerspace
Chris Alexiuk
Co-Founder & CTO at
AI Makerspace
🎯ALIGNING OUR AIM
BY THE END OF TODAY...
Understand processes for building and
improving production LLM applications
Overview of industry-standard tooling
How to leverage LangSmith
OVERVIEW
LLM Ops, LLM OS, and “The New Stack”
Leading Tooling
Meet LangSmith
Conclusions, Q&A
LLM OPS
What is LLM Ops?
BUILDING LLM APPLICATIONS
ENTERPRISE BUILDS
Synthetic data, closed-source models
1.
Baseline performance
a.
Open-source models
2.
Add your private data
a.
Iterate
3.
Optimize models, data, metrics,
inference, efficiency
a.
NEXT-LEVEL
❓Some additional questions:
On premise hardware?
What scale and speed?
Training proprietary LLM?
Small Language Models (SLMs)
Efficiency
Transparency
Accuracy
Security
PROTOTYPING
🧩PROTOTYPING LLM APPS
Prompt Engineering
1.
Question Answering Systems
2.
Fine-Tuning Models
3.
“THE NEW STACK AND OPS FOR AI”
#LLM OPS
USER EXPERIENCE
Control for uncertainty (PE)
Build guardrails for steerability
and safety (Harm/Help)
MODEL CONSISTENCY
Constrain model behavior (FT)
Ground the model (RAG)
EVALUATING PERF
Create evaluation suites (RAGAS)
Use model-graded evals (GPT-4)
LATENCY AND COST
Use semantic caching (Prompts)
Route to cheaper models (FT)
RAG, FROM LANGCHAIN
LEADING TOOLING
OUR LLM OPS CURRICULUM
🧑‍💻Building LLM Applications in Pure Python
1.
🔗LangChain Powered RAG and Advanced Retrieval
2.
🦙Open-Source Production RAG with LlamaIndex
3.
🕴️Agents, 🧑‍💻Hackathon, and 🧑‍🏫Demo Day!
4.
OUR LLM OPS CURRICULUM
LLMs
Evaluation
Visibility
Infrastructure
Vector Database
Embedding Models
User Interface
Deployment
LLMs: OpenAI GPT-4-Turbo, Mistral 7B
Evaluation: RAGAS, Built-in metrics
Visibility: Weights and Biases
Infrastructure: LangChain LlamaIndex
Vector Database: Pinecone FAISS
Embedding Models: OpenAI Ada MTEB Leaderboard
User Interface: Chainlit
Deployment: Hugging Face Amazon Bedrock
OPEN LEADERBOARDS
BASIC RAG +
ADVANCED RAG W/
LANGSMITH
HALLUCINATIONS
Confident responses that are false.
FACT CHECKING
RETREIVAL AUGMENTED GENERATION
Retrieval
Finds references in your documents
Augmented
Adds references to your prompts
Generation
Improves answers to questions!
SPECIALIZED DOMAINS
Jargon, e.g.;
Legal, healthcare, financial, insurance,
government, research
Alignment
With common language that humans use
RETRIEVAL AUGMENTED
GENERATION
OVERVIEW
RAG
=
DENSE VECTOR RETRIEVAL
(R)
+
IN-CONTEXT LEARNING
(AG)
🧩3 EASY PIECES TO RETRIEVAL
Ask a question
1.
Search database for stuff similar to question
2.
Return the stuff
3.
📇INDEX (THE DATABASE)
Split docs into chunks
1.
Create embeddings for each chunk
2.
Store embeddings in vector store index
3.
Embeddings Vector Store Index
Documents
Raw Source
Documents Chunked Documents
[0.1,0.4,-0.6,...]
[0.2,0.3,-0.4,...]
[0.8,0.3,-0.1,...]
🐕RETRIEVERS
Embeddings Vector Store Index
Documents
Raw Source
Documents Chunked Documents
[0.1,0.4,-0.6,...]
Query
INPUT
[0.1,0.4,-0.6,...]
Find Nearest Neighbors
Context: From source 1
Context: From source 2
Context: From source
🐕
[0.2,0.3,-0.4,...]
[0.8,0.3,-0.1,...]
[0.1, 0.4, -0.6, ...]
Ryan was ...
Query
Find Nearest
Neighbours
(cosine similarity)
Vector Database
App Logic
INPUT
“Query...”
Embedding Model
[0.1, 0.4, -0.6, ...]
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Query
Embedding Model Chat Model
Prompt Templates
INPUT
“Query...”
Find Nearest
Neighbours
(cosine similarity)
Vector Database
App Logic
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Ryan was ...
Query
INPUT
“Query”
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Dense Vector Retrieval
In-Context Learning
Embedding Model Chat Model
Vector Store
Find Nearest
Neighbours
(cosine similarity)
Return document(s)
from
Nearest Neighbours
[0.1, 0.4, -0.6, ...]
Prompt Templates
Vector Database
App Logic App Logic
Use the provided context to answer the user's query.
You may not answer the user's query unless there is specific
context in the following text.
If you do not know the answer, or cannot answer, please respond
with "I don't know".
Context:
{context}
User Query:
{user_query}
Context: ref 1
Context: ref 2
Context: ref 3
Context: ref 4
Answer
Query
INPUT
OUTPUT
“Query”
Dense Vector Retrieval
In-Context Learning
TODAY’S BUILD
DATA
Rich Content
Hard to search
OUR MODELS
EMBEDDING & CHAT (LLM) MODEL
Chat Model (e.g.
, LLM)
e.g,. OpenAI GPT-4
Embeddi
ng Model
e.g., Cohere V3
Embedding Model Chat Model
OUR TOOLING
Vector Database
Qdrant
Infrasturucture & Evaluation
LangChain
LangSmith
DATABASE AND INFRASTRUCTURE
Vector Database
🔗LANGCHAIN
“The real power
comes when you can
combine them with
other sources of
computation or
knowledge.”
~ Harrison Chase
Creator of LangChain
WHAT IS IT?
WHAT IS IT?
WHAT IS IT?
Search OpenAI blog for top k resources, rerank
1.
Ask specific questions related to content
2.
Return answers to questions with sources
3.
OpenAI RAG Flow
OPENAI RAG 👨‍💻
Presented By
Chris Alexiuk, LLM Wizard 🪄
HOW DO I IMPROVE?
PROMPT ENGINEERING
Check system-level prompting
and one-shot/few-shot examples
for alignment with your task
e.g., varies
RAG
“Are you pulling the right
references?
e.g., context recall
FINE-TUNING EMBEDDINGS
Is your model understanding
domain-specific language?
e.g., hit rate
AGENTS
Is your model reasoning the way
a human would?
e.g., ???
THE RISE OF THE AI
ENGINEER
THE AGE OF THE AI ENGINEER
“A wide range of AI tasks that used to
take 5 years and a research team to
accomplish in 2013, now just require API
docs and a spare afternoon in 2023.”
It is now possible to build what used
to take months in a single day!
DATA SCIENTISTS!
Enhance Retrieval (and thus Generation!)
Fine-Tuning
Embeddings
Chat Models
Evaluation
CONCLUSIONS
Best-practice tools are out there!
LangSmith-like tooling is the most comprehensive
Building
Prompt Engineering, RAG, Fine-Tuning
Improvement
Depends on Building!
Eval varies
Lots of work for data scientist and AI Engineers!
Q&A
Tony Karrer
Founder & CEO TechEmpower,
Founder & CTO Aggregage
Dr. Greg Loughnane
Founder & CEO of
AI Makerspace
Chris Alexiuk
Co-Founder & CTO at
AI Makerspace
Tara Dwyer
Webinar Manager
/in/tonykarrer/
aggregage.com
/in/gregloughnane/
aimakerspace.io
/in/csalexiuk/
aimakerspace.io
/in/taradwyer/
artificialintelligencezone.com
JOIN THE GENERATIVE AI FOR TECHNOLOGY LEADERS LINKEDIN GROUP
FOR THOUGHTFUL DISCUSSION AND Q&A! VISIT THE LINK OR SCAN THE QR CODE!
bit.ly/genaitechleaders

More Related Content

What's hot

Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
PhilipBasford
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Naoki (Neo) SATO
 
AZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in productionAZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in production
SARADINDU SENGUPTA
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
DianaGray10
 
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdfIntro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Korkrid Akepanidtaworn
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
CMassociates
 
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's PerspectiveGenerative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Huahai Yang
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
Massimo Ferre'
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
Jesus Rodriguez
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Mihai Criveti
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchers
Steven Van Vaerenbergh
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
caa28steve
 
Conversational AI and Chatbot Integrations
Conversational AI and Chatbot IntegrationsConversational AI and Chatbot Integrations
Conversational AI and Chatbot Integrations
Cristina Vidu
 
AI Transformation
AI TransformationAI Transformation
AI Transformation
Liming Zhu
 
Accelerating Path to Production for Generative AI-powered Applications
Accelerating Path to Production for Generative AI-powered ApplicationsAccelerating Path to Production for Generative AI-powered Applications
Accelerating Path to Production for Generative AI-powered Applications
HostedbyConfluent
 
Regulating Generative AI - LLMOps pipelines with Transparency
Regulating Generative AI - LLMOps pipelines with TransparencyRegulating Generative AI - LLMOps pipelines with Transparency
Regulating Generative AI - LLMOps pipelines with Transparency
Debmalya Biswas
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
David Rostcheck
 

What's hot (20)

Gen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdfGen AI Cognizant & AWS event presentation_12 Oct.pdf
Gen AI Cognizant & AWS event presentation_12 Oct.pdf
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdfUnlocking the Power of Generative AI An Executive's Guide.pdf
Unlocking the Power of Generative AI An Executive's Guide.pdf
 
Generative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdfGenerative-AI-in-enterprise-20230615.pdf
Generative-AI-in-enterprise-20230615.pdf
 
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
Microsoft Build 2023 Updates – Copilot Stack and Azure OpenAI Service (Machin...
 
AZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in productionAZConf 2023 - Considerations for LLMOps: Running LLMs in production
AZConf 2023 - Considerations for LLMOps: Running LLMs in production
 
Leveraging Generative AI & Best practices
Leveraging Generative AI & Best practicesLeveraging Generative AI & Best practices
Leveraging Generative AI & Best practices
 
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdfIntro to Azure OpenAI Service L100 (Thai Ver).pdf
Intro to Azure OpenAI Service L100 (Thai Ver).pdf
 
Responsible Generative AI
Responsible Generative AIResponsible Generative AI
Responsible Generative AI
 
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's PerspectiveGenerative AI: Past, Present, and Future – A Practitioner's Perspective
Generative AI: Past, Present, and Future – A Practitioner's Perspective
 
Generative AI for the rest of us
Generative AI for the rest of usGenerative AI for the rest of us
Generative AI for the rest of us
 
ChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptxChatGPT, Foundation Models and Web3.pptx
ChatGPT, Foundation Models and Web3.pptx
 
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
Retrieval Augmented Generation in Practice: Scalable GenAI platforms with k8s...
 
Let's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchersLet's talk about GPT: A crash course in Generative AI for researchers
Let's talk about GPT: A crash course in Generative AI for researchers
 
presentation.pdf
presentation.pdfpresentation.pdf
presentation.pdf
 
Conversational AI and Chatbot Integrations
Conversational AI and Chatbot IntegrationsConversational AI and Chatbot Integrations
Conversational AI and Chatbot Integrations
 
AI Transformation
AI TransformationAI Transformation
AI Transformation
 
Accelerating Path to Production for Generative AI-powered Applications
Accelerating Path to Production for Generative AI-powered ApplicationsAccelerating Path to Production for Generative AI-powered Applications
Accelerating Path to Production for Generative AI-powered Applications
 
Regulating Generative AI - LLMOps pipelines with Transparency
Regulating Generative AI - LLMOps pipelines with TransparencyRegulating Generative AI - LLMOps pipelines with Transparency
Regulating Generative AI - LLMOps pipelines with Transparency
 
Cavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AICavalry Ventures | Deep Dive: Generative AI
Cavalry Ventures | Deep Dive: Generative AI
 
Large Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdfLarge Language Models - Chat AI.pdf
Large Language Models - Chat AI.pdf
 

Similar to LLMs in Production: Tooling, Process, and Team Structure

TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
Usama Wahab Khan Cloud, Data and AI
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
Trey Grainger
 
How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...
Maxim Salnikov
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
Kaxil Naik
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Max Irwin
 
Building Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to startBuilding Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to start
Maxim Salnikov
 
Tensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS SagemakerTensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS Sagemaker
Anima Anandkumar
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
Sri Ambati
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
ArangoDB Database
 
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...Amazon Web Services
 
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Amazon Web Services
 
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Lviv Startup Club
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Daniel Zivkovic
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
DataWorks Summit
 
AI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Python
amyiris
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementations
Ben DeMott
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital.AI
 
Ssas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesiSsas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesi
Koray Kocabas
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender Systems
Gabriel Moreira
 
The Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge GraphThe Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge Graph
Trey Grainger
 

Similar to LLMs in Production: Tooling, Process, and Team Structure (20)

TechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptxTechDayPakistan-Slides RAG with Cosmos DB.pptx
TechDayPakistan-Slides RAG with Cosmos DB.pptx
 
The Semantic Knowledge Graph
The Semantic Knowledge GraphThe Semantic Knowledge Graph
The Semantic Knowledge Graph
 
How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...How Azure helps to build better business processes and customer experiences w...
How Azure helps to build better business processes and customer experiences w...
 
Building and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache AirflowBuilding and deploying LLM applications with Apache Airflow
Building and deploying LLM applications with Apache Airflow
 
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and VocabulariesHaystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
Haystack 2018 - Algorithmic Extraction of Keywords Concepts and Vocabularies
 
Building Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to startBuilding Generative AI-infused apps: what's possible and how to start
Building Generative AI-infused apps: what's possible and how to start
 
Tensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS SagemakerTensors for topic modeling and deep learning on AWS Sagemaker
Tensors for topic modeling and deep learning on AWS Sagemaker
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Fishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data LakeFishing Graphs in a Hadoop Data Lake
Fishing Graphs in a Hadoop Data Lake
 
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
An Overview of AI at AWS - Amazon Lex, Amazon Polly, Amazon Rekognition - Dev...
 
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
Overview of Artificial Intelligence at AWS - DevDay Los Angeles 2017
 
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
Vitalii Bondarenko, Pavlo Shtogryn "Cloud AI Solutions for Enterprises on AWS...
 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
 
The Future of Data Science
The Future of Data ScienceThe Future of Data Science
The Future of Data Science
 
AI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using PythonAI and Python: Developing a Conversational Interface using Python
AI and Python: Developing a Conversational Interface using Python
 
The need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementationsThe need for sophistication in modern search engine implementations
The need for sophistication in modern search engine implementations
 
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and SparkVital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
Vital AI MetaQL: Queries Across NoSQL, SQL, Sparql, and Spark
 
Ssas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesiSsas dmx ile kurum içi verilerin i̇şlenmesi
Ssas dmx ile kurum içi verilerin i̇şlenmesi
 
Discovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender SystemsDiscovering User's Topics of Interest in Recommender Systems
Discovering User's Topics of Interest in Recommender Systems
 
The Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge GraphThe Relevance of the Apache Solr Semantic Knowledge Graph
The Relevance of the Apache Solr Semantic Knowledge Graph
 

More from Aggregage

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
Unlocking Employee Potential with the Power of Continuous Feedback
Unlocking Employee Potential with the Power of Continuous FeedbackUnlocking Employee Potential with the Power of Continuous Feedback
Unlocking Employee Potential with the Power of Continuous Feedback
Aggregage
 
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
Aggregage
 
From Awareness to Action: An HR Guide to Making Accessibility Accessible
From Awareness to Action:  An HR Guide to Making Accessibility AccessibleFrom Awareness to Action:  An HR Guide to Making Accessibility Accessible
From Awareness to Action: An HR Guide to Making Accessibility Accessible
Aggregage
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
Aggregage
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
Aggregage
 
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdfSales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
Aggregage
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
Aggregage
 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of Reporting
Aggregage
 
Planning your Restaurant's Path to Profitability
Planning your Restaurant's Path to ProfitabilityPlanning your Restaurant's Path to Profitability
Planning your Restaurant's Path to Profitability
Aggregage
 
The Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance CultureThe Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance Culture
Aggregage
 
Driving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon HarmerDriving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon Harmer
Aggregage
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Aggregage
 
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor DynamicsThe Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
Aggregage
 
Breaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for ExcellenceBreaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for Excellence
Aggregage
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Aggregage
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
Aggregage
 
How to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product DevelopmentHow to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product Development
Aggregage
 
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee BenefitsBridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Aggregage
 
Mapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic ShiftMapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic Shift
Aggregage
 

More from Aggregage (20)

Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
Unlocking Employee Potential with the Power of Continuous Feedback
Unlocking Employee Potential with the Power of Continuous FeedbackUnlocking Employee Potential with the Power of Continuous Feedback
Unlocking Employee Potential with the Power of Continuous Feedback
 
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
The Key to Sustainable Energy Optimization: A Data-Driven Approach for Manufa...
 
From Awareness to Action: An HR Guide to Making Accessibility Accessible
From Awareness to Action:  An HR Guide to Making Accessibility AccessibleFrom Awareness to Action:  An HR Guide to Making Accessibility Accessible
From Awareness to Action: An HR Guide to Making Accessibility Accessible
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 
How to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail SuccessHow to Leverage Behavioral Science Insights for Direct Mail Success
How to Leverage Behavioral Science Insights for Direct Mail Success
 
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdfSales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
Sales & Marketing Alignment_ How to Synergize for Success.pptx.pdf
 
Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
How Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of ReportingHow Automation is Driving Efficiency Through the Last Mile of Reporting
How Automation is Driving Efficiency Through the Last Mile of Reporting
 
Planning your Restaurant's Path to Profitability
Planning your Restaurant's Path to ProfitabilityPlanning your Restaurant's Path to Profitability
Planning your Restaurant's Path to Profitability
 
The Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance CultureThe Engagement Engine: Strategies for Building a High-Performance Culture
The Engagement Engine: Strategies for Building a High-Performance Culture
 
Driving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon HarmerDriving Business Impact for PMs with Jon Harmer
Driving Business Impact for PMs with Jon Harmer
 
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
Strategic Project Finance Essentials: A Project Manager’s Guide to Financial ...
 
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor DynamicsThe Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
The Retention Ripple Effect: Nonprofit Staff and Donor Dynamics
 
Breaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for ExcellenceBreaking the Burnout Cycle: Empowering Managers for Excellence
Breaking the Burnout Cycle: Empowering Managers for Excellence
 
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for ClarityStrategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
Strategic CX: A Deep Dive into Voice of the Customer Insights for Clarity
 
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
The Data Metaverse: Unpacking the Roles, Use Cases, and Tech Trends in Data a...
 
How to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product DevelopmentHow to Build an Experimentation Culture for Data-Driven Product Development
How to Build an Experimentation Culture for Data-Driven Product Development
 
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee BenefitsBridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
Bridging the Gap: The Intersection of DEI Initiatives and Employee Benefits
 
Mapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic ShiftMapping Digital Transformation: Retail’s Strategic Shift
Mapping Digital Transformation: Retail’s Strategic Shift
 

Recently uploaded

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Jeffrey Haguewood
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Product School
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
Fwdays
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
OnBoard
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
Bhaskar Mitra
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
Safe Software
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
91mobiles
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
CatarinaPereira64715
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance
 

Recently uploaded (20)

Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
Slack (or Teams) Automation for Bonterra Impact Management (fka Social Soluti...
 
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
Unsubscribed: Combat Subscription Fatigue With a Membership Mentality by Head...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi"Impact of front-end architecture on development cost", Viktor Turskyi
"Impact of front-end architecture on development cost", Viktor Turskyi
 
Leading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdfLeading Change strategies and insights for effective change management pdf 1.pdf
Leading Change strategies and insights for effective change management pdf 1.pdf
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Search and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical FuturesSearch and Society: Reimagining Information Access for Radical Futures
Search and Society: Reimagining Information Access for Radical Futures
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
Essentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with ParametersEssentials of Automations: Optimizing FME Workflows with Parameters
Essentials of Automations: Optimizing FME Workflows with Parameters
 
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdfSmart TV Buyer Insights Survey 2024 by 91mobiles.pdf
Smart TV Buyer Insights Survey 2024 by 91mobiles.pdf
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
ODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User GroupODC, Data Fabric and Architecture User Group
ODC, Data Fabric and Architecture User Group
 
FIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdfFIDO Alliance Osaka Seminar: Overview.pdf
FIDO Alliance Osaka Seminar: Overview.pdf
 

LLMs in Production: Tooling, Process, and Team Structure

  • 1. LLMs in Production: Tooling, Process, and Team Structure December 6th, 2023 10:30am PST
  • 2. Have a question or comment for our panelists? Use this QR code to engage with our speakers, or visit the link in the chat! Having an audio issue? Try dialing in by phone! Dial: +1 312 626 6799 Webinar ID: 819 1469 6007 Passcode: 385318 Closed Captioning is available for this webinar!
  • 3. Our Panelists Tony Karrer Founder & CEO TechEmpower, Founder & CTO Aggregage Greg Loughnane Founder & CEO of AI Makerspace Chris Alexiuk Co-Founder & CTO at AI Makerspace
  • 5. BY THE END OF TODAY... Understand processes for building and improving production LLM applications Overview of industry-standard tooling How to leverage LangSmith
  • 6. OVERVIEW LLM Ops, LLM OS, and “The New Stack” Leading Tooling Meet LangSmith Conclusions, Q&A
  • 8. What is LLM Ops?
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 15. ENTERPRISE BUILDS Synthetic data, closed-source models 1. Baseline performance a. Open-source models 2. Add your private data a. Iterate 3. Optimize models, data, metrics, inference, efficiency a.
  • 16. NEXT-LEVEL ❓Some additional questions: On premise hardware? What scale and speed? Training proprietary LLM?
  • 17. Small Language Models (SLMs) Efficiency Transparency Accuracy Security
  • 19. 🧩PROTOTYPING LLM APPS Prompt Engineering 1. Question Answering Systems 2. Fine-Tuning Models 3.
  • 20.
  • 21.
  • 22.
  • 23. “THE NEW STACK AND OPS FOR AI” #LLM OPS
  • 24. USER EXPERIENCE Control for uncertainty (PE) Build guardrails for steerability and safety (Harm/Help)
  • 25. MODEL CONSISTENCY Constrain model behavior (FT) Ground the model (RAG)
  • 26. EVALUATING PERF Create evaluation suites (RAGAS) Use model-graded evals (GPT-4)
  • 27. LATENCY AND COST Use semantic caching (Prompts) Route to cheaper models (FT)
  • 28.
  • 29.
  • 32. OUR LLM OPS CURRICULUM 🧑‍💻Building LLM Applications in Pure Python 1. 🔗LangChain Powered RAG and Advanced Retrieval 2. 🦙Open-Source Production RAG with LlamaIndex 3. 🕴️Agents, 🧑‍💻Hackathon, and 🧑‍🏫Demo Day! 4.
  • 33. OUR LLM OPS CURRICULUM
  • 35. LLMs: OpenAI GPT-4-Turbo, Mistral 7B Evaluation: RAGAS, Built-in metrics Visibility: Weights and Biases Infrastructure: LangChain LlamaIndex Vector Database: Pinecone FAISS Embedding Models: OpenAI Ada MTEB Leaderboard User Interface: Chainlit Deployment: Hugging Face Amazon Bedrock
  • 37. BASIC RAG + ADVANCED RAG W/ LANGSMITH
  • 40. RETREIVAL AUGMENTED GENERATION Retrieval Finds references in your documents Augmented Adds references to your prompts Generation Improves answers to questions!
  • 41. SPECIALIZED DOMAINS Jargon, e.g.; Legal, healthcare, financial, insurance, government, research Alignment With common language that humans use
  • 44. 🧩3 EASY PIECES TO RETRIEVAL Ask a question 1. Search database for stuff similar to question 2. Return the stuff 3.
  • 45. 📇INDEX (THE DATABASE) Split docs into chunks 1. Create embeddings for each chunk 2. Store embeddings in vector store index 3. Embeddings Vector Store Index Documents Raw Source Documents Chunked Documents [0.1,0.4,-0.6,...] [0.2,0.3,-0.4,...] [0.8,0.3,-0.1,...]
  • 46. 🐕RETRIEVERS Embeddings Vector Store Index Documents Raw Source Documents Chunked Documents [0.1,0.4,-0.6,...] Query INPUT [0.1,0.4,-0.6,...] Find Nearest Neighbors Context: From source 1 Context: From source 2 Context: From source 🐕 [0.2,0.3,-0.4,...] [0.8,0.3,-0.1,...]
  • 47. [0.1, 0.4, -0.6, ...] Ryan was ... Query Find Nearest Neighbours (cosine similarity) Vector Database App Logic INPUT “Query...” Embedding Model
  • 48. [0.1, 0.4, -0.6, ...] Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Query Embedding Model Chat Model Prompt Templates INPUT “Query...” Find Nearest Neighbours (cosine similarity) Vector Database App Logic
  • 49. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Ryan was ... Query INPUT “Query”
  • 50. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query”
  • 51. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query” Dense Vector Retrieval In-Context Learning
  • 52. Embedding Model Chat Model Vector Store Find Nearest Neighbours (cosine similarity) Return document(s) from Nearest Neighbours [0.1, 0.4, -0.6, ...] Prompt Templates Vector Database App Logic App Logic Use the provided context to answer the user's query. You may not answer the user's query unless there is specific context in the following text. If you do not know the answer, or cannot answer, please respond with "I don't know". Context: {context} User Query: {user_query} Context: ref 1 Context: ref 2 Context: ref 3 Context: ref 4 Answer Query INPUT OUTPUT “Query” Dense Vector Retrieval In-Context Learning
  • 56. EMBEDDING & CHAT (LLM) MODEL Chat Model (e.g. , LLM) e.g,. OpenAI GPT-4 Embeddi ng Model e.g., Cohere V3 Embedding Model Chat Model
  • 58. Vector Database Qdrant Infrasturucture & Evaluation LangChain LangSmith DATABASE AND INFRASTRUCTURE Vector Database
  • 60. “The real power comes when you can combine them with other sources of computation or knowledge.” ~ Harrison Chase Creator of LangChain
  • 64.
  • 65. Search OpenAI blog for top k resources, rerank 1. Ask specific questions related to content 2. Return answers to questions with sources 3. OpenAI RAG Flow
  • 66. OPENAI RAG 👨‍💻 Presented By Chris Alexiuk, LLM Wizard 🪄
  • 67. HOW DO I IMPROVE?
  • 68.
  • 69. PROMPT ENGINEERING Check system-level prompting and one-shot/few-shot examples for alignment with your task e.g., varies
  • 70. RAG “Are you pulling the right references? e.g., context recall
  • 71. FINE-TUNING EMBEDDINGS Is your model understanding domain-specific language? e.g., hit rate
  • 72. AGENTS Is your model reasoning the way a human would? e.g., ???
  • 73. THE RISE OF THE AI ENGINEER
  • 74. THE AGE OF THE AI ENGINEER “A wide range of AI tasks that used to take 5 years and a research team to accomplish in 2013, now just require API docs and a spare afternoon in 2023.” It is now possible to build what used to take months in a single day!
  • 75. DATA SCIENTISTS! Enhance Retrieval (and thus Generation!) Fine-Tuning Embeddings Chat Models Evaluation
  • 76. CONCLUSIONS Best-practice tools are out there! LangSmith-like tooling is the most comprehensive Building Prompt Engineering, RAG, Fine-Tuning Improvement Depends on Building! Eval varies Lots of work for data scientist and AI Engineers!
  • 77. Q&A Tony Karrer Founder & CEO TechEmpower, Founder & CTO Aggregage Dr. Greg Loughnane Founder & CEO of AI Makerspace Chris Alexiuk Co-Founder & CTO at AI Makerspace Tara Dwyer Webinar Manager /in/tonykarrer/ aggregage.com /in/gregloughnane/ aimakerspace.io /in/csalexiuk/ aimakerspace.io /in/taradwyer/ artificialintelligencezone.com JOIN THE GENERATIVE AI FOR TECHNOLOGY LEADERS LINKEDIN GROUP FOR THOUGHTFUL DISCUSSION AND Q&A! VISIT THE LINK OR SCAN THE QR CODE! bit.ly/genaitechleaders