SlideShare a Scribd company logo
1 of 26
Download to read offline
1 | © Copyright 2024 Zilliz
1
Yujian Tang | Zilliz
Introduction to LLM
Customization
2 | © Copyright 2024 Zilliz
2
Yujian Tang
Senior Developer Advocate, Zilliz
yujian@zilliz.com
https://www.linkedin.com/in/yujiantang
https://www.twitter.com/yujian_tang
Speaker
3 | © Copyright 2024 Zilliz
3
01 Introduction to LLMs
CONTENTS
03
04 Examples
02 RAG
Fine Tuning
4 | © Copyright 2024 Zilliz
4
01 Introduction to LLMs
5 | © Copyright 2024 Zilliz
5
A Basic Neural Net
6 | © Copyright 2024 Zilliz
6
A Recurrent Neural Network
7 | © Copyright 2024 Zilliz
7
A Transformer Architecture
8 | © Copyright 2024 Zilliz
8
GPT
9 | © Copyright 2024 Zilliz
9
02 RAG
10 | © Copyright 2024 Zilliz
10
RAG
RAG
Inject your data via a vector
database like Milvus/Zilliz
Query LLM
Milvus
Your Data
Primary Use Case
- Factual Recall
- Forced Data Injection
- Cost Optimization
Embed
11 | © Copyright 2024 Zilliz
11
What Does Vector Data Look Like?
12 | © Copyright 2024 Zilliz
12
Find Semantically Similar Data
Apple made profits of $97 Billion in 2023
I like to eat apple pie for profit in 2023
Apple’s bottom line increased by record numbers in 2023
13 | © Copyright 2024 Zilliz
13
But wait! There’s more!
14 | © Copyright 2024 Zilliz
14
RAG lets us inject data via semantic similarity
provided by vector databases like Milvus
15 | © Copyright 2024 Zilliz
15
03 Fine Tuning
16 | © Copyright 2024 Zilliz
16
RAG vs Fine Tuning
LLM
Fine Tuning
Augment an LLM by training it
on your data
Your Data
“New” LLM
Query
Primary Use Case
- Style transfer
- Domain specific usage
17 | © Copyright 2024 Zilliz
17
18 | © Copyright 2024 Zilliz
18
19 | © Copyright 2024 Zilliz
19
Types of Fine Tuning
- Full fine tuning
- LoRA
- QLoRA
20 | © Copyright 2024 Zilliz
20
Fine Tuning Methods
- Supervised Fine Tuning
- Direct Preference Optimization
- 𝚿 (Identity) Preference Optimization
- Odds-Ratio Preference Optimization
21 | © Copyright 2024 Zilliz
21
Fine tuning lets us train LLMs to operate in certain
styles or domains
22 | © Copyright 2024 Zilliz
22
04 Examples
Give Milvus a
Star!
23 | © Copyright 2024 Zilliz
23
RAG without OpenAI project
24 | © Copyright 2024 Zilliz
24
AI Agents Projects
25 | © Copyright 2024 Zilliz
25
Fine Tuning Library
26 | © Copyright 2024 Zilliz
26
Start building
with Zilliz Cloud today!
zilliz.com/cloud

More Related Content

Similar to Introduction to Large Language Model Customization.pdf

Linthicum state of-the-art-cloud-platforms
Linthicum state of-the-art-cloud-platformsLinthicum state of-the-art-cloud-platforms
Linthicum state of-the-art-cloud-platforms
David Linthicum
 
Hybrid Cloud Keynote
Hybrid Cloud Keynote Hybrid Cloud Keynote
Hybrid Cloud Keynote
gcamarda
 
Leading in the Cloud – Oracle Modern Solution
Leading in the Cloud – Oracle Modern SolutionLeading in the Cloud – Oracle Modern Solution
Leading in the Cloud – Oracle Modern Solution
Mohammed Mojibur Raheman
 

Similar to Introduction to Large Language Model Customization.pdf (20)

Linthicum state of-the-art-cloud-platforms
Linthicum state of-the-art-cloud-platformsLinthicum state of-the-art-cloud-platforms
Linthicum state of-the-art-cloud-platforms
 
IDC datacenter of the future : Oracle point of view
IDC datacenter of the future : Oracle point of viewIDC datacenter of the future : Oracle point of view
IDC datacenter of the future : Oracle point of view
 
PI DX 2020 Atlanta - Data Management Strategy. _ How Do You Establish a Commo...
PI DX 2020 Atlanta - Data Management Strategy. _ How Do You Establish a Commo...PI DX 2020 Atlanta - Data Management Strategy. _ How Do You Establish a Commo...
PI DX 2020 Atlanta - Data Management Strategy. _ How Do You Establish a Commo...
 
IBM i Development: Increase Accuracy and Efficiency with SEQUEL's ABSTRACT a...
 IBM i Development: Increase Accuracy and Efficiency with SEQUEL's ABSTRACT a... IBM i Development: Increase Accuracy and Efficiency with SEQUEL's ABSTRACT a...
IBM i Development: Increase Accuracy and Efficiency with SEQUEL's ABSTRACT a...
 
Hybrid Cloud Keynote
Hybrid Cloud Keynote Hybrid Cloud Keynote
Hybrid Cloud Keynote
 
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
Hybrid cloud-for-flexible-accelerated-and-sustainable-it16-10-051475673810
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Secure your cloud applications by building solid foundations with enterprise ...
Secure your cloud applications by building solid foundations with enterprise ...Secure your cloud applications by building solid foundations with enterprise ...
Secure your cloud applications by building solid foundations with enterprise ...
 
Secure Clouds are Happy Clouds
Secure Clouds are Happy CloudsSecure Clouds are Happy Clouds
Secure Clouds are Happy Clouds
 
Neo4j & AWS Bedrock workshop at GraphSummit London 14 Nov 2023.pptx
Neo4j & AWS Bedrock workshop at GraphSummit London 14 Nov 2023.pptxNeo4j & AWS Bedrock workshop at GraphSummit London 14 Nov 2023.pptx
Neo4j & AWS Bedrock workshop at GraphSummit London 14 Nov 2023.pptx
 
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
How to run Real Time processing on Big Data / Ron Zavner (GigaSpaces)
 
7 Things You Need to Know for Your Cloud-First Strategy
7 Things You Need to Know for Your Cloud-First Strategy7 Things You Need to Know for Your Cloud-First Strategy
7 Things You Need to Know for Your Cloud-First Strategy
 
Achieving digital transformation with Siebel CRM and Oracle Cloud
Achieving digital transformation with Siebel CRM and Oracle Cloud Achieving digital transformation with Siebel CRM and Oracle Cloud
Achieving digital transformation with Siebel CRM and Oracle Cloud
 
Leading in the Cloud – Oracle Modern Solution
Leading in the Cloud – Oracle Modern SolutionLeading in the Cloud – Oracle Modern Solution
Leading in the Cloud – Oracle Modern Solution
 
A New Day for Oracle Analytics
A New Day for Oracle AnalyticsA New Day for Oracle Analytics
A New Day for Oracle Analytics
 
RightScale Webinar: Hybrid Cloud Fundamentals and Lessons Learned
RightScale Webinar: Hybrid Cloud Fundamentals and Lessons LearnedRightScale Webinar: Hybrid Cloud Fundamentals and Lessons Learned
RightScale Webinar: Hybrid Cloud Fundamentals and Lessons Learned
 
Final business intelligence in the cloud
Final   business intelligence in the cloudFinal   business intelligence in the cloud
Final business intelligence in the cloud
 
Unlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQLUnlocking Big Data Insights with MySQL
Unlocking Big Data Insights with MySQL
 
Neo4j Keynote: The Art of the Possible with Graph Technology
Neo4j Keynote: The Art of the Possible with Graph TechnologyNeo4j Keynote: The Art of the Possible with Graph Technology
Neo4j Keynote: The Art of the Possible with Graph Technology
 
Decoding Cloud for the Non-IT Executive
Decoding Cloud for the Non-IT ExecutiveDecoding Cloud for the Non-IT Executive
Decoding Cloud for the Non-IT Executive
 

More from Zilliz

More from Zilliz (14)

Emergent Methods: Multilingual narrative tracking in the news - real-time exp...
Emergent Methods: Multilingual narrative tracking in the news - real-time exp...Emergent Methods: Multilingual narrative tracking in the news - real-time exp...
Emergent Methods: Multilingual narrative tracking in the news - real-time exp...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Zilliz - Overview of Generative models in ML
Zilliz - Overview of Generative models in MLZilliz - Overview of Generative models in ML
Zilliz - Overview of Generative models in ML
 
Integrating Multimodal AI in Your Apps with Floom
Integrating Multimodal AI in Your Apps with FloomIntegrating Multimodal AI in Your Apps with Floom
Integrating Multimodal AI in Your Apps with Floom
 
Build streaming LLM with Timeplus and Zilliz
Build streaming LLM with Timeplus and ZillizBuild streaming LLM with Timeplus and Zilliz
Build streaming LLM with Timeplus and Zilliz
 
Beyond Retrieval Augmented Generation (RAG): Vector Databases
Beyond Retrieval Augmented Generation (RAG): Vector DatabasesBeyond Retrieval Augmented Generation (RAG): Vector Databases
Beyond Retrieval Augmented Generation (RAG): Vector Databases
 
Voyage AI: cutting-edge embeddings and rerankers for search and RAG
Voyage AI: cutting-edge embeddings and rerankers for search and RAGVoyage AI: cutting-edge embeddings and rerankers for search and RAG
Voyage AI: cutting-edge embeddings and rerankers for search and RAG
 
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage CostLeverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
Leverage Zilliz Serverless - Up to 50X Saving for Your Vector Storage Cost
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Fact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMsFact vs. Fiction: Autodetecting Hallucinations in LLMs
Fact vs. Fiction: Autodetecting Hallucinations in LLMs
 
VectorDB Schema Design 101 - Considerations for Building a Scalable and Perfo...
VectorDB Schema Design 101 - Considerations for Building a Scalable and Perfo...VectorDB Schema Design 101 - Considerations for Building a Scalable and Perfo...
VectorDB Schema Design 101 - Considerations for Building a Scalable and Perfo...
 
Voyage AI Embedding Models for Retrieval Augmented Generation
Voyage AI Embedding Models for Retrieval Augmented GenerationVoyage AI Embedding Models for Retrieval Augmented Generation
Voyage AI Embedding Models for Retrieval Augmented Generation
 
Chat with your data, privately and locally
Chat with your data, privately and locallyChat with your data, privately and locally
Chat with your data, privately and locally
 
Introducing Milvus and new features in 2.4 release
Introducing Milvus and new features in 2.4 releaseIntroducing Milvus and new features in 2.4 release
Introducing Milvus and new features in 2.4 release
 

Recently uploaded

Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
panagenda
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
FIDO Alliance
 

Recently uploaded (20)

Design and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data ScienceDesign and Development of a Provenance Capture Platform for Data Science
Design and Development of a Provenance Capture Platform for Data Science
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Vector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptxVector Search @ sw2con for slideshare.pptx
Vector Search @ sw2con for slideshare.pptx
 
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
Easier, Faster, and More Powerful – Alles Neu macht der Mai -Wir durchleuchte...
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
JavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate GuideJavaScript Usage Statistics 2024 - The Ultimate Guide
JavaScript Usage Statistics 2024 - The Ultimate Guide
 
The Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and InsightThe Zero-ETL Approach: Enhancing Data Agility and Insight
The Zero-ETL Approach: Enhancing Data Agility and Insight
 
2024 May Patch Tuesday
2024 May Patch Tuesday2024 May Patch Tuesday
2024 May Patch Tuesday
 
Oauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoftOauth 2.0 Introduction and Flows with MuleSoft
Oauth 2.0 Introduction and Flows with MuleSoft
 
WebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM PerformanceWebAssembly is Key to Better LLM Performance
WebAssembly is Key to Better LLM Performance
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
ChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps ProductivityChatGPT and Beyond - Elevating DevOps Productivity
ChatGPT and Beyond - Elevating DevOps Productivity
 
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptxHarnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
Harnessing Passkeys in the Battle Against AI-Powered Cyber Threats.pptx
 
Top 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development CompaniesTop 10 CodeIgniter Development Companies
Top 10 CodeIgniter Development Companies
 
Introduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptxIntroduction to FIDO Authentication and Passkeys.pptx
Introduction to FIDO Authentication and Passkeys.pptx
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
How to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in PakistanHow to Check GPS Location with a Live Tracker in Pakistan
How to Check GPS Location with a Live Tracker in Pakistan
 
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on ThanabotsContinuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
Continuing Bonds Through AI: A Hermeneutic Reflection on Thanabots
 

Introduction to Large Language Model Customization.pdf

  • 1. 1 | © Copyright 2024 Zilliz 1 Yujian Tang | Zilliz Introduction to LLM Customization
  • 2. 2 | © Copyright 2024 Zilliz 2 Yujian Tang Senior Developer Advocate, Zilliz yujian@zilliz.com https://www.linkedin.com/in/yujiantang https://www.twitter.com/yujian_tang Speaker
  • 3. 3 | © Copyright 2024 Zilliz 3 01 Introduction to LLMs CONTENTS 03 04 Examples 02 RAG Fine Tuning
  • 4. 4 | © Copyright 2024 Zilliz 4 01 Introduction to LLMs
  • 5. 5 | © Copyright 2024 Zilliz 5 A Basic Neural Net
  • 6. 6 | © Copyright 2024 Zilliz 6 A Recurrent Neural Network
  • 7. 7 | © Copyright 2024 Zilliz 7 A Transformer Architecture
  • 8. 8 | © Copyright 2024 Zilliz 8 GPT
  • 9. 9 | © Copyright 2024 Zilliz 9 02 RAG
  • 10. 10 | © Copyright 2024 Zilliz 10 RAG RAG Inject your data via a vector database like Milvus/Zilliz Query LLM Milvus Your Data Primary Use Case - Factual Recall - Forced Data Injection - Cost Optimization Embed
  • 11. 11 | © Copyright 2024 Zilliz 11 What Does Vector Data Look Like?
  • 12. 12 | © Copyright 2024 Zilliz 12 Find Semantically Similar Data Apple made profits of $97 Billion in 2023 I like to eat apple pie for profit in 2023 Apple’s bottom line increased by record numbers in 2023
  • 13. 13 | © Copyright 2024 Zilliz 13 But wait! There’s more!
  • 14. 14 | © Copyright 2024 Zilliz 14 RAG lets us inject data via semantic similarity provided by vector databases like Milvus
  • 15. 15 | © Copyright 2024 Zilliz 15 03 Fine Tuning
  • 16. 16 | © Copyright 2024 Zilliz 16 RAG vs Fine Tuning LLM Fine Tuning Augment an LLM by training it on your data Your Data “New” LLM Query Primary Use Case - Style transfer - Domain specific usage
  • 17. 17 | © Copyright 2024 Zilliz 17
  • 18. 18 | © Copyright 2024 Zilliz 18
  • 19. 19 | © Copyright 2024 Zilliz 19 Types of Fine Tuning - Full fine tuning - LoRA - QLoRA
  • 20. 20 | © Copyright 2024 Zilliz 20 Fine Tuning Methods - Supervised Fine Tuning - Direct Preference Optimization - 𝚿 (Identity) Preference Optimization - Odds-Ratio Preference Optimization
  • 21. 21 | © Copyright 2024 Zilliz 21 Fine tuning lets us train LLMs to operate in certain styles or domains
  • 22. 22 | © Copyright 2024 Zilliz 22 04 Examples Give Milvus a Star!
  • 23. 23 | © Copyright 2024 Zilliz 23 RAG without OpenAI project
  • 24. 24 | © Copyright 2024 Zilliz 24 AI Agents Projects
  • 25. 25 | © Copyright 2024 Zilliz 25 Fine Tuning Library
  • 26. 26 | © Copyright 2024 Zilliz 26 Start building with Zilliz Cloud today! zilliz.com/cloud