SlideShare a Scribd company logo
1 of 17
H2O.ai Confidential
SANDEEP SINGH
Head of Applied AI, Beans.AI
H2O.ai Confidential
Building LLM Solutions using Open Source and Closed Source Solutions
in Coherent Manner
H2O.ai Confidential
Agenda
Introduction
Ecosystem of Open Source LLM Tools
The Significance of blending Open and Closed
Source LLMs
Open Source LLMs: Features and Benefits
Closed Source LLMs: Advantages and Use-Cases
Integration Strategies: Methodologies
Case Study Highlights and Learning from the
Industry Dataset Enrichment.
Low to No Code Fine-Tuning Techniques
Conclusion
v
H2O.ai Confidential
About Me
• Head of Applied AI/Computer Vision, Beans AI
• Beans AI based out of Palo Alto, CA
• We are Location Intelligence Platform.
• Hyper-Accurate Maps, Much more accurate than
Google, Apple and Bing for apartments.
• Computer Vision and Image based Synthesis is
inherent part of Innovation at Beans Maps
• I deal with Satellite Imagery, Location Data,
Convexity Optimization domains at my day to day
job.
• Holds Masters from Georgia Tech.
v
H2O.ai Confidential
Ecosystem of LLMs
GPT-4
PaLM
SageMaker
Neo
IBM
Watson
Salesforce
Einstein
Wu Dao 2.0
Clarifai
Cohere
Anthropic
Claude
MT-NLG
LLaMA 2
Falcon-
40B/180B
Vicuna 33B
MPT-30B
GPT-NeoX-
20B
CodeGen
GPT-J
OPT-175B
BLOOM
Baichuan-
13B
Proprietary/Closed Source Open Source
v
H2O.ai Confidential
Open Source LLMs: Benefits
● Enhanced data security and privacy: Self-hosted deployment
● Cost savings: No licensing/subscription fees and no API calls expenditure.
● No External Dependency: No reliance on select few vendor avoiding lock-in.
● Code transparency and Constructive Collaboration/Validation: Underlying code and methodologies are
vetted for functionality by community.
● Language Model Customization: Domain Adaptation is more manageable with open-source LLMs by Fine-
tuning.
● Active Community Support: Often thriving communities , Quicker issue resolution, access to resources and
collaborative.
● Fosters innovation: Open-source LLMs encourage innovation by enabling organizations to experiment and
build upon existing models.
● Boon for Startups: Leverage models as a foundation for creative and unique applications.
v
H2O.ai Confidential
Closed Source LLMs: Advantages
● Support and Reliability: Vendor Support, Professional assistance, Maintenance, Troubleshooting, SLA Requirements
● Customization for Specific Business Needs: Accommodate Unique requirements of a business
● Security and Data Privacy: May offer more robust security features and privacy assurances critical for industries with
sensitive data.
● Performance: Regular optimizations and enhancements for better performance for specific tasks or industry
● Integration with Proprietary Systems: Dedicated tooling support to use existing proprietary software stacks within an
organization to avoid extensive re-engineering.
● Compliance and Liability: For regulated industries greater assurance of compliance.
● Guardrails Ownership: Responsibility for compliance often falls on the vendor, reducing the legal and financial risks for
the user.
● Continuous Development and Updates: Dedicated teams to keep up-to-date cutting edge
● Commercial Viability: Better viability for businesses with limited resource and investments, Enable quicker feature
developments.
v
H2O.ai Confidential
Best of Both Worlds
● Number of options available.
● Possibility of exploiting more than one solution.
● In-house Performance comparison for “your” task, Not just a benchmark.
● Different baselines for particular domain adaptation.
● Amount of fine tuning needs are not same for similar tasks.
● Possibility of using specific LLM solution for specific task in pipeline.
● Combinations available to hyper ensemble these LLM solutions.
● Ability to pick and choose LLMs without affecting other LLMs in play.
v
H2O.ai Confidential
Integration Strategies
● Grunualize the task at hand:
Break LLM “initiative” into LLM “tasks”
● Categorize the tasks by Stochasticity Tolerance and Criticality:
Different LLM solutions pose varying degree of temperature sensitivity.
● Less tolerant tasks are candidate for Proprietary Off-the-shelf solutions.
● More tolerant tasks are candidate for Open Source with no or less fine tuning needs.
v
H2O.ai Confidential
Case Study Highlights
● At Beans.AI, we use combination of approaches like:
Few tasks are achieved using Prompt Engineering/RAG based approach.
Few tasks are achieved using Limited to moderate Fine Tuning.
● Both Closed source and Open Source LLMs are used.
● Responses from Closed Source LLMs are used by Open Source LLMs and vice versa in pipeline.
● Used for automated support, Insights from dashboard, automated email order etc.
v
H2O.ai Confidential
Dataset Enrichment
● No, NOT THAT data enrichment!
● Most of the time:
For “your” purpose, you need “your” data.
● “Your” data is limited by:
Quantity, Quality and Variety
● LLMs are used to overcome:
Quantity: By creating more samples of data
Quality: By working with humans in the loop type setup
Variety: By revising and rewriting intents in many different possible ways.
v
H2O.ai Confidential
Dataset Enrichment(cond.)
Example:
● Task: Question Answering Bot for your particular app. Say: Delivery Support App.
● Interaction: Delivery Driver asks a question in the app and expects “how-to” type response.
Question: How do I mark an address not deliverable in the app?
Candidate Answer: Explains the steps to do the same.
● Current Training Data: Set of Questions and Answers in knowledge article.
Enrichment Step:
Prompt engineered app to create variations of your domain specific questions as:
“Ask the above question in 20 different ways”
All these new 20 ways of asking the “same” question, create new training examples for you.
v
H2O.ai Confidential
Dataset Enrichment(cond.)
All of these questions below ask the EXACT same thing!
● What's the process for labeling an address as undeliverable within the application?
● Can you guide me through the steps to indicate that an address is non-deliverable in the app?
● How can I flag an address as undeliverable when using the app?
● What is the method for setting an address to 'not deliverable' status in the application?
● Is there a way to mark an address as 'cannot be delivered to' in the app interface?
● Could you explain how to designate an address as not deliverable on the app?
● I'm looking to mark an address as non-deliverable in the app; how do I do that?
● How does one go about indicating that an address is not serviceable in the app?
● In the app, what are the steps to mark an address as one that can't be delivered to?
● What’s the procedure to flag an address as 'not deliverable' in the app's system?
v
H2O.ai Confidential
Case Optimization
● We at Beans.AI use LLMs o analyze the pipeline to be used on the fly.
● E.g.
Super Set of Tasks For the Jobs:
Task 1, Task 2, Task 3, Task 4, Task 5, Task 6
Set of Tasks actually need for “job” instance:
Task 2, Task 4 and Task 6 Only.
● Proprietary LLMs with higher reasoning and guardrails is used to find the Tasks needed to be run.
● Short-listed Tasks can be run with actual sensitive data uses locally deployed Open Source LLM.
v
H2O.ai Confidential
Low to No Code Fine-Tuning Techniques
● H2O LLM Studio: Equivalent to Stable Diffusion’s Automatic1111 or ComfyUI.
● Fine-tune Open Source LLMs without any coding, While can extensible with code.
● GUI specially for LLMs.
● Support for hyperparameters specific to finetune of LLMs.
● Support Low-Rank Adaptation (LoRA) and lower quantization to achieve lean memory footprint.
● Model Performance Tracking in UI.
● Test the fine-tuned model by testing it to get instant feedback.
● Most Important Enabler: Almost touch-less export to Hugging Face Hub.
v
H2O.ai Confidential
Low to No Code Fine-Tuning Techniques(cond.)
My first fine-tuning using LLM Studio took almost same time as this presentation!
H2O.ai Confidential

More Related Content

Similar to Building LLM Solutions using Open Source and Closed Source Solutions in Coherent Manner

Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...
Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...
Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...Elemica
 
What are the Best Practices for Enterprise Software Applications?
What are the Best Practices for Enterprise Software Applications?What are the Best Practices for Enterprise Software Applications?
What are the Best Practices for Enterprise Software Applications?BoTree Technologies
 
Harish software engineer (rpa) 4+ yrs exp
Harish software engineer (rpa) 4+ yrs expHarish software engineer (rpa) 4+ yrs exp
Harish software engineer (rpa) 4+ yrs expHarish M
 
Evolving to Cloud-Native - Anand Rao
Evolving to Cloud-Native - Anand RaoEvolving to Cloud-Native - Anand Rao
Evolving to Cloud-Native - Anand RaoVMware Tanzu
 
Mendix Essentials Presentatie Gerolf Roovers26/08/2011
Mendix Essentials Presentatie Gerolf Roovers26/08/2011Mendix Essentials Presentatie Gerolf Roovers26/08/2011
Mendix Essentials Presentatie Gerolf Roovers26/08/2011Mendix
 
Assure - Complete Standardization for HP ALM
Assure - Complete Standardization for HP ALMAssure - Complete Standardization for HP ALM
Assure - Complete Standardization for HP ALMAssure
 
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdf
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdfA Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdf
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdfExpert App Devs
 
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnerships
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though PartnershipsTHE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnerships
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnershipscyberopticgroup
 
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...panagenda
 
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...WSO2
 
Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the tradeFangda Wang
 
How to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software DeveloperHow to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software Developerjeetendra mandal
 
Introduce Test Harness for Direct To Consumer Solutions.pdf
Introduce Test Harness for Direct To Consumer Solutions.pdfIntroduce Test Harness for Direct To Consumer Solutions.pdf
Introduce Test Harness for Direct To Consumer Solutions.pdfKnoldus Inc.
 

Similar to Building LLM Solutions using Open Source and Closed Source Solutions in Coherent Manner (20)

Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...
Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...
Sergio Juarez, Elemica – “From Big Data to Value: The Power of Master Data Ma...
 
prod-dev-management.pptx
prod-dev-management.pptxprod-dev-management.pptx
prod-dev-management.pptx
 
What are the Best Practices for Enterprise Software Applications?
What are the Best Practices for Enterprise Software Applications?What are the Best Practices for Enterprise Software Applications?
What are the Best Practices for Enterprise Software Applications?
 
Harish software engineer (rpa) 4+ yrs exp
Harish software engineer (rpa) 4+ yrs expHarish software engineer (rpa) 4+ yrs exp
Harish software engineer (rpa) 4+ yrs exp
 
Pooja_resume_5.10
Pooja_resume_5.10Pooja_resume_5.10
Pooja_resume_5.10
 
Evolving to Cloud-Native - Anand Rao
Evolving to Cloud-Native - Anand RaoEvolving to Cloud-Native - Anand Rao
Evolving to Cloud-Native - Anand Rao
 
BUDDY White Paper
BUDDY White PaperBUDDY White Paper
BUDDY White Paper
 
Mendix Essentials Presentatie Gerolf Roovers26/08/2011
Mendix Essentials Presentatie Gerolf Roovers26/08/2011Mendix Essentials Presentatie Gerolf Roovers26/08/2011
Mendix Essentials Presentatie Gerolf Roovers26/08/2011
 
Assure - Complete Standardization for HP ALM
Assure - Complete Standardization for HP ALMAssure - Complete Standardization for HP ALM
Assure - Complete Standardization for HP ALM
 
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdf
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdfA Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdf
A Comprehensive Breakdown of Low Code, No Code and Traditional Development.pdf
 
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnerships
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though PartnershipsTHE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnerships
THE OPEN SOURCE OPPORTUNITY: Monetizing Open Source Though Partnerships
 
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...
CollabSphere 2020 - ANA101 - Domino Application Strategy Key insights for suc...
 
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
[APIdays Singapore 2019] Managing the API lifecycle with Open Source Technolo...
 
mohan_p1
mohan_p1mohan_p1
mohan_p1
 
Data science tools of the trade
Data science tools of the tradeData science tools of the trade
Data science tools of the trade
 
BVT_Swamy_Abap_4
BVT_Swamy_Abap_4BVT_Swamy_Abap_4
BVT_Swamy_Abap_4
 
How to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software DeveloperHow to become a Software Engineer Carrier Path for Software Developer
How to become a Software Engineer Carrier Path for Software Developer
 
Introduce Test Harness for Direct To Consumer Solutions.pdf
Introduce Test Harness for Direct To Consumer Solutions.pdfIntroduce Test Harness for Direct To Consumer Solutions.pdf
Introduce Test Harness for Direct To Consumer Solutions.pdf
 
Karuna Resume
Karuna ResumeKaruna Resume
Karuna Resume
 
DevOps for Enterprise Systems - Rosalind Radcliffe
DevOps for Enterprise Systems - Rosalind RadcliffeDevOps for Enterprise Systems - Rosalind Radcliffe
DevOps for Enterprise Systems - Rosalind Radcliffe
 

More from Sri Ambati

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DaySri Ambati
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxSri Ambati
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek Sri Ambati
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thSri Ambati
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionSri Ambati
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMsSri Ambati
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the WaySri Ambati
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OSri Ambati
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Sri Ambati
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersSri Ambati
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Sri Ambati
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...Sri Ambati
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability Sri Ambati
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email AgainSri Ambati
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Sri Ambati
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...Sri Ambati
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...Sri Ambati
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneySri Ambati
 
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DFML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DFSri Ambati
 

More from Sri Ambati (20)

H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo DayH2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
 
Generative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptxGenerative AI Masterclass - Model Risk Management.pptx
Generative AI Masterclass - Model Risk Management.pptx
 
AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek AI and the Future of Software Development: A Sneak Peek
AI and the Future of Software Development: A Sneak Peek
 
LLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5thLLMOps: Match report from the top of the 5th
LLMOps: Match report from the top of the 5th
 
Building, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for ProductionBuilding, Evaluating, and Optimizing your RAG App for Production
Building, Evaluating, and Optimizing your RAG App for Production
 
Risk Management for LLMs
Risk Management for LLMsRisk Management for LLMs
Risk Management for LLMs
 
Open-Source AI: Community is the Way
Open-Source AI: Community is the WayOpen-Source AI: Community is the Way
Open-Source AI: Community is the Way
 
Building Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2OBuilding Custom GenAI Apps at H2O
Building Custom GenAI Apps at H2O
 
Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical Applied Gen AI for the Finance Vertical
Applied Gen AI for the Finance Vertical
 
Cutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM PapersCutting Edge Tricks from LLM Papers
Cutting Edge Tricks from LLM Papers
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
Open Source h2oGPT with Retrieval Augmented Generation (RAG), Web Search, and...
 
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
KGM Mastering Classification and Regression with LLMs: Insights from Kaggle C...
 
LLM Interpretability
LLM Interpretability LLM Interpretability
LLM Interpretability
 
Never Reply to an Email Again
Never Reply to an Email AgainNever Reply to an Email Again
Never Reply to an Email Again
 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)Introducción al Aprendizaje Automatico con H2O-3 (1)
Introducción al Aprendizaje Automatico con H2O-3 (1)
 
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
From Rapid Prototypes to an end-to-end Model Deployment: an AI Hedge Fund Use...
 
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
AI Foundations Course Module 1 - Shifting to the Next Step in Your AI Transfo...
 
AI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation JourneyAI Foundations Course Module 1 - An AI Transformation Journey
AI Foundations Course Module 1 - An AI Transformation Journey
 
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DFML Model Deployment and Scoring on the Edge with Automatic ML & DF
ML Model Deployment and Scoring on the Edge with Automatic ML & DF
 

Recently uploaded

Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxMarkSteadman7
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringWSO2
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistandanishmna97
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformWSO2
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Victor Rentea
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAnitaRaj43
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard37
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamUiPathCommunity
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....rightmanforbloodline
 

Recently uploaded (20)

Simplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptxSimplifying Mobile A11y Presentation.pptx
Simplifying Mobile A11y Presentation.pptx
 
Choreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software EngineeringChoreo: Empowering the Future of Enterprise Software Engineering
Choreo: Empowering the Future of Enterprise Software Engineering
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data PlatformLess Is More: Utilizing Ballerina to Architect a Cloud Data Platform
Less Is More: Utilizing Ballerina to Architect a Cloud Data Platform
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
AI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by AnitarajAI in Action: Real World Use Cases by Anitaraj
AI in Action: Real World Use Cases by Anitaraj
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
JohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptxJohnPollard-hybrid-app-RailsConf2024.pptx
JohnPollard-hybrid-app-RailsConf2024.pptx
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
TEST BANK For Principles of Anatomy and Physiology, 16th Edition by Gerard J....
 

Building LLM Solutions using Open Source and Closed Source Solutions in Coherent Manner

  • 1. H2O.ai Confidential SANDEEP SINGH Head of Applied AI, Beans.AI
  • 2. H2O.ai Confidential Building LLM Solutions using Open Source and Closed Source Solutions in Coherent Manner
  • 3. H2O.ai Confidential Agenda Introduction Ecosystem of Open Source LLM Tools The Significance of blending Open and Closed Source LLMs Open Source LLMs: Features and Benefits Closed Source LLMs: Advantages and Use-Cases Integration Strategies: Methodologies Case Study Highlights and Learning from the Industry Dataset Enrichment. Low to No Code Fine-Tuning Techniques Conclusion
  • 4. v H2O.ai Confidential About Me • Head of Applied AI/Computer Vision, Beans AI • Beans AI based out of Palo Alto, CA • We are Location Intelligence Platform. • Hyper-Accurate Maps, Much more accurate than Google, Apple and Bing for apartments. • Computer Vision and Image based Synthesis is inherent part of Innovation at Beans Maps • I deal with Satellite Imagery, Location Data, Convexity Optimization domains at my day to day job. • Holds Masters from Georgia Tech.
  • 5. v H2O.ai Confidential Ecosystem of LLMs GPT-4 PaLM SageMaker Neo IBM Watson Salesforce Einstein Wu Dao 2.0 Clarifai Cohere Anthropic Claude MT-NLG LLaMA 2 Falcon- 40B/180B Vicuna 33B MPT-30B GPT-NeoX- 20B CodeGen GPT-J OPT-175B BLOOM Baichuan- 13B Proprietary/Closed Source Open Source
  • 6. v H2O.ai Confidential Open Source LLMs: Benefits ● Enhanced data security and privacy: Self-hosted deployment ● Cost savings: No licensing/subscription fees and no API calls expenditure. ● No External Dependency: No reliance on select few vendor avoiding lock-in. ● Code transparency and Constructive Collaboration/Validation: Underlying code and methodologies are vetted for functionality by community. ● Language Model Customization: Domain Adaptation is more manageable with open-source LLMs by Fine- tuning. ● Active Community Support: Often thriving communities , Quicker issue resolution, access to resources and collaborative. ● Fosters innovation: Open-source LLMs encourage innovation by enabling organizations to experiment and build upon existing models. ● Boon for Startups: Leverage models as a foundation for creative and unique applications.
  • 7. v H2O.ai Confidential Closed Source LLMs: Advantages ● Support and Reliability: Vendor Support, Professional assistance, Maintenance, Troubleshooting, SLA Requirements ● Customization for Specific Business Needs: Accommodate Unique requirements of a business ● Security and Data Privacy: May offer more robust security features and privacy assurances critical for industries with sensitive data. ● Performance: Regular optimizations and enhancements for better performance for specific tasks or industry ● Integration with Proprietary Systems: Dedicated tooling support to use existing proprietary software stacks within an organization to avoid extensive re-engineering. ● Compliance and Liability: For regulated industries greater assurance of compliance. ● Guardrails Ownership: Responsibility for compliance often falls on the vendor, reducing the legal and financial risks for the user. ● Continuous Development and Updates: Dedicated teams to keep up-to-date cutting edge ● Commercial Viability: Better viability for businesses with limited resource and investments, Enable quicker feature developments.
  • 8. v H2O.ai Confidential Best of Both Worlds ● Number of options available. ● Possibility of exploiting more than one solution. ● In-house Performance comparison for “your” task, Not just a benchmark. ● Different baselines for particular domain adaptation. ● Amount of fine tuning needs are not same for similar tasks. ● Possibility of using specific LLM solution for specific task in pipeline. ● Combinations available to hyper ensemble these LLM solutions. ● Ability to pick and choose LLMs without affecting other LLMs in play.
  • 9. v H2O.ai Confidential Integration Strategies ● Grunualize the task at hand: Break LLM “initiative” into LLM “tasks” ● Categorize the tasks by Stochasticity Tolerance and Criticality: Different LLM solutions pose varying degree of temperature sensitivity. ● Less tolerant tasks are candidate for Proprietary Off-the-shelf solutions. ● More tolerant tasks are candidate for Open Source with no or less fine tuning needs.
  • 10. v H2O.ai Confidential Case Study Highlights ● At Beans.AI, we use combination of approaches like: Few tasks are achieved using Prompt Engineering/RAG based approach. Few tasks are achieved using Limited to moderate Fine Tuning. ● Both Closed source and Open Source LLMs are used. ● Responses from Closed Source LLMs are used by Open Source LLMs and vice versa in pipeline. ● Used for automated support, Insights from dashboard, automated email order etc.
  • 11. v H2O.ai Confidential Dataset Enrichment ● No, NOT THAT data enrichment! ● Most of the time: For “your” purpose, you need “your” data. ● “Your” data is limited by: Quantity, Quality and Variety ● LLMs are used to overcome: Quantity: By creating more samples of data Quality: By working with humans in the loop type setup Variety: By revising and rewriting intents in many different possible ways.
  • 12. v H2O.ai Confidential Dataset Enrichment(cond.) Example: ● Task: Question Answering Bot for your particular app. Say: Delivery Support App. ● Interaction: Delivery Driver asks a question in the app and expects “how-to” type response. Question: How do I mark an address not deliverable in the app? Candidate Answer: Explains the steps to do the same. ● Current Training Data: Set of Questions and Answers in knowledge article. Enrichment Step: Prompt engineered app to create variations of your domain specific questions as: “Ask the above question in 20 different ways” All these new 20 ways of asking the “same” question, create new training examples for you.
  • 13. v H2O.ai Confidential Dataset Enrichment(cond.) All of these questions below ask the EXACT same thing! ● What's the process for labeling an address as undeliverable within the application? ● Can you guide me through the steps to indicate that an address is non-deliverable in the app? ● How can I flag an address as undeliverable when using the app? ● What is the method for setting an address to 'not deliverable' status in the application? ● Is there a way to mark an address as 'cannot be delivered to' in the app interface? ● Could you explain how to designate an address as not deliverable on the app? ● I'm looking to mark an address as non-deliverable in the app; how do I do that? ● How does one go about indicating that an address is not serviceable in the app? ● In the app, what are the steps to mark an address as one that can't be delivered to? ● What’s the procedure to flag an address as 'not deliverable' in the app's system?
  • 14. v H2O.ai Confidential Case Optimization ● We at Beans.AI use LLMs o analyze the pipeline to be used on the fly. ● E.g. Super Set of Tasks For the Jobs: Task 1, Task 2, Task 3, Task 4, Task 5, Task 6 Set of Tasks actually need for “job” instance: Task 2, Task 4 and Task 6 Only. ● Proprietary LLMs with higher reasoning and guardrails is used to find the Tasks needed to be run. ● Short-listed Tasks can be run with actual sensitive data uses locally deployed Open Source LLM.
  • 15. v H2O.ai Confidential Low to No Code Fine-Tuning Techniques ● H2O LLM Studio: Equivalent to Stable Diffusion’s Automatic1111 or ComfyUI. ● Fine-tune Open Source LLMs without any coding, While can extensible with code. ● GUI specially for LLMs. ● Support for hyperparameters specific to finetune of LLMs. ● Support Low-Rank Adaptation (LoRA) and lower quantization to achieve lean memory footprint. ● Model Performance Tracking in UI. ● Test the fine-tuned model by testing it to get instant feedback. ● Most Important Enabler: Almost touch-less export to Hugging Face Hub.
  • 16. v H2O.ai Confidential Low to No Code Fine-Tuning Techniques(cond.) My first fine-tuning using LLM Studio took almost same time as this presentation!