SlideShare a Scribd company logo
1 of 30
Download to read offline
Decoding ML/AI
A Practical Guide For Startups To
Drive Growth And Innovation
Denver Startup Week, 2023
PRESENTING SPONSOR
TITLE SPONSORS
TRACK SPONSORS
HEADLINE SPONSORS
PARTNER SPONSORS MEMBER SPONSORS
- CBRE
- Colorado Sun
- The Commons on Champa
- DAT Software and Analytics
- Denver Place
- Expansive Workspace
- Greenspoon Marder
- Halliburton Labs
- Jake Jabs Center for Entrepreneurship
- Molson Coors
- MSU Denver
- Park Central
- Polsinelli
- Tea with Tae
- Bounce Back
- Caruso
- Credera
- Doyle Group
- Industrious
- Initial Capacity
- MLJ Insurance
- Nexton
- Spectrum
- WeWork
© 2022 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent.
2.0
addition
Argie Angeleas
Group Product Manager
- @ibotta
Taylor Names
Principal ML Engineer
- @ibotta
Matt Reynolds
Principal ML Platform
Engineer - @ibotta
Gartner Predicts That 85%
Of ML Projects Will Fail
Denver, CO - 2023
Edtech Startup
People convert to registered users, BUT
they don’t watch courses
Course CTR (Click-through-rate) after a
strong start stayed flat
© 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent.
Product
Where To Start?
3 Important Questions
What is the problem we
are trying to solve?
Are there business
constraints?
Understand The Problem..
Remove Ambiguity..
But How?
What does a successful
outcome like?
Let The Fun Begin Step.0
Define The Problem Spaces
People can’t find relevant coursework and therefore they
are not starting courses
People find relevant courses, but the content of each
course does not meet their expectations
People can’t understand how to use our website, and
that’s why they are not starting courses
Great, But What’s Next?
Peel the onion
Is this an ML problem? What is the project value?
What is the expectation
in business?
Let The Fun Begin Vol. 1
Define The Hypotheses
Secret #1: Don’t Do It In A Dark Room On Your Own
Involve Subject Matter Experts
(Business, Engineering, Data Science, Product)
Define hypotheses that can solve the problem
ML is different than traditional software - it is
hypotheses based
© 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent.
Let the fun begin vol. 2
Ideate solutions
Earn trust through outcomes
Let The Fun Begin Vol. 3
Define Your Metrics
Secret #2: Don’t do it in a dark room on your own
Success Metric:
Click through rate
Guardrail Metrics:
› First course completion rate
› Sponsored content click through rate
© 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent.
Data Science
Your First Model Will Fail
Before model experimentation starts, there is uncertainty around how effectively the business
problem can be solved
› Should we expect a 2% lift in click through rate or 20%?
Uncertainty around which data will be predictive
› Example hypothesis: student’s geographic location will help predict relevant courses
› Many hypotheses won’t pan out!
Model experimentation process should be optimized for rapid iteration
› Input data + modeling approach = success?
› Don’t over polish a solution that isn’t guaranteed to succeed
How Will I Know If My Model Works?
In A Perfect World, Get Model Predictions In Front Of Real Customers & Measure Business
Impact!
A/B Testing: split customers between treatment (ML powered) and control (status-quo)
Cons:
› Large engineering investment (reliable production-ready solution)
› Risk to business (can we afford to see a significant drop in business metrics in our test population?)
› Slow and difficult to get conclusive results (if you have a small user base)
Is there a lower cost and lower risk alternative for rapid model evaluation?
Yes! Evaluate model performance on historical data
› Commonly referred to as offline testing or backtesting
Estimating Performance On Historical Data
Can you directly measure the business impact metric with different model
approaches on past data?
› If yes, do this!
01
What if it is infeasible to measure business metric on historical data?
› Are there proxy metrics that are well correlated with the critical business impact metrics?
• Example: Click through rate -> how high did the model rank items that were clicked
02
Remember to compare performance against a reasonable heuristic!
› Does my model generate more clicks than content sorted in order of general popularity?
03
Don’t over-optimize “offline” performance. Get in front of real users!
04
Clean, representative, and predictive input (training) data
› Clean: outliers, erroneous data need to be handled with care
› Representative: are the business scenarios where the model
will add value represented in your training set?
› Predictive: talk with SMEs on what data should be sourced
and transformed
The right model for the right problem
Rules of thumb:
› Tabular data: ensemble decision trees (XGBoost, lightGBM,
random forest)
› Text/computer vision/audio: open source pre-trained
neural networks
Model training is less than 10% of the total project effort!
What Does It
Take To Build An
Accurate Model?
Is My Model Ready For The Big Leagues?
Am I reasonably confident that my
model will add value & do no harm?
Outperforms a heuristic according to
business metric or reasonable proxy metric
Was my methodology of
evaluation sound?
Always peer review
code/analysis/conclusions!
Is my model able to be reasonably
supported in a production environment?
Overly complex models can be difficult to
productionize and expensive to maintain
If so, SHIP IT!
© 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. 22
Machine Learning Infrastructure
OK, I Have A Model, Now What?
You Need To Figure Out The Best Way To Get It Into Production
1
2
3
4
How “fresh” does the
model need to be?
What’s the simplest way to get
the results?
How often do you need to
perform predictions?
Who do you need to
create predictions for?
If you haven’t been talking to the engineers who will use the results, start now
Start with batch predictions
Starting With Batch Predictions Allows You To Avoid All This:
Real time feature store
(features = data used by model
to generate predictions)
Model serving code
Additional infrastructure
Latency concerns
Resiliency issues
01
02 04
03 05
25
25
› Use source control
› Have others review changes
› Be consistent – use naming conventions
› Parameterize for different environments/versions
› Provide a defined interface for your model predictions
› Reduce manual input in pipelines – aim for CI/CD
Software Engineering Teams (Should!) Do This, Copy Them
Follow Basic
Engineering Best
Practices
What Could This Look Like?
Source Control:
› Consider data scientist model work (generally Jupyter notebook)*
› Scripts for model data generation, training, batch predictions and storing results
Cloud storage:
Standardized naming model artifact, folder by environment, training date
Compute instance/Serverless:
Run scripts on regular time schedule or some automated trigger
Database:
Prediction results - include model version column
How can I save time?
Avoid scope creep
(stick to your defined
goals)
v
v
v
v
v
v
v
v
v
v
v
v
v
v
Keep it simple
(don’t try and build real-time
out the gate)
Use managed services
(don’t build everything from
scratch)
Get dedicated staff involved for
end to end support
(project should be
self-sufficient)
Follow best practices
(use standard model
approaches)
Use what you already have
(data engineering, service tools)
Avoid analysis paralysis
(beats baseline = start test)
Hire all-rounders
(no research only data
scientists)
What Should I Avoid Cutting Corners On?
Get other teams involved early
(upstream and downstream)
Don’t skip on automation
Regularly review
(both internal for design/code and stakeholders for progress)
Provide an API interface
(reduce coupling)
Basic monitoring
(did training fail? Did my predictions job run?)
Collaboration works wonders → business <> engineering <> data science <> product
Utilizing a structured approach to problem definition increases the likelihood of success
Even simple models can create impact that gets the business raving
Conclusions
Thank You
Questions?

More Related Content

What's hot

LIMITATIONS OF AI
LIMITATIONS OF AILIMITATIONS OF AI
LIMITATIONS OF AIAdityaK52
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligencevallibhargavi
 
Microsoft AI Platform Overview
Microsoft AI Platform OverviewMicrosoft AI Platform Overview
Microsoft AI Platform OverviewDavid Chou
 
Real life application of discrete math
Real life application of discrete mathReal life application of discrete math
Real life application of discrete mathSanad Bhowmik
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Sri Ambati
 
15/3 -17 impact exponential technologies
15/3 -17 impact exponential technologies 15/3 -17 impact exponential technologies
15/3 -17 impact exponential technologies Paul Epping
 

What's hot (7)

LLMs Bootcamp
LLMs BootcampLLMs Bootcamp
LLMs Bootcamp
 
LIMITATIONS OF AI
LIMITATIONS OF AILIMITATIONS OF AI
LIMITATIONS OF AI
 
artificial intelligence
artificial intelligenceartificial intelligence
artificial intelligence
 
Microsoft AI Platform Overview
Microsoft AI Platform OverviewMicrosoft AI Platform Overview
Microsoft AI Platform Overview
 
Real life application of discrete math
Real life application of discrete mathReal life application of discrete math
Real life application of discrete math
 
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
Practitioner's Guide to LLMs: Exploring Use Cases and a Glimpse Beyond Curren...
 
15/3 -17 impact exponential technologies
15/3 -17 impact exponential technologies 15/3 -17 impact exponential technologies
15/3 -17 impact exponential technologies
 

Similar to ML Guide Helps Startups Drive Growth

Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?AgileNetwork
 
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...Dario Mangano
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPeculium Crypto
 
The Power of < Artificial Intelligence >
The Power of < Artificial Intelligence >The Power of < Artificial Intelligence >
The Power of < Artificial Intelligence >Merelda
 
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys HolovatyiDataScienceConferenc1
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...Big Data Week
 
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...
Maximising likelihood of success:  Applying Product Management to AI/ML/DS pr...Maximising likelihood of success:  Applying Product Management to AI/ML/DS pr...
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...Kevin Wong
 
Blitzscaling Session 9: Village Stage
Blitzscaling Session 9: Village StageBlitzscaling Session 9: Village Stage
Blitzscaling Session 9: Village StageGreylock Partners
 
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AI
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AIExperteer for ICMA: Applied Machine Learning. Smart Process Automation with AI
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AIAlexander Chukovski
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaCapgemini
 
Top Takeaways from Validate 2019
Top Takeaways from Validate 2019Top Takeaways from Validate 2019
Top Takeaways from Validate 2019ObservePoint
 
Enterprise Architecture: Part I - Contextualizing the Practice
Enterprise Architecture: Part I - Contextualizing the PracticeEnterprise Architecture: Part I - Contextualizing the Practice
Enterprise Architecture: Part I - Contextualizing the PracticeFru Louis
 
Pin the tail on the metric v00 75 min version
Pin the tail on the metric v00 75 min versionPin the tail on the metric v00 75 min version
Pin the tail on the metric v00 75 min versionSteven Martin
 
Evolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryEvolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryVMware Tanzu
 
Evolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryEvolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryVMware Tanzu
 
Surge engr 245 lean launchpad stanford 2020
Surge engr 245 lean launchpad stanford 2020Surge engr 245 lean launchpad stanford 2020
Surge engr 245 lean launchpad stanford 2020Stanford University
 
Betsol | Machine Learning for IT Project Estimates
Betsol | Machine Learning for IT Project Estimates  Betsol | Machine Learning for IT Project Estimates
Betsol | Machine Learning for IT Project Estimates BETSOL
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudInside Analysis
 

Similar to ML Guide Helps Startups Drive Growth (20)

Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
Agile Mumbai 2022 - Ashwinee Singh | Agile in AI or AI in Agile?
 
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...SDD2017 - 03 Abed Ajraou  - putting data science in your business a first uti...
SDD2017 - 03 Abed Ajraou - putting data science in your business a first uti...
 
Putting data science in your business a first utility feedback
Putting data science in your business a first utility feedbackPutting data science in your business a first utility feedback
Putting data science in your business a first utility feedback
 
The Power of < Artificial Intelligence >
The Power of < Artificial Intelligence >The Power of < Artificial Intelligence >
The Power of < Artificial Intelligence >
 
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
[DSC Europe 22] The Making of a Data Organization - Denys Holovatyi
 
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
BDW17 London - Abed Ajraou - First Utility - Putting Data Science in your Bus...
 
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...
Maximising likelihood of success:  Applying Product Management to AI/ML/DS pr...Maximising likelihood of success:  Applying Product Management to AI/ML/DS pr...
Maximising likelihood of success: Applying Product Management to AI/ML/DS pr...
 
Blitzscaling Session 9: Village Stage
Blitzscaling Session 9: Village StageBlitzscaling Session 9: Village Stage
Blitzscaling Session 9: Village Stage
 
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AI
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AIExperteer for ICMA: Applied Machine Learning. Smart Process Automation with AI
Experteer for ICMA: Applied Machine Learning. Smart Process Automation with AI
 
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-indiaArtificial intelligence capabilities overview yashowardhan sowale cwin18-india
Artificial intelligence capabilities overview yashowardhan sowale cwin18-india
 
IoT Product Design and Prototyping
IoT Product Design and PrototypingIoT Product Design and Prototyping
IoT Product Design and Prototyping
 
Top Takeaways from Validate 2019
Top Takeaways from Validate 2019Top Takeaways from Validate 2019
Top Takeaways from Validate 2019
 
Cloud Analytics Playbook
Cloud Analytics PlaybookCloud Analytics Playbook
Cloud Analytics Playbook
 
Enterprise Architecture: Part I - Contextualizing the Practice
Enterprise Architecture: Part I - Contextualizing the PracticeEnterprise Architecture: Part I - Contextualizing the Practice
Enterprise Architecture: Part I - Contextualizing the Practice
 
Pin the tail on the metric v00 75 min version
Pin the tail on the metric v00 75 min versionPin the tail on the metric v00 75 min version
Pin the tail on the metric v00 75 min version
 
Evolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryEvolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success Story
 
Evolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success StoryEvolving Towards a Modernized Platform: Our Success Story
Evolving Towards a Modernized Platform: Our Success Story
 
Surge engr 245 lean launchpad stanford 2020
Surge engr 245 lean launchpad stanford 2020Surge engr 245 lean launchpad stanford 2020
Surge engr 245 lean launchpad stanford 2020
 
Betsol | Machine Learning for IT Project Estimates
Betsol | Machine Learning for IT Project Estimates  Betsol | Machine Learning for IT Project Estimates
Betsol | Machine Learning for IT Project Estimates
 
Bridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the CloudBridging the Gap: Analyzing Data in and Below the Cloud
Bridging the Gap: Analyzing Data in and Below the Cloud
 

Recently uploaded

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentationphoebematthew05
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr LapshynFwdays
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 

Recently uploaded (20)

Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
costume and set research powerpoint presentation
costume and set research powerpoint presentationcostume and set research powerpoint presentation
costume and set research powerpoint presentation
 
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
"Federated learning: out of reach no matter how close",Oleksandr Lapshyn
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort ServiceHot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
Hot Sexy call girls in Panjabi Bagh 🔝 9953056974 🔝 Delhi escort Service
 

ML Guide Helps Startups Drive Growth

  • 1. Decoding ML/AI A Practical Guide For Startups To Drive Growth And Innovation Denver Startup Week, 2023
  • 3. HEADLINE SPONSORS PARTNER SPONSORS MEMBER SPONSORS - CBRE - Colorado Sun - The Commons on Champa - DAT Software and Analytics - Denver Place - Expansive Workspace - Greenspoon Marder - Halliburton Labs - Jake Jabs Center for Entrepreneurship - Molson Coors - MSU Denver - Park Central - Polsinelli - Tea with Tae - Bounce Back - Caruso - Credera - Doyle Group - Industrious - Initial Capacity - MLJ Insurance - Nexton - Spectrum - WeWork
  • 4. © 2022 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. 2.0 addition Argie Angeleas Group Product Manager - @ibotta Taylor Names Principal ML Engineer - @ibotta Matt Reynolds Principal ML Platform Engineer - @ibotta
  • 5. Gartner Predicts That 85% Of ML Projects Will Fail
  • 6.
  • 8. Edtech Startup People convert to registered users, BUT they don’t watch courses Course CTR (Click-through-rate) after a strong start stayed flat
  • 9. © 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. Product
  • 10. Where To Start? 3 Important Questions What is the problem we are trying to solve? Are there business constraints? Understand The Problem.. Remove Ambiguity.. But How? What does a successful outcome like?
  • 11. Let The Fun Begin Step.0 Define The Problem Spaces People can’t find relevant coursework and therefore they are not starting courses People find relevant courses, but the content of each course does not meet their expectations People can’t understand how to use our website, and that’s why they are not starting courses
  • 12. Great, But What’s Next? Peel the onion Is this an ML problem? What is the project value? What is the expectation in business?
  • 13. Let The Fun Begin Vol. 1 Define The Hypotheses Secret #1: Don’t Do It In A Dark Room On Your Own Involve Subject Matter Experts (Business, Engineering, Data Science, Product) Define hypotheses that can solve the problem ML is different than traditional software - it is hypotheses based
  • 14. © 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. Let the fun begin vol. 2 Ideate solutions Earn trust through outcomes
  • 15. Let The Fun Begin Vol. 3 Define Your Metrics Secret #2: Don’t do it in a dark room on your own Success Metric: Click through rate Guardrail Metrics: › First course completion rate › Sponsored content click through rate
  • 16. © 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. Data Science
  • 17. Your First Model Will Fail Before model experimentation starts, there is uncertainty around how effectively the business problem can be solved › Should we expect a 2% lift in click through rate or 20%? Uncertainty around which data will be predictive › Example hypothesis: student’s geographic location will help predict relevant courses › Many hypotheses won’t pan out! Model experimentation process should be optimized for rapid iteration › Input data + modeling approach = success? › Don’t over polish a solution that isn’t guaranteed to succeed
  • 18. How Will I Know If My Model Works? In A Perfect World, Get Model Predictions In Front Of Real Customers & Measure Business Impact! A/B Testing: split customers between treatment (ML powered) and control (status-quo) Cons: › Large engineering investment (reliable production-ready solution) › Risk to business (can we afford to see a significant drop in business metrics in our test population?) › Slow and difficult to get conclusive results (if you have a small user base) Is there a lower cost and lower risk alternative for rapid model evaluation? Yes! Evaluate model performance on historical data › Commonly referred to as offline testing or backtesting
  • 19. Estimating Performance On Historical Data Can you directly measure the business impact metric with different model approaches on past data? › If yes, do this! 01 What if it is infeasible to measure business metric on historical data? › Are there proxy metrics that are well correlated with the critical business impact metrics? • Example: Click through rate -> how high did the model rank items that were clicked 02 Remember to compare performance against a reasonable heuristic! › Does my model generate more clicks than content sorted in order of general popularity? 03 Don’t over-optimize “offline” performance. Get in front of real users! 04
  • 20. Clean, representative, and predictive input (training) data › Clean: outliers, erroneous data need to be handled with care › Representative: are the business scenarios where the model will add value represented in your training set? › Predictive: talk with SMEs on what data should be sourced and transformed The right model for the right problem Rules of thumb: › Tabular data: ensemble decision trees (XGBoost, lightGBM, random forest) › Text/computer vision/audio: open source pre-trained neural networks Model training is less than 10% of the total project effort! What Does It Take To Build An Accurate Model?
  • 21. Is My Model Ready For The Big Leagues? Am I reasonably confident that my model will add value & do no harm? Outperforms a heuristic according to business metric or reasonable proxy metric Was my methodology of evaluation sound? Always peer review code/analysis/conclusions! Is my model able to be reasonably supported in a production environment? Overly complex models can be difficult to productionize and expensive to maintain If so, SHIP IT!
  • 22. © 2023 Ibotta, Inc. Proprietary and confidential, not to be shared without Ibotta’s express consent. 22 Machine Learning Infrastructure
  • 23. OK, I Have A Model, Now What? You Need To Figure Out The Best Way To Get It Into Production 1 2 3 4 How “fresh” does the model need to be? What’s the simplest way to get the results? How often do you need to perform predictions? Who do you need to create predictions for? If you haven’t been talking to the engineers who will use the results, start now
  • 24. Start with batch predictions Starting With Batch Predictions Allows You To Avoid All This: Real time feature store (features = data used by model to generate predictions) Model serving code Additional infrastructure Latency concerns Resiliency issues 01 02 04 03 05
  • 25. 25 25 › Use source control › Have others review changes › Be consistent – use naming conventions › Parameterize for different environments/versions › Provide a defined interface for your model predictions › Reduce manual input in pipelines – aim for CI/CD Software Engineering Teams (Should!) Do This, Copy Them Follow Basic Engineering Best Practices
  • 26. What Could This Look Like? Source Control: › Consider data scientist model work (generally Jupyter notebook)* › Scripts for model data generation, training, batch predictions and storing results Cloud storage: Standardized naming model artifact, folder by environment, training date Compute instance/Serverless: Run scripts on regular time schedule or some automated trigger Database: Prediction results - include model version column
  • 27. How can I save time? Avoid scope creep (stick to your defined goals) v v v v v v v v v v v v v v Keep it simple (don’t try and build real-time out the gate) Use managed services (don’t build everything from scratch) Get dedicated staff involved for end to end support (project should be self-sufficient) Follow best practices (use standard model approaches) Use what you already have (data engineering, service tools) Avoid analysis paralysis (beats baseline = start test) Hire all-rounders (no research only data scientists)
  • 28. What Should I Avoid Cutting Corners On? Get other teams involved early (upstream and downstream) Don’t skip on automation Regularly review (both internal for design/code and stakeholders for progress) Provide an API interface (reduce coupling) Basic monitoring (did training fail? Did my predictions job run?)
  • 29. Collaboration works wonders → business <> engineering <> data science <> product Utilizing a structured approach to problem definition increases the likelihood of success Even simple models can create impact that gets the business raving Conclusions