Adopting software design practices for better machine learning

•

1 like•511 views

MLconf

Jeff McGehee, Senior Data Scientist and IoT Practice Lead, Very

Technology

Jeff McGehee
Data Scientist, IoT Practice Lead
The Process is
Everything
An interpretation of Google’s Rules of Machine Learning
(https://goo.gl/9AKRvC)

It’s easy to optimize your model loss, but it’s hard
to optimize value delivered.
3

Introducing SoDQoP
4
Speed of Delivery
● Time to market
● Avg speed of new features (agility)
Quality of Product
● User Experience
○ Reliability
○ Availability
○ Ease of use
○ Valuable Features

SoDQoP Over Time (cont’d)
6
100
TIME
SoDQoP

7
Aggregate Team SoDQoP
People Tooling Process
GPU Tensor
Processing
Research
First
Losers Winners
Google’s Rules
of ML

Process is the place where machine learning
has the most room for improvement.

Scientists make discoveries. Engineers develop predictable processes for
aggregating and leveraging these discoveries in the world at large.
From (Computer) Science to (Software)
Engineering
10

Build lean.
Be clever with machine learning APIs.
Make it easy to improve your model.
Machine Learning Engineering
11

Build Lean
12
In General
● Understand what you’re measuring, and
why.
● Fail fast.
● Take “ship early and ship often”
seriously. Don’t waste time chasing a
few percentage points of accuracy on a
feature that users haven’t even been
exposed to yet.
Things We Do
● Two weeks or less to validate ML as a
viable solution.*
○ Jupyter, Pandas, SKLearn, TF,
Keras, Matplotlib/Seaborn.
● If you haven’t failed, ship what you have.
○ Serverless framework (AWS
Lambda, S3, Sagemaker, Batch).
● Iterate (rapidly) as needed.*
*Should be led by “Understand what you’re measuring, and why”

Be clever with ML APIs
13
In General
● Don’t reinvent the wheel.
● Have a deep understanding of accuracy
requirements.
● Build custom solutions where they will
have the highest impact.
Things We Do
● Wrap “noisy” API models with a
Bayesian inference engine tuned to
improve desired accuracy metric.
● Obtain features from API models (object
detection), and pass those to a final
model.

Make it easy to improve your model(s)
14
In General
● Record your predictions, along with
ground truth (if possible).
● Build features that allow users to label or
make corrections on predictions.
● Collect data that might be leveraged for
future models.
Things We Do
● AWS Lambda endpoints for receiving
data related to model feedback.
● High test coverage to facilitate agility
around changing the model.

What's hot

Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...Databricks

“Houston, we have a model...” Introduction to MLOpsRui Quintino

Automated Hyperparameter Tuning, Scaling and TrackingDatabricks

MLOps - Build pipelines with Tensor Flow Extended & KubeflowJan Kirenz

Automate your Machine LearningAjit Ananthram

Augmenting Machine Learning with Databricks Labs AutoML ToolkitDatabricks

Machine Learning Projects Using MATLAB Research HelpMatlab Simulation

Building A Machine Learning Platform At Quora (1)Nikhil Garg

DeepLearning and Advanced Machine Learning on IoTRomeo Kienzler

Buliding Reliable Data AppsGleb Mezhanskiy

Azure cognitive serviceVishwas N

Seamless End-to-End Production Machine Learning with Seldon and MLflowDatabricks

MLflow: Infrastructure for a Complete Machine Learning Life CycleDatabricks

SparkML: Easy ML Productization for Real-Time BiddingDatabricks

Julia + R for Data ScienceWork-Bench

Common Problems in Hyperparameter OptimizationSigOpt

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...Databricks

Machine Learning Deep DiveElasticsearch

Scott Clark, CEO, SigOpt, at The AI Conference 2017MLconf

IBM Middle East Data Science Connect 2016 - Doha, QatarRomeo Kienzler

What's hot (20)

Moving a Fraud-Fighting Random Forest from scikit-learn to Spark with MLlib, ...

“Houston, we have a model...” Introduction to MLOps

Automated Hyperparameter Tuning, Scaling and Tracking

MLOps - Build pipelines with Tensor Flow Extended & Kubeflow

Automate your Machine Learning

Augmenting Machine Learning with Databricks Labs AutoML Toolkit

Machine Learning Projects Using MATLAB Research Help

Building A Machine Learning Platform At Quora (1)

DeepLearning and Advanced Machine Learning on IoT

Buliding Reliable Data Apps

Azure cognitive service

Seamless End-to-End Production Machine Learning with Seldon and MLflow

MLflow: Infrastructure for a Complete Machine Learning Life Cycle

SparkML: Easy ML Productization for Real-Time Bidding

Julia + R for Data Science

Common Problems in Hyperparameter Optimization

Cloud Computing Was Built for Web Developers—What Does v2 Look Like for Deep...

Machine Learning Deep Dive

Scott Clark, CEO, SigOpt, at The AI Conference 2017

IBM Middle East Data Science Connect 2016 - Doha, Qatar

Similar to Adopting software design practices for better machine learning

Machine Learning InfrastructureSigOpt

[db tech showcase Tokyo 2018]　#dbts2018 #B27 『Discover Machine Learning and A...Insight Technology, Inc.

MLOps and Reproducible ML on AWS with Kubeflow and SageMakerProvectus

GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...James Anderson

Using SigOpt to Tune Deep Learning Models with Nervana CloudSigOpt

Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdfvitm11

DevOps Days Rockies MLOpsMatthew Reynolds

Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark

Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt

Google cloud Study Jam 2023.pptxGDSCNiT

SigOpt at GTC - Reducing operational barriers to optimizationSigOpt

Operationalizing analytics to scaleLooker

AI hype or realityAwantik Das

DevOps for DataScienceStepan Pushkarev

Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Sotrender

ANALYTICS WITHOUT LOSS OF GENERALITYWit Jakuczun

[Giovanni Galloro] How to use machine learning on Google Cloud PlatformMeetupDataScienceRoma

Danny Bickson - Python based predictive analytics with GraphLab Create PyData

Recommendations for Building Machine Learning SoftwareJustin Basilico

Slices Of Performance in Java - Oleksandr BodnarGlobalLogic Ukraine

Similar to Adopting software design practices for better machine learning (20)

Machine Learning Infrastructure

[db tech showcase Tokyo 2018]　#dbts2018 #B27 『Discover Machine Learning and A...

MLOps and Reproducible ML on AWS with Kubeflow and SageMaker

GDG Cloud Southlake #16: Priyanka Vergadia: Scalable Data Analytics in Google...

Using SigOpt to Tune Deep Learning Models with Nervana Cloud

Slides-Артем Коваль-Cloud-Native MLOps Framework - DataFest 2021.pdf

DevOps Days Rockies MLOps

Using Bayesian Optimization to Tune Machine Learning Models

Google cloud Study Jam 2023.pptx

SigOpt at GTC - Reducing operational barriers to optimization

Operationalizing analytics to scale

AI hype or reality

DevOps for DataScience

Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...

ANALYTICS WITHOUT LOSS OF GENERALITY

[Giovanni Galloro] How to use machine learning on Google Cloud Platform

Danny Bickson - Python based predictive analytics with GraphLab Create

Recommendations for Building Machine Learning Software

Slices Of Performance in Java - Oleksandr Bodnar

Recently uploaded

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays

Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer

[BuildWithAI] Introduction to Gemini.pdfSandro Moreira

AWS Community Day CPH - Three problems of TerraformAndrey Devyatkin

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software

Architecting Cloud Native ApplicationsWSO2

Cyberprint. Dark Pink Apt Group [EN].pdfOverkill Security

DBX First Quarter 2024 Investor PresentationDropbox

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@

ICT role in 21st century education and its challengesrafiqahmad00786416

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous

Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Orbitshub

FWD Group - Insurer Innovation Award 2024The Digital Insurer

AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer

Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya

Finding Java's Hidden Performance Traps @ DevoxxUK 2024Victor Rentea

Recently uploaded (20)

Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf

Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe

Axa Assurance Maroc - Insurer Innovation Award 2024

[BuildWithAI] Introduction to Gemini.pdf

AWS Community Day CPH - Three problems of Terraform

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME

Architecting Cloud Native Applications

Cyberprint. Dark Pink Apt Group [EN].pdf

DBX First Quarter 2024 Investor Presentation

Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...

+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...

ICT role in 21st century education and its challenges

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Exploring the Future Potential of AI-Enabled Smartphone Processors

Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...

FWD Group - Insurer Innovation Award 2024

AXA XL - Insurer Innovation Award Americas 2024

Artificial Intelligence Chap.5 : Uncertainty

Finding Java's Hidden Performance Traps @ DevoxxUK 2024

Adopting software design practices for better machine learning

1. Jeff McGehee Data Scientist, IoT Practice Lead The Process is Everything An interpretation of Google’s Rules of Machine Learning (https://goo.gl/9AKRvC)

2. The Theory

3. It’s easy to optimize your model loss, but it’s hard to optimize value delivered. 3

4. Introducing SoDQoP 4 Speed of Delivery ● Time to market ● Avg speed of new features (agility) Quality of Product ● User Experience ○ Reliability ○ Availability ○ Ease of use ○ Valuable Features

5. SoDQoP Over Time 5 100SoDQoP Time

6. SoDQoP Over Time (cont’d) 6 100 TIME SoDQoP

7. 7 Aggregate Team SoDQoP People Tooling Process GPU Tensor Processing Research First Losers Winners Google’s Rules of ML

8. Process is the place where machine learning has the most room for improvement.

9. 9 Process

10. Scientists make discoveries. Engineers develop predictable processes for aggregating and leveraging these discoveries in the world at large. From (Computer) Science to (Software) Engineering 10

11. Build lean. Be clever with machine learning APIs. Make it easy to improve your model. Machine Learning Engineering 11

12. Build Lean 12 In General ● Understand what you’re measuring, and why. ● Fail fast. ● Take “ship early and ship often” seriously. Don’t waste time chasing a few percentage points of accuracy on a feature that users haven’t even been exposed to yet. Things We Do ● Two weeks or less to validate ML as a viable solution.* ○ Jupyter, Pandas, SKLearn, TF, Keras, Matplotlib/Seaborn. ● If you haven’t failed, ship what you have. ○ Serverless framework (AWS Lambda, S3, Sagemaker, Batch). ● Iterate (rapidly) as needed.* *Should be led by “Understand what you’re measuring, and why”

13. Be clever with ML APIs 13 In General ● Don’t reinvent the wheel. ● Have a deep understanding of accuracy requirements. ● Build custom solutions where they will have the highest impact. Things We Do ● Wrap “noisy” API models with a Bayesian inference engine tuned to improve desired accuracy metric. ● Obtain features from API models (object detection), and pass those to a final model.

14. Make it easy to improve your model(s) 14 In General ● Record your predictions, along with ground truth (if possible). ● Build features that allow users to label or make corrections on predictions. ● Collect data that might be leveraged for future models. Things We Do ● AWS Lambda endpoints for receiving data related to model feedback. ● High test coverage to facilitate agility around changing the model.

15. Questions?

Adopting software design practices for better machine learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Adopting software design practices for better machine learning

Similar to Adopting software design practices for better machine learning (20)

More from MLconf

More from MLconf (20)

Recently uploaded

Recently uploaded (20)

Adopting software design practices for better machine learning