SlideShare a Scribd company logo
Dr June Andrews
ML Playbook
Getting Machine Learning Products into Production …
… bit more of an art than a science.
“For every idea in production, there are ~7 ideas that did not make the cut.”
Ellsworth Kelly
Optimize and refine products by cycling multiple times with improvements.
Machine Learning Product Development Cycle - Phases
Extending Boehm Spiral Model from Software Engineering
Plan
BuildTest
Learn
Release
Works for Social Networks, Consumer Web and Industrial Internet of Things.
Diverse Experiences over Years Incapsulated in the ML Playbook
Extending Boehm Spiral Model from Software Engineering
Plan
BuildTest
Learn
Release
Today - Demonstrate the ML Playbook in Action 3 Times
Example Playbook cycles:
Build
Improve
Prototype
Example of a Build Product - Augmenting Aviation Monitoring
Ellsworth Kelly
… you can tell how challenging a project is -
based on the graveyard of previous attempts.
Aviation: History of Innovation & Excellence
Best Practices & Decision Design has Evolved over 15+ Years of Monitoring Fleets
Plan
Source
Ideas
Learned from previous work. Immediately
prioritized to build without prototype.

Model Performance Success Metrics:
๏ Coverage x %
๏ Accuracy y %
๏ Low Latency < z seconds
Comprehensive Success Metrics:
๏ Usability is intuitive & transparent
๏ Delivered on time
๏ Maintains current history of excellence
๏ Collaborators enjoy working with Wise
Prioritize
Define
Success
Design
Roadmap
Plan
Source
Ideas
Make it easy on yourself & proportional to
the success metrics.

Minor variations in how the ML product is
designed lead to drastically different
difficulties of implementation. Explore
design decisions around:

๏ Critical Paths of existing systems

๏ Augment v Automate
๏ UI - number of results, placement,
text length, …
Prioritize
Define
Success
Design
Roadmap
New Workflow Preserves Baseline Reliability with Increasing Speed & Accuracy
Delivered an Augmented Human Interpretable Model
Increased the Power of the ML Models by Demanding More
Handling All Alerts was Key Design Choice
Competitive Machine Learning Requires Strategic Talent Contributions
Augmenting Fleet Monitor was an Orchestration of Aligning Talents with Needs
… the Roadmap was detailed.
Plan
Build
Source
Ideas
We love this stage!

Everything else is very linear - Build is
where it can get messy and enthralling.
There are plenty of great explanations
and ‘wheels’ with more detail. Remember
- building is not the end goal.

Proactively communicate complications
& delays to project stake holders.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Build &
Train
Optimize
Plan
Build
Source
Ideas
Lessons Learned working with Time Series:

๏ Time is not the same for all engines.
Normalize by Life Cycle.
๏ Inferring missing data from nearby
sensor readings for use with simpler
models assuming even time steps is a
bad idea.
๏ Create features resilient to missing data.
๏ Take the time to be clever. More data is
not an option.
(Deep Learning?) There have been more flights in a
year than websites about Dogs on Google - but far
fewer maintenance alerts than there are T-Rex
Dogs.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Build &
Train
Optimize
Machine Learning Experts Transform Domain Knowledge into Model Inputs
Aviation Experts are a Strategic Advantage in IIoT
Plan
BuildTest
Source
IdeasThird party support with
MTurk, Crowdflower ….

Largest companies tend to
use in-house solutions,
Pinterest uses Sofia, Stitchfix
has Stylists, Facebook and
Google famously have in
house teams.

We worked with in house
Aviation Experts.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Build &
Train
OptimizeEvaluateDog Food
Plan
BuildTest
Source
Ideas
Productionizing is becoming much easier!

Airbnb has Airflow

Netflix has Spinnaker+

Wise has Wise Factory 2.0

Clouds are now starting to offer

Google Cloud with SavedModel

AWS with Sagemaker

Lowering hiring requirements.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Build &
Train
OptimizeEvaluateDog Food
Plan
BuildTest
Learn
Source
Ideas
Overall learning for GE -
break from the idea of one
task one model, one alert
one model. There are
cross learnings from
common alerts that are
pertinent for rare alerts.
Learnings from Aviation
are being used in Power.

Tactical learning - found
there was a difference in
performance between
offline and online systems
due to data delays.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
LearnAnalyze
Product
Review
Evangelize
Resist letting the Data
Used and Scope of the
model reflect your
organization.
bonkersworld.net
Plan
BuildTest
Learn
Release
Source
Ideas
Non trivial - expect this
ML Product to be in
production 2 to 25 years.

Plan accordingly with
documentation, multiple
teams supporting & Black
Swan simulations.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Release
100%
Retrain &
Maintain
Example of an Improve Product - Increase Coverage by 20%
Ellsworth Kelly
… there’s always an improvement to be made.
Is it worth the effort is another question.
Plan
Source
Ideas
Project Prioritization - books have been written.
Let’s distill:

๏ Gather all ideas. 

๏ Measure each idea by {Impact, Cost, Strategy} 

๏ Use physics and math principles for estimating
cost and error propagation for upper and lower
bounds.

๏ Keep records of assumptions and results,
update assumptions as more is learned.
Prioritize
Define
Success
Design
Roadmap
Plan
Source
Ideas
Prioritize
Define
Success
Design
Roadmap
Estimated
Impact
Cost Strategic
Tuning of
Window
Parameters
z% 3 days No
Debugging of
missing data.
z% 2 weeks Yes
Train new model
on uncovered
alerts
z% 1 month+ No
Add more Time
Series Features
z% 2 days per
feature
Yes
Plan
Build
Source
Ideas
Building Improvements is very quick for scalable
models. Scalable engineering allows arbitrary
number of features. Scalable math allows inclusion
of additional features with predictable or safe
outcomes (don’t mix logs and multiplications, don’t
mix embeddings with clustering.)



Improvement features are often bucketed together.
Balance ‘fix it while I’m here’ with feature creep.
There will be another cycle, can always add ideas to
be prioritized in the next cycle.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Build &
Train
Optimize
Plan
BuildTest
Source
Ideas
Test stage should be
straight forward.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Build &
Train
OptimizeEvaluateDog Food
Plan
BuildTest
Learn
Source
Ideas
Even straight forward
feature additions offer a
chance to learn:

New Feature Family?

Suspicious behavior?

Bugs?

Improvements for faster
iteration?
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Plan
BuildTest
Learn
Release
Source
Ideas
Should be straight
forward. Update
documentation,
monitoring, dependancies
…
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Release
100%
Retrain &
Maintain
Example of Prototype Product - Human in the Loop Clustering
Ellsworth Kelly
…no recommendation here is domain specific.
Let’s test that.
Pinterest: History of Investing in Tools for Scalable Data Science
Project - Understand how users are engaging with Link Domains. Make tool reusable.
Plan
Source
Ideas
How are Users Engaging with Link Domains?

Success Metrics:

๏ Human Interpretable Answer
๏ Representative of Entire Domain
๏ Easily Reused
๏ Done in 2 weeks
Prioritize
Define
Success
Design
Roadmap
Plan
Source
Ideas
Prioritize
Define
Success
Design
Roadmap
๏ Human Interpretable Answer
๏ Representative of Entire Domain
๏ Easily Reused
๏ Done in 2 weeks
๏ Reuse existing code and features
๏ Use Python Notebook
๏ Extend a Human in the Loop Algorithm
๏ Ask some work of the user
Plan
Build
Source
Ideas
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Build &
Train
Optimize
Another time series!

Focused time on
visualization and human
interaction components.

( reused ETL jobs,

basic features,

used K-means )
Plan
BuildTest
Source
Ideas
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Build &
Train
OptimizeEvaluateDog Food
Plan
BuildTest
Learn
Source
Ideas
๏ Identifiable skill gaps:
๏ Cherry Picking
๏ Biased Exploration
๏ Replication Crisis of
Science extends to
Data Science
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Plan
BuildTest
Learn
Release
Source
Ideas
Code was
documented &
committed.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Release
100%
Retrain &
Maintain
Conclusions
Ellsworth Kelly
… all together now.
Plan
BuildTest
Learn
Release
Works for prototyping,
building & improving ML
Products.

Use cycles to focus
efforts, avoid feature
creep & set expectations.
Plan
Source
Ideas
Good companies
make good
decisions. Great
companies forgo
good decisions to
make great ones. 

Proper planning
leads to efficiently
executing great
ideas.
Prioritize
Define
Success
Design
Roadmap
Plan
Build
Source
Ideas
Building &
Optimizing models
is largely a solved
problem.

Identify & prioritize
unsolved problems
early on.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Build &
Train
Optimize
Plan
BuildTest
Source
Ideas
Testing should
match the diversity
of the user base &
the gravity of the
ML Product’s
responsibility.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Build &
Train
OptimizeEvaluateDog Food
Plan
BuildTest
Learn
Source
Ideas
Take the time to
learn.

Small changes in
complex
environments often
reveal cascading
effects.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Plan
BuildTest
Learn
Release
Source
Ideas
Model life
expectancy is
increasing.

Model Robustness
should follow.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Release
100%
Retrain &
Maintain
Plan
BuildTest
Learn
Release
Source
Ideas
All stages should
be addressed, even
if they are skipped.

A modification to
an upstream stage
triggers changes to
all downstream
stages.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Release
100%
Retrain
& Maintain
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Source
Ideas
Each stage may
involve input from
many roles including
users & customers,
but each stage should
have an Owner.

Who owns Prioritize
is a reflection of
{Product, Data,
Engineering, Design}
- Driven Companies.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Release
100%
Retrain &
Maintain
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Product

Data Science

Data Engineering

ML Engineering

Software Engineering
Per Company
Philosophical Thoughts
Ellsworth Kelly
Plan
BuildTest
Learn
Release
Source
Ideas
Platforms will
automate many
stages.

ML teams will
focus efforts on
stages that yield a
competitive
advantage.
Prioritize
Define
Success
Design
Roadmap
Find &
ETL Data
Feature
Engineer
Productionize
A/B Test
Analyze
Product
Review
Release
100%
Retrain &
Maintain
Build &
Train
OptimizeEvaluateDog Food
Evangelize
Turing Test, tests a mastery of communication.
Bob Test, tests a mastery of communication, data synthesis & problem solving.
Challenge to AI - The Bob Test
Bob gathers input from the monitoring team, synthesizes the data & calls the airline.
Bob works with experts from the airline to determine root cause & design safe plans
of action for quick resolution.
ML + IIoT Involves an Ongoing Negotiation to Figure Out What is Possible
1/3 of the World’s Power is Generated by GE.
60% of Airplane Engines are made by GE.
…
Preventative
Maintenance
Failure/
Anomaly
Detection
Assistive
Diagnosis
& Treatment
Systems
Optimization
5 Months to Augment Aviation — 2 Months to Augment Power
First Order Effects are When Innovations for Planes Help Planes.
Second Order Effects are When Innovations for Planes Help Power.
Third Order Effects are …
www.MLplaybook.com
/DrAndrews
Thank You

More Related Content

What's hot

Announcing Databricks Cloud (Spark Summit 2014)
Announcing Databricks Cloud (Spark Summit 2014)Announcing Databricks Cloud (Spark Summit 2014)
Announcing Databricks Cloud (Spark Summit 2014)
Databricks
 
Databricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With DataDatabricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With Data
Databricks
 
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
Daljit Banger
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
DATAVERSITY
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
Databricks
 
Data Modeling on Azure for Analytics
Data Modeling on Azure for AnalyticsData Modeling on Azure for Analytics
Data Modeling on Azure for Analytics
Ike Ellis
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
DATAVERSITY
 
Building an integrated data strategy
Building an integrated data strategyBuilding an integrated data strategy
Building an integrated data strategy
Lucas Modesto
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
Silicon Valley Data Science
 
Data Mesh
Data MeshData Mesh
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelinesAccelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
Paul Van Siclen
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
DATAVERSITY
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
Denodo
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
DATAVERSITY
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
ScyllaDB
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdf
Alan McSweeney
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

Cloudera, Inc.
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
Kent Graziano
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
Databricks
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Eric Kavanagh
 

What's hot (20)

Announcing Databricks Cloud (Spark Summit 2014)
Announcing Databricks Cloud (Spark Summit 2014)Announcing Databricks Cloud (Spark Summit 2014)
Announcing Databricks Cloud (Spark Summit 2014)
 
Databricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With DataDatabricks: A Tool That Empowers You To Do More With Data
Databricks: A Tool That Empowers You To Do More With Data
 
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...Aligning The Business Model to  Technology Landscapes Enterprise Systems Arch...
Aligning The Business Model to Technology Landscapes Enterprise Systems Arch...
 
Enterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data ArchitectureEnterprise Architecture vs. Data Architecture
Enterprise Architecture vs. Data Architecture
 
Architect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh ArchitectureArchitect’s Open-Source Guide for a Data Mesh Architecture
Architect’s Open-Source Guide for a Data Mesh Architecture
 
Data Modeling on Azure for Analytics
Data Modeling on Azure for AnalyticsData Modeling on Azure for Analytics
Data Modeling on Azure for Analytics
 
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
Data Architecture, Solution Architecture, Platform Architecture — What’s the ...
 
Building an integrated data strategy
Building an integrated data strategyBuilding an integrated data strategy
Building an integrated data strategy
 
8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy8 Steps to Creating a Data Strategy
8 Steps to Creating a Data Strategy
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Accelerate and modernize your data pipelines
Accelerate and modernize your data pipelinesAccelerate and modernize your data pipelines
Accelerate and modernize your data pipelines
 
Business Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected ApproachBusiness Intelligence & Data Analytics– An Architected Approach
Business Intelligence & Data Analytics– An Architected Approach
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 
Data Strategy Best Practices
Data Strategy Best PracticesData Strategy Best Practices
Data Strategy Best Practices
 
Data Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation CriteriaData Platform Architecture Principles and Evaluation Criteria
Data Platform Architecture Principles and Evaluation Criteria
 
Data Architecture for Solutions.pdf
Data Architecture for Solutions.pdfData Architecture for Solutions.pdf
Data Architecture for Solutions.pdf
 
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera
Supercharge Splunk with Cloudera

Supercharge Splunk with Cloudera

 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Active Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with AlationActive Governance Across the Delta Lake with Alation
Active Governance Across the Delta Lake with Alation
 
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data PipelinesBest Practices in DataOps: How to Create Agile, Automated Data Pipelines
Best Practices in DataOps: How to Create Agile, Automated Data Pipelines
 

Similar to ML Playbook

Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
MLconf
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
June Andrews
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
June Andrews
 
Platform approach to scaling machine learning across the enterprise
Platform approach to scaling machine learning across the enterprisePlatform approach to scaling machine learning across the enterprise
Platform approach to scaling machine learning across the enterprise
Olalekan Fuad Elesin
 
Jan Bosch | Agile Product Development: From Hunch to Hard Data
Jan Bosch | Agile Product Development: From Hunch to Hard DataJan Bosch | Agile Product Development: From Hunch to Hard Data
Jan Bosch | Agile Product Development: From Hunch to Hard Data
Optimizely
 
Agile Cafe Boulder - Panelist and keynote slides
Agile Cafe Boulder - Panelist and keynote slidesAgile Cafe Boulder - Panelist and keynote slides
Agile Cafe Boulder - Panelist and keynote slides
Cloud Elements
 
Introduction to Agile
Introduction to AgileIntroduction to Agile
Introduction to Agile
Charlotte Chang
 
AGILE2017 Top 10 Takeaways by Synerzip
AGILE2017 Top 10 Takeaways by SynerzipAGILE2017 Top 10 Takeaways by Synerzip
AGILE2017 Top 10 Takeaways by Synerzip
Synerzip
 
Final synerzip-agile2017-top10-v1
Final synerzip-agile2017-top10-v1Final synerzip-agile2017-top10-v1
Final synerzip-agile2017-top10-v1
Hemant Elhence
 
SigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
SigOpt at O'Reilly - Best Practices for Scaling Modeling PlatformsSigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
SigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
SigOpt
 
Agile Engineering Practices
Agile Engineering PracticesAgile Engineering Practices
Agile Engineering Practices
Vernon Stinebaker
 
Continuous Intelligence Workshop
Continuous Intelligence WorkshopContinuous Intelligence Workshop
Continuous Intelligence Workshop
David Tan
 
MongoDB World 2018: How an Idea Becomes a MongoDB Feature
MongoDB World 2018: How an Idea Becomes a MongoDB FeatureMongoDB World 2018: How an Idea Becomes a MongoDB Feature
MongoDB World 2018: How an Idea Becomes a MongoDB Feature
MongoDB
 
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
DataScienceConferenc1
 
Data Governance in an Agile SCRUM Lean MVP World
Data Governance in an Agile SCRUM Lean MVP WorldData Governance in an Agile SCRUM Lean MVP World
Data Governance in an Agile SCRUM Lean MVP World
DATAVERSITY
 
What can DesignOps do for you? by Carol Smith at TLMUX in Montreal
What can DesignOps do for you? by Carol Smith at TLMUX in MontrealWhat can DesignOps do for you? by Carol Smith at TLMUX in Montreal
What can DesignOps do for you? by Carol Smith at TLMUX in Montreal
Carol Smith
 
E xtreme programming for etl and data analytics final
E xtreme programming for etl and data analytics finalE xtreme programming for etl and data analytics final
E xtreme programming for etl and data analytics final
Prafulla Girgaonkar
 
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
NadinaLisbon1
 
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
gdgsurrey
 

Similar to ML Playbook (20)

Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
Dr. June Andrews, Principal Data Scientist, Wise.io, From GE Digital at MLcon...
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
Critical turbine maintenance: Monitoring and diagnosing planes and power plan...
 
Platform approach to scaling machine learning across the enterprise
Platform approach to scaling machine learning across the enterprisePlatform approach to scaling machine learning across the enterprise
Platform approach to scaling machine learning across the enterprise
 
Jan Bosch | Agile Product Development: From Hunch to Hard Data
Jan Bosch | Agile Product Development: From Hunch to Hard DataJan Bosch | Agile Product Development: From Hunch to Hard Data
Jan Bosch | Agile Product Development: From Hunch to Hard Data
 
Agile Cafe Boulder - Panelist and keynote slides
Agile Cafe Boulder - Panelist and keynote slidesAgile Cafe Boulder - Panelist and keynote slides
Agile Cafe Boulder - Panelist and keynote slides
 
Introduction to Agile
Introduction to AgileIntroduction to Agile
Introduction to Agile
 
AGILE2017 Top 10 Takeaways by Synerzip
AGILE2017 Top 10 Takeaways by SynerzipAGILE2017 Top 10 Takeaways by Synerzip
AGILE2017 Top 10 Takeaways by Synerzip
 
Final synerzip-agile2017-top10-v1
Final synerzip-agile2017-top10-v1Final synerzip-agile2017-top10-v1
Final synerzip-agile2017-top10-v1
 
SigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
SigOpt at O'Reilly - Best Practices for Scaling Modeling PlatformsSigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
SigOpt at O'Reilly - Best Practices for Scaling Modeling Platforms
 
Agile Engineering Practices
Agile Engineering PracticesAgile Engineering Practices
Agile Engineering Practices
 
Continuous Intelligence Workshop
Continuous Intelligence WorkshopContinuous Intelligence Workshop
Continuous Intelligence Workshop
 
MongoDB World 2018: How an Idea Becomes a MongoDB Feature
MongoDB World 2018: How an Idea Becomes a MongoDB FeatureMongoDB World 2018: How an Idea Becomes a MongoDB Feature
MongoDB World 2018: How an Idea Becomes a MongoDB Feature
 
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
[DSC Croatia 22] The Ethos of Process in ML - Leanne Fitzpatrick
 
Agile webinar pack (2)
Agile webinar pack (2)Agile webinar pack (2)
Agile webinar pack (2)
 
Data Governance in an Agile SCRUM Lean MVP World
Data Governance in an Agile SCRUM Lean MVP WorldData Governance in an Agile SCRUM Lean MVP World
Data Governance in an Agile SCRUM Lean MVP World
 
What can DesignOps do for you? by Carol Smith at TLMUX in Montreal
What can DesignOps do for you? by Carol Smith at TLMUX in MontrealWhat can DesignOps do for you? by Carol Smith at TLMUX in Montreal
What can DesignOps do for you? by Carol Smith at TLMUX in Montreal
 
E xtreme programming for etl and data analytics final
E xtreme programming for etl and data analytics finalE xtreme programming for etl and data analytics final
E xtreme programming for etl and data analytics final
 
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
Salesforce Architect Group, Frederick, United States July 2023 - Generative A...
 
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5Certification Study Group - NLP & Recommendation Systems on GCP Session 5
Certification Study Group - NLP & Recommendation Systems on GCP Session 5
 

More from June Andrews

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
June Andrews
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of ML
June Andrews
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
June Andrews
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & Academia
June Andrews
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
June Andrews
 
Replication in Data Science
Replication in Data ScienceReplication in Data Science
Replication in Data Science
June Andrews
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...
June Andrews
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on Pinterest
June Andrews
 
Math in data
Math in dataMath in data
Math in data
June Andrews
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North Stars
June Andrews
 
Economic Insights
Economic InsightsEconomic Insights
Economic Insights
June Andrews
 
Predictive Analytics & Business Insights
Predictive Analytics & Business InsightsPredictive Analytics & Business Insights
Predictive Analytics & Business Insights
June Andrews
 

More from June Andrews (12)

Scaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will loveScaling & Transforming Stitch Fix's Visibility into What Folks will love
Scaling & Transforming Stitch Fix's Visibility into What Folks will love
 
The Uncanny Valley of ML
The Uncanny Valley of MLThe Uncanny Valley of ML
The Uncanny Valley of ML
 
Data Competitive
Data CompetitiveData Competitive
Data Competitive
 
Push & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & AcademiaPush & Pull History of Data Science in Industry & Academia
Push & Pull History of Data Science in Industry & Academia
 
Counter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of ThingsCounter Intuitive Machine Learning for the Industrial Internet of Things
Counter Intuitive Machine Learning for the Industrial Internet of Things
 
Replication in Data Science
Replication in Data ScienceReplication in Data Science
Replication in Data Science
 
Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...Replication in Data Science - A Dance Between Data Science & Machine Learning...
Replication in Data Science - A Dance Between Data Science & Machine Learning...
 
Trends on Pinterest
Trends on PinterestTrends on Pinterest
Trends on Pinterest
 
Math in data
Math in dataMath in data
Math in data
 
Growth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North StarsGrowth, Engagement & Search Metrics: Snake Oil or North Stars
Growth, Engagement & Search Metrics: Snake Oil or North Stars
 
Economic Insights
Economic InsightsEconomic Insights
Economic Insights
 
Predictive Analytics & Business Insights
Predictive Analytics & Business InsightsPredictive Analytics & Business Insights
Predictive Analytics & Business Insights
 

Recently uploaded

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Subhajit Sahu
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
benishzehra469
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
slg6lamcq
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
jerlynmaetalle
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
axoqas
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
ewymefz
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
mbawufebxi
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
nscud
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
v3tuleee
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
balafet
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Linda486226
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
jerlynmaetalle
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
Opendatabay
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
ewymefz
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
Oppotus
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
AnirbanRoy608946
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
AbhimanyuSinha9
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
g4dpvqap0
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
u86oixdj
 

Recently uploaded (20)

Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
Levelwise PageRank with Loop-Based Dead End Handling Strategy : SHORT REPORT ...
 
Empowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptxEmpowering Data Analytics Ecosystem.pptx
Empowering Data Analytics Ecosystem.pptx
 
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
一比一原版(Adelaide毕业证书)阿德莱德大学毕业证如何办理
 
The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...The affect of service quality and online reviews on customer loyalty in the E...
The affect of service quality and online reviews on customer loyalty in the E...
 
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
哪里卖(usq毕业证书)南昆士兰大学毕业证研究生文凭证书托福证书原版一模一样
 
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
一比一原版(UPenn毕业证)宾夕法尼亚大学毕业证成绩单
 
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
一比一原版(Bradford毕业证书)布拉德福德大学毕业证如何办理
 
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
一比一原版(CBU毕业证)不列颠海角大学毕业证成绩单
 
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理一比一原版(UofS毕业证书)萨省大学毕业证如何办理
一比一原版(UofS毕业证书)萨省大学毕业证如何办理
 
Machine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptxMachine learning and optimization techniques for electrical drives.pptx
Machine learning and optimization techniques for electrical drives.pptx
 
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdfSample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
Sample_Global Non-invasive Prenatal Testing (NIPT) Market, 2019-2030.pdf
 
Influence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business PlanInfluence of Marketing Strategy and Market Competition on Business Plan
Influence of Marketing Strategy and Market Competition on Business Plan
 
Opendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptxOpendatabay - Open Data Marketplace.pptx
Opendatabay - Open Data Marketplace.pptx
 
一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单一比一原版(BU毕业证)波士顿大学毕业证成绩单
一比一原版(BU毕业证)波士顿大学毕业证成绩单
 
Q1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year ReboundQ1’2024 Update: MYCI’s Leap Year Rebound
Q1’2024 Update: MYCI’s Leap Year Rebound
 
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptxData_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
Data_and_Analytics_Essentials_Architect_an_Analytics_Platform.pptx
 
Criminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdfCriminal IP - Threat Hunting Webinar.pdf
Criminal IP - Threat Hunting Webinar.pdf
 
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...Best best suvichar in gujarati english meaning of this sentence as Silk road ...
Best best suvichar in gujarati english meaning of this sentence as Silk road ...
 
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
一比一原版(爱大毕业证书)爱丁堡大学毕业证如何办理
 
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
原版制作(Deakin毕业证书)迪肯大学毕业证学位证一模一样
 

ML Playbook

  • 2. Getting Machine Learning Products into Production … … bit more of an art than a science. “For every idea in production, there are ~7 ideas that did not make the cut.” Ellsworth Kelly
  • 3. Optimize and refine products by cycling multiple times with improvements. Machine Learning Product Development Cycle - Phases Extending Boehm Spiral Model from Software Engineering Plan BuildTest Learn Release
  • 4. Works for Social Networks, Consumer Web and Industrial Internet of Things. Diverse Experiences over Years Incapsulated in the ML Playbook Extending Boehm Spiral Model from Software Engineering Plan BuildTest Learn Release
  • 5. Today - Demonstrate the ML Playbook in Action 3 Times Example Playbook cycles: Build Improve Prototype
  • 6. Example of a Build Product - Augmenting Aviation Monitoring Ellsworth Kelly … you can tell how challenging a project is - based on the graveyard of previous attempts.
  • 7. Aviation: History of Innovation & Excellence Best Practices & Decision Design has Evolved over 15+ Years of Monitoring Fleets
  • 8. Plan Source Ideas Learned from previous work. Immediately prioritized to build without prototype. Model Performance Success Metrics: ๏ Coverage x % ๏ Accuracy y % ๏ Low Latency < z seconds Comprehensive Success Metrics: ๏ Usability is intuitive & transparent ๏ Delivered on time ๏ Maintains current history of excellence ๏ Collaborators enjoy working with Wise Prioritize Define Success Design Roadmap
  • 9. Plan Source Ideas Make it easy on yourself & proportional to the success metrics. Minor variations in how the ML product is designed lead to drastically different difficulties of implementation. Explore design decisions around: ๏ Critical Paths of existing systems ๏ Augment v Automate ๏ UI - number of results, placement, text length, … Prioritize Define Success Design Roadmap
  • 10. New Workflow Preserves Baseline Reliability with Increasing Speed & Accuracy Delivered an Augmented Human Interpretable Model
  • 11. Increased the Power of the ML Models by Demanding More Handling All Alerts was Key Design Choice
  • 12. Competitive Machine Learning Requires Strategic Talent Contributions Augmenting Fleet Monitor was an Orchestration of Aligning Talents with Needs … the Roadmap was detailed.
  • 13. Plan Build Source Ideas We love this stage! Everything else is very linear - Build is where it can get messy and enthralling. There are plenty of great explanations and ‘wheels’ with more detail. Remember - building is not the end goal. Proactively communicate complications & delays to project stake holders. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Build & Train Optimize
  • 14. Plan Build Source Ideas Lessons Learned working with Time Series: ๏ Time is not the same for all engines. Normalize by Life Cycle. ๏ Inferring missing data from nearby sensor readings for use with simpler models assuming even time steps is a bad idea. ๏ Create features resilient to missing data. ๏ Take the time to be clever. More data is not an option. (Deep Learning?) There have been more flights in a year than websites about Dogs on Google - but far fewer maintenance alerts than there are T-Rex Dogs. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Build & Train Optimize
  • 15. Machine Learning Experts Transform Domain Knowledge into Model Inputs Aviation Experts are a Strategic Advantage in IIoT
  • 16. Plan BuildTest Source IdeasThird party support with MTurk, Crowdflower …. Largest companies tend to use in-house solutions, Pinterest uses Sofia, Stitchfix has Stylists, Facebook and Google famously have in house teams. We worked with in house Aviation Experts. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Build & Train OptimizeEvaluateDog Food
  • 17. Plan BuildTest Source Ideas Productionizing is becoming much easier! Airbnb has Airflow Netflix has Spinnaker+ Wise has Wise Factory 2.0 Clouds are now starting to offer Google Cloud with SavedModel AWS with Sagemaker Lowering hiring requirements. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Build & Train OptimizeEvaluateDog Food
  • 18. Plan BuildTest Learn Source Ideas Overall learning for GE - break from the idea of one task one model, one alert one model. There are cross learnings from common alerts that are pertinent for rare alerts. Learnings from Aviation are being used in Power. Tactical learning - found there was a difference in performance between offline and online systems due to data delays. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize
  • 19. LearnAnalyze Product Review Evangelize Resist letting the Data Used and Scope of the model reflect your organization. bonkersworld.net
  • 20. Plan BuildTest Learn Release Source Ideas Non trivial - expect this ML Product to be in production 2 to 25 years. Plan accordingly with documentation, multiple teams supporting & Black Swan simulations. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize Release 100% Retrain & Maintain
  • 21. Example of an Improve Product - Increase Coverage by 20% Ellsworth Kelly … there’s always an improvement to be made. Is it worth the effort is another question.
  • 22. Plan Source Ideas Project Prioritization - books have been written. Let’s distill: ๏ Gather all ideas. ๏ Measure each idea by {Impact, Cost, Strategy} ๏ Use physics and math principles for estimating cost and error propagation for upper and lower bounds. ๏ Keep records of assumptions and results, update assumptions as more is learned. Prioritize Define Success Design Roadmap
  • 23. Plan Source Ideas Prioritize Define Success Design Roadmap Estimated Impact Cost Strategic Tuning of Window Parameters z% 3 days No Debugging of missing data. z% 2 weeks Yes Train new model on uncovered alerts z% 1 month+ No Add more Time Series Features z% 2 days per feature Yes
  • 24. Plan Build Source Ideas Building Improvements is very quick for scalable models. Scalable engineering allows arbitrary number of features. Scalable math allows inclusion of additional features with predictable or safe outcomes (don’t mix logs and multiplications, don’t mix embeddings with clustering.) 
 Improvement features are often bucketed together. Balance ‘fix it while I’m here’ with feature creep. There will be another cycle, can always add ideas to be prioritized in the next cycle. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Build & Train Optimize
  • 25. Plan BuildTest Source Ideas Test stage should be straight forward. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Build & Train OptimizeEvaluateDog Food
  • 26. Plan BuildTest Learn Source Ideas Even straight forward feature additions offer a chance to learn: New Feature Family? Suspicious behavior? Bugs? Improvements for faster iteration? Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize
  • 27. Plan BuildTest Learn Release Source Ideas Should be straight forward. Update documentation, monitoring, dependancies … Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize Release 100% Retrain & Maintain
  • 28. Example of Prototype Product - Human in the Loop Clustering Ellsworth Kelly …no recommendation here is domain specific. Let’s test that.
  • 29. Pinterest: History of Investing in Tools for Scalable Data Science Project - Understand how users are engaging with Link Domains. Make tool reusable.
  • 30. Plan Source Ideas How are Users Engaging with Link Domains? Success Metrics: ๏ Human Interpretable Answer ๏ Representative of Entire Domain ๏ Easily Reused ๏ Done in 2 weeks Prioritize Define Success Design Roadmap
  • 31. Plan Source Ideas Prioritize Define Success Design Roadmap ๏ Human Interpretable Answer ๏ Representative of Entire Domain ๏ Easily Reused ๏ Done in 2 weeks ๏ Reuse existing code and features ๏ Use Python Notebook ๏ Extend a Human in the Loop Algorithm ๏ Ask some work of the user
  • 32. Plan Build Source Ideas Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Build & Train Optimize Another time series! Focused time on visualization and human interaction components. ( reused ETL jobs, basic features, used K-means )
  • 34. Plan BuildTest Learn Source Ideas ๏ Identifiable skill gaps: ๏ Cherry Picking ๏ Biased Exploration ๏ Replication Crisis of Science extends to Data Science Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize
  • 35. Plan BuildTest Learn Release Source Ideas Code was documented & committed. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize Release 100% Retrain & Maintain
  • 37. Plan BuildTest Learn Release Works for prototyping, building & improving ML Products. Use cycles to focus efforts, avoid feature creep & set expectations.
  • 38. Plan Source Ideas Good companies make good decisions. Great companies forgo good decisions to make great ones. Proper planning leads to efficiently executing great ideas. Prioritize Define Success Design Roadmap
  • 39. Plan Build Source Ideas Building & Optimizing models is largely a solved problem. Identify & prioritize unsolved problems early on. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Build & Train Optimize
  • 40. Plan BuildTest Source Ideas Testing should match the diversity of the user base & the gravity of the ML Product’s responsibility. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Build & Train OptimizeEvaluateDog Food
  • 41. Plan BuildTest Learn Source Ideas Take the time to learn. Small changes in complex environments often reveal cascading effects. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize
  • 42. Plan BuildTest Learn Release Source Ideas Model life expectancy is increasing. Model Robustness should follow. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Build & Train OptimizeEvaluateDog Food Evangelize Release 100% Retrain & Maintain
  • 43. Plan BuildTest Learn Release Source Ideas All stages should be addressed, even if they are skipped. A modification to an upstream stage triggers changes to all downstream stages. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Release 100% Retrain & Maintain Build & Train OptimizeEvaluateDog Food Evangelize
  • 44. Source Ideas Each stage may involve input from many roles including users & customers, but each stage should have an Owner. Who owns Prioritize is a reflection of {Product, Data, Engineering, Design} - Driven Companies. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Release 100% Retrain & Maintain Build & Train OptimizeEvaluateDog Food Evangelize Product Data Science Data Engineering ML Engineering Software Engineering Per Company
  • 46. Plan BuildTest Learn Release Source Ideas Platforms will automate many stages. ML teams will focus efforts on stages that yield a competitive advantage. Prioritize Define Success Design Roadmap Find & ETL Data Feature Engineer Productionize A/B Test Analyze Product Review Release 100% Retrain & Maintain Build & Train OptimizeEvaluateDog Food Evangelize
  • 47. Turing Test, tests a mastery of communication. Bob Test, tests a mastery of communication, data synthesis & problem solving. Challenge to AI - The Bob Test Bob gathers input from the monitoring team, synthesizes the data & calls the airline. Bob works with experts from the airline to determine root cause & design safe plans of action for quick resolution.
  • 48. ML + IIoT Involves an Ongoing Negotiation to Figure Out What is Possible 1/3 of the World’s Power is Generated by GE. 60% of Airplane Engines are made by GE. … Preventative Maintenance Failure/ Anomaly Detection Assistive Diagnosis & Treatment Systems Optimization
  • 49. 5 Months to Augment Aviation — 2 Months to Augment Power First Order Effects are When Innovations for Planes Help Planes. Second Order Effects are When Innovations for Planes Help Power. Third Order Effects are …