Managing Machines: The New AI Dev Stack

•

1 like•1,647 views

Machine learning drove massive growth at consumer internet companies over the last decade, and this was enabled by open software, datasets, and AI research. For many problems, machine learning will produce better, faster, and more repeatable decisions at scale. Unfortunately, building and maintaining these systems is still extremely difficult and expensive. As more machine learning software moves to production, many of our traditional tools and best practices in software development will change. Pete Skomoroch walks you through what you need to know as we shift from a world of deterministic programs to systems that give unpredictable results on ever-changing training data. To navigate this world powered by nondeterministic data-dependent programs, we’ll also need a new development stack to help us write, test, deploy, and monitor machine learning software. Presented at OSCON Portland July 18, 2019

Technology

Managing Machines
Peter Skomoroch - @peteskomoroch
July 18, 2019

Better, Faster Decisions at Scale
• Machine learning drove massive growth at
consumer internet companies over the last
decade
• A wave of AI startups and vertical machine
learning applications are emerging across
other industries
• For many problems, machine learning
makes better, faster, and more repeatable
decisions at scale

AI Products
Automated systems that collect & learn from data to make user facing decisions with ML

Beyond Code: Open Data, Papers
• The mission of Papers With
Code is to create a free and
open resource with machine
learning papers, code and
evaluation tables.
• https://PapersWithCode.com
• https://arXiv.org
• http://commoncrawl.org
• https://voice.mozilla.org
Source: @paperswithcode

Machine Learning is Still Difficult
• Hard to know which models will work
• ML needs lots of labelled training data
• ML is prone to fail in unexpected ways
• Development progress is often episodic
and unpredictable
• Many research & production systems still
held together by duct tape, notebooks
• Input data & dependencies change, so
past model runs are difficult to reproduce

Machine Learning Dev Process
1. Problem definition and framing
2. Model design
3. Data collection, labelling, and cleaning
4. Feature engineering
5. Model training and offline validation
6. Model deployment, monitoring &
online training
Problem
Definition
Model
Design
Data
Collection &
Cleaning
Feature
Engineering
Model
Training
Deployment
& Monitoring

Better Tools for AI Developers
Source: @mrogati

Data Quality & Standardization
• Guide user input when you can
• Use auto suggest fields vs. free text
• Validate user inputs, emails
• Collect user tags, votes, ratings
• Track impressions, queries, clicks
• Sessionize & de-identify logs
• Disambiguate and annotate entities
• GDPR, Compliance, retention
Source: @brookLYNevery1

Training & Model Management
Source: @OpenAI @weights_biases

Machine Learning Testing
• Real world data changes over time
• Model tests and benchmarks get stale
• Maintain a suite of model input test fixtures
• Test for silent failures & output shifts
• Decouple testing of your data & infrastructure
from testing of your ML code
• Train models with sample data and run
through entire ML pipeline
Source: Machine Learning Testing: Survey, Landscapes and Horizons

Monitoring: Data, Features, Models
• Diversity of ever-changing data sources
• Data issues cascade through the ML stack
• Features also change, drift, break over time
• Model accuracy can decay over time
• Need data monitoring to spot statistical
changes in data inputs, anomalies
• Unlike traditional applications, model
performance is non-deterministic
Source: @fiddlerlabs

Filling Out the AI Dev Stack
Data & Model Versioning
Data & Model Monitoring
Model Training
Model Evaluation
Model Deployment
Visualization & Exploration
IDEs & Collaboration
Data Labelling
Feature Management
Parameter Tuning
Testing
Debugging
Profiling
Explainability
Data Anomaly Detection
We need better tools for:

More Resources
• Redpoint’s ML Workflow Landscape
• Awesome production machine learning
• Berkeley’s Full Stack Deep Learning Bootcamp
• O’Reilly: AI adoption is being fueled by an improved tool ecosystem
• Machine Learning Testing: Survey, Landscapes and Horizons
• What’s your ML Test Score? A rubric for ML production systems
• AllenAI: Writing Code for NLP Research
• Models and microservices should be running on the same continuous delivery stack
• Google’s Rules of Machine Learning: Best Practices for ML Engineering
• AWS Managing Machine Learning Projects

What's hot

How to classify documents automatically using NLPSkyl.ai

Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...Sri Ambati

Watson Analytics - Специалист по обработке данных "в коробке"Irina Podlevskikh

Scaling Training Data for AI ApplicationsApplause

New professional careers in dataDavid Rostcheck

Anatomy of a data science projectAdam Sroka

Why Machine Learning OperationsPrem Naraindas

The Other 99% of a Data Science ProjectEugene Mandel

Cloud Computing ServicesBigDataCloud

Product Management 101 for Data and Analytics Ravi Padaki

Robert Coop, Stanley Black & Decker - Optimizing Manufacturing with Driverles...Sri Ambati

Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Junglucenerevolution

Are you ready for Data science? A 12 point testBertil Hatt

Set the Hiring Managers’ Expectations: Using Big Data to answer Big Questions...Textkernel

Data-driven design (UX Antwerp 24/09/19)Peter Vermaercke

Growing Your SharePoint Toddler to a Mature SolutionDavid J Pileggi Jr

Big Data CareersSteven Miller

Big Data, Machine Learning till Business IntelligenceDinesh Kumar

Devon_Granillo resume finalDevon Granillo

Get Behind the Wheel with H2O Driverless AI - Hands on Lab - H2O World San Fr...Sri Ambati

What's hot (20)

How to classify documents automatically using NLP

Krish Swamy + Balaji Gopalakrishnan, Wells Fargo - Building a World Class Dat...

Watson Analytics - Специалист по обработке данных "в коробке"

Scaling Training Data for AI Applications

New professional careers in data

Anatomy of a data science project

Why Machine Learning Operations

The Other 99% of a Data Science Project

Cloud Computing Services

Product Management 101 for Data and Analytics

Robert Coop, Stanley Black & Decker - Optimizing Manufacturing with Driverles...

Lightning talk :IBM Content Analytics with Enterprise Search - Wolfgang Jung

Are you ready for Data science? A 12 point test

Set the Hiring Managers’ Expectations: Using Big Data to answer Big Questions...

Data-driven design (UX Antwerp 24/09/19)

Growing Your SharePoint Toddler to a Mature Solution

Big Data Careers

Big Data, Machine Learning till Business Intelligence

Devon_Granillo resume final

Get Behind the Wheel with H2O Driverless AI - Hands on Lab - H2O World San Fr...

Similar to Managing Machines: The New AI Dev Stack

Making Data Science Scalable - 5 Lessons LearnedLaurenz Wuttke

Design Like a Pro: Machine Learning BasicsInductive Automation

Mykola Mykytenko: MLOps: your way from nonsense to valuable effect (approache...Lviv Startup Club

MLOps and Data Quality: Deploying Reliable ML Models in ProductionProvectus

Introducción al Machine Learning AutomáticoSri Ambati

How to use continual learning in your ML modelscnvrg.io AI OS - Hands-on ML Workshops

Cnvrg webinar continual learningMaya Perry

Democratizing Data Science in the EnterpriseJesus Rodriguez

Machine learning systems for engineersCameron Joannidis

Machine learningSaravanan Subburayal

MLOps Virtual Event: Automating ML at ScaleDatabricks

Architecting for Data ScienceJohann Schleier-Smith

Artificial Intelligence As a ServiceJohn Liu

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...DATAVERSITY

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...Dell World

Productionising Machine Learning ModelsTash Bickley

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...Sri Ambati

Data engineering track module 2Pranav Prakash

Enterprise Machine Learning Governance Terence Siganakis

Similar to Managing Machines: The New AI Dev Stack (20)

Making Data Science Scalable - 5 Lessons Learned

Design Like a Pro: Machine Learning Basics

Mykola Mykytenko: MLOps: your way from nonsense to valuable effect (approache...

MLOps and Data Quality: Deploying Reliable ML Models in Production

Introducción al Machine Learning Automático

How to use continual learning in your ML models

Cnvrg webinar continual learning

Democratizing Data Science in the Enterprise

Machine learning systems for engineers

Machine learning

MLOps Virtual Event: Automating ML at Scale

Architecting for Data Science

Artificial Intelligence As a Service

ADV Slides: What the Aspiring or New Data Scientist Needs to Know About the E...

If You Are Not Embedding Analytics Into Your Day To Day Processes, You Are Do...

Productionising Machine Learning Models

Design Patterns for Machine Learning in Production - Sergei Izrailev, Chief D...

Data engineering track module 2

Enterprise Machine Learning Governance

Recently uploaded

How to Remove Document Management Hurdles with X-Docs?XfilesPro

How to convert PDF to text with Nanonetsnaman860154

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Install Stable Diffusion in windows machinePadma Pradeep

Slack Application Development 101 Slidespraypatel2

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Pigging Solutions Piggable Sweeping ElbowsPigging Solutions

The transition to renewables in India.pdfCompetition Advisory Services (India) LLP

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Key Features Of Token Development (1).pptxLBM Solutions

CloudStudio User manual (basic edition):comworks

Pigging Solutions in Pet Food ManufacturingPigging Solutions

GenCyber Cyber Security Day PresentationMichael W. Hawkins

08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst

Recently uploaded (20)

How to Remove Document Management Hurdles with X-Docs?

How to convert PDF to text with Nanonets

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Install Stable Diffusion in windows machine

Slack Application Development 101 Slides

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Pigging Solutions Piggable Sweeping Elbows

The transition to renewables in India.pdf

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

08448380779 Call Girls In Greater Kailash - I Women Seeking Men

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Next-generation AAM aircraft unveiled by Supernal, S-A2

Advanced Test Driven-Development @ php[tek] 2024

Key Features Of Token Development (1).pptx

CloudStudio User manual (basic edition):

Pigging Solutions in Pet Food Manufacturing

GenCyber Cyber Security Day Presentation

08448380779 Call Girls In Diplomatic Enclave Women Seeking Men

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Human Factors of XR: Using Human Factors to Design XR Systems

Managing Machines: The New AI Dev Stack

1. Managing Machines Peter Skomoroch - @peteskomoroch July 18, 2019

2. Better, Faster Decisions at Scale • Machine learning drove massive growth at consumer internet companies over the last decade • A wave of AI startups and vertical machine learning applications are emerging across other industries • For many problems, machine learning makes better, faster, and more repeatable decisions at scale

3. AI Products Automated systems that collect & learn from data to make user facing decisions with ML

4. Open Source Foundations for AI

5. Beyond Code: Open Data, Papers • The mission of Papers With Code is to create a free and open resource with machine learning papers, code and evaluation tables. • https://PapersWithCode.com • https://arXiv.org • http://commoncrawl.org • https://voice.mozilla.org Source: @paperswithcode

6. Machine Learning is Still Difficult • Hard to know which models will work • ML needs lots of labelled training data • ML is prone to fail in unexpected ways • Development progress is often episodic and unpredictable • Many research & production systems still held together by duct tape, notebooks • Input data & dependencies change, so past model runs are difficult to reproduce

7. Machine Learning Dev Process 1. Problem definition and framing 2. Model design 3. Data collection, labelling, and cleaning 4. Feature engineering 5. Model training and offline validation 6. Model deployment, monitoring & online training Problem Definition Model Design Data Collection & Cleaning Feature Engineering Model Training Deployment & Monitoring

8. Better Tools for AI Developers Source: @mrogati

9. Data Quality & Standardization • Guide user input when you can • Use auto suggest fields vs. free text • Validate user inputs, emails • Collect user tags, votes, ratings • Track impressions, queries, clicks • Sessionize & de-identify logs • Disambiguate and annotate entities • GDPR, Compliance, retention Source: @brookLYNevery1

10. Training & Model Management Source: @OpenAI @weights_biases

11. Machine Learning Testing • Real world data changes over time • Model tests and benchmarks get stale • Maintain a suite of model input test fixtures • Test for silent failures & output shifts • Decouple testing of your data & infrastructure from testing of your ML code • Train models with sample data and run through entire ML pipeline Source: Machine Learning Testing: Survey, Landscapes and Horizons

12. Monitoring: Data, Features, Models • Diversity of ever-changing data sources • Data issues cascade through the ML stack • Features also change, drift, break over time • Model accuracy can decay over time • Need data monitoring to spot statistical changes in data inputs, anomalies • Unlike traditional applications, model performance is non-deterministic Source: @fiddlerlabs

13. Troubleshooting ML

14. Filling Out the AI Dev Stack Data & Model Versioning Data & Model Monitoring Model Training Model Evaluation Model Deployment Visualization & Exploration IDEs & Collaboration Data Labelling Feature Management Parameter Tuning Testing Debugging Profiling Explainability Data Anomaly Detection We need better tools for:

15.

16. More Resources • Redpoint’s ML Workflow Landscape • Awesome production machine learning • Berkeley’s Full Stack Deep Learning Bootcamp • O’Reilly: AI adoption is being fueled by an improved tool ecosystem • Machine Learning Testing: Survey, Landscapes and Horizons • What’s your ML Test Score? A rubric for ML production systems • AllenAI: Writing Code for NLP Research • Models and microservices should be running on the same continuous delivery stack • Google’s Rules of Machine Learning: Best Practices for ML Engineering • AWS Managing Machine Learning Projects

Managing Machines: The New AI Dev Stack

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Managing Machines: The New AI Dev Stack

Similar to Managing Machines: The New AI Dev Stack (20)

More from Peter Skomoroch

More from Peter Skomoroch (13)

Recently uploaded

Recently uploaded (20)

Managing Machines: The New AI Dev Stack