MLflow: A Platform for Production Machine Learning

•

5 likes•957 views

Presentation about MLflow and the ML Platforms / MLOps class of software systems at the NeurIPS 2019 ML Systems workshop.

Software

: A Platform for
Production Machine Learning
Matei Zaharia
Databricks and Stanford University
@matei_zaharia

2
ML Research & Courses ML Products
ML in Production is Different from ML Research
Focus: reliably solving a business problem
Data is often the top challenge
(for models, try many common ones)
Must continuously deploy, monitor &
retrain models to maintain quality
Need new tools to enable this process!
(reproducibility, monitoring, …)
Focus: designing a good model
Data is provided and ready to use
(e.g. benchmark dataset)
No need to deploy, monitor, retrain
Tools for model design & evaluation
(e.g. TensorFlow, PyTorch, …)

Response: ML Platforms
Facebook FBLearner, Uber Michelangelo, Google TFX, …
+Standardize the data prep / training / deploy cycle:
if you work within the platform, you get these!
–Limited to a few algorithms or frameworks
–Tied to each company’s infrastructure
Can we provide similar benefits in an open manner?

Open source machine learning platform
• Works with any ML library, algorithm, language, etc
• Open interface design (use with any code you already have)
Tracking
Record and query
experiments: code,
data, confs, results
Projects
Packaging format
for reproducible
runs and workflows
Models
General format
that standardizes
deployment paths
Model Registry
Centralized model
management,
review & sharing
new

Community
158 contributors from >50 companies
• Integrated in RStudio, Azure ML, Faculty.ai, Neptune, Splice
900k downloads/month on PyPI

$ mlflow ui
MLflow Tracking
data = load_text(file)
ngrams = extract_ngrams(data, N=n)
model = train_model(ngrams,
learning_rate=lr)
score = compute_accuracy(model)
mlflow.log_param(“data_file”, file)
mlflow.log_param(“n”, n)
mlflow.log_param(“learning_rate”, lr)
mlflow.log_metric(“score”, score)
mlflow.keras.log_model(model)
Track parameters, metrics,
output files & code version

data = load_text(file)
ngrams = extract_ngrams(data, N=n)
model = train_model(ngrams,
learning_rate=lr)
score = compute_accuracy(model)
mlflow.log_param(“data_file”, file)
mlflow.log_param(“n”, n)
mlflow.log_param(“learning_rate”, lr)
mlflow.log_metric(“score”, score)
mlflow.keras.log_model(model)
$ mlflow ui
MLflow Tracking
Track parameters, metrics,
output files & code version
mlflow.keras.autolog()

MLflow Model Registry
GitHub-like environment for organizing & reviewing models
Model Registry
MODEL
DEVELOPER
DOWNSTREAM
USERS
REST SERVING
REVIEWERS,
CI/CD TOOLS

Interesting MLflow Use Cases
1) Massive number of independent models
• Company wants to train a separate model for each {facility,
chemical processing machine, household, …}
• Solution: large Spark job that runs an AutoML library for each task
+ MLflow for managing & selecting models
• ML scientists can’t look at each model ⇒ need hands-free ML!

Example:
Millions of models trained on terabytes of data/day

Interesting MLflow Use Cases
2) Big data analytics on model training results
• ML developer wants to analyze the result of multiple runs
interactively, possibly slicing across data points
• Solution: Pandas & SQL interfaces to MLflow tracking data
df = mlflow.search_runs(experiment_id, “metrics.loss < 2.5”)

Conclusion
Turning ML into reliable products is hard and requires a new
class of systems (ML Platforms)
Try MLflow at mlflow.org
Join the MLOps workshop at MLSys 2020

What's hot

Pythonsevilla2019 - Introduction to MLFlowFernando Ortega Gallego

MLOps for production-level machine learningcnvrg.io AI OS - Hands-on ML Workshops

From Data Science to MLOpsCarl W. Handlin

Managing the Complete Machine Learning Lifecycle with MLflowDatabricks

MLflow with DatabricksLiangjun Jiang

MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleDatabricks

mlflow: Accelerating the End-to-End ML lifecycleDatabricks

Productionzing ML Model Using MLflow Model ServingDatabricks

ML-Ops how to bring your data science to productionHerman Wu

MLOps by Sasha RosenbaumSasha Rosenbaum

MLflow: Infrastructure for a Complete Machine Learning Life CycleDatabricks

Databricks Overview for MLOpsDatabricks

MLOps in actionPieter de Bruin

Managing the Machine Learning Lifecycle with MLOpsFatih Baltacı

Ml ops intro sessionAvinash Patil

MLOps - The Assembly Line of MLJordan Birdsell

Mlflow with databricksLiangjun Jiang

MLOps Virtual Event: Automating ML at ScaleDatabricks

MLOps with Kubeflow Saurabh Kaushik

MLflow Model ServingDatabricks

What's hot (20)

Pythonsevilla2019 - Introduction to MLFlow

MLOps for production-level machine learning

From Data Science to MLOps

Managing the Complete Machine Learning Lifecycle with MLflow

MLflow with Databricks

MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle

mlflow: Accelerating the End-to-End ML lifecycle

Productionzing ML Model Using MLflow Model Serving

ML-Ops how to bring your data science to production

MLOps by Sasha Rosenbaum

MLflow: Infrastructure for a Complete Machine Learning Life Cycle

Databricks Overview for MLOps

MLOps in action

Managing the Machine Learning Lifecycle with MLOps

Ml ops intro session

MLOps - The Assembly Line of ML

Mlflow with databricks

MLOps Virtual Event: Automating ML at Scale

MLOps with Kubeflow

MLflow Model Serving

Similar to MLflow: A Platform for Production Machine Learning

Accelerating Production Machine Learning with MLflow with Matei ZahariaDatabricks

Certification Study Group - NLP & Recommendation Systems on GCP Session 5gdgsurrey

TensorFlow Extended: An End-to-End Machine Learning Platform for TensorFlowDatabricks

Data ops: Machine Learning in productionStepan Pushkarev

Ml programming with pythonKumud Arora

Scaling up Machine Learning DevelopmentMatei Zaharia

Utilisation de MLflow pour le cycle de vie des projet Machine learningParis Data Engineers !

Flink Forward San Francisco 2019: TensorFlow Extended: An end-to-end machine ...Flink Forward

TensorFlow Extension (TFX) and Apache Beammarkgrover

Analysis using rPriya Mohan

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...DataWorks Summit

OSCON 2014: Data Workflows for Machine LearningPaco Nathan

data-science-pdf-16588.pdfvkharish18

Managing the Machine Learning Lifecycle with MLflowDatabricks

ClassifyingIssuesFromSRTextAzureMLGeorge Simov

Discovering User's Topics of Interest in Recommender SystemsGabriel Moreira

Data Workflows for Machine Learning - Seattle DAMLPaco Nathan

Start machine learning in 5 simple stepsRenjith M P

OpenML 2019Joaquin Vanschoren

Text Analytics for Legal workAlgoAnalytics Financial Consultancy Pvt. Ltd.

Similar to MLflow: A Platform for Production Machine Learning (20)

Accelerating Production Machine Learning with MLflow with Matei Zaharia

Certification Study Group - NLP & Recommendation Systems on GCP Session 5

TensorFlow Extended: An End-to-End Machine Learning Platform for TensorFlow

Data ops: Machine Learning in production

Ml programming with python

Scaling up Machine Learning Development

Utilisation de MLflow pour le cycle de vie des projet Machine learning

Flink Forward San Francisco 2019: TensorFlow Extended: An end-to-end machine ...

TensorFlow Extension (TFX) and Apache Beam

Analysis using r

Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...

OSCON 2014: Data Workflows for Machine Learning

data-science-pdf-16588.pdf

Managing the Machine Learning Lifecycle with MLflow

ClassifyingIssuesFromSRTextAzureML

Discovering User's Topics of Interest in Recommender Systems

Data Workflows for Machine Learning - Seattle DAML

Start machine learning in 5 simple steps

OpenML 2019

Text Analytics for Legal work

Recently uploaded

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝soniya singh

Project Based Learning (A.I).pptx detail explanationkaushalgiri8080

Engage Usergroup 2024 - The Good The Bad_The UglyFrank van der Linden

Building Real-Time Data Pipelines: Stream & Batch Processing workshop SlideChristina Lin

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptxTier1 app

Unveiling the Tech Salsa of LAMs with Janus in Real-Time ApplicationsAlberto González Trastoy

Der Spagat zwischen BIAS und FAIRNESS (2024)OPEN KNOWLEDGE GmbH

Unit 1.1 Excite Part 1, class 9, cbse...aditisharan08

cybersecurity notes for mca students for learningVitsRangannavar

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...soniya singh

EY_Graph Database Powered SustainabilityNeo4j

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed DataAlluxio, Inc.

Advancing Engineering with AI through the Next Generation of Strategic Projec...OnePlan Solutions

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...MyIntelliSource, Inc.

What is Binary Language? Computer Number SystemsJheuzeDellosa

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...Christina Lin

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...kellynguyen01

why an Opensea Clone Script might be your perfect match.pdfjoe51371421

Asset Management Software - InfographicHr365.us smith

Recently uploaded (20)

Call Girls in Naraina Delhi 💯Call Us 🔝8264348440🔝

Project Based Learning (A.I).pptx detail explanation

Engage Usergroup 2024 - The Good The Bad_The Ugly

Building Real-Time Data Pipelines: Stream & Batch Processing workshop Slide

KnowAPIs-UnknownPerf-jaxMainz-2024 (1).pptx

Unveiling the Tech Salsa of LAMs with Janus in Real-Time Applications

Der Spagat zwischen BIAS und FAIRNESS (2024)

Unit 1.1 Excite Part 1, class 9, cbse...

cybersecurity notes for mca students for learning

Russian Call Girls in Karol Bagh Aasnvi ➡️ 8264348440 💋📞 Independent Escort S...

EY_Graph Database Powered Sustainability

Call Girls In Mukherjee Nagar 📱 9999965857 🤩 Delhi 🫦 HOT AND SEXY VVIP 🍎 SE...

Alluxio Monthly Webinar | Cloud-Native Model Training on Distributed Data

Advancing Engineering with AI through the Next Generation of Strategic Projec...

Steps To Getting Up And Running Quickly With MyTimeClock Employee Scheduling ...

What is Binary Language? Computer Number Systems

ODSC - Batch to Stream workshop - integration of Apache Spark, Cassandra, Pos...

Short Story: Unveiling the Reasoning Abilities of Large Language Models by Ke...

why an Opensea Clone Script might be your perfect match.pdf

Asset Management Software - Infographic

MLflow: A Platform for Production Machine Learning

1. : A Platform for Production Machine Learning Matei Zaharia Databricks and Stanford University @matei_zaharia

2. 2 ML Research & Courses ML Products ML in Production is Different from ML Research Focus: reliably solving a business problem Data is often the top challenge (for models, try many common ones) Must continuously deploy, monitor & retrain models to maintain quality Need new tools to enable this process! (reproducibility, monitoring, …) Focus: designing a good model Data is provided and ready to use (e.g. benchmark dataset) No need to deploy, monitor, retrain Tools for model design & evaluation (e.g. TensorFlow, PyTorch, …)

3. Response: ML Platforms Facebook FBLearner, Uber Michelangelo, Google TFX, … +Standardize the data prep / training / deploy cycle: if you work within the platform, you get these! –Limited to a few algorithms or frameworks –Tied to each company’s infrastructure Can we provide similar benefits in an open manner?

4. Open source machine learning platform • Works with any ML library, algorithm, language, etc • Open interface design (use with any code you already have) Tracking Record and query experiments: code, data, confs, results Projects Packaging format for reproducible runs and workflows Models General format that standardizes deployment paths Model Registry Centralized model management, review & sharing new

5. Community 158 contributors from >50 companies • Integrated in RStudio, Azure ML, Faculty.ai, Neptune, Splice 900k downloads/month on PyPI

6. $ mlflow ui MLflow Tracking data = load_text(file) ngrams = extract_ngrams(data, N=n) model = train_model(ngrams, learning_rate=lr) score = compute_accuracy(model) mlflow.log_param(“data_file”, file) mlflow.log_param(“n”, n) mlflow.log_param(“learning_rate”, lr) mlflow.log_metric(“score”, score) mlflow.keras.log_model(model) Track parameters, metrics, output files & code version

7. data = load_text(file) ngrams = extract_ngrams(data, N=n) model = train_model(ngrams, learning_rate=lr) score = compute_accuracy(model) mlflow.log_param(“data_file”, file) mlflow.log_param(“n”, n) mlflow.log_param(“learning_rate”, lr) mlflow.log_metric(“score”, score) mlflow.keras.log_model(model) $ mlflow ui MLflow Tracking Track parameters, metrics, output files & code version mlflow.keras.autolog()

8. MLflow UI: Inspecting Runs

9. MLflow Model Registry GitHub-like environment for organizing & reviewing models Model Registry MODEL DEVELOPER DOWNSTREAM USERS REST SERVING REVIEWERS, CI/CD TOOLS

10. 10

11. 11 Released in MLflow 1.4

12. Interesting MLflow Use Cases 1) Massive number of independent models • Company wants to train a separate model for each {facility, chemical processing machine, household, …} • Solution: large Spark job that runs an AutoML library for each task + MLflow for managing & selecting models • ML scientists can’t look at each model ⇒ need hands-free ML!

13. Example: Millions of models trained on terabytes of data/day

14. Interesting MLflow Use Cases 2) Big data analytics on model training results • ML developer wants to analyze the result of multiple runs interactively, possibly slicing across data points • Solution: Pandas & SQL interfaces to MLflow tracking data df = mlflow.search_runs(experiment_id, “metrics.loss < 2.5”)

15. Conclusion Turning ML into reliable products is hard and requires a new class of systems (ML Platforms) Try MLflow at mlflow.org Join the MLOps workshop at MLSys 2020

MLflow: A Platform for Production Machine Learning

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to MLflow: A Platform for Production Machine Learning

Similar to MLflow: A Platform for Production Machine Learning (20)

Recently uploaded

Recently uploaded (20)

MLflow: A Platform for Production Machine Learning