How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

•

15 likes•8,120 views

This document summarizes a presentation about utilizing MLFlow and Kubernetes to build an enterprise machine learning platform. It discusses challenges that motivated building such a platform, like lack of model management and difficult deployments. The solution presented abstracts data pipelines into modular components to standardize workflows. It also uses MLFlow to package and track models and experiments, and Kubernetes with Kubeflow to deploy models at scale. A demo shows implementing model serving with these tools.

Data & Analytics

WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

Nick Pinckernell, Comcast Applied AI Research
Utilizing MLFlow and
Kubernetes to build an
Enterprise ML Platform
#UnifiedAnalytics #SparkAISummit

Topics
TOPIC WHY?
Example of data pipeline abstraction Modular components and reuse are
important for abstracting complex
systems
Ways to package and track ML project
and experiments
Consistency and reproducibility is key
for scale
How Comcast uses Kubeflow to serve
and deploy models and pipelines
A tangible example to help you
brainstorm about your organizations
requirements
3#UnifiedAnalytics #SparkAISummit

Challenges and motivations
• Before, there was no
– model management or tracking
– standardization for model packaging or deployments
• Cumbersome deployment process
– Deployment required code rewrite from research to operations
– Days or weeks to deploy
• Response and tradeoff: restrict model complexity
4#UnifiedAnalytics #SparkAISummit

Requirements
Minimum requirements from our organization
• Zero code refactoring or rewriting between research ready models and
production
• Easier experiment and model tracking
• Researchers need to deploy their own models
• A/B testing for quick model enhancement testing in production
• Ability to modularize and inject custom metrics and workflows at each step
5#UnifiedAnalytics #SparkAISummit

Solution – existing technologies
6#UnifiedAnalytics #SparkAISummit
Research
Model serving
Images,
containers

Data pipeline abstraction
• Determine use cases
• Identify commonalities for modularization
• Abstract interfaces
• Automate configuration
7#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
8#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
9#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
10#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
11#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
12#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
13#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
14#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
15#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
16#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
17#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
18#UnifiedAnalytics #SparkAISummit

Pipeline abstraction
19#UnifiedAnalytics #SparkAISummit

Seldon inference graphs
Allows for complex graphs
• A/B testing
• Ensembles
• Multi-armed bandit
• Custom combinations
20#UnifiedAnalytics #SparkAISummit
https://github.com/SeldonIO/seldon-core/blob/release-0.2/notebooks/advanced_graphs.ipynb

Packaging and tracking
1. Researchers code and train models with Databricks, Spark
2. Experiments tracked with MLFlow
3. Packaging and model tracking with MLFlow and Kubeflow
• MLFLow standard packaging formats
• scikit-learn
• h2o
• TensorFlow
• more
21#UnifiedAnalytics #SparkAISummit

An MLFlow experiment
22#UnifiedAnalytics #SparkAISummit

MLFlow – multiple experiments
23#UnifiedAnalytics #SparkAISummit

MLFlow – multiple experiments
24#UnifiedAnalytics #SparkAISummit

MLFlow packaging
25#UnifiedAnalytics #SparkAISummit

Research and model flow
26#UnifiedAnalytics #SparkAISummit

Research and model flow
27#UnifiedAnalytics #SparkAISummit

Research and model flow – at scale
28#UnifiedAnalytics #SparkAISummit

Model serving with Kubeflow
Considerations and requirements
• Resilient
• Highly available
• Rate limiting
• Shadow deployments
• Auto-scaling (WIP)
29#UnifiedAnalytics #SparkAISummit
Ambassador
http://www.getambassador.io

Throughput
Static number of replicas Determined after
• Constant and burst load testing with Locust
30#UnifiedAnalytics #SparkAISummit

DEMO
A demonstration of
• MLFlow experiments
– Serving the chosen model
• Implementation of components
– Consumer pod
– Model pod
– Producer logic (to simulate real requests)
31#UnifiedAnalytics #SparkAISummit

Choosing the run
32#UnifiedAnalytics #SparkAISummit

Choosing the model
33#UnifiedAnalytics #SparkAISummit

Implementing the model
34#UnifiedAnalytics #SparkAISummit

Implementing the consumer
35#UnifiedAnalytics #SparkAISummit

Implementing the producer
36#UnifiedAnalytics #SparkAISummit

Deploy the model
• Define the YAML / JSON Seldon deployment
• Build the image
– s2i build -E environment_rest .
seldonio/seldon-core-s2i-python3:0.6-SNAPSHOT
sklearn-iris-mlflow:0.3
• Deploy
– kubectl create -f sklearn_iris_deployment.json
-n kubeflow
37#UnifiedAnalytics #SparkAISummit

38#UnifiedAnalytics #SparkAISummit
Grafana metrics

39#UnifiedAnalytics #SparkAISummit
COMCAST IS HIRING
PHILADELPHIA
WASHINGTON, D.C.
SILICON VALLEY
DENVER

DON’T FORGET TO RATE
AND REVIEW THE SESSIONS
SEARCH SPARK + AI SUMMIT

What's hot

Databricks Overview for MLOpsDatabricks

Ml ops past_present_futureNisha Talagala

Getting Started with BigQuery MLDan Sullivan, Ph.D.

"Managing the Complete Machine Learning Lifecycle with MLflow"Databricks

MLOps for production-level machine learningcnvrg.io AI OS - Hands-on ML Workshops

MLOps – Applying DevOps to Competitive AdvantageDATAVERSITY

MLOps by Sasha RosenbaumSasha Rosenbaum

MLOps with Azure DevOpsMarco Parenzan

Pythonsevilla2019 - Introduction to MLFlowFernando Ortega Gallego

MLOps with Kubeflow Saurabh Kaushik

How to Build a ML Platform Efficiently Using Open-SourceDatabricks

Introduction to MLflowDatabricks

Introducing MLOps.pdfDr. Anish Cheriyan (PhD)

Machine Learning Operations & AzureErlangen Artificial Intelligence & Machine Learning Meetup

Apply MLOps at Scale by H&MDatabricks

Seamless MLOps with Seldon and MLflowDatabricks

What’s New with Databricks Machine LearningDatabricks

Infrastructure Agnostic Machine Learning Workload DeploymentDatabricks

Deploy and Serve Model from Azure Databricks onto Azure Machine LearningDatabricks

Google Vertex AIVikasBisoi

What's hot (20)

Databricks Overview for MLOps

Ml ops past_present_future

Getting Started with BigQuery ML

"Managing the Complete Machine Learning Lifecycle with MLflow"

MLOps for production-level machine learning

MLOps – Applying DevOps to Competitive Advantage

MLOps by Sasha Rosenbaum

MLOps with Azure DevOps

Pythonsevilla2019 - Introduction to MLFlow

MLOps with Kubeflow

How to Build a ML Platform Efficiently Using Open-Source

Introduction to MLflow

Introducing MLOps.pdf

Machine Learning Operations & Azure

Apply MLOps at Scale by H&M

Seamless MLOps with Seldon and MLflow

What’s New with Databricks Machine Learning

Infrastructure Agnostic Machine Learning Workload Deployment

Deploy and Serve Model from Azure Databricks onto Azure Machine Learning

Google Vertex AI

Similar to How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

Accelerating Machine Learning on Databricks RuntimeDatabricks

Boston Data Engineering: Kedro Python Framework for Data Science: Overview an...Boston Data Engineering

Using Spark Mllib Models in a Production Training and Serving Platform: Exper...Databricks

A Collaborative Data Science Development WorkflowDatabricks

Machine learning at scale challenges and solutionsStavros Kontopoulos

Connecting the Dots: Integrating Apache Spark into Production PipelinesDatabricks

Scaling ML-Based Threat Detection For Production Cyber AttacksDatabricks

Tech leaders guide to effective building of machine learning productsGianmario Spacagna

Apache Spark Data ValidationDatabricks

DevOps for Applications in Azure Databricks: Creating Continuous Integration ...Databricks

[DSC Europe 23] Petar Zecevic - ML in Production on DatabricksDataScienceConferenc1

Nasscom ml ops webinarSameer Mahajan

Containerized architectures for deep learningAntje Barth

Legion - AI Runtime PlatformAlexey Kharlamov

Building reliable apps with cdkRaphaelManke1

Supporting Highly Multitenant Spark Notebook Workloads with Craig Ingram and ...Spark Summit

AI & Machine Learning Pipelines with KnativeAnimesh Singh

The A-Z of Data: Introduction to MLOpsDataPhoenix

Build A Better Way to Deliver ITRackspace

Towards Design-space Exploration of Component Chains in Vehicle SoftwareAlessio Bucaioni

Similar to How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform (20)

Accelerating Machine Learning on Databricks Runtime

Boston Data Engineering: Kedro Python Framework for Data Science: Overview an...

Using Spark Mllib Models in a Production Training and Serving Platform: Exper...

A Collaborative Data Science Development Workflow

Machine learning at scale challenges and solutions

Connecting the Dots: Integrating Apache Spark into Production Pipelines

Scaling ML-Based Threat Detection For Production Cyber Attacks

Tech leaders guide to effective building of machine learning products

Apache Spark Data Validation

DevOps for Applications in Azure Databricks: Creating Continuous Integration ...

[DSC Europe 23] Petar Zecevic - ML in Production on Databricks

Nasscom ml ops webinar

Containerized architectures for deep learning

Legion - AI Runtime Platform

Building reliable apps with cdk

Supporting Highly Multitenant Spark Notebook Workloads with Craig Ingram and ...

AI & Machine Learning Pipelines with Knative

The A-Z of Data: Introduction to MLOps

Build A Better Way to Deliver IT

Towards Design-space Exploration of Component Chains in Vehicle Software

Recently uploaded

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H

Halmar dropshipping via API with DroFxolyaivanovalion

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls

Carero dropshipping via API with DroFx.pptxolyaivanovalion

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls

Ravak dropshipping via API with DroFx.pptxolyaivanovalion

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...amitlee9823

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...amitlee9823

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

Data-Analysis for Chicago Crime Data 2023ymrp368

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf

Halmar dropshipping via API with DroFx

Best VIP Call Girls Noida Sector 22 Call Me: 8448380779

Carero dropshipping via API with DroFx.pptx

Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night

Ravak dropshipping via API with DroFx.pptx

Smarteg dropshipping via API with DroFx.pptx

Call Girls Hsr Layout Just Call 👗 7737669865 👗 Top Class Call Girl Service Ba...

Call Girls Bannerghatta Road Just Call 👗 7737669865 👗 Top Class Call Girl Ser...

Sampling (random) method and Non random.ppt

Accredited-Transport-Cooperatives-Jan-2021-Web.pdf

Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

Data-Analysis for Chicago Crime Data 2023

Schema on read is obsolete. Welcome metaprogramming..pdf

CebaBaby dropshipping via API with DroFX.pptx

Mature dropshipping via API with DroFx.pptx

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

1. WIFI SSID:SparkAISummit | Password: UnifiedAnalytics

2. Nick Pinckernell, Comcast Applied AI Research Utilizing MLFlow and Kubernetes to build an Enterprise ML Platform #UnifiedAnalytics #SparkAISummit

3. Topics TOPIC WHY? Example of data pipeline abstraction Modular components and reuse are important for abstracting complex systems Ways to package and track ML project and experiments Consistency and reproducibility is key for scale How Comcast uses Kubeflow to serve and deploy models and pipelines A tangible example to help you brainstorm about your organizations requirements 3#UnifiedAnalytics #SparkAISummit

4. Challenges and motivations • Before, there was no – model management or tracking – standardization for model packaging or deployments • Cumbersome deployment process – Deployment required code rewrite from research to operations – Days or weeks to deploy • Response and tradeoff: restrict model complexity 4#UnifiedAnalytics #SparkAISummit

5. Requirements Minimum requirements from our organization • Zero code refactoring or rewriting between research ready models and production • Easier experiment and model tracking • Researchers need to deploy their own models • A/B testing for quick model enhancement testing in production • Ability to modularize and inject custom metrics and workflows at each step 5#UnifiedAnalytics #SparkAISummit

6. Solution – existing technologies 6#UnifiedAnalytics #SparkAISummit Research Model serving Images, containers

7. Data pipeline abstraction • Determine use cases • Identify commonalities for modularization • Abstract interfaces • Automate configuration 7#UnifiedAnalytics #SparkAISummit

8. Pipeline abstraction 8#UnifiedAnalytics #SparkAISummit

9. Pipeline abstraction 9#UnifiedAnalytics #SparkAISummit

10. Pipeline abstraction 10#UnifiedAnalytics #SparkAISummit

11. Pipeline abstraction 11#UnifiedAnalytics #SparkAISummit

12. Pipeline abstraction 12#UnifiedAnalytics #SparkAISummit

13. Pipeline abstraction 13#UnifiedAnalytics #SparkAISummit

14. Pipeline abstraction 14#UnifiedAnalytics #SparkAISummit

15. Pipeline abstraction 15#UnifiedAnalytics #SparkAISummit

16. Pipeline abstraction 16#UnifiedAnalytics #SparkAISummit

17. Pipeline abstraction 17#UnifiedAnalytics #SparkAISummit

18. Pipeline abstraction 18#UnifiedAnalytics #SparkAISummit

19. Pipeline abstraction 19#UnifiedAnalytics #SparkAISummit

20. Seldon inference graphs Allows for complex graphs • A/B testing • Ensembles • Multi-armed bandit • Custom combinations 20#UnifiedAnalytics #SparkAISummit https://github.com/SeldonIO/seldon-core/blob/release-0.2/notebooks/advanced_graphs.ipynb

21. Packaging and tracking 1. Researchers code and train models with Databricks, Spark 2. Experiments tracked with MLFlow 3. Packaging and model tracking with MLFlow and Kubeflow • MLFLow standard packaging formats • scikit-learn • h2o • TensorFlow • more 21#UnifiedAnalytics #SparkAISummit

22. An MLFlow experiment 22#UnifiedAnalytics #SparkAISummit

23. MLFlow – multiple experiments 23#UnifiedAnalytics #SparkAISummit

24. MLFlow – multiple experiments 24#UnifiedAnalytics #SparkAISummit

25. MLFlow packaging 25#UnifiedAnalytics #SparkAISummit

26. Research and model flow 26#UnifiedAnalytics #SparkAISummit

27. Research and model flow 27#UnifiedAnalytics #SparkAISummit

28. Research and model flow – at scale 28#UnifiedAnalytics #SparkAISummit

29. Model serving with Kubeflow Considerations and requirements • Resilient • Highly available • Rate limiting • Shadow deployments • Auto-scaling (WIP) 29#UnifiedAnalytics #SparkAISummit Ambassador http://www.getambassador.io

30. Throughput Static number of replicas Determined after • Constant and burst load testing with Locust 30#UnifiedAnalytics #SparkAISummit

31. DEMO A demonstration of • MLFlow experiments – Serving the chosen model • Implementation of components – Consumer pod – Model pod – Producer logic (to simulate real requests) 31#UnifiedAnalytics #SparkAISummit

32. Choosing the run 32#UnifiedAnalytics #SparkAISummit

33. Choosing the model 33#UnifiedAnalytics #SparkAISummit

34. Implementing the model 34#UnifiedAnalytics #SparkAISummit

35. Implementing the consumer 35#UnifiedAnalytics #SparkAISummit

36. Implementing the producer 36#UnifiedAnalytics #SparkAISummit

37. Deploy the model • Define the YAML / JSON Seldon deployment • Build the image – s2i build -E environment_rest . seldonio/seldon-core-s2i-python3:0.6-SNAPSHOT sklearn-iris-mlflow:0.3 • Deploy – kubectl create -f sklearn_iris_deployment.json -n kubeflow 37#UnifiedAnalytics #SparkAISummit

38. 38#UnifiedAnalytics #SparkAISummit Grafana metrics

39. 39#UnifiedAnalytics #SparkAISummit COMCAST IS HIRING PHILADELPHIA WASHINGTON, D.C. SILICON VALLEY DENVER

40. DON’T FORGET TO RATE AND REVIEW THE SESSIONS SEARCH SPARK + AI SUMMIT

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform

Similar to How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform (20)

More from Databricks

More from Databricks (20)

Recently uploaded

Recently uploaded (20)

How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform