SlideShare a Scribd company logo
Revamping
ML Pipelines
with MLOps
2
Presented by
Sameer Mahajan
Principal Architect
Sameer Mahajan has 25 years of experience in the
software industry. He has worked for companies
like Microsoft and Symantec across areas like
machine learning, storage, cloud, big data,
networking and analytics in the United States &
India.
Sameer holds 9 US patents and is an alumnus of IIT
Bombay and Georgia Tech. He not only conducts
hands-on workshops and seminars but also
participates in panel discussions in upcoming
technologies like machine learning and big data.
Sameer is one of the mentors for the Machine
Learning Foundations course at Coursera.
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
• ML spend will reach $57.6 billion by 2021
• More and more ML systems are going into production
• Gartner 2019 Survey suggests that
i. 59% have AI deployed today
ii. The average number of deployed AI projects to increase to 35 by 2022
• Streamline ML lifecycle
• Machine Learning Operations
• Started traction in 2018
Background
ML Lifecycle
Process model – Option B
• Reality we are
trying to
model
• Source of data
Ingest data from
sensors, devices,
databases
• Cleanse and
transform data
• Signal processing
Visual analytics
to capture trends
indicative of
underlying
model processes
Use the models in
the real world
applications and
processes for
predictions, insights
etc.
Data Engineering
Data
Capturing
Data
Preparation
Data
Visualization
Machine
Learning
Train models that
reflect the real-
world phenomena
InferenceWorld
Challenges
• Dealing with data, models and code
• Deployment and automation
• Collaboration : data engineers, data
scientists, ML engineers, business analysts,
operations
• Continuous Integration (CI), Deployment
(CD), Training (CT)
• Reproducibility of results
• Transformations
• Hyperparameters
• Initializers
• Hardware
More Challenges
• Complex pipelines
1. Ensemble
2. Retraining
3. Transfer learning
4. Multiple prediction pipelines in
parallel (Canary)
• Self-updating ML pipelines
• Governance : tracing failed result back
to data or code
• Scalability
Examples of end-to-end ML platforms
1. Uber’s Michaelangelo
2. Facebook’s FBLearner
3. Google has TFX
4. Airbnb has BigHead
5. Databricks introduced mlflow which is now open source
6. Sagemaker
7. Azure
8. Datarobot
9. Polyaxon and KubeFlow
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
Best Practices
• Data Pipeline: Discoverable and Accessible Data - data lake, data mesh
• Versioned control: github, Data Science Version Control (DVC), mlflow Projects
• Data Exploration: Jupyter,
pandas, numpy, seaborn
• ML: scikit-learn
• CI/CD: Jenkins
• Packaging: Docker
• Orchestrator: Airflow, Kubernetes
• Monitoring: ELK, Prometheus
Methodologies
1. Combination of DevOps (CI/CD), Software Engineering and ML
2. ML experiments are captured as runs
3. Each run captures all its steps, its data, parameters, hyper parameters, code,
initializers, model evaluations, artifacts like trained models and business results
after deployment
4. Packaging a model: container
Closer look at some pipelines
1. Build pipeline
• Triggered on schedule or when new code checked in / data becomes available
• Building code and running unit tests
• Data tests: schema and distribution conformance
2. Retrain pipeline
• Triggered on a schedule or when new data becomes available
• Train, evaluate and register model
3. Release pipeline
• Triggered every time a new artifact is available
• Package, test, deploy to production, start monitoring
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
MLflow Tracking
Record and query
experiments: code, data,
config, and results
mlflow
MLflow Projects
Package data science code in
a format to reproduce runs on
any platform
MLflow Models
Deploy machine learning
models in diverse serving
environments
MLflow Registry
Store, annotate, discover,
and manage models in a
central repository
mlflow demo
Airflow demo
Model serving
Embedded model
1. Serialized pickle file
2. Language agnostic exchange formats like PMML, PFA and ONNX
3. H2O exports a POJO in a JAR
Separate service
1. Cloud providers’ tools and SDKs wrapping models
2. Kubeflow
3. mlflow models
Published as data
1. Typically used in streaming / real time scenarios
Tensorflow.js model serving demo
1. Open google chrome
2. Open chrome://apps/
3. Start web server
4. RockPaperScissorsTensorflow.jsDemo (based on a courser assignment)
5. Open http://127.0.0.1:8887 in chrome
6. Open developer tools
7. Demo retraining and predictions
• Background
• ML Lifecycle
• Challenges with ML Productization
• Examples of end-to-end ML platforms
• MLOps Best Practices
• MLOps Methodologies
• Build, Retrain and Release Pipelines
• Mlflow and demo
Agenda
• Airflow demo
• Model Serving Pipeline
• Tensorflow Model Serving
• Tensorflow js demo
• TFX-based MLOps system on Google Cloud
• Azure MLOps
• Conclusion
• Q & A
TFX-based MLOps system on Google Cloud
MLOps using Azure Machine Learning
• Evolving field
• Applying learning from other fields like DevOps, Software Engineering
• Taking holistic view
• Upcoming tools and practices
• Key in making ML productization successful
Conclusion
• https://www.brighttalk.com/webcast/6793/365540/machine-learning-operations-mlops-deploy-at-scale
• https://www.linkedin.com/pulse/mlops-101-modern-operation-machine-learning-payam-mokhtarian/
• https://en.wikipedia.org/wiki/MLOps
• https://cloud.google.com/solutions/machine-learning/architecture-for-mlops-using-tfx-kubeflow-pipelines-and-cloud-build
• https://docs.microsoft.com/en-us/samples/microsoft/mlopspython/mlops-with-azure-ml/
• https://ibm-cloud-architecture.github.io/refarch-data-ai-analytics/methodology/MLops/
• https://www.kdnuggets.com/2018/04/operational-machine-learning-successful-mlops.html
• https://www.pgs-soft.com/blog/more-effective-machine-learning-production-with-mlops/
• https://www.xenonstack.com/blog/mlops/
• https://towardsdatascience.com/mlops-reducing-the-technical-debt-of-machine-learning-dac528ef39de
• https://towardsdatascience.com/ml-ops-challenges-solutions-and-future-trends-d2e59b74dc6b
References
• https://www.bristoldatascientists.org/wp-content/uploads/sites/5/2020/02/Luke-The-Future-of-MLOps.pdf
• https://www.cognilytica.com/2020/03/03/ml-model-management-and-operations-2020-mlops/
• https://martinfowler.com/articles/data-monolith-to-mesh.html
• https://emilygorcenski.com/post/data-versioning/
• https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf
• https://martinfowler.com/articles/cd4ml.html
• https://mlflow.org/docs/latest/index.html
• https://airflow.apache.org/docs/stable/
• https://www.tensorflow.org/
• https://www.coursera.org/specializations/tensorflow-in-practice
• https://www.coursera.org/specializations/tensorflow-data-and-deployment
References (contd.)
Thank you!
sameer.mahajan@gslab.com
For more information please reach out to me at
https://in.linkedin.com/in/sameersmahajan

More Related Content

What's hot

MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)
Julien SIMON
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
Pieter de Bruin
 
[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure
Korkrid Akepanidtaworn
 
“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
Rui Quintino
 
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughModel versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 Walkthrough
Manasi Vartak
 
Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark
Herman Wu
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
Databricks
 
The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
DataPhoenix
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
Provectus
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
Jordan Birdsell
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
Bill Liu
 
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflowContinuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Databricks
 
What's Next for MLflow in 2019
What's Next for MLflow in 2019What's Next for MLflow in 2019
What's Next for MLflow in 2019
Anyscale
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Databricks
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
Jan Kirenz
 
Magdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine LearningMagdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine Learning
Lviv Startup Club
 
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model ServingDAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
amesar0
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflow
Databricks
 

What's hot (20)

MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)MLOps with serverless architectures (October 2018)
MLOps with serverless architectures (October 2018)
 
MLOps in action
MLOps in actionMLOps in action
MLOps in action
 
[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure[AI] ML Operationalization with Microsoft Azure
[AI] ML Operationalization with Microsoft Azure
 
“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
 
Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2Richard Coffey (x18140785) - Research in Computing CA2
Richard Coffey (x18140785) - Research in Computing CA2
 
Model versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 WalkthroughModel versioning done right: A ModelDB 2.0 Walkthrough
Model versioning done right: A ModelDB 2.0 Walkthrough
 
Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark Use MLflow to manage and deploy Machine Learning model on Spark
Use MLflow to manage and deploy Machine Learning model on Spark
 
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full LifecycleMLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
MLOps Virtual Event | Building Machine Learning Platforms for the Full Lifecycle
 
The A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOpsThe A-Z of Data: Introduction to MLOps
The A-Z of Data: Introduction to MLOps
 
MLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in ProductionMLOps and Data Quality: Deploying Reliable ML Models in Production
MLOps and Data Quality: Deploying Reliable ML Models in Production
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
AISF19 - Building Scalable, Kubernetes-Native ML/AI Pipelines with TFX, KubeF...
 
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflowContinuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
Continuous Delivery of ML-Enabled Pipelines on Databricks using MLflow
 
What's Next for MLflow in 2019
What's Next for MLflow in 2019What's Next for MLflow in 2019
What's Next for MLflow in 2019
 
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
Developing ML-enabled Data Pipelines on Databricks using IDE & CI/CD at Runta...
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML PlatformHow to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
 
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & KubeflowMLOps - Build pipelines with Tensor Flow Extended & Kubeflow
MLOps - Build pipelines with Tensor Flow Extended & Kubeflow
 
Magdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine LearningMagdalena Stenius: MLOPS Will Change Machine Learning
Magdalena Stenius: MLOPS Will Change Machine Learning
 
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model ServingDAIS Europe Nov. 2020 presentation on MLflow Model Serving
DAIS Europe Nov. 2020 presentation on MLflow Model Serving
 
MLOps Using MLflow
MLOps Using MLflowMLOps Using MLflow
MLOps Using MLflow
 

Similar to Nasscom ml ops webinar

MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
Databricks
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Pythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlowPythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlow
Fernando Ortega Gallego
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
Databricks
 
Introduction to MLflow
Introduction to MLflowIntroduction to MLflow
Introduction to MLflow
Databricks
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
Databricks
 
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Databricks
 
Scaling up Machine Learning Development
Scaling up Machine Learning DevelopmentScaling up Machine Learning Development
Scaling up Machine Learning Development
Matei Zaharia
 
MLflow with Databricks
MLflow with DatabricksMLflow with Databricks
MLflow with Databricks
Liangjun Jiang
 
Mlflow with databricks
Mlflow with databricksMlflow with databricks
Mlflow with databricks
Liangjun Jiang
 
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
DataScienceConferenc1
 
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
AbishekSubramanian2
 
A Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowA Collaborative Data Science Development Workflow
A Collaborative Data Science Development Workflow
Databricks
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
Databricks
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Sotrender
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
DataScienceConferenc1
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Databricks
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
Sandesh Rao
 
Legion - AI Runtime Platform
Legion -  AI Runtime PlatformLegion -  AI Runtime Platform
Legion - AI Runtime Platform
Alexey Kharlamov
 
MLOps with Kubeflow
MLOps with Kubeflow MLOps with Kubeflow
MLOps with Kubeflow
Saurabh Kaushik
 

Similar to Nasscom ml ops webinar (20)

MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
 
Pythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlowPythonsevilla2019 - Introduction to MLFlow
Pythonsevilla2019 - Introduction to MLFlow
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
 
Introduction to MLflow
Introduction to MLflowIntroduction to MLflow
Introduction to MLflow
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life CycleMLflow: Infrastructure for a Complete Machine Learning Life Cycle
MLflow: Infrastructure for a Complete Machine Learning Life Cycle
 
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
Advanced MLflow: Multi-Step Workflows, Hyperparameter Tuning and Integrating ...
 
Scaling up Machine Learning Development
Scaling up Machine Learning DevelopmentScaling up Machine Learning Development
Scaling up Machine Learning Development
 
MLflow with Databricks
MLflow with DatabricksMLflow with Databricks
MLflow with Databricks
 
Mlflow with databricks
Mlflow with databricksMlflow with databricks
Mlflow with databricks
 
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
 
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
Unlocking MLOps Potential: Streamlining Machine Learning Lifecycle with Datab...
 
A Collaborative Data Science Development Workflow
A Collaborative Data Science Development WorkflowA Collaborative Data Science Development Workflow
A Collaborative Data Science Development Workflow
 
Experimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOpsExperimentation to Industrialization: Implementing MLOps
Experimentation to Industrialization: Implementing MLOps
 
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
Trenowanie i wdrażanie modeli uczenia maszynowego z wykorzystaniem Google Clo...
 
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
[DSC Europe 23] Petar Zecevic - ML in Production on Databricks
 
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
Building a MLOps Platform Around MLflow to Enable Model Productionalization i...
 
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
AutoML - Heralding a New Era of Machine Learning - CASOUG Oct 2021
 
Legion - AI Runtime Platform
Legion -  AI Runtime PlatformLegion -  AI Runtime Platform
Legion - AI Runtime Platform
 
MLOps with Kubeflow
MLOps with Kubeflow MLOps with Kubeflow
MLOps with Kubeflow
 

More from Sameer Mahajan

blockchainpost.pptx
blockchainpost.pptxblockchainpost.pptx
blockchainpost.pptx
Sameer Mahajan
 
Crypto101.pptx
Crypto101.pptxCrypto101.pptx
Crypto101.pptx
Sameer Mahajan
 
Software Architect Track
Software Architect TrackSoftware Architect Track
Software Architect Track
Sameer Mahajan
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
Sameer Mahajan
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
Sameer Mahajan
 
Computer Networking 101
Computer Networking 101Computer Networking 101
Computer Networking 101
Sameer Mahajan
 
Apache spark
Apache sparkApache spark
Apache spark
Sameer Mahajan
 
Feature engineering
Feature engineeringFeature engineering
Feature engineering
Sameer Mahajan
 
Leapfrog in deep learning
Leapfrog in deep learningLeapfrog in deep learning
Leapfrog in deep learning
Sameer Mahajan
 

More from Sameer Mahajan (9)

blockchainpost.pptx
blockchainpost.pptxblockchainpost.pptx
blockchainpost.pptx
 
Crypto101.pptx
Crypto101.pptxCrypto101.pptx
Crypto101.pptx
 
Software Architect Track
Software Architect TrackSoftware Architect Track
Software Architect Track
 
Introduction to Machine Learning
Introduction to Machine LearningIntroduction to Machine Learning
Introduction to Machine Learning
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Computer Networking 101
Computer Networking 101Computer Networking 101
Computer Networking 101
 
Apache spark
Apache sparkApache spark
Apache spark
 
Feature engineering
Feature engineeringFeature engineering
Feature engineering
 
Leapfrog in deep learning
Leapfrog in deep learningLeapfrog in deep learning
Leapfrog in deep learning
 

Recently uploaded

Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems S.M.S.A.
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
KAMESHS29
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
Daiki Mogmet Ito
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
tolgahangng
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
innovationoecd
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
DianaGray10
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
panagenda
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Safe Software
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
Claudio Di Ciccio
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
Matthew Sinclair
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
Tomaz Bratanic
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
Zilliz
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
Kumud Singh
 

Recently uploaded (20)

Uni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdfUni Systems Copilot event_05062024_C.Vlachos.pdf
Uni Systems Copilot event_05062024_C.Vlachos.pdf
 
RESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for studentsRESUME BUILDER APPLICATION Project for students
RESUME BUILDER APPLICATION Project for students
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
How to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For FlutterHow to use Firebase Data Connect For Flutter
How to use Firebase Data Connect For Flutter
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
Serial Arm Control in Real Time Presentation
Serial Arm Control in Real Time PresentationSerial Arm Control in Real Time Presentation
Serial Arm Control in Real Time Presentation
 
Presentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of GermanyPresentation of the OECD Artificial Intelligence Review of Germany
Presentation of the OECD Artificial Intelligence Review of Germany
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1Communications Mining Series - Zero to Hero - Session 1
Communications Mining Series - Zero to Hero - Session 1
 
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAUHCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
HCL Notes und Domino Lizenzkostenreduzierung in der Welt von DLAU
 
Driving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success StoryDriving Business Innovation: Latest Generative AI Advancements & Success Story
Driving Business Innovation: Latest Generative AI Advancements & Success Story
 
“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”“I’m still / I’m still / Chaining from the Block”
“I’m still / I’m still / Chaining from the Block”
 
20240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 202420240609 QFM020 Irresponsible AI Reading List May 2024
20240609 QFM020 Irresponsible AI Reading List May 2024
 
GraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracyGraphRAG for Life Science to increase LLM accuracy
GraphRAG for Life Science to increase LLM accuracy
 
Building Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and MilvusBuilding Production Ready Search Pipelines with Spark and Milvus
Building Production Ready Search Pipelines with Spark and Milvus
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
Infrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI modelsInfrastructure Challenges in Scaling RAG with Custom AI models
Infrastructure Challenges in Scaling RAG with Custom AI models
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
Mind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AIMind map of terminologies used in context of Generative AI
Mind map of terminologies used in context of Generative AI
 

Nasscom ml ops webinar

  • 2. 2 Presented by Sameer Mahajan Principal Architect Sameer Mahajan has 25 years of experience in the software industry. He has worked for companies like Microsoft and Symantec across areas like machine learning, storage, cloud, big data, networking and analytics in the United States & India. Sameer holds 9 US patents and is an alumnus of IIT Bombay and Georgia Tech. He not only conducts hands-on workshops and seminars but also participates in panel discussions in upcoming technologies like machine learning and big data. Sameer is one of the mentors for the Machine Learning Foundations course at Coursera.
  • 3. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 4. • ML spend will reach $57.6 billion by 2021 • More and more ML systems are going into production • Gartner 2019 Survey suggests that i. 59% have AI deployed today ii. The average number of deployed AI projects to increase to 35 by 2022 • Streamline ML lifecycle • Machine Learning Operations • Started traction in 2018 Background
  • 6. Process model – Option B • Reality we are trying to model • Source of data Ingest data from sensors, devices, databases • Cleanse and transform data • Signal processing Visual analytics to capture trends indicative of underlying model processes Use the models in the real world applications and processes for predictions, insights etc. Data Engineering Data Capturing Data Preparation Data Visualization Machine Learning Train models that reflect the real- world phenomena InferenceWorld
  • 7. Challenges • Dealing with data, models and code • Deployment and automation • Collaboration : data engineers, data scientists, ML engineers, business analysts, operations • Continuous Integration (CI), Deployment (CD), Training (CT) • Reproducibility of results • Transformations • Hyperparameters • Initializers • Hardware
  • 8. More Challenges • Complex pipelines 1. Ensemble 2. Retraining 3. Transfer learning 4. Multiple prediction pipelines in parallel (Canary) • Self-updating ML pipelines • Governance : tracing failed result back to data or code • Scalability
  • 9. Examples of end-to-end ML platforms 1. Uber’s Michaelangelo 2. Facebook’s FBLearner 3. Google has TFX 4. Airbnb has BigHead 5. Databricks introduced mlflow which is now open source 6. Sagemaker 7. Azure 8. Datarobot 9. Polyaxon and KubeFlow
  • 10. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 11. Best Practices • Data Pipeline: Discoverable and Accessible Data - data lake, data mesh • Versioned control: github, Data Science Version Control (DVC), mlflow Projects • Data Exploration: Jupyter, pandas, numpy, seaborn • ML: scikit-learn • CI/CD: Jenkins • Packaging: Docker • Orchestrator: Airflow, Kubernetes • Monitoring: ELK, Prometheus
  • 12. Methodologies 1. Combination of DevOps (CI/CD), Software Engineering and ML 2. ML experiments are captured as runs 3. Each run captures all its steps, its data, parameters, hyper parameters, code, initializers, model evaluations, artifacts like trained models and business results after deployment 4. Packaging a model: container
  • 13. Closer look at some pipelines 1. Build pipeline • Triggered on schedule or when new code checked in / data becomes available • Building code and running unit tests • Data tests: schema and distribution conformance 2. Retrain pipeline • Triggered on a schedule or when new data becomes available • Train, evaluate and register model 3. Release pipeline • Triggered every time a new artifact is available • Package, test, deploy to production, start monitoring
  • 14. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 15. MLflow Tracking Record and query experiments: code, data, config, and results mlflow MLflow Projects Package data science code in a format to reproduce runs on any platform MLflow Models Deploy machine learning models in diverse serving environments MLflow Registry Store, annotate, discover, and manage models in a central repository
  • 18. Model serving Embedded model 1. Serialized pickle file 2. Language agnostic exchange formats like PMML, PFA and ONNX 3. H2O exports a POJO in a JAR Separate service 1. Cloud providers’ tools and SDKs wrapping models 2. Kubeflow 3. mlflow models Published as data 1. Typically used in streaming / real time scenarios
  • 19.
  • 20. Tensorflow.js model serving demo 1. Open google chrome 2. Open chrome://apps/ 3. Start web server 4. RockPaperScissorsTensorflow.jsDemo (based on a courser assignment) 5. Open http://127.0.0.1:8887 in chrome 6. Open developer tools 7. Demo retraining and predictions
  • 21. • Background • ML Lifecycle • Challenges with ML Productization • Examples of end-to-end ML platforms • MLOps Best Practices • MLOps Methodologies • Build, Retrain and Release Pipelines • Mlflow and demo Agenda • Airflow demo • Model Serving Pipeline • Tensorflow Model Serving • Tensorflow js demo • TFX-based MLOps system on Google Cloud • Azure MLOps • Conclusion • Q & A
  • 22. TFX-based MLOps system on Google Cloud
  • 23. MLOps using Azure Machine Learning
  • 24. • Evolving field • Applying learning from other fields like DevOps, Software Engineering • Taking holistic view • Upcoming tools and practices • Key in making ML productization successful Conclusion
  • 25. • https://www.brighttalk.com/webcast/6793/365540/machine-learning-operations-mlops-deploy-at-scale • https://www.linkedin.com/pulse/mlops-101-modern-operation-machine-learning-payam-mokhtarian/ • https://en.wikipedia.org/wiki/MLOps • https://cloud.google.com/solutions/machine-learning/architecture-for-mlops-using-tfx-kubeflow-pipelines-and-cloud-build • https://docs.microsoft.com/en-us/samples/microsoft/mlopspython/mlops-with-azure-ml/ • https://ibm-cloud-architecture.github.io/refarch-data-ai-analytics/methodology/MLops/ • https://www.kdnuggets.com/2018/04/operational-machine-learning-successful-mlops.html • https://www.pgs-soft.com/blog/more-effective-machine-learning-production-with-mlops/ • https://www.xenonstack.com/blog/mlops/ • https://towardsdatascience.com/mlops-reducing-the-technical-debt-of-machine-learning-dac528ef39de • https://towardsdatascience.com/ml-ops-challenges-solutions-and-future-trends-d2e59b74dc6b References
  • 26. • https://www.bristoldatascientists.org/wp-content/uploads/sites/5/2020/02/Luke-The-Future-of-MLOps.pdf • https://www.cognilytica.com/2020/03/03/ml-model-management-and-operations-2020-mlops/ • https://martinfowler.com/articles/data-monolith-to-mesh.html • https://emilygorcenski.com/post/data-versioning/ • https://papers.nips.cc/paper/5656-hidden-technical-debt-in-machine-learning-systems.pdf • https://martinfowler.com/articles/cd4ml.html • https://mlflow.org/docs/latest/index.html • https://airflow.apache.org/docs/stable/ • https://www.tensorflow.org/ • https://www.coursera.org/specializations/tensorflow-in-practice • https://www.coursera.org/specializations/tensorflow-data-and-deployment References (contd.)
  • 27. Thank you! sameer.mahajan@gslab.com For more information please reach out to me at https://in.linkedin.com/in/sameersmahajan