SlideShare a Scribd company logo
• PUBLIC 公開
Well Architected ML Platforms for
Data Science
Reliable Machine Learning Lifecycle
Goal Definition
Business Understanding
Data Understanding
Data Preparation
Modeling
Evaluation
Deployment
Maintenance
Outliers
Model Regression
Recall Feedback (Re-train)
Battle tested CRISP-DM Model for ML Implementations
• PUBLIC 公開
https://d1.awsstatic.com/whitepapers/architecture/wellarchitected-Machine-Learning-Lens.pdf
Platform Capabilities
1 Autoscaling Cloud-native ML/AI Platform with Identity Access
Management and Data Classification
2 Data collection and Curation for Business Analytics Reporting
and ML Model Preparation
3 Self-service Training and Experimentation for Forecasting and
Simulation
4
Prediction performance and Feedback from drift of forecast
Decisions
Insights
Data Lake
IAM
PaaS/DbaaS
5
ML as a Service Model Deployment
• PUBLIC 公開
• PUBLIC 公開
ML Ops Platform – Technical Architecture1
• PUBLIC 公開
ML Ops Platform – Technical Architecture2
Data Sources
Customer
Mart
Sales
Mart
Sales
Mart
Customer
Mart
RISKS
Mart
Sales
Mart
Account
Salesforce
SAS
Click
Stream
Speech
Telematic
s
Customer Bus Ops Risk Profitability Click Stream
Data Sourcing &
Data Wrangling
Data
Scientists
ML Engineers
AWS
Data and Model
Exploration
Spark Mlib
JupyterHub
Notebooks
SparkMagic
Livy
Rest
API
EMR Spark EMR Presto
Tensor flow
H2O
Sparkling
Water
Git Dvc ML Flow
Model Registry
(Versioning)
Training Validation
S3
S3
Data
Collection
Data
Curation
Model Repository
Model Inference
S3
Model Deployment
ML Ops Deployment pipeline
Jenkins
Advanced Analytics Engine
S3
Feature Engineering
Api
Release
Predict
Api
EKS
Jenkin
s
Click
Stream
• PUBLIC 公開
Heterogenous
data sources
Interoperability
and Integration
Schema
Management
using Parquet
Data discovery
Catalog
Data Lineage
and
Classification
Data
Governance and
Privacy
• PUBLIC 公開
•System Appreciation and
Discovery
•Gap Analysis
•Component Design
•Team structure and RACI
Plan
•Milestones and Timelines
•Infrastructure
•Data Systems
•Security
•Governance
Build •Socialize
•Demo
•Training
•Benchmark
•Pilot
Evangelize
•Transform
•Dual Support
•Bridge
•Decommission Legacy
Adopt •Release Management
•Warranty
•Operational Support
•Design Lifecycle
•Iterative Releases
Run
• PUBLIC 公開
• PUBLIC 公開
• PUBLIC 公開
DOCKERIZED FLASK
API
EKS CONTAINER
REGISTRY
REDUCE API BUILD
DEPENDENCIES
ENABLE FARGATE
SERVERLESS GATEWAY
• PUBLIC 公開
Streamlined Data collection
Version controlled Feature Engineering
Collaborative Discovery of features
Distributed Training with Validation
Reliable ML as a service
Prediction performance
Drift monitoring
Model governance and fairness
ML Ops Platform Goals
• PUBLIC 公開
1.Data Science
Teams
•Conceptualiz
ation
•Requirement
s
•Prototype
•Design
Review
1.ML Platform
Team
oJAD - Joint
Application
Design
oDesign
Approval
(JIRA)
oModel
Development
oCoding
oTraining
oCross
Validation
oAPI
Development
Team
oAPI
Requirements
oAPI Security
oAPI Catalog
oAPI
Integration
oDevops Team
oCICD for
Data pipelines
oCICD for
Training
oCICD for
Model API
oInfra Team
oEKS
oNetwork
oMonitoring
o*Governance
Team
oModel
Governance
oModel
Fairness
oModel
Monitoring
oData Science
Teams
oModel
monitoring
oModel
feedback
• PUBLIC 公開
1.Model
Conceptualization
oBusiness need
oMarket Research
o Customer Feedback
oBusiness value
1.Model
Requirements
oBusiness potential -
Inputs from Model
Conceptualization
oData sources
oCoordinating
Customer and
Technical requirements
oPrototype
1.Model Prototype
oData Collection
oData Curation
oFeature Engineering
oTraining and
Prediction
1.Model Design
Review
oJAD - Joint
Application Design
oDesign Approval
1.Model
Development
oOperational
dependencies
oIntegration
dependencies
oCoding
oContinuous
Training
oFeature Engineering
oTraining
oCross validation
oTracking
• PUBLIC 公開
1.Model
DevOps
oUnit tests
oCode coverage
oCI/CD
1.Model Quality
Assurance
oModel
performance
oSystem
performance
1.Model
Deployment
oCloud native
oContainerized
oData Pipeline
oML Service
oMonitoring
and Logging
1.Model
Integration
oService
oData Feed
oDashboard
and
Visualization
1.Model
Management
oModel
Repository
oModel Catalog
oModel
monitoring
oSimulation
1.Model
Governance
and Fairness
oAudit
oExplanation
oTrace back
1.Model
Feedback
oRecall and
Bias
oTraining
Outliers
oContinuous
Training
• PUBLIC 公開
Monitoring
Inputs and
decision outputs
Model fairness
characteristics
Overfitting,
skew, bias
Concept Drift
Outliers
Revised
Training
• PUBLIC 公開

More Related Content

What's hot

MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at Scale
Databricks
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
Jordan Birdsell
 
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
SlideTeam
 
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
Codiax
 
Databricks Overview for MLOps
Databricks Overview for MLOpsDatabricks Overview for MLOps
Databricks Overview for MLOps
Databricks
 
ETL Process
ETL ProcessETL Process
ETL Process
Rashmi Bhat
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
Avinash Patil
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOps
Carl W. Handlin
 
“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
Rui Quintino
 
Introducing MLOps.pdf
Introducing MLOps.pdfIntroducing MLOps.pdf
Introducing MLOps.pdf
Dr. Anish Cheriyan (PhD)
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learning
Yuriy Guts
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
Databricks
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
Márton Kodok
 
The relation in between ITIL, Cobit, Togaf and CMMI.
The relation in between ITIL, Cobit, Togaf and CMMI.The relation in between ITIL, Cobit, Togaf and CMMI.
The relation in between ITIL, Cobit, Togaf and CMMI.Mücahit Sidimi
 
NLP techniques for log analysis
NLP techniques for log analysisNLP techniques for log analysis
NLP techniques for log analysis
Jacob Perkins
 
Release Management
Release Management Release Management
Release Management
Vyom Labs
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
Adrien Blind
 
Ml ops past_present_future
Ml ops past_present_futureMl ops past_present_future
Ml ops past_present_future
Nisha Talagala
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
BATbern
 
Machine Learning Operations & Azure
Machine Learning Operations & AzureMachine Learning Operations & Azure

What's hot (20)

MLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at ScaleMLOps Virtual Event: Automating ML at Scale
MLOps Virtual Event: Automating ML at Scale
 
MLOps - The Assembly Line of ML
MLOps - The Assembly Line of MLMLOps - The Assembly Line of ML
MLOps - The Assembly Line of ML
 
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
Devops Strategy Roadmap Lifecycle Ppt Powerpoint Presentation Slides Complete...
 
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
Emeli Dral (Evidently AI) – Analyze it: production monitoring for machine lea...
 
Databricks Overview for MLOps
Databricks Overview for MLOpsDatabricks Overview for MLOps
Databricks Overview for MLOps
 
ETL Process
ETL ProcessETL Process
ETL Process
 
Ml ops intro session
Ml ops   intro sessionMl ops   intro session
Ml ops intro session
 
From Data Science to MLOps
From Data Science to MLOpsFrom Data Science to MLOps
From Data Science to MLOps
 
“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps“Houston, we have a model...” Introduction to MLOps
“Houston, we have a model...” Introduction to MLOps
 
Introducing MLOps.pdf
Introducing MLOps.pdfIntroducing MLOps.pdf
Introducing MLOps.pdf
 
Automated Machine Learning
Automated Machine LearningAutomated Machine Learning
Automated Machine Learning
 
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ... MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
MLflow: Infrastructure for a Complete Machine Learning Life Cycle with Mani ...
 
Vertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflowsVertex AI: Pipelines for your MLOps workflows
Vertex AI: Pipelines for your MLOps workflows
 
The relation in between ITIL, Cobit, Togaf and CMMI.
The relation in between ITIL, Cobit, Togaf and CMMI.The relation in between ITIL, Cobit, Togaf and CMMI.
The relation in between ITIL, Cobit, Togaf and CMMI.
 
NLP techniques for log analysis
NLP techniques for log analysisNLP techniques for log analysis
NLP techniques for log analysis
 
Release Management
Release Management Release Management
Release Management
 
Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
 
Ml ops past_present_future
Ml ops past_present_futureMl ops past_present_future
Ml ops past_present_future
 
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future VisionMLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
MLOps journey at Swisscom: AI Use Cases, Architecture and Future Vision
 
Machine Learning Operations & Azure
Machine Learning Operations & AzureMachine Learning Operations & Azure
Machine Learning Operations & Azure
 

Similar to Well architected ML platforms for Enterprise Data Science

Operationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at StarbucksOperationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at Starbucks
Databricks
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
DATAVERSITY
 
.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014
Mark Tabladillo
 
Marlabs Services Capabilities Overview
Marlabs Services Capabilities OverviewMarlabs Services Capabilities Overview
Marlabs Services Capabilities Overview
Marlabs
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
Karan Sachdeva
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Mark Tabladillo
 
Overview DYN365O
Overview DYN365OOverview DYN365O
Overview DYN365O
Juan Fabian
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with Kubeflow
Lviv Startup Club
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with Kubeflow
Edunomica
 
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Microsoft TechNet - Belgium and Luxembourg
 
Marlabs Capabilities Overview: Banking and Finance
Marlabs Capabilities Overview: Banking and Finance Marlabs Capabilities Overview: Banking and Finance
Marlabs Capabilities Overview: Banking and Finance
Marlabs
 
Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom
Marlabs
 
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
Alluxio, Inc.
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics
Ruben Pertusa Lopez
 
Candra_CollinsCV112016
Candra_CollinsCV112016Candra_CollinsCV112016
Candra_CollinsCV112016Candra Collins
 
Marlabs Capabilities: Retail
Marlabs Capabilities: Retail Marlabs Capabilities: Retail
Marlabs Capabilities: Retail
Marlabs
 
Marlabs Capabilities Overview: Energy and Utilities
Marlabs Capabilities Overview: Energy and UtilitiesMarlabs Capabilities Overview: Energy and Utilities
Marlabs Capabilities Overview: Energy and Utilities
Marlabs
 
ICML'16 Scaling ML System@Twitter
ICML'16 Scaling ML System@TwitterICML'16 Scaling ML System@Twitter
ICML'16 Scaling ML System@TwitterJack Xiaojiang Guo
 
Marlabs Capabilities Overview: Education and Media - Publishing
Marlabs Capabilities Overview: Education and Media - Publishing Marlabs Capabilities Overview: Education and Media - Publishing
Marlabs Capabilities Overview: Education and Media - Publishing
Marlabs
 
Azure ml and dynamics 365
Azure ml and dynamics 365Azure ml and dynamics 365
Azure ml and dynamics 365
Jivtesh Singh
 

Similar to Well architected ML platforms for Enterprise Data Science (20)

Operationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at StarbucksOperationalizing Machine Learning at Scale at Starbucks
Operationalizing Machine Learning at Scale at Starbucks
 
MLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive AdvantageMLOps – Applying DevOps to Competitive Advantage
MLOps – Applying DevOps to Competitive Advantage
 
.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014.Net development with Azure Machine Learning (AzureML) Nov 2014
.Net development with Azure Machine Learning (AzureML) Nov 2014
 
Marlabs Services Capabilities Overview
Marlabs Services Capabilities OverviewMarlabs Services Capabilities Overview
Marlabs Services Capabilities Overview
 
ICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data ScienceICP for Data- Enterprise platform for AI, ML and Data Science
ICP for Data- Enterprise platform for AI, ML and Data Science
 
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
Insider's introduction to microsoft azure machine learning: 201411 Seattle Bu...
 
Overview DYN365O
Overview DYN365OOverview DYN365O
Overview DYN365O
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with Kubeflow
 
Mohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with KubeflowMohamed Sabri: Operationalize machine learning with Kubeflow
Mohamed Sabri: Operationalize machine learning with Kubeflow
 
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
Building your first Analysis Services Tabular BI Semantic model with SQL Serv...
 
Marlabs Capabilities Overview: Banking and Finance
Marlabs Capabilities Overview: Banking and Finance Marlabs Capabilities Overview: Banking and Finance
Marlabs Capabilities Overview: Banking and Finance
 
Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom Marlabs Capabilities Overview: Telecom
Marlabs Capabilities Overview: Telecom
 
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
Achieving Massive Concurrency & Sub-second Query Latency on Cloud Warehouses ...
 
AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics AzureML Welcome to the future of Predictive Analytics
AzureML Welcome to the future of Predictive Analytics
 
Candra_CollinsCV112016
Candra_CollinsCV112016Candra_CollinsCV112016
Candra_CollinsCV112016
 
Marlabs Capabilities: Retail
Marlabs Capabilities: Retail Marlabs Capabilities: Retail
Marlabs Capabilities: Retail
 
Marlabs Capabilities Overview: Energy and Utilities
Marlabs Capabilities Overview: Energy and UtilitiesMarlabs Capabilities Overview: Energy and Utilities
Marlabs Capabilities Overview: Energy and Utilities
 
ICML'16 Scaling ML System@Twitter
ICML'16 Scaling ML System@TwitterICML'16 Scaling ML System@Twitter
ICML'16 Scaling ML System@Twitter
 
Marlabs Capabilities Overview: Education and Media - Publishing
Marlabs Capabilities Overview: Education and Media - Publishing Marlabs Capabilities Overview: Education and Media - Publishing
Marlabs Capabilities Overview: Education and Media - Publishing
 
Azure ml and dynamics 365
Azure ml and dynamics 365Azure ml and dynamics 365
Azure ml and dynamics 365
 

Recently uploaded

In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
Juraj Vysvader
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
Tendenci - The Open Source AMS (Association Management Software)
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
takuyayamamoto1800
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
Donna Lenk
 
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Jay Das
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Globus
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
abdulrafaychaudhry
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
e20449
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Globus
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
NYGGS Automation Suite
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Shahin Sheidaei
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Globus
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
Ortus Solutions, Corp
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
Fermin Galan
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
Adele Miller
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
Globus
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
Matt Welsh
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
Georgi Kodinov
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
Ortus Solutions, Corp
 

Recently uploaded (20)

In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
In 2015, I used to write extensions for Joomla, WordPress, phpBB3, etc and I ...
 
Corporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMSCorporate Management | Session 3 of 3 | Tendenci AMS
Corporate Management | Session 3 of 3 | Tendenci AMS
 
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoamOpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
OpenFOAM solver for Helmholtz equation, helmholtzFoam / helmholtzBubbleFoam
 
Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"Navigating the Metaverse: A Journey into Virtual Evolution"
Navigating the Metaverse: A Journey into Virtual Evolution"
 
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdfEnhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
Enhancing Project Management Efficiency_ Leveraging AI Tools like ChatGPT.pdf
 
Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...Developing Distributed High-performance Computing Capabilities of an Open Sci...
Developing Distributed High-performance Computing Capabilities of an Open Sci...
 
Lecture 1 Introduction to games development
Lecture 1 Introduction to games developmentLecture 1 Introduction to games development
Lecture 1 Introduction to games development
 
Graphic Design Crash Course for beginners
Graphic Design Crash Course for beginnersGraphic Design Crash Course for beginners
Graphic Design Crash Course for beginners
 
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
Exploring Innovations in Data Repository Solutions - Insights from the U.S. G...
 
Enterprise Resource Planning System in Telangana
Enterprise Resource Planning System in TelanganaEnterprise Resource Planning System in Telangana
Enterprise Resource Planning System in Telangana
 
Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024Globus Connect Server Deep Dive - GlobusWorld 2024
Globus Connect Server Deep Dive - GlobusWorld 2024
 
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
Gamify Your Mind; The Secret Sauce to Delivering Success, Continuously Improv...
 
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
Innovating Inference - Remote Triggering of Large Language Models on HPC Clus...
 
BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024BoxLang: Review our Visionary Licenses of 2024
BoxLang: Review our Visionary Licenses of 2024
 
Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604Orion Context Broker introduction 20240604
Orion Context Broker introduction 20240604
 
May Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdfMay Marketo Masterclass, London MUG May 22 2024.pdf
May Marketo Masterclass, London MUG May 22 2024.pdf
 
Enhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdfEnhancing Research Orchestration Capabilities at ORNL.pdf
Enhancing Research Orchestration Capabilities at ORNL.pdf
 
Large Language Models and the End of Programming
Large Language Models and the End of ProgrammingLarge Language Models and the End of Programming
Large Language Models and the End of Programming
 
2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx2024 RoOUG Security model for the cloud.pptx
2024 RoOUG Security model for the cloud.pptx
 
Into the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdfInto the Box 2024 - Keynote Day 2 Slides.pdf
Into the Box 2024 - Keynote Day 2 Slides.pdf
 

Well architected ML platforms for Enterprise Data Science

  • 1. • PUBLIC 公開 Well Architected ML Platforms for Data Science Reliable Machine Learning Lifecycle
  • 2. Goal Definition Business Understanding Data Understanding Data Preparation Modeling Evaluation Deployment Maintenance Outliers Model Regression Recall Feedback (Re-train) Battle tested CRISP-DM Model for ML Implementations
  • 4. Platform Capabilities 1 Autoscaling Cloud-native ML/AI Platform with Identity Access Management and Data Classification 2 Data collection and Curation for Business Analytics Reporting and ML Model Preparation 3 Self-service Training and Experimentation for Forecasting and Simulation 4 Prediction performance and Feedback from drift of forecast Decisions Insights Data Lake IAM PaaS/DbaaS 5 ML as a Service Model Deployment
  • 6. • PUBLIC 公開 ML Ops Platform – Technical Architecture1
  • 7. • PUBLIC 公開 ML Ops Platform – Technical Architecture2 Data Sources Customer Mart Sales Mart Sales Mart Customer Mart RISKS Mart Sales Mart Account Salesforce SAS Click Stream Speech Telematic s Customer Bus Ops Risk Profitability Click Stream Data Sourcing & Data Wrangling Data Scientists ML Engineers AWS Data and Model Exploration Spark Mlib JupyterHub Notebooks SparkMagic Livy Rest API EMR Spark EMR Presto Tensor flow H2O Sparkling Water Git Dvc ML Flow Model Registry (Versioning) Training Validation S3 S3 Data Collection Data Curation Model Repository Model Inference S3 Model Deployment ML Ops Deployment pipeline Jenkins Advanced Analytics Engine S3 Feature Engineering Api Release Predict Api EKS Jenkin s Click Stream
  • 8. • PUBLIC 公開 Heterogenous data sources Interoperability and Integration Schema Management using Parquet Data discovery Catalog Data Lineage and Classification Data Governance and Privacy
  • 9. • PUBLIC 公開 •System Appreciation and Discovery •Gap Analysis •Component Design •Team structure and RACI Plan •Milestones and Timelines •Infrastructure •Data Systems •Security •Governance Build •Socialize •Demo •Training •Benchmark •Pilot Evangelize •Transform •Dual Support •Bridge •Decommission Legacy Adopt •Release Management •Warranty •Operational Support •Design Lifecycle •Iterative Releases Run
  • 12. • PUBLIC 公開 DOCKERIZED FLASK API EKS CONTAINER REGISTRY REDUCE API BUILD DEPENDENCIES ENABLE FARGATE SERVERLESS GATEWAY
  • 13. • PUBLIC 公開 Streamlined Data collection Version controlled Feature Engineering Collaborative Discovery of features Distributed Training with Validation Reliable ML as a service Prediction performance Drift monitoring Model governance and fairness ML Ops Platform Goals
  • 14. • PUBLIC 公開 1.Data Science Teams •Conceptualiz ation •Requirement s •Prototype •Design Review 1.ML Platform Team oJAD - Joint Application Design oDesign Approval (JIRA) oModel Development oCoding oTraining oCross Validation oAPI Development Team oAPI Requirements oAPI Security oAPI Catalog oAPI Integration oDevops Team oCICD for Data pipelines oCICD for Training oCICD for Model API oInfra Team oEKS oNetwork oMonitoring o*Governance Team oModel Governance oModel Fairness oModel Monitoring oData Science Teams oModel monitoring oModel feedback
  • 15. • PUBLIC 公開 1.Model Conceptualization oBusiness need oMarket Research o Customer Feedback oBusiness value 1.Model Requirements oBusiness potential - Inputs from Model Conceptualization oData sources oCoordinating Customer and Technical requirements oPrototype 1.Model Prototype oData Collection oData Curation oFeature Engineering oTraining and Prediction 1.Model Design Review oJAD - Joint Application Design oDesign Approval 1.Model Development oOperational dependencies oIntegration dependencies oCoding oContinuous Training oFeature Engineering oTraining oCross validation oTracking
  • 16. • PUBLIC 公開 1.Model DevOps oUnit tests oCode coverage oCI/CD 1.Model Quality Assurance oModel performance oSystem performance 1.Model Deployment oCloud native oContainerized oData Pipeline oML Service oMonitoring and Logging 1.Model Integration oService oData Feed oDashboard and Visualization 1.Model Management oModel Repository oModel Catalog oModel monitoring oSimulation 1.Model Governance and Fairness oAudit oExplanation oTrace back 1.Model Feedback oRecall and Bias oTraining Outliers oContinuous Training
  • 17. • PUBLIC 公開 Monitoring Inputs and decision outputs Model fairness characteristics Overfitting, skew, bias Concept Drift Outliers Revised Training