SlideShare a Scribd company logo
Accelerate ML Lifecycle with
Kubernetes and Containerized
Data Science Tools
April 16th, 2020
1
Abhinav Joshi & Tushar Katarki
Red Hat
2
Abhinav Joshi
Senior Manager, Red Hat OpenShift Product Marketing
19+ yrs IT experience, 2 yrs at Red Hat, ex VMware, NetApp, Cisco
Email: abhjoshi@redhat.com
LinkedIn: https://www.linkedin.com/in/abhinavjoshi/
Tushar Katarki
Senior Manager, Red Hat OpenShift Product Management
20 yrs IT experience, 8 yrs at Red Hat, Ex Oracle/Sun, Polycom, etc
Email: tkatarki@redhat.com
LinkedIn: https://www.linkedin.com/in/katarki/
What we’ll
discuss today
3
● Desired AI/ML architecture & execution challenges
● Why containers, Kubernetes, and DevOps for AI/ML
● Enterprise Kubernetes Platform examples
● Real world deployment use cases
4
Desired Architecture and
execution challenges
5
AI/ML lifecycle and key personas
App developer
IT operations
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML
models in app
dev process
Implement
Apps &
Inference
ML models
Monitoring &
Management
Data engineer
Business
leadership
Data scientists
ML Engineer
ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.)
Desired Conceptual Architecture
6
ML/DL data pipeline and sources (databases, data lake, etc.)
Compute acceleration (GPU, FPGA, TPU)
Hybrid, multi cloud platform with self service capabilities
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML models
in app dev process
Implement
Apps & inference
ML models
monitoring &
management
Infrastructure
Virtual Private Public Hybrid EdgePhysical
AI/ML execution challenges
7
Lots of data is
collected, but finding
and preparing the right
data is difficult.
Readily usable data
lacking
Lack of key skills make it
difficult to find and
secure talent to
maintain operations.
Talent
shortage
No rapid availability of
infrastructure and
software tools slows data
scientists and developers
Unavailability of
infrastructure & software
Unable to implement
quickly due to slow,
manual and siloed
operations.
Lack of collaboration
across teams
Containers, Kubernetes, and DevOps can help!
What does a Data Scientist care about?
As a Data Scientist, I want a
“self-service cloud like” experience
for my Machine Learning projects,
where I can access a rich set of
modelling tools, data, and
computational resources, share and
collaborate with colleagues, and
deliver my work into production
with speed, agility and repeatability
to drive business value!
Self service portal to access ML
tools and access sources
ML model deployment
in app dev process
Data Scientists care less about infrastructure platform unless it integrates with their
ML tooling, and provides them the agility, flexibility, portability, & scalability.
8
ML Modelling / hardware
acceleration
Inferencing w/ hardware
acceleration
ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.)
Containers, Kubernetes, and DevOps as part of the Hybrid, Multi Cloud Platform
9
ML/DL data sources - databases (SQL, NoSQL, etc.), data lake, etc.
Compute acceleration (GPU, FPGA, TPU)
Hybrid, multi cloud platform with self service capabilities
Set
goals
Gather and
prepare data
Develop ML
model
Deploy ML models
in app dev process
Implement
Apps & inference
ML models
monitoring &
management
Infrastructure
Virtual Private Public Hybrid EdgePhysical
● Containers
● Kubernetes
● DevOps
10
Containers,
Kubernetes, and
DevOps help
accelerate your
AI/ML initiatives.
Why containers,
Kubernetes, and DevOps
for AI/ML?
Containers
Basic units that make AI/ML
programs shareable and portable
across hybrid cloud
Choice: Containers contain all your ML
frameworks and tools
Sharing: Container images can be shared and
iterated in flexible ways
Immutable & Portable: Contain once and run
them anywhere with integrity
Versioning: Incremental changes are tracked
Fast & Efficient: They are Linux processes!
Security: Process isolation and resource
control
Container Host Operating System
Container
App
Supporting Files &
Runtime
Container
App
...
Container
App
...
App
Kubernetes
▸ Centralizes compute resources and
provides a consistent experience across the
data center, cloud, and edge
▸ Resource management for compute
resources (including GPUs and FPGA)
▸ Workload scheduling and management
▸ Multi tenancy and quotas enforcement
▸ Networking and storage abstractions
Kubernetes is the de facto container
management platform for the hybrid, multi
cloudFoundation for the Hybrid, Multi Cloud
Platform w/ self service capabilities for
Data Scientists, Developers, etc.
Self-service,
Automation, CI/CD
Boosts speed, efficiency and
productivity
▸ Jupyter Notebooks running on Kubernetes form
the basis for self-service
▸ Source-2-image automatically converts a
notebook into a container image that is ready to
be deployed
▸ Kubernetes Operators provide automation and
lifecycle management for the containers
▸ CI/CD makes rapid, incremental and iterative
change possible; Open source technologies such
as Argo, Tekton, Jenkins and Spinnaker in
conjunction with Kubernetes make this happen
▸ ‘Serverless’ technologies such as Knative will
enable AI/ML users to spend more time
developing their models
Image source: https://www.brainvire.com/devops/
Data
Engineering
Easy, self-service and repeatable
Data sources: Kubernetes Persistent Volumes and
S3 object store makes access to storage easy and
standardized
Data pipes: Kubernetes Networking and
ServiceMesh provides the data connectivity - high
bandwidth, low latency that is secure
Data streaming and manipulation: Tools such as
Apache Spark, Kafka, Presto etc. can run natively and
can be accessed as a service
Data governance: With open source technologies
like Open Policy Agent (OPA)
Deploying into
production
To deliver business value and
redeem the promise of AI in the
enterprise
Containerize models and expose the service
with an REST API using the microservices
pattern - ServiceMesh (such as ISTIO) makes
this easy !
Models are incorporated in a data pipeline
Jobs (batch or real-time) with tools such as
Spark, Kafka and Argo
Models are delivered into existing
application workflow as binaries: PMML,
ONNX, Pickle
Monitoring model performance and drift
with open source tools native to Kubernetes i.e.
Prometheus and Grafana
CI/CD to drive continuous change and
improvement in production
15
Stitching all of this together into an…...
“Enterprise Kubernetes and Container Platform for AI/ML”
DatacenterLaptop
ANY
INFRASTRUCTURE
Intelligent
Applications
ENTERPRISE CONTAINER HOST(S)
CONTAINER ORCHESTRATION AND MANAGEMENT
(KUBERNETES)
AI/ML/DL & DevOps tool chain
Machine Learning
Modeling
Data Pipeline
Open Source community ML
toolkit for Kubernetes
16
17
Examples of Enterprise
Kubernetes and
Container Platforms
Red Hat OpenShift Kubernetes Platform
EXISTING
AUTOMATION
TOOLSETS
SCM
(GIT)
CI/CD
DATA SCIENTIST
Deploy ML on
any cloud
18
Developer Productivity
Cluster Services
Automated Ops ⠇Over-The-Air Updates ⠇Monitoring ⠇Logging ⠇Registry ⠇Networking ⠇Router ⠇KubeVirt ⠇OLM ⠇Helm
Red Hat Enterprise Linux & RHEL CoreOS
Kubernetes
Developer CLI ⠇VS Code
extensions ⠇IDE Plugins
Code Ready Workspaces
CodeReady Containers
Service Mesh ⠇Serverless
Builds ⠇CI/CD Pipelines
Full Stack Logging
Chargeback
Databases ⠇Languages
Runtimes ⠇Integration
Business Automation
100+ ISV Services
Platform Services Application Services Developer Services
Physical
Virtual Private cloud Public cloud
Build Cloud-Native AppsManage Workloads
Multi-cluster Management
Discovery ⠇Policy ⠇Compliance ⠇Configuration ⠇Workloads
Managed cloud
(Azure, AWS, IBM, Red Hat)
Windows Server
Nodes
Expose ML as
services, load
balanced and
scalable
Compute
Resources
on-demand
Best of SDLC
ML in
Production
Open source community project
● Open Source AI/ML Tooling
● Open source Red Hat
technologies e.g. OpenShift
● Automated deployment of open
source AI/ML tooling with
Kubernetes Operators
● https://www.opendatahub.io
Open Data Hub - “Data and AI Platform for the Hybrid Cloud”
Relationship between Kubeflow and Open Data Hub project
ML-as-a-service platform based on OpenShift,
Ceph storage, Kafka, JupyterHub and Spark
Home for k8s community to share
operators for various apps/tools
20
21
Examples of containerized data science on Red Hat OpenShift
Connected Drive &
Autonomous Driving
Data driven diagnosis
Data driven diagnosis
Democratize data science for oil
and gas exploration
Containerized Apache
Spark
Healthcare and public sector Automotive Financial Oil and gas
Discover Financial
ServicesJupyter notebooks as a service
Ministry of Defence (Israel)
RBC Bank
ML/DL with Jupyter Notebook on Enterprise Kubernetes
Platform
22
DATA SCIENTIST
ML/DL
Model
test &
iteration
Model deployed
into production via
Inference server
Data Sources Access to
multiple data
sources
Kubernetes Platform
Kubernetes Platform
Integrated
GPU access
Integrated
GPU access
Example Data Science Delivery Model on OpenShift
23
Source: https://assets.openshift.com/hubfs/OpenShift-Commons-SF-Agile-Data-Science-ExxonMobil.pdf
● kubeflow.org
● openshift.com/ai-ml
● opendatahub.io
24
RESOURCES
Summary
25
Containers, Kubernetes, &
DevOps can help
Agility, self-service, hybrid
cloud portability, scalability,
flexibility, automation
AI/ML benefits businesses
AI-powered intelligent
applications help achieve
key business goals, but
execution challenges
exists
Enterprise Kubernetes &
Container Platforms make it real
Allows leveraging the
benefits of containers,
Kubernetes, DevOps, and
accelerate delivery of
AI-powered apps
Abhinav Joshi: abhjoshi@redhat.com
Tushar Katarki: tkatarki@redhat.com
Thank you
26

More Related Content

What's hot

Leveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformationLeveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformation
John Archer
 
Big data + cloud computing glossary for community
Big data + cloud computing glossary for communityBig data + cloud computing glossary for community
Big data + cloud computing glossary for community
Kumar Chinnakali
 
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformNatalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
matteo mazzeri
 
Azure Digital Twins 2.0
Azure Digital Twins 2.0Azure Digital Twins 2.0
Azure Digital Twins 2.0
Marco Parenzan
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive Applications
VMware Tanzu
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builder
Timothy Spann
 
CNCF and Cloud Native Intro
CNCF and Cloud Native IntroCNCF and Cloud Native Intro
CNCF and Cloud Native Intro
Cloud Native Bangalore
 
Introducción al Machine Learning Automático
Introducción al Machine Learning AutomáticoIntroducción al Machine Learning Automático
Introducción al Machine Learning Automático
Sri Ambati
 
Quantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computingQuantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computing
Data Con LA
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDL
DESMOND YUEN
 
What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3
DataWorks Summit
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
John Archer
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservices
confluent
 
Big data on google cloud
Big data on google cloudBig data on google cloud
Big data on google cloud
Tu Pham
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containers
Mesosphere Inc.
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
DataWorks Summit
 
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with KubernetesWhen HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
Yong Feng
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
Jason Dai
 
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - HighlightsKubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
Krishna-Kumar
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
Alison B. Lowndes
 

What's hot (20)

Leveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformationLeveraging IoT as part of your digital transformation
Leveraging IoT as part of your digital transformation
 
Big data + cloud computing glossary for community
Big data + cloud computing glossary for communityBig data + cloud computing glossary for community
Big data + cloud computing glossary for community
 
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platformNatalie Godec - AirFlow and GCP: tomorrow's health service data platform
Natalie Godec - AirFlow and GCP: tomorrow's health service data platform
 
Azure Digital Twins 2.0
Azure Digital Twins 2.0Azure Digital Twins 2.0
Azure Digital Twins 2.0
 
Cloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive ApplicationsCloud-Native Patterns for Data-Intensive Applications
Cloud-Native Patterns for Data-Intensive Applications
 
Continus sql with sql stream builder
Continus sql with sql stream builderContinus sql with sql stream builder
Continus sql with sql stream builder
 
CNCF and Cloud Native Intro
CNCF and Cloud Native IntroCNCF and Cloud Native Intro
CNCF and Cloud Native Intro
 
Introducción al Machine Learning Automático
Introducción al Machine Learning AutomáticoIntroducción al Machine Learning Automático
Introducción al Machine Learning Automático
 
Quantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computingQuantum Computing: The next new technology in computing
Quantum Computing: The next new technology in computing
 
Very large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDLVery large scale distributed deep learning on BigDL
Very large scale distributed deep learning on BigDL
 
What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3What’s new in Apache Spark 2.3
What’s new in Apache Spark 2.3
 
Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes Democratizing Data Science on Kubernetes
Democratizing Data Science on Kubernetes
 
Modernizing your Application Architecture with Microservices
Modernizing your Application Architecture with MicroservicesModernizing your Application Architecture with Microservices
Modernizing your Application Architecture with Microservices
 
Big data on google cloud
Big data on google cloudBig data on google cloud
Big data on google cloud
 
Webinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containersWebinar: Déployez facilement Kubernetes & vos containers
Webinar: Déployez facilement Kubernetes & vos containers
 
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
Making the Most of Data in Multiple Data Sources (with Virtual Data Lakes)
 
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with KubernetesWhen HPC meet ML/DL: Manage HPC Data Center with Kubernetes
When HPC meet ML/DL: Manage HPC Data Center with Kubernetes
 
End-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics ZooEnd-to-End Big Data AI with Analytics Zoo
End-to-End Big Data AI with Analytics Zoo
 
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - HighlightsKubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
KubeCon + CloudNativeCon Barcelona and Shanghai 2019 - Highlights
 
Hardware in Space
Hardware in SpaceHardware in Space
Hardware in Space
 

Similar to ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools

DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to Green
John Archer
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
Orgad Kimchi
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload Deployment
Databricks
 
NextGenML
NextGenML NextGenML
IBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptxIBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptx
ssuser666667
 
Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.
Oracle Developers
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
FarazulHoda2
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Animesh Singh
 
From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture
Claude Riousset
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Akash Tandon
 
Simplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptxSimplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptx
ssuser5faa791
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
Provectus
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
David Talby
 
OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)
ConSol Consulting & Solutions Software GmbH
 
The Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and Streaming
Timothy Spann
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
Daniel Krook
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
DataWorks Summit
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
ScyllaDB
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
Data Science Milan
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
Krishna-Kumar
 

Similar to ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools (20)

DDDP 2019 - Brown to Green
DDDP 2019  - Brown to GreenDDDP 2019  - Brown to Green
DDDP 2019 - Brown to Green
 
Red hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategyRed hat's updates on the cloud & infrastructure strategy
Red hat's updates on the cloud & infrastructure strategy
 
Infrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload DeploymentInfrastructure Agnostic Machine Learning Workload Deployment
Infrastructure Agnostic Machine Learning Workload Deployment
 
NextGenML
NextGenML NextGenML
NextGenML
 
IBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptxIBM RedHat OCP Vs xKS.pptx
IBM RedHat OCP Vs xKS.pptx
 
Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.Building Cloud Native Applications with Oracle Autonomous Database.
Building Cloud Native Applications with Oracle Autonomous Database.
 
03_aiops-1.pptx
03_aiops-1.pptx03_aiops-1.pptx
03_aiops-1.pptx
 
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]
 
From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture From OpenStack.... towards an Open cloud architecture
From OpenStack.... towards an Open cloud architecture
 
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
Kubeflow: portable and scalable machine learning using Jupyterhub and Kuberne...
 
Simplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptxSimplify DevOps with Microservices and Mobile Backends.pptx
Simplify DevOps with Microservices and Mobile Backends.pptx
 
Feature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine LearningFeature Store as a Data Foundation for Machine Learning
Feature Store as a Data Foundation for Machine Learning
 
Architecting an Open Source AI Platform 2018 edition
Architecting an Open Source AI Platform   2018 editionArchitecting an Open Source AI Platform   2018 edition
Architecting an Open Source AI Platform 2018 edition
 
OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)OpenShift Meetup - Summit 2021 (Part 1)
OpenShift Meetup - Summit 2021 (Part 1)
 
The Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and StreamingThe Never Landing Stream with HTAP and Streaming
The Never Landing Stream with HTAP and Streaming
 
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaSOpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
OpenStack and Cloud Foundry - Pair the leading open source IaaS and PaaS
 
Scaling Data Science on Big Data
Scaling Data Science on Big DataScaling Data Science on Big Data
Scaling Data Science on Big Data
 
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
Simplifying the Creation of Machine Learning Workflow Pipelines for IoT Appli...
 
Serverless machine learning architectures at Helixa
Serverless machine learning architectures at HelixaServerless machine learning architectures at Helixa
Serverless machine learning architectures at Helixa
 
CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018CNCF Introduction - Feb 2018
CNCF Introduction - Feb 2018
 

Recently uploaded

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
sonjaschweigert1
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
Peter Spielvogel
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
James Anderson
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
UiPathCommunity
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Aggregage
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
DianaGray10
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
Jen Stirrup
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
nkrafacyberclub
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
Sri Ambati
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 

Recently uploaded (20)

FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...A tale of scale & speed: How the US Navy is enabling software delivery from l...
A tale of scale & speed: How the US Navy is enabling software delivery from l...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdfSAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
SAP Sapphire 2024 - ASUG301 building better apps with SAP Fiori.pdf
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
Alt. GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using ...
 
UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..UiPath Community Day Dubai: AI at Work..
UiPath Community Day Dubai: AI at Work..
 
Generative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to ProductionGenerative AI Deep Dive: Advancing from Proof of Concept to Production
Generative AI Deep Dive: Advancing from Proof of Concept to Production
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4UiPath Test Automation using UiPath Test Suite series, part 4
UiPath Test Automation using UiPath Test Suite series, part 4
 
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...The Metaverse and AI: how can decision-makers harness the Metaverse for their...
The Metaverse and AI: how can decision-makers harness the Metaverse for their...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptxSecstrike : Reverse Engineering & Pwnable tools for CTF.pptx
Secstrike : Reverse Engineering & Pwnable tools for CTF.pptx
 
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in O...
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools

  • 1. Accelerate ML Lifecycle with Kubernetes and Containerized Data Science Tools April 16th, 2020 1 Abhinav Joshi & Tushar Katarki Red Hat
  • 2. 2 Abhinav Joshi Senior Manager, Red Hat OpenShift Product Marketing 19+ yrs IT experience, 2 yrs at Red Hat, ex VMware, NetApp, Cisco Email: abhjoshi@redhat.com LinkedIn: https://www.linkedin.com/in/abhinavjoshi/ Tushar Katarki Senior Manager, Red Hat OpenShift Product Management 20 yrs IT experience, 8 yrs at Red Hat, Ex Oracle/Sun, Polycom, etc Email: tkatarki@redhat.com LinkedIn: https://www.linkedin.com/in/katarki/
  • 3. What we’ll discuss today 3 ● Desired AI/ML architecture & execution challenges ● Why containers, Kubernetes, and DevOps for AI/ML ● Enterprise Kubernetes Platform examples ● Real world deployment use cases
  • 5. 5 AI/ML lifecycle and key personas App developer IT operations Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & Inference ML models Monitoring & Management Data engineer Business leadership Data scientists ML Engineer
  • 6. ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.) Desired Conceptual Architecture 6 ML/DL data pipeline and sources (databases, data lake, etc.) Compute acceleration (GPU, FPGA, TPU) Hybrid, multi cloud platform with self service capabilities Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & inference ML models monitoring & management Infrastructure Virtual Private Public Hybrid EdgePhysical
  • 7. AI/ML execution challenges 7 Lots of data is collected, but finding and preparing the right data is difficult. Readily usable data lacking Lack of key skills make it difficult to find and secure talent to maintain operations. Talent shortage No rapid availability of infrastructure and software tools slows data scientists and developers Unavailability of infrastructure & software Unable to implement quickly due to slow, manual and siloed operations. Lack of collaboration across teams Containers, Kubernetes, and DevOps can help!
  • 8. What does a Data Scientist care about? As a Data Scientist, I want a “self-service cloud like” experience for my Machine Learning projects, where I can access a rich set of modelling tools, data, and computational resources, share and collaborate with colleagues, and deliver my work into production with speed, agility and repeatability to drive business value! Self service portal to access ML tools and access sources ML model deployment in app dev process Data Scientists care less about infrastructure platform unless it integrates with their ML tooling, and provides them the agility, flexibility, portability, & scalability. 8 ML Modelling / hardware acceleration Inferencing w/ hardware acceleration
  • 9. ML/DL and DevOps Tools (e.g. TensorFlow, Jupyter Notebooks, Python, Seldon, etc.) Containers, Kubernetes, and DevOps as part of the Hybrid, Multi Cloud Platform 9 ML/DL data sources - databases (SQL, NoSQL, etc.), data lake, etc. Compute acceleration (GPU, FPGA, TPU) Hybrid, multi cloud platform with self service capabilities Set goals Gather and prepare data Develop ML model Deploy ML models in app dev process Implement Apps & inference ML models monitoring & management Infrastructure Virtual Private Public Hybrid EdgePhysical ● Containers ● Kubernetes ● DevOps
  • 10. 10 Containers, Kubernetes, and DevOps help accelerate your AI/ML initiatives. Why containers, Kubernetes, and DevOps for AI/ML?
  • 11. Containers Basic units that make AI/ML programs shareable and portable across hybrid cloud Choice: Containers contain all your ML frameworks and tools Sharing: Container images can be shared and iterated in flexible ways Immutable & Portable: Contain once and run them anywhere with integrity Versioning: Incremental changes are tracked Fast & Efficient: They are Linux processes! Security: Process isolation and resource control Container Host Operating System Container App Supporting Files & Runtime Container App ... Container App ... App
  • 12. Kubernetes ▸ Centralizes compute resources and provides a consistent experience across the data center, cloud, and edge ▸ Resource management for compute resources (including GPUs and FPGA) ▸ Workload scheduling and management ▸ Multi tenancy and quotas enforcement ▸ Networking and storage abstractions Kubernetes is the de facto container management platform for the hybrid, multi cloudFoundation for the Hybrid, Multi Cloud Platform w/ self service capabilities for Data Scientists, Developers, etc.
  • 13. Self-service, Automation, CI/CD Boosts speed, efficiency and productivity ▸ Jupyter Notebooks running on Kubernetes form the basis for self-service ▸ Source-2-image automatically converts a notebook into a container image that is ready to be deployed ▸ Kubernetes Operators provide automation and lifecycle management for the containers ▸ CI/CD makes rapid, incremental and iterative change possible; Open source technologies such as Argo, Tekton, Jenkins and Spinnaker in conjunction with Kubernetes make this happen ▸ ‘Serverless’ technologies such as Knative will enable AI/ML users to spend more time developing their models Image source: https://www.brainvire.com/devops/
  • 14. Data Engineering Easy, self-service and repeatable Data sources: Kubernetes Persistent Volumes and S3 object store makes access to storage easy and standardized Data pipes: Kubernetes Networking and ServiceMesh provides the data connectivity - high bandwidth, low latency that is secure Data streaming and manipulation: Tools such as Apache Spark, Kafka, Presto etc. can run natively and can be accessed as a service Data governance: With open source technologies like Open Policy Agent (OPA)
  • 15. Deploying into production To deliver business value and redeem the promise of AI in the enterprise Containerize models and expose the service with an REST API using the microservices pattern - ServiceMesh (such as ISTIO) makes this easy ! Models are incorporated in a data pipeline Jobs (batch or real-time) with tools such as Spark, Kafka and Argo Models are delivered into existing application workflow as binaries: PMML, ONNX, Pickle Monitoring model performance and drift with open source tools native to Kubernetes i.e. Prometheus and Grafana CI/CD to drive continuous change and improvement in production 15
  • 16. Stitching all of this together into an…... “Enterprise Kubernetes and Container Platform for AI/ML” DatacenterLaptop ANY INFRASTRUCTURE Intelligent Applications ENTERPRISE CONTAINER HOST(S) CONTAINER ORCHESTRATION AND MANAGEMENT (KUBERNETES) AI/ML/DL & DevOps tool chain Machine Learning Modeling Data Pipeline Open Source community ML toolkit for Kubernetes 16
  • 17. 17 Examples of Enterprise Kubernetes and Container Platforms
  • 18. Red Hat OpenShift Kubernetes Platform EXISTING AUTOMATION TOOLSETS SCM (GIT) CI/CD DATA SCIENTIST Deploy ML on any cloud 18 Developer Productivity Cluster Services Automated Ops ⠇Over-The-Air Updates ⠇Monitoring ⠇Logging ⠇Registry ⠇Networking ⠇Router ⠇KubeVirt ⠇OLM ⠇Helm Red Hat Enterprise Linux & RHEL CoreOS Kubernetes Developer CLI ⠇VS Code extensions ⠇IDE Plugins Code Ready Workspaces CodeReady Containers Service Mesh ⠇Serverless Builds ⠇CI/CD Pipelines Full Stack Logging Chargeback Databases ⠇Languages Runtimes ⠇Integration Business Automation 100+ ISV Services Platform Services Application Services Developer Services Physical Virtual Private cloud Public cloud Build Cloud-Native AppsManage Workloads Multi-cluster Management Discovery ⠇Policy ⠇Compliance ⠇Configuration ⠇Workloads Managed cloud (Azure, AWS, IBM, Red Hat) Windows Server Nodes Expose ML as services, load balanced and scalable Compute Resources on-demand Best of SDLC ML in Production
  • 19. Open source community project ● Open Source AI/ML Tooling ● Open source Red Hat technologies e.g. OpenShift ● Automated deployment of open source AI/ML tooling with Kubernetes Operators ● https://www.opendatahub.io Open Data Hub - “Data and AI Platform for the Hybrid Cloud”
  • 20. Relationship between Kubeflow and Open Data Hub project ML-as-a-service platform based on OpenShift, Ceph storage, Kafka, JupyterHub and Spark Home for k8s community to share operators for various apps/tools 20
  • 21. 21 Examples of containerized data science on Red Hat OpenShift Connected Drive & Autonomous Driving Data driven diagnosis Data driven diagnosis Democratize data science for oil and gas exploration Containerized Apache Spark Healthcare and public sector Automotive Financial Oil and gas Discover Financial ServicesJupyter notebooks as a service Ministry of Defence (Israel) RBC Bank
  • 22. ML/DL with Jupyter Notebook on Enterprise Kubernetes Platform 22 DATA SCIENTIST ML/DL Model test & iteration Model deployed into production via Inference server Data Sources Access to multiple data sources Kubernetes Platform Kubernetes Platform Integrated GPU access Integrated GPU access
  • 23. Example Data Science Delivery Model on OpenShift 23 Source: https://assets.openshift.com/hubfs/OpenShift-Commons-SF-Agile-Data-Science-ExxonMobil.pdf
  • 24. ● kubeflow.org ● openshift.com/ai-ml ● opendatahub.io 24 RESOURCES
  • 25. Summary 25 Containers, Kubernetes, & DevOps can help Agility, self-service, hybrid cloud portability, scalability, flexibility, automation AI/ML benefits businesses AI-powered intelligent applications help achieve key business goals, but execution challenges exists Enterprise Kubernetes & Container Platforms make it real Allows leveraging the benefits of containers, Kubernetes, DevOps, and accelerate delivery of AI-powered apps
  • 26. Abhinav Joshi: abhjoshi@redhat.com Tushar Katarki: tkatarki@redhat.com Thank you 26