SlideShare a Scribd company logo
1 of 61
Download to read offline
Hybrid Cloud, Kubeflow and Tensorflow
Extended [TFX]
Session by:
Hybrid Cloud, Kubeflow and TFX
Animesh Singh
Chief Architect
Data and AI Platform
CODAIT
Tommy Li
Software Engineer
CODAIT
Pete MacKinnon
Principal Software Engineer,
RedHat AICoE
Center for Open Source
Data and AI
Technologies (CODAIT)
Code – Build and improve practical frameworks to
enable more developers to realize immediate
value.
Content – Showcase solutions for complex and
real-world AI problems.
Community – Bring developers and data
scientists to engage with IBM
Improving Enterprise AI lifecycle in
Open Source
•  Team	contributes	to	over	10	open	source	projects	
•  Committers	in	Kubeflow,	Spark,	Tensorflow,	PyTorch,	ONNX…	
•  17	committers	and	many	contributors	in	Apache	projects	
•  Speakers	at	over	100	conferences,	meetups,	unconferences	and	more
CODAIT
codait.org
↳ codait.org
The Machine Learning Workflow
4
Perception
And the ML workflow spans teams …
…and is much more complex….
Data
cleansing
Data
analysis
Data
transformation
Data
validation
Data
splitting
Data
prep
Building
a model
Model
validation
Training
at scale
Model
creation
Deploying Serving
Monitoring &
Logging
Finetune &
improvements
Rollout
Training
optimization Model
Model
Data
Data
Data
ingestion
EdgeCloud
Hybrid Clouds, TFX and Kubeflow Pipelines
Hybrid
Cloud
Hybrid Cloud
Definition:
A cloud that is:
•  Inclusive of on-prem and public
•  Multicloud
•  Open
•  Secure
•  Managed
To scale enterprise workloads across the globe
Compute Network Storage
{Infrastructure
Cloud / On-Prem
10
Compute Network Storage
{Infrastructure
Cloud / On-Prem
{Platform Security Kubernetes DevOps CFApp Services
11
Compute Network Storage
{Infrastructure
Cloud / On-Prem
{Platform
Databases Analytics Governance
{Data
Security Kubernetes DevOps CFApp Services
12
Compute Network Storage
{Infrastructure
Cloud / On-Prem
{Platform
Databases Analytics Governance
{Data
Watson TensorFlow Machine learning
{AI
Security Kubernetes DevOps CFApp Services
13
Compute Network Storage
{Infrastructure
Cloud / On-Prem
{Platform
Databases Analytics Governance
TensorFlow Machine learning
Security Kubernetes DevOps CFApp Services
14
Watson
{Data
{AI
Compute Network Storage
{Infrastructure
Cloud / On-Prem
{Platform
Databases Analytics Governance
TensorFlow Machine learning
Public Cloud
Private Cloud
Security Kubernetes DevOps CFApp Services
{Infrastructure
{Platform
15
Watson
{Data
{AI
Hybrid Clouds, TFX and Kubeflow Pipelines
Private
Cloud
OpenShift is our Private Cloud for ML
Workloads
EXISTING
AUTOMATION
TOOLSETS
SCM
(GIT)
CI/CD
SERVICE LAYER
PERSISTEN
T
STORAGE
REGISTRY
RHEL
NODE
c
RHEL
NODE
RHEL
NODE
RHEL
NODE
RHEL
NODE
RHEL
NODE
C
C
C C
C
C
C CC C
RED HAT
ENTERPRISE LINUX
MASTER
API/AUTHENTICATION
DATA STORE
SCHEDULER
HEALTH/SCALING
PHYSICAL VIRTUAL PRIVATE PUBLIC HYBRID
DATA SCIENTIST
ML deployed across
clouds, data center,
and edge
ML services,
load balanced
and scaled
ML microservices
scheduled and
orchestrated on
shared resources
Best of SDLC
ML in Production
17
Enterprise
App
ML
Training
Data
Processing
ML
Inference
CI/CD
Pipelines
Data
Engineer
Data Scientist
Infra
Engineer
App
Developer
Red Hat
OpenShift Container Storage
Baremetal
Red Hat
OpenShift Container Platform
Persona Key
Red Hat
OpenStack
Platform
Multi cloud gateway
Analytics
RDBMS
Discovery
Ceph
OPEN DATA HUB
AI PLATFORM POWERED BY OPEN
SOURCE
Data Ingest
18
We are working across the Data and AI
Lifecycle
The Open Data Hub Project
●  OpenDataHub.io
●  Meta-operator that integrates best open source AI/ML/Data projects
●  Blueprint architecture for AI/ML on OpenShift
https://opendatahub.io/docs/architecture.html
Data
Acquisition & Preparation
ML Model
Selection, Training, Testing
ML Model Deployment in
App. Dev. Process
Open Data Hub Blueprint
20
Hybrid Clouds, TFX and Kubeflow Pipelines
Public
Cloud
IBM Cloud Architecture
Infrastructure
X86, Power CPU, GPU Compute Sleds Programmable Mesh Network Flash & Spinning Storage
Container Orchestration & Networking: Kubernetes
Red Hat OpenShift on IBM Cloud
Armada API
Carrier
Cluster
Workers
Master Master
Cluster
Workers
Red Hat OpenShift on IBM Cloud
Armada API
Carrier
Cluster
Workers
Master Master
Cluster
Workers
RH OpenShift
Carrier
Cluster
Workers
Master Master
Cluster
Workers
Red Hat OpenShift on IBM Cloud
Ubuntu Linux
ARMADA API
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
CARRIER
WORKER
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
Containerd
Kubelet
(Community)
Calico Agent
CLUSTER
WORKER
Red Hat OpenShift on IBM Cloud
Ubuntu Linux
ARMADA API
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
CARRIER
WORKER
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
Containerd
Kubelet
(Community)
Calico Agent
CLUSTER
WORKER
RHEL
REDHAT
CARRIER
CRI-O
Kubelet
(OpenShift)
Calico Agent
RHEL
CRI-O
Kubelet
(OpenShift)
Calico Agent
REDHAT
WORKER
Red Hat OpenShift on IBM Cloud
Ubuntu Linux
ARMADA API
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
CARRIER
WORKER
Containerd
Kubelet
(Community)
Calico Agent
Ubuntu Linux
Containerd
Kubelet
(Community)
Calico Agent
CLUSTER
WORKER
RHEL
REDHAT
CARRIER
CRI-O
Kubelet
(OpenShift)
Calico Agent
RHEL
CRI-O
Kubelet
(OpenShift)
Calico Agent
REDHAT
WORKER
Hybrid Clouds, TFX and Kubeflow Pipelines
Kubeflow
Pipelines
29
Distributed Model Training and HPO (Katib, TFJob, PyTorch Job…)
●  Addresses One of the key goals for model builder
persona:
Distributed Model Training and Hyper parameter
optimization for Tensorflow, PyTorch etc.
●  Common problems in HP optimization
○  Overfitting
○  Wrong metrics
○  Too few hyperparameters
●  Katib: a fully open source, Kubernetes-native
hyperparameter tuning service
○  Inspired by Google Vizier
○  Framework agnostic
○  Extensible algorithms
Kubernetes
Compute cluster
GPU, TPU ,CPU
Cloud Object
Storage
Model Assets.
31
Istio
Knative
KFServing
Serving and Management: KFServing
Bringing the power of Knative and Istio for serverless Model deployments
PRE-
PROCESS
PREDICT POST-
PROCESS
EXPLAIN
Manages	the	hosting	aspects	of	your	models	
	
•  KFService	-	manages	the	lifecycle	of	models	
•  Configuration	-	manages	history	of	model	
deployments.	Two	configurations	for	default	
and	canary.	
•  Revision	-	A	snapshot	of	your	model	version	
•  Config	and	image
•  Route	-	Endpoint	and	network	traffic	
management
Route Default
Configuration		
Revision	1
Revision	M	90
%
KFService	
Canary
Configuration		
Revision	1
Revision	N	10
%
KFServing: Default and Canary Configurations
Kubeflow Pipelines
-  Released to Kubeflow in Nov 2018,
integrated into KF deployment CLI and
1-click-deploy-app
-  Aimed to bring
-  Orchestration for complex ML
workflows
-  Reproducible and reliable
experimentation
-  Bridging experimentation and
operationalization
-  Composition and reusable ML
components and pipelines
Kubeflow
Pipelines
Experiment Tracking
34
•  Kubeflow offers an easy way to compare different runs of the pipeline.
•  You can create the pipeline with model training. Then run it multiple times with different parameter values,
and you’ll get accuracy and ROC AUC scores for every run compared.
•  Lot more under “Compare runs” view.
What constitutes a Kubeflow ML Pipeline
§  Containerized implementations of ML Tasks
§  Pre-built components: Just provide params or code
snippets (e.g. training code)
§  Create your own components from code or libraries
§  Use any runtime, framework, data types
§  Attach k8s objects - volumes, secrets
§  Specification of the sequence of steps
§  Specified via Python DSL
§  Inferred from data dependencies on input/output
§  Input Parameters
§  A “Run” = Pipeline invoked w/ specific parameters
§  Can be cloned with different parameters
§  Schedules
§  Invoke a single run or create a recurring scheduled
pipeline
Define Pipeline with Python SDK
@dsl.pipeline(name='Taxi	Cab	Classification	Pipeline	Example’)	
def	taxi_cab_classification(	
				output_dir,		
				project,	
				Train_data						=	'gs://bucket/train.csv',	
				Evaluation_data	=	'gs://bucket/eval.csv',	
				Target										=	'tips',		
				Learning_rate			=	0.1,	hidden_layer_size	=	'100,50’,	steps=3000):	
	
				 	tfdv	 	 	=	TfdvOp(train_data,	evaluation_data,	project,	output_dir)	
				 	preprocess	 	=	PreprocessOp(train_data,	evaluation_data,	tfdv.output[“schema”],	project,	output_dir)	
				 	training	 	=	DnnTrainerOp(preprocess.output,	tfdv.schema,	learning_rate,	hidden_layer_size,	steps,		
target,	output_dir)	
				 	tfma	 	 	=	TfmaOp(training.output,	evaluation_data,	tfdv.schema,	project,	output_dir)	
				 	deploy	 	=	TfServingDeployerOp(training.output)	
Compile and Submit Pipeline Run
dsl.compile(taxi_cab_classification,		'tfx.tar.gz')	
run	=	client.run_pipeline(	
'tfx_run',	'tfx.tar.gz',	params={'output':	‘gs://dpa22’,	'project':	‘my-project-33’})
Creating your own components
-  Ways to build reusable components for pipelines
-  Create a container with your code and write either a ContainerOp() or shareable component descriptor
-  Turn your python code into a component directly in the notebook (with or without building a container)
-  These components can be exported into a shareable format
Container
Image
Execution
Code
ContainerOp
I/O schema
Code
ContainerOp
Container
Image
I/O schema
Container build
Container build
Component
descriptor
Watson AI Operations: Kubeflow Pipelines
e.g. Watson Speech to Text Operations Kubeflow pipeline
e.g. Watson Machine Learning and Watson OpenScale Pipeline
Hybrid Clouds, TFX and Kubeflow Pipelines
TFX
What is TFX?
TL;DR
●  TFX is a platform that to deploy Tensorflow models in production
●  TFX pipelines consist of a set of integrated components
●  TFX pipelines are configured using python
●  TFX consists of components, executors, and libraries
●  TFX components are optional (and repeated)
●  TFX can be configured to run in many different ways
42
TFX has existed externally as open source
libraries.
4343
Open sourced TFX libraries (circa 2018)
TensorFlow
Data Validation
TensorFlow
Transform
TensorFlow
Model Analysis
TensorFlow
Serving
In 2019, the horizontal layers that integrate
TFX libraries as one platform were open
sourced
4444
Open sourced TFX platform (2019)
Data
Ingestion
TensorFlow
Data Validation
TensorFlow
Transform
Estimator
or Keras
Model
TensorFlow
Model Analysis
TensorFlow
Serving
Logging
Shared Utilities for Garbage Collection, Data Access Controls
Pipeline Storage
Shared Configuration Framework and Job Orchestration
Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization
Anatomy of a Component
TFX components consist of
three main pieces:
●  Driver
●  Executor
●  Publisher
45
Anatomy of a Component
TFX includes both libraries and pipeline components. This diagram illustrates the relationships
between TFX libraries and pipeline components. TFX provides several Python packages that are
the libraries which are used to create pipeline components
46
TFX (inside the box)
47
Other runtimes
ExampleGen
StatisticsGen SchemaGen
Example
Validator
Transform Trainer
Evaluator
Model
Validator
Pusher
TFX Config
Metadata Store
Training +
Eval Data
TensorFlow
Serving
TensorFlow
Hub
TensorFlow
Lite
TensorFlow
JS
TFX Pipeline
TFX uses ml-metadata for artifact
management.
48
Trainer
Task-Aware Pipelines
Input Data
Transformed
Data
Trained
Models
Serving
System
Task- and Data-Aware Pipelines
Pipeline + Metadata Storage
Training Data
Transform TrainerTransform
Snapshot of a component.
49
Metadata Store
Trainer
Config
Last Validated
Model
New (Candidate)
Model
New Model
Model
Validator
Validation
Outcome
Pusher
New (Candidate)
Model
Validation
Outcome
What’s in the Metadata Store?
50
Trained
Models
Type definitions of Artifacts and their Properties
E.g., Models, Data, Evaluation Metrics
Trainer Execution Records (Runs) of Components
E.g., Runtime Configuration, Inputs + Outputs
Lineage Tracking Across All Executions
E.g., to recurse back to all inputs of a specific artifact
Examples of Metadata-Powered Functionality.
51
Find out which data a model was trained on Compare previous model runs
Carry-over state from previous models Re-use previously computed outputs
Hybrid Clouds, TFX and Kubeflow Pipelines
demo
https://github.com/kubeflow/kfp-tekton/tree/master/samples/kfp-tfx
Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation
Model Goal: Will the customer tip more or less than 20%?
Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation
TFX Taxi Pipeline
Private Cloud Public Cloud
On-prem
PVC
KubeFlow Pipelines
Istio + Kiali
Kubeflow Serving
Trigger model
deployment
Object
Storage
Hybrid Clouds, TFX and Kubeflow Pipelines
Lessons
Learnt
Recommendations for TFX and KFP
•  TFX Pipelines shall be made executable without any dependency on public cloud service e.g. GCS.
•  Apache Beam is a strong dependency in TFX. Doesn’t support S3 natively
•  TFX DSL shall support dynamically creating Persistent Volume Claims.
•  Support for mixing and matching KFP ContainerOps components with TFX ones through DSL
•  IBM IKS runs Kubernetes with containerd, Openshift uses CRIO APIs. The underlying pipeline platforms
(Argo, Airflow, Beam etc) should support them as first class citizens
•  Visualizing artifacts on the KubeFlow Pipeline UI shall not be limited to GCS by default
•  Don’t assume root privileges on OpenShift and Kube, as well as underlying storage file system.
TFX and KFP DSL
Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation
DEMO CODE:
https://github.com/kubeflow/kfp-tekton/tree/master/samples/kfp-tfx
RFC for TFX and KFP DSL Merge
https://docs.google.com/document/d/1_n3q0mNOr7gUSM04yaA0e5BO9RrS0Vkh1cNCyrB07WM/edit#
Please reach out at @AnimeshSingh for any follow-on discussions
Thank You!
Session by:
Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation

More Related Content

What's hot

Meetup OpenTelemetry Intro
Meetup OpenTelemetry IntroMeetup OpenTelemetry Intro
Meetup OpenTelemetry IntroDimitrisFinas1
 
Introduction to kubernetes
Introduction to kubernetesIntroduction to kubernetes
Introduction to kubernetesMichal Cwienczek
 
Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena InfluxData
 
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...Edureka!
 
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies Daniel Oh
 
OpenTelemetry Introduction
OpenTelemetry Introduction OpenTelemetry Introduction
OpenTelemetry Introduction DimitrisFinas1
 
Introduction to Terraform and Google Cloud Platform
Introduction to Terraform and Google Cloud PlatformIntroduction to Terraform and Google Cloud Platform
Introduction to Terraform and Google Cloud PlatformPradeep Bhadani
 
Introduction to openshift
Introduction to openshiftIntroduction to openshift
Introduction to openshiftMamathaBusi
 
Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryEric D. Schabell
 
Infrastructure as Code with Terraform and Ansible
Infrastructure as Code with Terraform and AnsibleInfrastructure as Code with Terraform and Ansible
Infrastructure as Code with Terraform and AnsibleDevOps Meetup Bern
 
Getting Started with Infrastructure as Code
Getting Started with Infrastructure as CodeGetting Started with Infrastructure as Code
Getting Started with Infrastructure as CodeWinWire Technologies Inc
 
State of AI Report 2023 - ONLINE presentation
State of AI Report 2023 - ONLINE presentationState of AI Report 2023 - ONLINE presentation
State of AI Report 2023 - ONLINE presentationssuser2750ef
 
Open shift 4 infra deep dive
Open shift 4    infra deep diveOpen shift 4    infra deep dive
Open shift 4 infra deep diveWinton Winton
 
OpenShift 4 installation
OpenShift 4 installationOpenShift 4 installation
OpenShift 4 installationRobert Bohne
 
Session on API auto scaling, monitoring and Log management
Session on API auto scaling, monitoring and Log managementSession on API auto scaling, monitoring and Log management
Session on API auto scaling, monitoring and Log managementpqrs1234
 
Kubernetes Networking
Kubernetes NetworkingKubernetes Networking
Kubernetes NetworkingCJ Cullen
 
Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)Animesh Singh
 
Deep Dive into Kubernetes - Part 1
Deep Dive into Kubernetes - Part 1Deep Dive into Kubernetes - Part 1
Deep Dive into Kubernetes - Part 1Imesh Gunaratne
 

What's hot (20)

Meetup OpenTelemetry Intro
Meetup OpenTelemetry IntroMeetup OpenTelemetry Intro
Meetup OpenTelemetry Intro
 
Introduction to kubernetes
Introduction to kubernetesIntroduction to kubernetes
Introduction to kubernetes
 
Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena Start Automating InfluxDB Deployments at the Edge with balena
Start Automating InfluxDB Deployments at the Edge with balena
 
Quick introduction to Kubernetes
Quick introduction to KubernetesQuick introduction to Kubernetes
Quick introduction to Kubernetes
 
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...
Kubernetes Networking | Kubernetes Services, Pods & Ingress Networks | Kubern...
 
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies
Podman, Buildah, and Quarkus - The Latest in Linux Containers Technologies
 
OpenTelemetry Introduction
OpenTelemetry Introduction OpenTelemetry Introduction
OpenTelemetry Introduction
 
Kubernetes 101
Kubernetes 101Kubernetes 101
Kubernetes 101
 
Introduction to Terraform and Google Cloud Platform
Introduction to Terraform and Google Cloud PlatformIntroduction to Terraform and Google Cloud Platform
Introduction to Terraform and Google Cloud Platform
 
Introduction to openshift
Introduction to openshiftIntroduction to openshift
Introduction to openshift
 
Observability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetryObservability For You and Me with OpenTelemetry
Observability For You and Me with OpenTelemetry
 
Infrastructure as Code with Terraform and Ansible
Infrastructure as Code with Terraform and AnsibleInfrastructure as Code with Terraform and Ansible
Infrastructure as Code with Terraform and Ansible
 
Getting Started with Infrastructure as Code
Getting Started with Infrastructure as CodeGetting Started with Infrastructure as Code
Getting Started with Infrastructure as Code
 
State of AI Report 2023 - ONLINE presentation
State of AI Report 2023 - ONLINE presentationState of AI Report 2023 - ONLINE presentation
State of AI Report 2023 - ONLINE presentation
 
Open shift 4 infra deep dive
Open shift 4    infra deep diveOpen shift 4    infra deep dive
Open shift 4 infra deep dive
 
OpenShift 4 installation
OpenShift 4 installationOpenShift 4 installation
OpenShift 4 installation
 
Session on API auto scaling, monitoring and Log management
Session on API auto scaling, monitoring and Log managementSession on API auto scaling, monitoring and Log management
Session on API auto scaling, monitoring and Log management
 
Kubernetes Networking
Kubernetes NetworkingKubernetes Networking
Kubernetes Networking
 
Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)Kubeflow Pipelines (with Tekton)
Kubeflow Pipelines (with Tekton)
 
Deep Dive into Kubernetes - Part 1
Deep Dive into Kubernetes - Part 1Deep Dive into Kubernetes - Part 1
Deep Dive into Kubernetes - Part 1
 

Similar to Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...Abhinav Joshi
 
Building an MLOps Stack for Companies at Reasonable Scale
Building an MLOps Stack for Companies at Reasonable ScaleBuilding an MLOps Stack for Companies at Reasonable Scale
Building an MLOps Stack for Companies at Reasonable ScaleMerelda
 
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and IstioAdvanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and IstioAnimesh Singh
 
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...Henry Saputra
 
Containerized architectures for deep learning
Containerized architectures for deep learningContainerized architectures for deep learning
Containerized architectures for deep learningAntje Barth
 
Red Hat and kubernetes: awesome stuff coming your way
Red Hat and kubernetes:  awesome stuff coming your wayRed Hat and kubernetes:  awesome stuff coming your way
Red Hat and kubernetes: awesome stuff coming your wayJohannes Brännström
 
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptxIBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptxGeorg Ember
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLNordic APIs
 
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and KubeflowKostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and KubeflowIT Arena
 
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)Amazon Web Services
 
StrongLoop Overview
StrongLoop OverviewStrongLoop Overview
StrongLoop OverviewShubhra Kar
 
AI & Machine Learning Pipelines with Knative
AI & Machine Learning Pipelines with KnativeAI & Machine Learning Pipelines with Knative
AI & Machine Learning Pipelines with KnativeAnimesh Singh
 
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)QAware GmbH
 
Scientific Computing @ Fred Hutch
Scientific Computing @ Fred HutchScientific Computing @ Fred Hutch
Scientific Computing @ Fred HutchDirk Petersen
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-PipelinesTimothy Spann
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectPatrick Chanezon
 
Running Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesRunning Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesDatabricks
 

Similar to Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX] (20)

ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...ODSC East 2020   Accelerate ML Lifecycle with Kubernetes and Containerized Da...
ODSC East 2020 Accelerate ML Lifecycle with Kubernetes and Containerized Da...
 
Building an MLOps Stack for Companies at Reasonable Scale
Building an MLOps Stack for Companies at Reasonable ScaleBuilding an MLOps Stack for Companies at Reasonable Scale
Building an MLOps Stack for Companies at Reasonable Scale
 
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and IstioAdvanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
Advanced Model Inferencing leveraging Kubeflow Serving, KNative and Istio
 
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
S8277 - Introducing Krylov: AI Platform that Empowers eBay Data Science and E...
 
Containerized architectures for deep learning
Containerized architectures for deep learningContainerized architectures for deep learning
Containerized architectures for deep learning
 
Red Hat and kubernetes: awesome stuff coming your way
Red Hat and kubernetes:  awesome stuff coming your wayRed Hat and kubernetes:  awesome stuff coming your way
Red Hat and kubernetes: awesome stuff coming your way
 
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptxIBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
IBM BP Session - Multiple CLoud Paks and Cloud Paks Foundational Services.pptx
 
OS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of MLOS for AI: Elastic Microservices & the Next Gen of ML
OS for AI: Elastic Microservices & the Next Gen of ML
 
NextGenML
NextGenML NextGenML
NextGenML
 
Cloud computing: highlights
Cloud computing: highlightsCloud computing: highlights
Cloud computing: highlights
 
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and KubeflowKostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
Kostiantyn Bokhan, N-iX. CD4ML based on Azure and Kubeflow
 
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)
AWS re:Invent 2016: Deploying Scalable SAP Hybris Clusters using Docker (CON312)
 
ansible_rhel_90.pdf
ansible_rhel_90.pdfansible_rhel_90.pdf
ansible_rhel_90.pdf
 
StrongLoop Overview
StrongLoop OverviewStrongLoop Overview
StrongLoop Overview
 
AI & Machine Learning Pipelines with Knative
AI & Machine Learning Pipelines with KnativeAI & Machine Learning Pipelines with Knative
AI & Machine Learning Pipelines with Knative
 
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
Kubernetes One-Click Deployment: Hands-on Workshop (Mainz)
 
Scientific Computing @ Fred Hutch
Scientific Computing @ Fred HutchScientific Computing @ Fred Hutch
Scientific Computing @ Fred Hutch
 
28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines28March2024-Codeless-Generative-AI-Pipelines
28March2024-Codeless-Generative-AI-Pipelines
 
Oscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby projectOscon 2017: Build your own container-based system with the Moby project
Oscon 2017: Build your own container-based system with the Moby project
 
Running Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using KubernetesRunning Apache Spark Jobs Using Kubernetes
Running Apache Spark Jobs Using Kubernetes
 

More from Animesh Singh

Machine Learning Exchange (MLX)
Machine Learning Exchange (MLX)Machine Learning Exchange (MLX)
Machine Learning Exchange (MLX)Animesh Singh
 
KFServing Payload Logging for Trusted AI
KFServing Payload Logging for Trusted AIKFServing Payload Logging for Trusted AI
KFServing Payload Logging for Trusted AIAnimesh Singh
 
KFServing and Kubeflow Pipelines
KFServing and Kubeflow PipelinesKFServing and Kubeflow Pipelines
KFServing and Kubeflow PipelinesAnimesh Singh
 
Kubeflow Distributed Training and HPO
Kubeflow Distributed Training and HPOKubeflow Distributed Training and HPO
Kubeflow Distributed Training and HPOAnimesh Singh
 
KFServing - Serverless Model Inferencing
KFServing - Serverless Model InferencingKFServing - Serverless Model Inferencing
KFServing - Serverless Model InferencingAnimesh Singh
 
Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Animesh Singh
 
Trusted, Transparent and Fair AI using Open Source
Trusted, Transparent and Fair AI using Open SourceTrusted, Transparent and Fair AI using Open Source
Trusted, Transparent and Fair AI using Open SourceAnimesh Singh
 
AIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAnimesh Singh
 
Fabric for Deep Learning
Fabric for Deep LearningFabric for Deep Learning
Fabric for Deep LearningAnimesh Singh
 
Microservices, Kubernetes and Istio - A Great Fit!
Microservices, Kubernetes and Istio - A Great Fit!Microservices, Kubernetes and Istio - A Great Fit!
Microservices, Kubernetes and Istio - A Great Fit!Animesh Singh
 
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...How to build a Distributed Serverless Polyglot Microservices IoT Platform us...
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...Animesh Singh
 
How to build an event-driven, polyglot serverless microservices framework on ...
How to build an event-driven, polyglot serverless microservices framework on ...How to build an event-driven, polyglot serverless microservices framework on ...
How to build an event-driven, polyglot serverless microservices framework on ...Animesh Singh
 
As a Service: Cloud Foundry on OpenStack - Lessons Learnt
As a Service: Cloud Foundry on OpenStack - Lessons LearntAs a Service: Cloud Foundry on OpenStack - Lessons Learnt
As a Service: Cloud Foundry on OpenStack - Lessons LearntAnimesh Singh
 
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...Animesh Singh
 
Finding and-organizing Great Cloud Foundry User Groups
Finding and-organizing Great Cloud Foundry User GroupsFinding and-organizing Great Cloud Foundry User Groups
Finding and-organizing Great Cloud Foundry User GroupsAnimesh Singh
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...Animesh Singh
 
Building a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackBuilding a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackAnimesh Singh
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateAnimesh Singh
 
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & Cloudant
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & CloudantBuild Scalable Internet of Things Apps using Cloud Foundry, Bluemix & Cloudant
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & CloudantAnimesh Singh
 

More from Animesh Singh (20)

Machine Learning Exchange (MLX)
Machine Learning Exchange (MLX)Machine Learning Exchange (MLX)
Machine Learning Exchange (MLX)
 
KFServing Payload Logging for Trusted AI
KFServing Payload Logging for Trusted AIKFServing Payload Logging for Trusted AI
KFServing Payload Logging for Trusted AI
 
KFServing and Kubeflow Pipelines
KFServing and Kubeflow PipelinesKFServing and Kubeflow Pipelines
KFServing and Kubeflow Pipelines
 
KFServing and Feast
KFServing and FeastKFServing and Feast
KFServing and Feast
 
Kubeflow Distributed Training and HPO
Kubeflow Distributed Training and HPOKubeflow Distributed Training and HPO
Kubeflow Distributed Training and HPO
 
KFServing - Serverless Model Inferencing
KFServing - Serverless Model InferencingKFServing - Serverless Model Inferencing
KFServing - Serverless Model Inferencing
 
Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox Defend against adversarial AI using Adversarial Robustness Toolbox
Defend against adversarial AI using Adversarial Robustness Toolbox
 
Trusted, Transparent and Fair AI using Open Source
Trusted, Transparent and Fair AI using Open SourceTrusted, Transparent and Fair AI using Open Source
Trusted, Transparent and Fair AI using Open Source
 
AIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AIAIF360 - Trusted and Fair AI
AIF360 - Trusted and Fair AI
 
Fabric for Deep Learning
Fabric for Deep LearningFabric for Deep Learning
Fabric for Deep Learning
 
Microservices, Kubernetes and Istio - A Great Fit!
Microservices, Kubernetes and Istio - A Great Fit!Microservices, Kubernetes and Istio - A Great Fit!
Microservices, Kubernetes and Istio - A Great Fit!
 
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...How to build a Distributed Serverless Polyglot Microservices IoT Platform us...
How to build a Distributed Serverless Polyglot Microservices IoT Platform us...
 
How to build an event-driven, polyglot serverless microservices framework on ...
How to build an event-driven, polyglot serverless microservices framework on ...How to build an event-driven, polyglot serverless microservices framework on ...
How to build an event-driven, polyglot serverless microservices framework on ...
 
As a Service: Cloud Foundry on OpenStack - Lessons Learnt
As a Service: Cloud Foundry on OpenStack - Lessons LearntAs a Service: Cloud Foundry on OpenStack - Lessons Learnt
As a Service: Cloud Foundry on OpenStack - Lessons Learnt
 
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...
Introducing Cloud Native, Event Driven, Serverless, Micrsoservices Framework ...
 
Finding and-organizing Great Cloud Foundry User Groups
Finding and-organizing Great Cloud Foundry User GroupsFinding and-organizing Great Cloud Foundry User Groups
Finding and-organizing Great Cloud Foundry User Groups
 
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
CAPS: What's best for deploying and managing OpenStack? Chef vs. Ansible vs. ...
 
Building a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStackBuilding a PaaS Platform like Bluemix on OpenStack
Building a PaaS Platform like Bluemix on OpenStack
 
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source TriumvirateCloud foundry Docker Openstack - Leading Open Source Triumvirate
Cloud foundry Docker Openstack - Leading Open Source Triumvirate
 
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & Cloudant
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & CloudantBuild Scalable Internet of Things Apps using Cloud Foundry, Bluemix & Cloudant
Build Scalable Internet of Things Apps using Cloud Foundry, Bluemix & Cloudant
 

Recently uploaded

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 

Recently uploaded (20)

What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 

Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX]

  • 1. Hybrid Cloud, Kubeflow and Tensorflow Extended [TFX] Session by:
  • 2. Hybrid Cloud, Kubeflow and TFX Animesh Singh Chief Architect Data and AI Platform CODAIT Tommy Li Software Engineer CODAIT Pete MacKinnon Principal Software Engineer, RedHat AICoE
  • 3. Center for Open Source Data and AI Technologies (CODAIT) Code – Build and improve practical frameworks to enable more developers to realize immediate value. Content – Showcase solutions for complex and real-world AI problems. Community – Bring developers and data scientists to engage with IBM Improving Enterprise AI lifecycle in Open Source •  Team contributes to over 10 open source projects •  Committers in Kubeflow, Spark, Tensorflow, PyTorch, ONNX… •  17 committers and many contributors in Apache projects •  Speakers at over 100 conferences, meetups, unconferences and more CODAIT codait.org ↳ codait.org
  • 4. The Machine Learning Workflow 4
  • 6. And the ML workflow spans teams …
  • 7. …and is much more complex…. Data cleansing Data analysis Data transformation Data validation Data splitting Data prep Building a model Model validation Training at scale Model creation Deploying Serving Monitoring & Logging Finetune & improvements Rollout Training optimization Model Model Data Data Data ingestion EdgeCloud
  • 8. Hybrid Clouds, TFX and Kubeflow Pipelines Hybrid Cloud
  • 9. Hybrid Cloud Definition: A cloud that is: •  Inclusive of on-prem and public •  Multicloud •  Open •  Secure •  Managed To scale enterprise workloads across the globe
  • 11. Compute Network Storage {Infrastructure Cloud / On-Prem {Platform Security Kubernetes DevOps CFApp Services 11
  • 12. Compute Network Storage {Infrastructure Cloud / On-Prem {Platform Databases Analytics Governance {Data Security Kubernetes DevOps CFApp Services 12
  • 13. Compute Network Storage {Infrastructure Cloud / On-Prem {Platform Databases Analytics Governance {Data Watson TensorFlow Machine learning {AI Security Kubernetes DevOps CFApp Services 13
  • 14. Compute Network Storage {Infrastructure Cloud / On-Prem {Platform Databases Analytics Governance TensorFlow Machine learning Security Kubernetes DevOps CFApp Services 14 Watson {Data {AI
  • 15. Compute Network Storage {Infrastructure Cloud / On-Prem {Platform Databases Analytics Governance TensorFlow Machine learning Public Cloud Private Cloud Security Kubernetes DevOps CFApp Services {Infrastructure {Platform 15 Watson {Data {AI
  • 16. Hybrid Clouds, TFX and Kubeflow Pipelines Private Cloud
  • 17. OpenShift is our Private Cloud for ML Workloads EXISTING AUTOMATION TOOLSETS SCM (GIT) CI/CD SERVICE LAYER PERSISTEN T STORAGE REGISTRY RHEL NODE c RHEL NODE RHEL NODE RHEL NODE RHEL NODE RHEL NODE C C C C C C C CC C RED HAT ENTERPRISE LINUX MASTER API/AUTHENTICATION DATA STORE SCHEDULER HEALTH/SCALING PHYSICAL VIRTUAL PRIVATE PUBLIC HYBRID DATA SCIENTIST ML deployed across clouds, data center, and edge ML services, load balanced and scaled ML microservices scheduled and orchestrated on shared resources Best of SDLC ML in Production 17
  • 18. Enterprise App ML Training Data Processing ML Inference CI/CD Pipelines Data Engineer Data Scientist Infra Engineer App Developer Red Hat OpenShift Container Storage Baremetal Red Hat OpenShift Container Platform Persona Key Red Hat OpenStack Platform Multi cloud gateway Analytics RDBMS Discovery Ceph OPEN DATA HUB AI PLATFORM POWERED BY OPEN SOURCE Data Ingest 18 We are working across the Data and AI Lifecycle
  • 19. The Open Data Hub Project ●  OpenDataHub.io ●  Meta-operator that integrates best open source AI/ML/Data projects ●  Blueprint architecture for AI/ML on OpenShift https://opendatahub.io/docs/architecture.html Data Acquisition & Preparation ML Model Selection, Training, Testing ML Model Deployment in App. Dev. Process
  • 20. Open Data Hub Blueprint 20
  • 21. Hybrid Clouds, TFX and Kubeflow Pipelines Public Cloud
  • 22. IBM Cloud Architecture Infrastructure X86, Power CPU, GPU Compute Sleds Programmable Mesh Network Flash & Spinning Storage Container Orchestration & Networking: Kubernetes
  • 23. Red Hat OpenShift on IBM Cloud Armada API Carrier Cluster Workers Master Master Cluster Workers
  • 24. Red Hat OpenShift on IBM Cloud Armada API Carrier Cluster Workers Master Master Cluster Workers RH OpenShift Carrier Cluster Workers Master Master Cluster Workers
  • 25. Red Hat OpenShift on IBM Cloud Ubuntu Linux ARMADA API Containerd Kubelet (Community) Calico Agent Ubuntu Linux CARRIER WORKER Containerd Kubelet (Community) Calico Agent Ubuntu Linux Containerd Kubelet (Community) Calico Agent CLUSTER WORKER
  • 26. Red Hat OpenShift on IBM Cloud Ubuntu Linux ARMADA API Containerd Kubelet (Community) Calico Agent Ubuntu Linux CARRIER WORKER Containerd Kubelet (Community) Calico Agent Ubuntu Linux Containerd Kubelet (Community) Calico Agent CLUSTER WORKER RHEL REDHAT CARRIER CRI-O Kubelet (OpenShift) Calico Agent RHEL CRI-O Kubelet (OpenShift) Calico Agent REDHAT WORKER
  • 27. Red Hat OpenShift on IBM Cloud Ubuntu Linux ARMADA API Containerd Kubelet (Community) Calico Agent Ubuntu Linux CARRIER WORKER Containerd Kubelet (Community) Calico Agent Ubuntu Linux Containerd Kubelet (Community) Calico Agent CLUSTER WORKER RHEL REDHAT CARRIER CRI-O Kubelet (OpenShift) Calico Agent RHEL CRI-O Kubelet (OpenShift) Calico Agent REDHAT WORKER
  • 28. Hybrid Clouds, TFX and Kubeflow Pipelines Kubeflow Pipelines
  • 29. 29
  • 30. Distributed Model Training and HPO (Katib, TFJob, PyTorch Job…) ●  Addresses One of the key goals for model builder persona: Distributed Model Training and Hyper parameter optimization for Tensorflow, PyTorch etc. ●  Common problems in HP optimization ○  Overfitting ○  Wrong metrics ○  Too few hyperparameters ●  Katib: a fully open source, Kubernetes-native hyperparameter tuning service ○  Inspired by Google Vizier ○  Framework agnostic ○  Extensible algorithms
  • 31. Kubernetes Compute cluster GPU, TPU ,CPU Cloud Object Storage Model Assets. 31 Istio Knative KFServing Serving and Management: KFServing Bringing the power of Knative and Istio for serverless Model deployments PRE- PROCESS PREDICT POST- PROCESS EXPLAIN
  • 32. Manages the hosting aspects of your models •  KFService - manages the lifecycle of models •  Configuration - manages history of model deployments. Two configurations for default and canary. •  Revision - A snapshot of your model version •  Config and image •  Route - Endpoint and network traffic management Route Default Configuration Revision 1 Revision M 90 % KFService Canary Configuration Revision 1 Revision N 10 % KFServing: Default and Canary Configurations
  • 33. Kubeflow Pipelines -  Released to Kubeflow in Nov 2018, integrated into KF deployment CLI and 1-click-deploy-app -  Aimed to bring -  Orchestration for complex ML workflows -  Reproducible and reliable experimentation -  Bridging experimentation and operationalization -  Composition and reusable ML components and pipelines Kubeflow Pipelines
  • 34. Experiment Tracking 34 •  Kubeflow offers an easy way to compare different runs of the pipeline. •  You can create the pipeline with model training. Then run it multiple times with different parameter values, and you’ll get accuracy and ROC AUC scores for every run compared. •  Lot more under “Compare runs” view.
  • 35. What constitutes a Kubeflow ML Pipeline §  Containerized implementations of ML Tasks §  Pre-built components: Just provide params or code snippets (e.g. training code) §  Create your own components from code or libraries §  Use any runtime, framework, data types §  Attach k8s objects - volumes, secrets §  Specification of the sequence of steps §  Specified via Python DSL §  Inferred from data dependencies on input/output §  Input Parameters §  A “Run” = Pipeline invoked w/ specific parameters §  Can be cloned with different parameters §  Schedules §  Invoke a single run or create a recurring scheduled pipeline
  • 36. Define Pipeline with Python SDK @dsl.pipeline(name='Taxi Cab Classification Pipeline Example’) def taxi_cab_classification( output_dir, project, Train_data = 'gs://bucket/train.csv', Evaluation_data = 'gs://bucket/eval.csv', Target = 'tips', Learning_rate = 0.1, hidden_layer_size = '100,50’, steps=3000): tfdv = TfdvOp(train_data, evaluation_data, project, output_dir) preprocess = PreprocessOp(train_data, evaluation_data, tfdv.output[“schema”], project, output_dir) training = DnnTrainerOp(preprocess.output, tfdv.schema, learning_rate, hidden_layer_size, steps, target, output_dir) tfma = TfmaOp(training.output, evaluation_data, tfdv.schema, project, output_dir) deploy = TfServingDeployerOp(training.output) Compile and Submit Pipeline Run dsl.compile(taxi_cab_classification, 'tfx.tar.gz') run = client.run_pipeline( 'tfx_run', 'tfx.tar.gz', params={'output': ‘gs://dpa22’, 'project': ‘my-project-33’})
  • 37. Creating your own components -  Ways to build reusable components for pipelines -  Create a container with your code and write either a ContainerOp() or shareable component descriptor -  Turn your python code into a component directly in the notebook (with or without building a container) -  These components can be exported into a shareable format Container Image Execution Code ContainerOp I/O schema Code ContainerOp Container Image I/O schema Container build Container build Component descriptor
  • 38. Watson AI Operations: Kubeflow Pipelines
  • 39. e.g. Watson Speech to Text Operations Kubeflow pipeline
  • 40. e.g. Watson Machine Learning and Watson OpenScale Pipeline
  • 41. Hybrid Clouds, TFX and Kubeflow Pipelines TFX
  • 42. What is TFX? TL;DR ●  TFX is a platform that to deploy Tensorflow models in production ●  TFX pipelines consist of a set of integrated components ●  TFX pipelines are configured using python ●  TFX consists of components, executors, and libraries ●  TFX components are optional (and repeated) ●  TFX can be configured to run in many different ways 42
  • 43. TFX has existed externally as open source libraries. 4343 Open sourced TFX libraries (circa 2018) TensorFlow Data Validation TensorFlow Transform TensorFlow Model Analysis TensorFlow Serving
  • 44. In 2019, the horizontal layers that integrate TFX libraries as one platform were open sourced 4444 Open sourced TFX platform (2019) Data Ingestion TensorFlow Data Validation TensorFlow Transform Estimator or Keras Model TensorFlow Model Analysis TensorFlow Serving Logging Shared Utilities for Garbage Collection, Data Access Controls Pipeline Storage Shared Configuration Framework and Job Orchestration Integrated Frontend for Job Management, Monitoring, Debugging, Data/Model/Evaluation Visualization
  • 45. Anatomy of a Component TFX components consist of three main pieces: ●  Driver ●  Executor ●  Publisher 45
  • 46. Anatomy of a Component TFX includes both libraries and pipeline components. This diagram illustrates the relationships between TFX libraries and pipeline components. TFX provides several Python packages that are the libraries which are used to create pipeline components 46
  • 47. TFX (inside the box) 47 Other runtimes ExampleGen StatisticsGen SchemaGen Example Validator Transform Trainer Evaluator Model Validator Pusher TFX Config Metadata Store Training + Eval Data TensorFlow Serving TensorFlow Hub TensorFlow Lite TensorFlow JS TFX Pipeline
  • 48. TFX uses ml-metadata for artifact management. 48 Trainer Task-Aware Pipelines Input Data Transformed Data Trained Models Serving System Task- and Data-Aware Pipelines Pipeline + Metadata Storage Training Data Transform TrainerTransform
  • 49. Snapshot of a component. 49 Metadata Store Trainer Config Last Validated Model New (Candidate) Model New Model Model Validator Validation Outcome Pusher New (Candidate) Model Validation Outcome
  • 50. What’s in the Metadata Store? 50 Trained Models Type definitions of Artifacts and their Properties E.g., Models, Data, Evaluation Metrics Trainer Execution Records (Runs) of Components E.g., Runtime Configuration, Inputs + Outputs Lineage Tracking Across All Executions E.g., to recurse back to all inputs of a specific artifact
  • 51. Examples of Metadata-Powered Functionality. 51 Find out which data a model was trained on Compare previous model runs Carry-over state from previous models Re-use previously computed outputs
  • 52.
  • 53. Hybrid Clouds, TFX and Kubeflow Pipelines demo https://github.com/kubeflow/kfp-tekton/tree/master/samples/kfp-tfx
  • 54. Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation Model Goal: Will the customer tip more or less than 20%?
  • 55. Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation TFX Taxi Pipeline Private Cloud Public Cloud On-prem PVC KubeFlow Pipelines Istio + Kiali Kubeflow Serving Trigger model deployment Object Storage
  • 56.
  • 57. Hybrid Clouds, TFX and Kubeflow Pipelines Lessons Learnt
  • 58. Recommendations for TFX and KFP •  TFX Pipelines shall be made executable without any dependency on public cloud service e.g. GCS. •  Apache Beam is a strong dependency in TFX. Doesn’t support S3 natively •  TFX DSL shall support dynamically creating Persistent Volume Claims. •  Support for mixing and matching KFP ContainerOps components with TFX ones through DSL •  IBM IKS runs Kubernetes with containerd, Openshift uses CRIO APIs. The underlying pipeline platforms (Argo, Airflow, Beam etc) should support them as first class citizens •  Visualizing artifacts on the KubeFlow Pipeline UI shall not be limited to GCS by default •  Don’t assume root privileges on OpenShift and Kube, as well as underlying storage file system.
  • 59. TFX and KFP DSL Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation DEMO CODE: https://github.com/kubeflow/kfp-tekton/tree/master/samples/kfp-tfx RFC for TFX and KFP DSL Merge https://docs.google.com/document/d/1_n3q0mNOr7gUSM04yaA0e5BO9RrS0Vkh1cNCyrB07WM/edit# Please reach out at @AnimeshSingh for any follow-on discussions
  • 61. Center for Open-Source Data & AI Technologies (CODAIT) / June 28, 2019 / © 2019 IBM Corporation