Hydrosphere.io for ODSC: Webinar on Kubeflow

Train and deliver machine
learning models to production
with a single command
STEPAN PUSHKAREV
ILNUR GARIFULLIN

Today’s webinar overview
1. Machine Learning Workflow
2. Tools overview
a. Kubeflow
b. Hydrosphere.io
3. Deep Dive into Automation
a. Steps definition
b. Steps automation

ML Workflow
1. Research
2. Data Preparation
3. Model Training
4. Model Cataloguing
5. Model Deployment
6. Model Integration Testing
7. Production Inferencing
8. Model Performance Monitoring
9. Model Maintenance

Step 1: Research
● Defining an objective
● Defining requirements
● Defining methods
● Defining data sources
1.

Step 2: Data Preparation
● Collecting data
● Preparing data
○ Cleaning
○ Feature engineering
○ Transformation
● Important! To be reused for Inferencing.
1. 2.

Step 3: Model Training
● Building the model
● Training the model
● Evaluating the model
● Tuning hyper-parameters
● Versioning training data
1. 2. 3.

Step 4: Model Cataloguing
● Metadata extraction
○ Graph definition
○ Weights
○ Training data version / stats
○ Other dependencies (look_up vocabulary, etc)
● Indexing model’s binaries
● Versioning a model artifact
● Storing a model in Repository
1. 2. 3. 4.

Step 5: Model Deployment
● Preparing infrastructure for the model
● Preparing runtime for the model
● Deploying the model server
● Exposing API endpoints to the model
● Model Integration
1. 2. 3. 4. 5.

Step 6: Model Integration Testing
● Performing integration tests
● Replaying a golden data set
● Replaying edge cases
● Replaying recent traffic
● Asserting results
1. 2. 3. 4. 5. 6.

Step 7: Production Inferencing
● A/B & Canary deployment
● Model scaling
1. 2. 3. 4. 5. 6. 7.

Step 8: Model Performance Monitoring
● System metrics monitoring
● Model metrics tracking
● Model comparison
● Concept drift monitoring
● Anomaly detection
● Data profiling
1. 2. 3. 4. 5. 6. 7. 8.

Step 9: Model Maintenance
● Alerts & Troubleshooting
● Root Cause Analysis
● Edge Case Exploration
● Retraining Dataset Subsampling
● Retraining
1. 2. 3. 4. 5. 6. 7. 8. 9.

The Machine Learning Model Management Platform
The Machine Learning Toolkit for Kubernetes

What is Kubeflow?
● Began as Kubernetes template / blueprint for running Tensorflow
● Evolved into “Toolkit” - loosely coupled tools and blueprints for ML on
Kubernetes

What is Hydrosphere.io?
Hydrosphere.io is a platform for ML models Management.
- An exact value-add “tool” - a part of the toolkit
- Opensource
- Augments Cataloguing, Deployment, Inferencing,
Monitoring and Maintenance

Research Data Prep Training Cataloguing Deployment Integration
Testing
Production
Inferencing
Performance
Monitoring
Model
Maintenance
Tools Landscape
Orchestrate
ModelDB

Deep Dive into
Workflow Automation
Part 1: Creating executables

Step 1: Research
MNIST● Objective – given an image of the handwritten
digit, predict what digit it is;
● Requirements – model export with an ease;
● Tools and Methods – Tensorflow Estimator API;
● Data – Mnist dataset

Step 2: Data Preparation — Building Container
FROM python:3.6-slim
RUN pip install numpy==1.14.3 Pillow==5.2.0
ADD ./download.py /src/
WORKDIR /src/
ENTRYPOINT [ "python", "download.py" ]
$ docker build -t {username}/mnist-pipeline-download .
$ docker push {username}/mnist-pipeline-download
Dockerfile

Step 3: Model Training — Building a model

Step 3.5: Model Training and Saving

DIY:
Instrument training
pipeline
Store metadata
Zip model and metadata
Store in S3
Or push to Artifactory
Or push to git

DIY:
Instrument training
pipeline
Store metadata
Store in S3
Or push to git
ModelDB:
Python DSL:
- Sync Model
- Sync Test data
- Sync metrics
Nice UI

DIY:
Instrument training
pipeline
Store metadata
Store in S3
Or push to git
ModelDB:
Python DSL:
- Sync Model
- Sync Test data
- Sync metrics
Nice UI
Hydrosphere.io:
$ hs upload /models/mnist/
$ hs profile push /data/mnist/

Version
Extract metadata
Build model docker
Image
Store in Docker
Registry
Hydrosphere.io:
$ hs upload /models/mnist/
$ hs profile push /data/mnist/

DIY:
Implement model
server (Flask App)
Lookup for model
Dockerize
Add Kube configs, tags
Expose API (HTTP,
gRPC, batch, Streaming)

DIY:
Implement model
server (Flask App)
Lookup for Model
Dockerize
Expose API (HTTP,
Niche tools:
TensorFlow Serving
PyTorch Serving
Nvidia TensorRT Serving

DIY:
Implement model
server (Flask App)
Lookup for Model
Dockerize
Expose API (HTTP,
Niche tools:
TensorFlow Serving
PyTorch Serving
Nvidia TensorRT Serving
Hydrosphere.io
$ hs apply -f - << EOF
kind: Application
name : “MyPredictionApp”
singular:
model: mnist:1
runtime:
“serving-runtime-python:1.7.0-latest”
EOF

Hydrosphere.io
$ hs apply -f - << EOF
kind: Application
name : “MyPredictionApp”
singular:
model: mnist:1
runtime:
“serving-runtime-python:1.7.0-latest”
EOF
metadata
runtime
model
Model launched on Kube
HTTP, gRPC, Kafka API

DIY:
Implement testing
script
Dockerize, add to Kube
Replay a golden data
Replay edge cases
Replay recent traffic
Asserting results

DIY:
Implement testing
script
Dockerize, add to Kube
Replay a golden data
Replay edge cases
Replay recent traffic
Asserting results
Hydrosphere Serving (Q2 2019)
$ hs test -f /test/dataset
$ hs test replay anomalies
$ hs test replay <from_date>

Step 8: Model Performance Monitoring

Step 9: Model Maintenance
alert
accuracy
drops
data
changed
what exactly

Step 9: Model Maintenance - explainability of monitoring alert

Deep Dive into
Workflow Automation
Part 2: Defining Kubeflow Pipeline

Stage 1: Defining Downloading Container

Stage 2: Defining Training Container

Stage 3: Defining Uploading Container

Stage 4: Defining Deploying Container

Stage 5: Defining Testing Container

Stage 6: Defining Cleaning Container

Compiling Pipeline
$ python pipeline.py pipeline.tar.gz
$ tar -xvf pipeline.tar.gz # produces pipeline.yaml

Executing Pipeline with a single command
$ argo submit pipeline.yaml --watch

Source code
https://github.com/Hydrospheredata/hydro-serving-kubeflow-demo

Contact Us
GENERAL INQUIRIES
hydrosphere.io
info@hydrosphere.io
linkedin.com/company/hydrospherebigdata
twitter.com/hydrospheredata
facebook.com/hydrosphere.io
ADDRESS
125 University Avenue, Suite 290
Palo Alto, CA, 94301
tel: 650-521-7875
BUSINESS AND TECHNICAL
Stepan Pushkarev
spushkarev@hydrosphere.io
Ilnur Garifullin
igarifullin@provectus.com

Hydrosphere.io for ODSC: Webinar on Kubeflow

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to Hydrosphere.io for ODSC: Webinar on Kubeflow

Similar to Hydrosphere.io for ODSC: Webinar on Kubeflow (20)

Recently uploaded

Recently uploaded (20)

Hydrosphere.io for ODSC: Webinar on Kubeflow