Seamless End-to-End Production Machine Learning with Seldon and MLflow

End to end Machine Learning in
kubernetes with Seldon and
MLFlow
Adaptive. Intelligent.
Agile. Intuitive.
Alive. Inspiring.

About me
Adrian Gonzalez-Martin
Machine Learning Engineer
agm@seldon.io
@kaseyo23
github.com/adriangonz

About Seldon
We are hiring!
seldon.io/careers/

Outline
Example use case.
Training models with MLFlow.
Deploying models with Seldon.
Demo!

Wine e-commerce
6.5
2.3
?
https://wine.seldon.io/...

Regression problem where goal is to predict wine quality.
Wine quality dataset
Fixed
Acidity
Volatile
Acidity
Citric
Acid
... Sulphates Alcohol Quality
7 0.27 0.36 0.45 8.8 6
6.3 0.3 0.34 0.49 9.5 7
8.1 0.28 0.4 0.44 10.1 1
7.2 0.23 0.32 0.4 9.9 2
7.2 0.23 0.32 0.4 9.9 5
...

Train two versions of ElasticNet.
Deploy both and do A/B test.
Model A
A/B Test
Router
Model B

ElasticNet
Linear regression with L1 and L2 regularisers.
Two hyperparameters:

Keep track of experiments (e.g. versioning).
Optimise hyperparameters at scale.
Share models with the rest of the team.
Plain notebooks may not be the best solution!!
Training models at scale is hard
secret-project
├── eda.ipynb
├── requirements.txt
├── requirements-tf2.txt
└── experiments
├── attention-embedd.ipynb
├── attn.ipynb
├── hmm.ipynb
├── LSTM-2.ipynb
└── lstm.ipynb

MLtracking
API to track results and hyperparameters.
These can be stored remotely.
MLproject
Deﬁne environment, parameters and model’s interface.
Hyperparameter optimisation can be run Databricks or Kubernetes.
MLmodel
Snapshot/version of the model.
MLFlow can help!

MLFlow can help!
MLtracking
Track both hyperparameters:
Track metrics:
MLproject
Pip dependencies and entry point template.
MLmodel
Snapshot of regression coefﬁcients and hyperparameters:

MLproject file and training
MLproject
name: mlflow-talk
conda_env: conda.yaml
entry_points:
main:
parameters:
alpha: float
l1_ratio: {type: float, default: 0.1}
command: "python train.py {alpha} {l1_ratio}"
$ mlflow run ./training -P alpha=0.5
$ mlflow run ./training -P alpha=1.0

MLmodel snapshot
MLmodel
artifact_path: model
ﬂavors:
python_function:
data: model.pkl
env: conda.yaml
loader_module: mlﬂow.sklearn
python_version: 3.6.9
sklearn:
pickled_model: model.pkl
serialization_format: cloudpickle
sklearn_version: 0.19.1
run_id: 5a6be5a1ef844783a50a6577745dbdc3
utc_time_created: '2019-10-02 14:21:15.783806'

Serving infrastructure.
Scaling up (and down!).
Monitoring models.
Updating models.
Deploying models is hard
Analysis Tools
Serving
Infrastructure
Monitoring
Machine
Resource
Management
Process
Management
Tools
Analysis Tools
Serving
Infrastructure
Monitoring
Machine
Resource
Management
Process
Management Tools
ML
Code
Data Veriﬁcation
Data Collection
Feature Extraction
Conﬁguration
Seldon can help with these...
[1] Hidden Technical Debt in Machine Learning Systems, NIPS 2015

Inference graph
Seldon/3rd party
component
MLFlow model
API
(REST, gRPC)
Model A
A/B Test
Router
Explainer
Model B
Inference graph (k8s pod)

Cloud Native
On-prem
Built on top of Kubernetes Cloud Native APIs.
All major cloud providers are supported.
On-prem providers such as OpenShift are also
supported.
https://docs.seldon.io/projects/seldon-core/en/latest/examples/notebooks.html

Seldon Architecture
Data scientists,
engineers and
managers
Deployment Controller
(kubectl, CI/CD, Seldon Deploy)
Business
Applications REST API or gRPC
Kubernetes clusters
running Seldon Core
Inference
Components
Kubernetes
API
Operator
Ingress
(Ambassador, Istio)
Public docker
registry
Client docker
registry
Seldon docker
registry
MLFlow models
(S3, GCS, etc.)

Inference graph
apiVersion: machinelearning.seldon.io/v1alpha2
kind: SeldonDeployment
metadata:
name: wines-classifier
spec:
predictors:
- graph:
children: []
implementation: MLFLOW_SERVER
modelUri: gs://seldon-models/mlflow/model-a
name: model-a
replicas: 1
traffic: 50
- graph:
children: []
implementation: MLFLOW_SERVER
modelUri: gs://seldon-models/mlflow/model-b
name: model-b
replicas: 1
traffic: 50

Feedback
Kubernetes clusters
running Seldon Core
Inference graphPrediction
Feedback
(reward signal)

Feedback
We can build a rough reward signal using the squared error.

Demo!
https://github.com/adriangonz/mlﬂow-talk

Thanks!
agm@seldon.io
@kaseyo23
github.com/adriangonz
We are hiring!
seldon.io/careers/

Seamless End-to-End Production Machine Learning with Seldon and MLflow

More Related Content

What's hot

Similar to Seamless End-to-End Production Machine Learning with Seldon and MLflow

More from Databricks

Recently uploaded

Seamless End-to-End Production Machine Learning with Seldon and MLflow