Model serving made easy using Kedro pipelines - Mariusz Strzelecki, GetInData

Model serving made easy using
Kedro pipelines
Mariusz Strzelecki
ML Engineer, GetInData
www.ml.dssconf.pl

© Copyright. All rights reserved. Not to be reproduced without prior written consent.
Let’s serve a model!
● Mlﬂow
● Seldon
● …
● Or just a microservice in python
● https://docs.seldon.io/projects/seldon-core/en/v1.6.0/examples/iris.html

Easy, right?
● “Real” models have way more parameters
● “Real” models require some transformations:
○ Values scaling
○ Encoding
● “Real” models should validate input
● “Real” models may depend on data from Feature Store
● “Real” models may require post-processing

Custom code already supported
● MLﬂow offers pyfunc

● Seldon provides “Python Components”

● or even multi-microservice inference graph

Awesome, right?
● Right!
● But:
○ You keep every model in 2 subprojects: training and serving
○ Either you write tailored code for every model as Python class
○ … or you keep reusable components as multiple separate Docker containers
● Basically: a lot of work to do

What if I told you, that:
● There is a way to keep both: training and serving in one
project
● You can test your serving (aka. “inference”) part without
leaving Jupyter
● It glues two existing frameworks

Why Kedro?
● There is no standard on how ML projects should look like
● …, but, there are known good practices!
● Reproducibility is the key
● Fills the gap between Data Science and software
development

Building blocks of Kedro project
Icons made by dDara and Freepik from Flaticon
Node Pipeline Data catalog

Node

Pipeline

Data Catalog

Kedro Pipelines
Data preparation Data science
Load
dataset 1
Load
dataset 2
Join
datasets
Normalize
data
Split normalized data
Train model
Evaluate model
Train
Test
Model
Inference
Load input
Decorate
input
Normalize
data
Evaluate model
Format output

The code

Ups and downs
+ No longer need to keep 2 separate projects
+ Super fast iterating on the model
+ Same docker image enforces identical python version and
all libraries
- Not as fast as it could be (~11ms of overhead)
- Need to install compatible seldon-core package

Summary
● Kedro + Seldon + Mlﬂow make a useful serving toolchain
● Kedro proved (again) to be all-in-one environment for ML
● Great method for fast iterating, but not an option if every
millisecond counts

Model serving made easy using Kedro pipelines - Mariusz Strzelecki, GetInData

More Related Content

What's hot

Similar to Model serving made easy using Kedro pipelines - Mariusz Strzelecki, GetInData

More from GetInData

Recently uploaded

Model serving made easy using Kedro pipelines - Mariusz Strzelecki, GetInData