OVHcloud TechTalks - ML serving

•

0 likes•130 views

Chez OVHcloud, nous utilisons en interne des modèles de Machine Learning qui aident à la prise de décision, dans des domaines allant de la lutte contre la fraude à l'amélioration de la maintenance de nos infrastructures. Tirant parti des formats Open Source standard - tels que les SavedModels de Tensorflow - ML Serving permet aux utilisateurs de déployer facilement leurs modèles tout en bénéficiant de fonctionnalités essentielles telles que l'instrumentation, l'évolutivité et la gestion des versions des modèles.

Internet

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou1

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
OVHcloud ML Serving
Maël Le Gal github:mael-le-gal
Christophe Rannou @ChrisRannou

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Public Cloud ML Serving
3

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Public Cloud ML Serving
4
Instead of taking care of the deployment in production, simply select ML models (your own
or pre-trained), size and deploy. We provide API endpoints and more !
✔ We simplify your architecture : we deploy your ML models for you in few clicks
✔ We simplify your code : everything can be automated (via API / CLI)
✔ We reduce your costs : you reduce the time-to-production from weeks to second, pay as you go
✔ We ﬁx your mains challenges : we provide Scaling, Monitoring and Versioning
Our extra value :

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Demo
5

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
6

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
7
Model CRD

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
8
Token CRD

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
9
Auth API
• Generate tokens
• Check tokens
Hub API, model management
• Create
• Delete
• Update
Web APIs

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
10

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
11
• Image building:
– Build image with model ﬁles from storage
– Push image to the registry
• Model Lifecycle:
– ApiStatus: describe the state of the runtime API
– VersionStatus: describe the state of the model image
Model Controller

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
12
Example

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Monitoring
13

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
14
• Ingress controller:
– Count of HTTP requests by method and status code
– Sum of HTTP latencies by method and status code
• k8s:
– Number of pods
– Number of Model CRD
• Custom Model Metrics:
– Liveness
– Version
– Version status
– API status
Metrics

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Hub
15
• HPA: horizontal pod autoscaler
• RAM usage +60%
• CPU usage +60%
• Params:
– Max/Min threshold
– Scale decision: % and which resource
• To come: GPU usage
Auto Scaling

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
16

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
17
Prerequisites :
• Support ONNX & TensorFlow (TF) & PMML serialization format
• Able to chain several models of diﬀerent kind
• Available through a web service API
HTTP Query Preprocessing Model
Postprocessin
g
HTTP
Response
Example :

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
18
Inputs
• ?
Evaluator
•ONNX
•TensorFlow
•PMML
Outputs
• ?
Think generic :
Let's create one Evaluator per supported serialization
format.
What are the common inputs & outputs ?

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
19
ONNX & TensorFlow:
Inputs & outputs :
• List n-dimensional arrays (i.e tensors)
identify by names and shapes
Example :
0 1 0
1 0 0
0 0 1
tensor_A (3,3)
3.2 0.1 8.7 6.0
0.0 1.2 2.0 7.7
tensor_B (2, 4)

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
20
PMML
Inputs & outputs :
• Tabular data (i.e Dataset) Can be interpreted as a list of named tensors :
Example :
prop_int prop_bool prop_string
1 true "John"
6 true "Kim"
8 false "Hugo"
1 6 8
prop_int (3, 1)
true true false
prop_int (3, 1)
"John" "Kim" "Hugo"
prop_string (3, 1)

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
21
Inputs
• List of named tensors
Evaluator
•ONNX
•TensorFlow
•PMML
Outputs
•List of named tensors
Think generic :
What are the common inputs & outputs ?

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
22
Web API
How to convert http query into a named tensors ?
How to convert named tensors into http response ?
Use the Content-Type header to decode/encode message body
?
?
..

$@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 23 Web API • Content-Type: application/json 0 1 0 1 0 0 0 0 1 tensor_A (3,3) 3.2 0.1 8.7 6.0 0.0 1.2 2.0 7.7 tensor_B (2, 3) { "tensor_A": [ [0, 1, 0], [1, 0, 0], [0, 0, 1] ], "tensor_B": [ [3.2, 0.1, 8.7, 6.0], [0.0, 1.2, 2.0, 7.7] ] }$

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
24
Web API
• Content-Type: image/png
... ... ...
(253, 152, 6) (253, 13, 10) ...
(200, 100, 255) (10, 10, 6) ...
image (1, width, height, 3)
width
h
e
i
g
h
t

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
25
Web API
Other supported Content-Type :
• image/jpeg
• multipart/form-data
• text/html
Later :
• application/protobuf

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Serving Runtime
26
Web API
Example with available ONNX model :
https://github.com/onnx/models/tree/master/vision/style_transfer/fast_neural_style

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
ML Serving
27

@OVHcloud_fr #OVHcloudTechTalks @ChrisRannou
Thanks!
Maël Le Gal github:mael-le-gal
Christophe Rannou @ChrisRannou

Similar to OVHcloud TechTalks - ML serving

ITCamp 2017 - Raffaele Rialdi - A Deep Dive Into Bridging Node-js with .NET CoreITCamp

Scaling Machine Learning Systems up to Billions of Predictions per DayCarmine Paolino

Running microservices successfully | Bastian Hofmann | CODEiDCODEiD PHP Community

Tool up your lamp stackAgileOnTheBeach

Tool Up Your LAMP StackLorna Mitchell

from ai.backend import python @ pycontw2018Chun-Yu Tseng

Consul administration at scalePierre Souchay

Design Like a Pro: Scripting Best PracticesInductive Automation

TensorFlow meetup: Keras - Pytorch - TensorFlow.jsStijn Decubber

Design Like a Pro: Scripting Best PracticesInductive Automation

Code for Startup MVP (Ruby on Rails) Session 1Henry S

Busy Developers Guide to AngularJS (Tiberiu Covaci)ITCamp

CodeChecker Overview Nov 2019Olivera Milenkovic

December OpenNTF Webinar: The Volt MX LotusScript ToolkitHoward Greenberg

IBM Watson & PHP, A Practical DemonstrationClark Everetts

The 1990s Called. They Want Their Code Back.Jonathan Oliver

What’s eating python performancePiotr Przymus

Vertex AI: Pipelines for your MLOps workflowsMárton Kodok

Azure tales: a real world CQRS and ES Deep Dive - Andrea SaltarelloITCamp

Metrics spark meetupSteven Le Roux

Similar to OVHcloud TechTalks - ML serving (20)

ITCamp 2017 - Raffaele Rialdi - A Deep Dive Into Bridging Node-js with .NET Core

Scaling Machine Learning Systems up to Billions of Predictions per Day

Running microservices successfully | Bastian Hofmann | CODEiD

Tool up your lamp stack

Tool Up Your LAMP Stack

from ai.backend import python @ pycontw2018

Consul administration at scale

Design Like a Pro: Scripting Best Practices

TensorFlow meetup: Keras - Pytorch - TensorFlow.js

Design Like a Pro: Scripting Best Practices

Code for Startup MVP (Ruby on Rails) Session 1

Busy Developers Guide to AngularJS (Tiberiu Covaci)

CodeChecker Overview Nov 2019

December OpenNTF Webinar: The Volt MX LotusScript Toolkit

IBM Watson & PHP, A Practical Demonstration

The 1990s Called. They Want Their Code Back.

What’s eating python performance

Vertex AI: Pipelines for your MLOps workflows

Azure tales: a real world CQRS and ES Deep Dive - Andrea Saltarello

Metrics spark meetup

Recently uploaded

Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝soniya singh

FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607dollysharma2066

象限策略：Google Workspace 与 Microsoft 365 对业务的影响 .pdfkeithzhangding

Complet Documnetation for Smart Assistant Application for Disabled Personfurqan222004

定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一3sw2qly1

Denver Web Design brochure for public viewingbigorange77

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts servicesonalikaur4

How is AI changing journalism? (v. April 2024)Damian Radcliffe

Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...SofiyaSharma5

DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024APNIC

VIP Kolkata Call Girls Salt Lake 8250192130 Available With Roomgirls4nights

VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service PuneCall girls in Ahmedabad High profile

Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No AdvanceCall Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...aditipandeya

✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663Call Girls Mumbai

Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝9953056974 Low Rate Call Girls In Saket, Delhi NCR

'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...APNIC

On Starlink, presented by Geoff Huston at NZNOG 2024APNIC

Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts servicevipmodelshub1

AWS Community DAY Albertini-Ellan Cloud Security (1).pptxellan12

Recently uploaded (20)

Call Girls In Model Towh Delhi 💯Call Us 🔝8264348440🔝

FULL ENJOY Call Girls In Mayur Vihar Delhi Contact Us 8377087607

象限策略：Google Workspace 与 Microsoft 365 对业务的影响 .pdf

Complet Documnetation for Smart Assistant Application for Disabled Person

定制(CC毕业证书)美国美国社区大学毕业证成绩单原版一比一

Denver Web Design brochure for public viewing

Chennai Call Girls Porur Phone 🍆 8250192130 👅 celebrity escorts service

How is AI changing journalism? (v. April 2024)

Low Rate Young Call Girls in Sector 63 Mamura Noida ✔️☆9289244007✔️☆ Female E...

DDoS In Oceania and the Pacific, presented by Dave Phelan at NZNOG 2024

VIP Kolkata Call Girls Salt Lake 8250192130 Available With Room

VIP Call Girls Pune Madhuri 8617697112 Independent Escort Service Pune

Rohini Sector 6 Call Girls Delhi 9999965857 @Sabina Saikh No Advance

VIP 7001035870 Find & Meet Hyderabad Call Girls Dilsukhnagar high-profile Cal...

✂️ 👅 Independent Andheri Escorts With Room Vashi Call Girls 💃 9004004663

Model Call Girl in Jamuna Vihar Delhi reach out to us at 🔝9953056974🔝

'Future Evolution of the Internet' delivered by Geoff Huston at Everything Op...

On Starlink, presented by Geoff Huston at NZNOG 2024

Chennai Call Girls Alwarpet Phone 🍆 8250192130 👅 celebrity escorts service

AWS Community DAY Albertini-Ellan Cloud Security (1).pptx

OVHcloud TechTalks - ML serving

1. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou1

2. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou OVHcloud ML Serving Maël Le Gal github:mael-le-gal Christophe Rannou @ChrisRannou

3. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Public Cloud ML Serving 3

4. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Public Cloud ML Serving 4 Instead of taking care of the deployment in production, simply select ML models (your own or pre-trained), size and deploy. We provide API endpoints and more ! ✔ We simplify your architecture : we deploy your ML models for you in few clicks ✔ We simplify your code : everything can be automated (via API / CLI) ✔ We reduce your costs : you reduce the time-to-production from weeks to second, pay as you go ✔ We ﬁx your mains challenges : we provide Scaling, Monitoring and Versioning Our extra value :

5. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Demo 5

6. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 6

7. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 7 Model CRD

8. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 8 Token CRD

9. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 9 Auth API • Generate tokens • Check tokens Hub API, model management • Create • Delete • Update Web APIs

10. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 10

11. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 11 • Image building: – Build image with model ﬁles from storage – Push image to the registry • Model Lifecycle: – ApiStatus: describe the state of the runtime API – VersionStatus: describe the state of the model image Model Controller

12. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 12 Example

13. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Monitoring 13

14. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 14 • Ingress controller: – Count of HTTP requests by method and status code – Sum of HTTP latencies by method and status code • k8s: – Number of pods – Number of Model CRD • Custom Model Metrics: – Liveness – Version – Version status – API status Metrics

15. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Hub 15 • HPA: horizontal pod autoscaler • RAM usage +60% • CPU usage +60% • Params: – Max/Min threshold – Scale decision: % and which resource • To come: GPU usage Auto Scaling

16. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 16

17. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 17 Prerequisites : • Support ONNX & TensorFlow (TF) & PMML serialization format • Able to chain several models of diﬀerent kind • Available through a web service API HTTP Query Preprocessing Model Postprocessin g HTTP Response Example :

18. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 18 Inputs • ? Evaluator •ONNX •TensorFlow •PMML Outputs • ? Think generic : Let's create one Evaluator per supported serialization format. What are the common inputs & outputs ?

19. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 19 ONNX & TensorFlow: Inputs & outputs : • List n-dimensional arrays (i.e tensors) identify by names and shapes Example : 0 1 0 1 0 0 0 0 1 tensor_A (3,3) 3.2 0.1 8.7 6.0 0.0 1.2 2.0 7.7 tensor_B (2, 4)

20. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 20 PMML Inputs & outputs : • Tabular data (i.e Dataset) Can be interpreted as a list of named tensors : Example : prop_int prop_bool prop_string 1 true "John" 6 true "Kim" 8 false "Hugo" 1 6 8 prop_int (3, 1) true true false prop_int (3, 1) "John" "Kim" "Hugo" prop_string (3, 1)

21. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 21 Inputs • List of named tensors Evaluator •ONNX •TensorFlow •PMML Outputs •List of named tensors Think generic : What are the common inputs & outputs ?

22. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 22 Web API How to convert http query into a named tensors ? How to convert named tensors into http response ? Use the Content-Type header to decode/encode message body ? ? ..

23. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 23 Web API • Content-Type: application/json 0 1 0 1 0 0 0 0 1 tensor_A (3,3) 3.2 0.1 8.7 6.0 0.0 1.2 2.0 7.7 tensor_B (2, 3) { "tensor_A": [ [0, 1, 0], [1, 0, 0], [0, 0, 1] ], "tensor_B": [ [3.2, 0.1, 8.7, 6.0], [0.0, 1.2, 2.0, 7.7] ] }

24. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 24 Web API • Content-Type: image/png ... ... ... (253, 152, 6) (253, 13, 10) ... (200, 100, 255) (10, 10, 6) ... image (1, width, height, 3) width h e i g h t

25. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 25 Web API Other supported Content-Type : • image/jpeg • multipart/form-data • text/html Later : • application/protobuf

26. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Serving Runtime 26 Web API Example with available ONNX model : https://github.com/onnx/models/tree/master/vision/style_transfer/fast_neural_style

27. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou ML Serving 27

28. @OVHcloud_fr #OVHcloudTechTalks @ChrisRannou Thanks! Maël Le Gal github:mael-le-gal Christophe Rannou @ChrisRannou

OVHcloud TechTalks - ML serving

Recommended

Recommended

More Related Content

Similar to OVHcloud TechTalks - ML serving

Similar to OVHcloud TechTalks - ML serving (20)

More from OVHcloud

More from OVHcloud (20)

Recently uploaded

Recently uploaded (20)

OVHcloud TechTalks - ML serving