SlideShare a Scribd company logo
1 of 20
Download to read offline
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Train once, deploy anywhere on the cloud
and at the edge with Amazon SageMaker
Neo
Vebhhav Singh
Sr. Solutions Architect
AWS
A I M 3 0 3
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon AI/ML Stack
M L S E R V I C E S
M L F R A M E W O R K S
A I S E R V I C E S
(ML researchers
and academics)
(ML developers
and data scientists)
A m a z o n
S a g e M a k e r
G R O U N D T R U T H A L G O R I T H M S
N O T E B O O K S
M A R K E T P L A C E
U N S U P E R V I S E D
L E A R N I N G
S U P E R V I S E D
L E A R N I N G
R E I N F O R C E M E N T
L E A R N I N G
O P T I M I Z A T I O N
( N E O )
T R A I N I N G
H O S T I N G
D E P L O Y M E N T
Vision Speech Language Chatbots &
Contact Centers
Verticals
A m a z o n
R e k o g n i t i o n
I m a g e
A m a z o n
R e k o g n i t i o n
V i d e o
A m a z o n
P o l l y
A m a z o n
T r a n s c r i b e
A m a z o n
T r a n s l a t e
A m a z o n
C o m p r e h e n d A m a z o n
L e x
A m a z o n
F o r e c a s t
A m a z o n
T e x t r a c t
A m a z o n
P e r s o n a l i
(Applicaion
Developers)
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Deployment of ML models is complex
Repeat for every model and every change in the model
BYO
AWS
Build with your
own algorithms
Build with built-in algorithms
from AWS
Train with
TensorFlow,
MXNet ,
or PyTorch
Optimize
your models
Deploy to
the cloud
Deploy to
the edge
A B
A/B test
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
The majority of the cost and
complexity of ML in
production is due to Inference
Inference (Prediction)
90%
Training
10%
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Problem: Not all models are skinny
Models that are accurate tend to
be big and slow
Models are chained to the
framework in which they were
trainedS
L
M
Accuracy
Performance
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Problem: Not all targets are easy
Need enormous expertise …
Application Development: Cloud-Native or Embedded System
Machine Learning: Model Training and Parameter Tuning
Performance Tuning: Troubleshooting and Optimization
Frameworks: TensorFlow or MXNet or PyTorch or Chainer
Hardware: Cloud Server or Edge Device
Computer Architecture: x86 or RISC or GPU or FPGA or ASIC
…and endless time
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Problem: Not every path is a catwalk
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
What if you could train your model once and run it
anywhere, in the cloud or at the edge, with twice the
speed and no loss in accuracy?
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Neo
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Increase inference performance by 2x
Reduce runtime footprint by 100x
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Neo
K e y f e a t u r e s
Machine Learning Compiler for CPU or GPU
Compact Open Source Runtime
https://aws.amazon.com/sagemaker/neo
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Neo bridges Amazon SageMaker & AWS IoT Greengrass
Data labeling &
pre-built
notebooks
for common
problems
Model and
algorithm
marketplace &
built-in, high-
performance
algorithms
One-click
training on the
highest
performing
infrastructure
One-click model
optimization
One-click
deployment
Improves performance on
selected hardware
Extends AWS IoT
onto your devices
NeoAmazon SageMaker AWS IoT Greengrass
ML models train once run anywhere
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Neo delivers compilation as a service
Parses
model
Optimizes
tensors
Generates
code
Optimizes
graph
Convert a TensorFlow,
MXNet, PyTorch, or
XGBoost model into a
common format
Detect patterns in the
ML model structure to
reduce the execution
time
Detect patterns in the shape
of input data to allocate
memory efficiently
Use a low-level compiler to
generate machine code for
each target
No additional cost for Amazon SageMaker users
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Neo delivers compilation as a service
Parses
model
Optimizes
tensors
Generates
code
Optimizes
graph
Graph pruning
Layer fusion
Constant folding
Layout transforms
Nested parallelism
Tiling
Tensorization
Auto-tuning
Uses Treelite and Apache TVM for model optimization
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Neo delivers a compact runtime
Dispatches
model
Partitions
graph
Matches model with
execution backend
Sends subgraph to
suitable accelerator
Framework Size
MXNet 450 MB
TensorFlow 660 MB
PyTorch 1000 MB
Neo 1 MB
Open-source software enables device-specific customization
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Steps for sample deployment
- Get an existing model
- Compile model for new hardware
- Copy the artifacts to new hardware
- Deploy the Neo runtime
- Run the inference
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Neo: Open-source project
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Q&A
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Vebhhav Singh
vebhhavs@amazon.com

More Related Content

What's hot

Data modeling with Amazon DynamoDB - ADB301 - New York AWS Summit
Data modeling with Amazon DynamoDB - ADB301 - New York AWS SummitData modeling with Amazon DynamoDB - ADB301 - New York AWS Summit
Data modeling with Amazon DynamoDB - ADB301 - New York AWS SummitAmazon Web Services
 
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...AWS Summits
 
Increase the value of video using ML and AWS media services - SVC301 - Atlant...
Increase the value of video using ML and AWS media services - SVC301 - Atlant...Increase the value of video using ML and AWS media services - SVC301 - Atlant...
Increase the value of video using ML and AWS media services - SVC301 - Atlant...Amazon Web Services
 
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...Amazon Web Services
 
What's new in Amazon EC2 - CMP201 - New York AWS Summit
What's new in Amazon EC2 - CMP201 - New York AWS SummitWhat's new in Amazon EC2 - CMP201 - New York AWS Summit
What's new in Amazon EC2 - CMP201 - New York AWS SummitAmazon Web Services
 
Accelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-ServicesAccelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-ServicesAmazon Web Services
 
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Amazon Web Services
 
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...Amazon Web Services
 
Build a VR experience in 60 minutes - SVC222 - New York AWS Summit
Build a VR experience in 60 minutes - SVC222 - New York AWS SummitBuild a VR experience in 60 minutes - SVC222 - New York AWS Summit
Build a VR experience in 60 minutes - SVC222 - New York AWS SummitAmazon Web Services
 
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...Amazon Web Services
 
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdf
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdfDo you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdf
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdfAmazon Web Services
 
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Amazon Web Services
 
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Amazon Web Services
 
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...Amazon Web Services
 
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Amazon Web Services
 
Architetture per l'analisi di flussi di dati in tempo reale
Architetture per l'analisi di flussi di dati in tempo realeArchitetture per l'analisi di flussi di dati in tempo reale
Architetture per l'analisi di flussi di dati in tempo realeAmazon Web Services
 
Introduction to AWS App Mesh - MAD303 - Atlanta AWS Summit
Introduction to AWS App Mesh - MAD303 - Atlanta AWS SummitIntroduction to AWS App Mesh - MAD303 - Atlanta AWS Summit
Introduction to AWS App Mesh - MAD303 - Atlanta AWS SummitAmazon Web Services
 
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...Amazon Web Services
 
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...Amazon Web Services
 
Building enterprise solutions with blockchain technology - SVC217 - New York ...
Building enterprise solutions with blockchain technology - SVC217 - New York ...Building enterprise solutions with blockchain technology - SVC217 - New York ...
Building enterprise solutions with blockchain technology - SVC217 - New York ...Amazon Web Services
 

What's hot (20)

Data modeling with Amazon DynamoDB - ADB301 - New York AWS Summit
Data modeling with Amazon DynamoDB - ADB301 - New York AWS SummitData modeling with Amazon DynamoDB - ADB301 - New York AWS Summit
Data modeling with Amazon DynamoDB - ADB301 - New York AWS Summit
 
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
Microservices on AWS: Architectural Patterns and Best Practices | AWS Summit ...
 
Increase the value of video using ML and AWS media services - SVC301 - Atlant...
Increase the value of video using ML and AWS media services - SVC301 - Atlant...Increase the value of video using ML and AWS media services - SVC301 - Atlant...
Increase the value of video using ML and AWS media services - SVC301 - Atlant...
 
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...
Building home security solutions at scale, ft. Comcast - SVC206 - New York AW...
 
What's new in Amazon EC2 - CMP201 - New York AWS Summit
What's new in Amazon EC2 - CMP201 - New York AWS SummitWhat's new in Amazon EC2 - CMP201 - New York AWS Summit
What's new in Amazon EC2 - CMP201 - New York AWS Summit
 
Accelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-ServicesAccelerating-ML-Adoption-with-Our-New-AI-Services
Accelerating-ML-Adoption-with-Our-New-AI-Services
 
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
 
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
The Zen of governance - Establish guardrails and empower builders - SVC201 - ...
 
Build a VR experience in 60 minutes - SVC222 - New York AWS Summit
Build a VR experience in 60 minutes - SVC222 - New York AWS SummitBuild a VR experience in 60 minutes - SVC222 - New York AWS Summit
Build a VR experience in 60 minutes - SVC222 - New York AWS Summit
 
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...
Databases on AWS - The right tool for the right job - ADB203 - Santa Clara AW...
 
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdf
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdfDo you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdf
Do you need a ledger database or a blockchain - SVC208 - Atlanta AWS Summit.pdf
 
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
 
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
Developing-Effective-Mass-Migration-Strategy-out-of-a-Tool-based-Portfolio-As...
 
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...
Migrating on-premises Apache Spark and Hive to Amazon EMR - ADB304 - New York...
 
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
Next generation intelligent data lakes, powered by GraphQL & AWS AppSync - MA...
 
Architetture per l'analisi di flussi di dati in tempo reale
Architetture per l'analisi di flussi di dati in tempo realeArchitetture per l'analisi di flussi di dati in tempo reale
Architetture per l'analisi di flussi di dati in tempo reale
 
Introduction to AWS App Mesh - MAD303 - Atlanta AWS Summit
Introduction to AWS App Mesh - MAD303 - Atlanta AWS SummitIntroduction to AWS App Mesh - MAD303 - Atlanta AWS Summit
Introduction to AWS App Mesh - MAD303 - Atlanta AWS Summit
 
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...
Searching for patterns: Log analytics using Amazon ES - ADB205 - New York AWS...
 
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...
Build intelligent applications quickly with AWS AI services - AIM301 - New Yo...
 
Building enterprise solutions with blockchain technology - SVC217 - New York ...
Building enterprise solutions with blockchain technology - SVC217 - New York ...Building enterprise solutions with blockchain technology - SVC217 - New York ...
Building enterprise solutions with blockchain technology - SVC217 - New York ...
 

Similar to Train models once with SageMaker Neo, deploy anywhere

Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Amazon Web Services
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Julien SIMON
 
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Amazon Web Services
 
Build-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Build-Train-Deploy-Machine-Learning-Models-at-Any-ScaleBuild-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Build-Train-Deploy-Machine-Learning-Models-at-Any-ScaleAmazon Web Services
 
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerDeep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerAmazon Web Services
 
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...Amazon Web Services
 
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...AWS Summits
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon Web Services
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Julien SIMON
 
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Julien SIMON
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersAmazon Web Services
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfAmazon Web Services
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitAmazon Web Services
 
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon Web Services
 
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalisere:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon PersonaliseAmazon Web Services
 
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Julien SIMON
 
Build, train and deploy machine learning models at scale using AWS
Build, train and deploy machine learning models at scale using AWSBuild, train and deploy machine learning models at scale using AWS
Build, train and deploy machine learning models at scale using AWSAmazon Web Services
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019AWS Summits
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Amazon Web Services
 

Similar to Train models once with SageMaker Neo, deploy anywhere (20)

Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
 
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...Machine learning for developers & data scientists with Amazon SageMaker - AIM...
Machine learning for developers & data scientists with Amazon SageMaker - AIM...
 
Build-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Build-Train-Deploy-Machine-Learning-Models-at-Any-ScaleBuild-Train-Deploy-Machine-Learning-Models-at-Any-Scale
Build-Train-Deploy-Machine-Learning-Models-at-Any-Scale
 
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerDeep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
 
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...
ML for every developer and data scientist with Amazon SageMaker - AIM201 - At...
 
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...
AWS Summit Singapore 2019 | Build, Train and Deploy Deep Learning Models on A...
 
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
Amazon SageMaker sviluppa, addestra e distribuisci modelli di Machine Learnin...
 
Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)Build Machine Learning Models with Amazon SageMaker (April 2019)
Build Machine Learning Models with Amazon SageMaker (April 2019)
 
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for Builders
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
 
Amazon SageMaker In Action
Amazon SageMaker In Action Amazon SageMaker In Action
Amazon SageMaker In Action
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
 
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
 
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalisere:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
re:Invent Deep Dive on Amazon SageMaker, Amazon Forecast and Amazon Personalise
 
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
 
Build, train and deploy machine learning models at scale using AWS
Build, train and deploy machine learning models at scale using AWSBuild, train and deploy machine learning models at scale using AWS
Build, train and deploy machine learning models at scale using AWS
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
 
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
Unleash the Power of ML with AWS | AWS Summit Tel Aviv 2019
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Train models once with SageMaker Neo, deploy anywhere

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Train once, deploy anywhere on the cloud and at the edge with Amazon SageMaker Neo Vebhhav Singh Sr. Solutions Architect AWS A I M 3 0 3
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon AI/ML Stack M L S E R V I C E S M L F R A M E W O R K S A I S E R V I C E S (ML researchers and academics) (ML developers and data scientists) A m a z o n S a g e M a k e r G R O U N D T R U T H A L G O R I T H M S N O T E B O O K S M A R K E T P L A C E U N S U P E R V I S E D L E A R N I N G S U P E R V I S E D L E A R N I N G R E I N F O R C E M E N T L E A R N I N G O P T I M I Z A T I O N ( N E O ) T R A I N I N G H O S T I N G D E P L O Y M E N T Vision Speech Language Chatbots & Contact Centers Verticals A m a z o n R e k o g n i t i o n I m a g e A m a z o n R e k o g n i t i o n V i d e o A m a z o n P o l l y A m a z o n T r a n s c r i b e A m a z o n T r a n s l a t e A m a z o n C o m p r e h e n d A m a z o n L e x A m a z o n F o r e c a s t A m a z o n T e x t r a c t A m a z o n P e r s o n a l i (Applicaion Developers)
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Deployment of ML models is complex Repeat for every model and every change in the model BYO AWS Build with your own algorithms Build with built-in algorithms from AWS Train with TensorFlow, MXNet , or PyTorch Optimize your models Deploy to the cloud Deploy to the edge A B A/B test
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T The majority of the cost and complexity of ML in production is due to Inference Inference (Prediction) 90% Training 10%
  • 5. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Problem: Not all models are skinny Models that are accurate tend to be big and slow Models are chained to the framework in which they were trainedS L M Accuracy Performance
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Problem: Not all targets are easy Need enormous expertise … Application Development: Cloud-Native or Embedded System Machine Learning: Model Training and Parameter Tuning Performance Tuning: Troubleshooting and Optimization Frameworks: TensorFlow or MXNet or PyTorch or Chainer Hardware: Cloud Server or Edge Device Computer Architecture: x86 or RISC or GPU or FPGA or ASIC …and endless time
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Problem: Not every path is a catwalk
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T What if you could train your model once and run it anywhere, in the cloud or at the edge, with twice the speed and no loss in accuracy?
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Neo
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Increase inference performance by 2x Reduce runtime footprint by 100x
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Neo K e y f e a t u r e s Machine Learning Compiler for CPU or GPU Compact Open Source Runtime https://aws.amazon.com/sagemaker/neo
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Neo bridges Amazon SageMaker & AWS IoT Greengrass Data labeling & pre-built notebooks for common problems Model and algorithm marketplace & built-in, high- performance algorithms One-click training on the highest performing infrastructure One-click model optimization One-click deployment Improves performance on selected hardware Extends AWS IoT onto your devices NeoAmazon SageMaker AWS IoT Greengrass ML models train once run anywhere
  • 13. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Neo delivers compilation as a service Parses model Optimizes tensors Generates code Optimizes graph Convert a TensorFlow, MXNet, PyTorch, or XGBoost model into a common format Detect patterns in the ML model structure to reduce the execution time Detect patterns in the shape of input data to allocate memory efficiently Use a low-level compiler to generate machine code for each target No additional cost for Amazon SageMaker users
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Neo delivers compilation as a service Parses model Optimizes tensors Generates code Optimizes graph Graph pruning Layer fusion Constant folding Layout transforms Nested parallelism Tiling Tensorization Auto-tuning Uses Treelite and Apache TVM for model optimization
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Neo delivers a compact runtime Dispatches model Partitions graph Matches model with execution backend Sends subgraph to suitable accelerator Framework Size MXNet 450 MB TensorFlow 660 MB PyTorch 1000 MB Neo 1 MB Open-source software enables device-specific customization
  • 16. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Steps for sample deployment - Get an existing model - Compile model for new hardware - Copy the artifacts to new hardware - Deploy the Neo runtime - Run the inference
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Neo: Open-source project
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Q&A
  • 20. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Vebhhav Singh vebhhavs@amazon.com