SlideShare a Scribd company logo
1 of 52
Download to read offline
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Alex Smola, Amazon Web Services
special thanks to Edo Liberty, Craig Wiley, Tom Faulhaber, Owen
Thomas, Vineet Khare, Bing Xu, Ramesh Nallapati,
BDA301
Machine Learning in Amazon
SageMaker:
Algorithms, Models, and Inference
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
... or ...
Apache Spark via
Amazon EMR and
the SageMaker
Spark SDK
Build
Hosted notebook
instances on
SageMaker
... or ...
SageMaker‘s
Console for a point
and click
experience
... or ...
your own device
(EC2, laptop, etc.)
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Zero setup
Training
Streaming data +
distributed
computing
Docker / Amazon
ECS
Deploy trained models
• Amazon SageMaker
• AWS Greengrass,
• AWS DeepLens
• Locally
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
One-step
deployment
Low latency
High throughput
High reliability
A/B testing Use your own
model
Hosting
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon’s fast, scalable algorithms
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• XGBoost
• Factorization
Machines
• Linear methods
• Autoregressive
models
• K-means
clustering
• PCA
• Word2Vec
embeddings for
clustering and pre-
processing
Image classification
with convolutional
neural networks
(plenty of models in
MXNet ModelZoo)
• Spectral LDA
• Neural Topic Models
• Seq2Seq for
translation and
similar problems
Built-in Algorithms
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Distributed Apache MXNet and TensorFlow
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
… explore and
refine models in a
single notebook
instance
TensorFlow & Apache MXNet Containers
… deploy to
production
Sample your
data…
Use the same code
to train on the full
dataset in a cluster
of GPU
instances…
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Bring your own algorithm
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Bring Your Own Algorithm
... add algorithm
code to a Docker
container...
Choose your own framework
... publish to
Amazon ECS
Amazon ECS
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameter optimization
Building HostingTraining
Amazon SageMaker Components
Amazon’s fast, scalable algorithms
Distributed Apache MXNet and TensorFlow
Bring your own algorithm
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hyperparameter Optimization
Run a large set of training
jobs with varying
hyperparameters...
... and search the
hyperparameter space for
improved accuracy
Platform Services
Application Services
Frameworks & Interfaces
Caffe2 CNTK
Apache
MXNet
PyTorch TensorFlow Torch Keras Gluon
AWS Deep Learning AMIs
Infrastructure
Amazon Machine Learning Stack
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
EC2
GPUs
EC2
CPUs
IoT Edge
Amazon EC2 Instances for Deep Learning
• Amazon EC2 P3 Instances
The fastest GPU instances in the cloud
• Up to 8 NVIDIA Tesla V100 GPUs
• 1 PetaFLOP of computational performance (14x faster than P2)
• 300 GB/sec GPU-to-GPU communication (NVLink) (9x faster than P2)
• 16-GB GPU memory with 900 GB/sec peak GPU memory bandwidth
• Amazon EC2 C5 Instances
Cost effective CPUs, e.g., for models using INT8
• Powered by 3.0 GHz Intel Xeon (Skylake) platinum processors
• 72 vCPUs and 155-GB RAM (25% price/performance improvement vs. C4)
• Nitro Hypervisor for larger instance sizes
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Architecture and Design Choices
Streaming, GPU/CPU
Distributed Learning with Shared State
Streaming
State
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Streaming
Data size
Memory
Data size
Time/Cost
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Incremental Training
2
3
1
2
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Incremental Training
3
1
2
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Distributed
GPU State
GPU State
GPU State
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
GPU/CPU
GPU State
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Parameter Server – Distributed (k,v) Store
GPU
GPU
GPU Local
state
Shared
state
Local
state
Local
state
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Best alternative
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Best alternative
Amazon SageMaker
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Production Readiness
Data/Model size
Investment Reasonable
investment level
Unusable data
Wasted opportunity
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Production Readiness
Data/Model size
Investment Reasonable
Investment level
No unusable data
No wasted opportunity
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Infinitely Scalable ML Algorithms
Linear Learner
Regression:
Estimate a real valued function
Binary classification:
Predict a 0/1 class
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Linear Learner
Train
Fit thresholds
and select
Select model with best validation performance
More than 8x speed-up over naive parallel training!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Linear Learner
Regression (mean squared error)
SageMaker Other
1.02 1.06
1.09 1.02
0.332 0.183
0.086 0.129
83.3 84.5
Classification (F1 Score)
SageMaker Other
0.980 0.981
0.870 0.930
0.997 0.997
0.978 0.964
0.914 0.859
0.470 0.472
0.903 0.908
0.508 0.508
30-GB datasets for web-spam and web-URL classification
0
0.2
0.4
0.6
0.8
1
1.2
0 5 10 15 20 25 30
CostinDollars
Billable time in Minutes
sagemaker-url sagemaker-spam other-url other-spam
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Factorization Machines
Log_loss F1 Score Seconds
SageMaker 0.494 0.277 820
Other (10 Iter) 0.516 0.190 650
Other (20 Iter) 0.507 0.254 1300
Other (50 Iter) 0.481 0.313 3250
Click Prediction 1-TB advertising dataset,
m4.4xlarge machines, perfect scaling
$-
$20.00
$40.00
$60.00
$80.00
$100.00
$120.00
$140.00
$160.00
$180.00
$200.00
1 2 3 4 5 6 7 8
Costindollars
Billable time in hours
10
machines
20
machines
30
machines
4050
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
K-Means Clustering
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
K-Means Clustering
Method Accurate? Passes Efficient
tuning
Comments
Lloyds [1] Yes* 5–10 No
K-Means ++ [2] Yes k+5 to k+10 No scikit-learn
K-Means|| [3] Yes 7–12 No spark.ml
Online [4] No 1 No
Streaming [5,6] No 1 No Impractical
Webscale [7] No 1 No Spark streaming
Coresets [8] No 1 Yes Impractical
Amazon SageMaker Yes 1 Yes
[1] Lloyd, IEEE TIT, 1982
[2] Arthur et. al. ACM-SIAM, 2007
[3] Bahmani et. al., VLDB, 2012
[4] Liberty et. al., 2015
[5] Shindler et. al, NIPS, 2011
[6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003
[7] Sculley, WWW, 2010
[8] Feldman et. al.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
0
1
2
3
4
5
6
7
8
10 100 500
BillableTimeinMinutes Number of clusters
sagemaker other
K-Means Clustering
k SageMaker Other
Text
1.2 GB
10 1.18E3 1.18E3
100 1.00E3 9.77E2
500 9.18.E2 9.03E2
Images
9 GB
10 3.29E2 3.28E2
100 2.72E2 2.71E2
500 2.17E2 Failed
Videos
27 GB
10 2.19E2 2.18E2
100 2.03E2 2.02E2
500 1.86E2 1.85E2
Advertising
127 GB
10 1.72E7 Failed
100 1.30E7 Failed
500 1.03E7 Failed
Synthetic
1,100 GB
10 3.81E7 Failed
100 3.51E7 Failed
500 2.81E7 Failed
Running time vs. Number of clusters
~10x faster!
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Principal Component Analysis (PCA)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Principal Component Analysis (PCA)
More than 10x faster
at a fraction the cost!
0.00
20.00
40.00
60.00
80.00
100.00
120.00
8 10 20
Mb/Sec/Machine
Number of machines
other sagemaker-deterministic sagemaker-randomized
Cost vs. time Throughput and scalability
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
0 5 10 15 20 25 30 35 40 45
CostinDollars
Billable time in minutes
other sagemaker-deterministic sagemaker-randomized
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Neural Topic Modeling
• Perplexity vs. number of topics
Encoder: feedforward net
Input term counts vector
Document
posterior
Sampled document
representation
Decoder:
Softmax
Output term counts vector
0
2000
4000
6000
8000
10000
12000
0 50 100 150 200
Perplexity
Number of topics
NTM Other
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Time Series Forecasting
Mean absolute
percentage error
P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits of
websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
1 hour on p2.xlarge, $1
Input
Network
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Boosted Decision Trees
Throughput vs. number of machinesXGBoost is one of the most
commonly used
implementations of boosted
decision trees in the world.
It is now available in Amazon
SageMaker!
0
200
400
600
800
1000
1200
1400
0 10 20 30 40 50 60 70
ThroughputinMB/Sec
Number of machines (C4.8xLarge)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Using Amazon SageMaker
Input data
From the Command Line
profile=<your_profile>
arn_role=<your_arn_role>
training_image=382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1
training_job_name=clustering_text_documents_`date '+%Y_%m_%d_%H_%M_%S'`
aws --profile $profile 
--region us-east-1 
sagemaker create-training-job 
--training-job-name $training_job_name 
--algorithm-specification TrainingImage=$training_image,TrainingInputMode=File 
--hyper-parameters k=10,feature_dim=1024,mini_batch_size=1000 
--role-arn $arn_role 
--input-data-config '{"ChannelName": "train", "DataSource":
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train",
"S3DataDistributionType": "ShardedByS3Key"}}, "CompressionType": "None",
"RecordWrapperType": "None"}' 
--output-data-config S3OutputPath=s3://training_output/$training_job_name
--resource-config InstanceCount=2,InstanceType=ml.c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInSeconds=3600
Hardware
Algorithm
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
From Amazon EMR
Start training
Parameters
Hardware
Apply model
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
From Amazon SageMaker Notebooks
Parameters
Hardware
Start training
Host model
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ML compute by the
second starting
at $0.0464/hr
ML storage by the
second
at $0.14
per GB-month
Data processed in
notebooks and hosting
at $0.016 per GB
Free trial to get
started quickly
Pay-as-You-Go (Inexpensive, Too)
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Amazon SageMaker – Try It Free
aws.amazon.com/sagemaker
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Please complete the session
survey in the summit mobile app.
Submit Session Feedback
1. Tap the Schedule icon. 2. Select the session
you attended.
3. Tap Session
Evaluation to submit your
feedback.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!

More Related Content

What's hot

SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job
 SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job
SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right JobAmazon Web Services
 
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a Week
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a WeekDEM18 How SendBird Built a Serverless Log-Processing Pipeline in a Week
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a WeekAmazon Web Services
 
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018Amazon Web Services
 
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...Amazon Web Services
 
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianSRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianAmazon Web Services
 
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Web Services
 
SRV205 Architectures and Strategies for Building Modern Applications on AWS
 SRV205 Architectures and Strategies for Building Modern Applications on AWS SRV205 Architectures and Strategies for Building Modern Applications on AWS
SRV205 Architectures and Strategies for Building Modern Applications on AWSAmazon Web Services
 
Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Amazon Web Services
 
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...Amazon Web Services
 
Workshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data LakeWorkshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data LakeAmazon Web Services
 
SRV313 Introduction to Building Web Apps on AWS
 SRV313 Introduction to Building Web Apps on AWS SRV313 Introduction to Building Web Apps on AWS
SRV313 Introduction to Building Web Apps on AWSAmazon Web Services
 
Create and Publish AR and VR Apps with Amazon Sumerian
Create and Publish AR and VR Apps with Amazon SumerianCreate and Publish AR and VR Apps with Amazon Sumerian
Create and Publish AR and VR Apps with Amazon SumerianAmazon Web Services
 
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video Analysis
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video AnalysisBDA303 Amazon Rekognition: Deep Learning-Based Image and Video Analysis
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video AnalysisAmazon Web Services
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAmazon Web Services
 
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...Amazon Web Services
 
How a Major Australian University Brought Backup to the Cloud
 How a Major Australian University Brought Backup to the Cloud How a Major Australian University Brought Backup to the Cloud
How a Major Australian University Brought Backup to the CloudAmazon Web Services
 
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Amazon Web Services
 

What's hot (20)

SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job
 SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job
SRV309 AWS Purpose-Built Database Strategy: The Right Tool for the Right Job
 
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a Week
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a WeekDEM18 How SendBird Built a Serverless Log-Processing Pipeline in a Week
DEM18 How SendBird Built a Serverless Log-Processing Pipeline in a Week
 
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018
A Deep Dive into What's New with Amazon EMR (ANT340-R1) - AWS re:Invent 2018
 
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...
Well-Architected Workspaces: Enterprise Deployment at Scale (BAP315) - AWS re...
 
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon SumerianSRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
SRV317 Creating and Publishing AR and VR Apps with Amazon Sumerian
 
SRV319 Amazon EC2 Foundations
SRV319 Amazon EC2 FoundationsSRV319 Amazon EC2 Foundations
SRV319 Amazon EC2 Foundations
 
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
Amazon Redshift 與 Amazon Redshift Spectrum 幫您建立現代化資料倉儲 (Level 300)
 
SRV205 Architectures and Strategies for Building Modern Applications on AWS
 SRV205 Architectures and Strategies for Building Modern Applications on AWS SRV205 Architectures and Strategies for Building Modern Applications on AWS
SRV205 Architectures and Strategies for Building Modern Applications on AWS
 
Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)
 
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...
Ionic and React Hybrid Web/Native Mobile Applications with Mobile Hub - AWS O...
 
Amazon Aurora_Deep Dive
Amazon Aurora_Deep DiveAmazon Aurora_Deep Dive
Amazon Aurora_Deep Dive
 
Workshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data LakeWorkshop: Architecting a Serverless Data Lake
Workshop: Architecting a Serverless Data Lake
 
SRV313 Introduction to Building Web Apps on AWS
 SRV313 Introduction to Building Web Apps on AWS SRV313 Introduction to Building Web Apps on AWS
SRV313 Introduction to Building Web Apps on AWS
 
Create and Publish AR and VR Apps with Amazon Sumerian
Create and Publish AR and VR Apps with Amazon SumerianCreate and Publish AR and VR Apps with Amazon Sumerian
Create and Publish AR and VR Apps with Amazon Sumerian
 
BDA310 Transcribe and Translate
BDA310 Transcribe and TranslateBDA310 Transcribe and Translate
BDA310 Transcribe and Translate
 
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video Analysis
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video AnalysisBDA303 Amazon Rekognition: Deep Learning-Based Image and Video Analysis
BDA303 Amazon Rekognition: Deep Learning-Based Image and Video Analysis
 
Accelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMakerAccelerate Machine Learning with Ease using Amazon SageMaker
Accelerate Machine Learning with Ease using Amazon SageMaker
 
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...
Hands-On with Amazon ElastiCache for Redis - Workshop (DAT309-R1) - AWS re:In...
 
How a Major Australian University Brought Backup to the Cloud
 How a Major Australian University Brought Backup to the Cloud How a Major Australian University Brought Backup to the Cloud
How a Major Australian University Brought Backup to the Cloud
 
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
Rightsizing Your Silicon Design Environment: Elastic Clusters for EDA Workloa...
 

Similar to BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models, and Inferences

Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...Amazon Web Services
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitAmazon Web Services
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML OverviewBESPIN GLOBAL
 
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Amazon Web Services
 
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Amazon Web Services
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...Amazon Web Services Korea
 
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...Amazon Web Services
 
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Amazon Web Services
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Amazon Web Services
 
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018Demystifying Machine Learning On AWS - AWS Summit Sydney 2018
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018Amazon Web Services
 
Quickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleQuickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleAWS Germany
 
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...Amazon Web Services
 
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Amazon Web Services
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Amazon Web Services
 
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Amazon Web Services
 
Apache MXNet and Gluon
Apache MXNet and GluonApache MXNet and Gluon
Apache MXNet and GluonSoji Adeshina
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitAmazon Web Services
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Amazon Web Services
 

Similar to BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models, and Inferences (20)

Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências -  MCL...
Machine Learning e Amazon SageMaker: Algoritmos, Modelos e Inferências - MCL...
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML Overview
 
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
Machine Learning with Amazon SageMaker - Algorithms and Frameworks - BDA304 -...
 
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
Building Deep Learning Applications with TensorFlow and SageMaker on AWS - Te...
 
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
AWS의 새로운 언어, 음성, 텍스트 처리 인공 지능 서비스, Amazon SageMaker::Sunil Mallya::AWS Summit...
 
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...
Save up to 90% on Big Data and Machine Learning Workloads with Spot Instances...
 
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
Serverless AI with Scikit-Learn (GPSWS405) - AWS re:Invent 2018
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
 
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018Demystifying Machine Learning On AWS - AWS Summit Sydney 2018
Demystifying Machine Learning On AWS - AWS Summit Sydney 2018
 
Quickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scaleQuickly and easily build, train, and deploy machine learning models at any scale
Quickly and easily build, train, and deploy machine learning models at any scale
 
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...
Perform Machine Learning at the IoT Edge using AWS Greengrass and Amazon Sage...
 
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
Build Deep Learning Applications Using Apache MXNet, Featuring Workday (AIM40...
 
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
Build Deep Learning Applications Using Apache MXNet - Featuring Chick-fil-A (...
 
Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
Accelerate Machine Learning with Ease Using Amazon SageMaker - BDA301 - Chica...
 
Apache MXNet and Gluon
Apache MXNet and GluonApache MXNet and Gluon
Apache MXNet and Gluon
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
 
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
Supercharge Your ML Model with SageMaker - AWS Summit Sydney 2018
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

BDA301 Working with Machine Learning in Amazon SageMaker: Algorithms, Models, and Inferences

  • 1. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Alex Smola, Amazon Web Services special thanks to Edo Liberty, Craig Wiley, Tom Faulhaber, Owen Thomas, Vineet Khare, Bing Xu, Ramesh Nallapati, BDA301 Machine Learning in Amazon SageMaker: Algorithms, Models, and Inference
  • 2. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow Bring your own algorithm Hyperparameter optimization Building HostingTraining Amazon SageMaker Components
  • 3. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow Bring your own algorithm
  • 4. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ... or ... Apache Spark via Amazon EMR and the SageMaker Spark SDK Build Hosted notebook instances on SageMaker ... or ... SageMaker‘s Console for a point and click experience ... or ... your own device (EC2, laptop, etc.)
  • 5. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow Bring your own algorithm
  • 6. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Zero setup Training Streaming data + distributed computing Docker / Amazon ECS Deploy trained models • Amazon SageMaker • AWS Greengrass, • AWS DeepLens • Locally
  • 7. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow Bring your own algorithm
  • 8. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. One-step deployment Low latency High throughput High reliability A/B testing Use your own model Hosting
  • 9. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon’s fast, scalable algorithms Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Distributed Apache MXNet and TensorFlow Bring your own algorithm
  • 10. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • XGBoost • Factorization Machines • Linear methods • Autoregressive models • K-means clustering • PCA • Word2Vec embeddings for clustering and pre- processing Image classification with convolutional neural networks (plenty of models in MXNet ModelZoo) • Spectral LDA • Neural Topic Models • Seq2Seq for translation and similar problems Built-in Algorithms
  • 11. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Distributed Apache MXNet and TensorFlow Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Bring your own algorithm
  • 12. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. … explore and refine models in a single notebook instance TensorFlow & Apache MXNet Containers … deploy to production Sample your data… Use the same code to train on the full dataset in a cluster of GPU instances…
  • 13. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Bring your own algorithm Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow
  • 14. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Bring Your Own Algorithm ... add algorithm code to a Docker container... Choose your own framework ... publish to Amazon ECS Amazon ECS
  • 15. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameter optimization Building HostingTraining Amazon SageMaker Components Amazon’s fast, scalable algorithms Distributed Apache MXNet and TensorFlow Bring your own algorithm
  • 16. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hyperparameter Optimization Run a large set of training jobs with varying hyperparameters... ... and search the hyperparameter space for improved accuracy
  • 17. Platform Services Application Services Frameworks & Interfaces Caffe2 CNTK Apache MXNet PyTorch TensorFlow Torch Keras Gluon AWS Deep Learning AMIs Infrastructure Amazon Machine Learning Stack © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. EC2 GPUs EC2 CPUs IoT Edge
  • 18. Amazon EC2 Instances for Deep Learning • Amazon EC2 P3 Instances The fastest GPU instances in the cloud • Up to 8 NVIDIA Tesla V100 GPUs • 1 PetaFLOP of computational performance (14x faster than P2) • 300 GB/sec GPU-to-GPU communication (NVLink) (9x faster than P2) • 16-GB GPU memory with 900 GB/sec peak GPU memory bandwidth • Amazon EC2 C5 Instances Cost effective CPUs, e.g., for models using INT8 • Powered by 3.0 GHz Intel Xeon (Skylake) platinum processors • 72 vCPUs and 155-GB RAM (25% price/performance improvement vs. C4) • Nitro Hypervisor for larger instance sizes © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 19. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Architecture and Design Choices Streaming, GPU/CPU Distributed Learning with Shared State
  • 20. Streaming State © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 21. Streaming Data size Memory Data size Time/Cost © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 22. Incremental Training 2 3 1 2 © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 23. Incremental Training 3 1 2 © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 24. Distributed GPU State GPU State GPU State © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 25. GPU/CPU GPU State © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 26. Parameter Server – Distributed (k,v) Store GPU GPU GPU Local state Shared state Local state Local state © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 27. Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Best alternative © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 28. Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Best alternative Amazon SageMaker © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 29. Production Readiness Data/Model size Investment Reasonable investment level Unusable data Wasted opportunity © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 30. Production Readiness Data/Model size Investment Reasonable Investment level No unusable data No wasted opportunity © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 31. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Infinitely Scalable ML Algorithms
  • 32. Linear Learner Regression: Estimate a real valued function Binary classification: Predict a 0/1 class © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 33. Linear Learner Train Fit thresholds and select Select model with best validation performance More than 8x speed-up over naive parallel training! © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 34. Linear Learner Regression (mean squared error) SageMaker Other 1.02 1.06 1.09 1.02 0.332 0.183 0.086 0.129 83.3 84.5 Classification (F1 Score) SageMaker Other 0.980 0.981 0.870 0.930 0.997 0.997 0.978 0.964 0.914 0.859 0.470 0.472 0.903 0.908 0.508 0.508 30-GB datasets for web-spam and web-URL classification 0 0.2 0.4 0.6 0.8 1 1.2 0 5 10 15 20 25 30 CostinDollars Billable time in Minutes sagemaker-url sagemaker-spam other-url other-spam © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 35. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Factorization Machines Log_loss F1 Score Seconds SageMaker 0.494 0.277 820 Other (10 Iter) 0.516 0.190 650 Other (20 Iter) 0.507 0.254 1300 Other (50 Iter) 0.481 0.313 3250 Click Prediction 1-TB advertising dataset, m4.4xlarge machines, perfect scaling $- $20.00 $40.00 $60.00 $80.00 $100.00 $120.00 $140.00 $160.00 $180.00 $200.00 1 2 3 4 5 6 7 8 Costindollars Billable time in hours 10 machines 20 machines 30 machines 4050
  • 36. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. K-Means Clustering
  • 37. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. K-Means Clustering Method Accurate? Passes Efficient tuning Comments Lloyds [1] Yes* 5–10 No K-Means ++ [2] Yes k+5 to k+10 No scikit-learn K-Means|| [3] Yes 7–12 No spark.ml Online [4] No 1 No Streaming [5,6] No 1 No Impractical Webscale [7] No 1 No Spark streaming Coresets [8] No 1 Yes Impractical Amazon SageMaker Yes 1 Yes [1] Lloyd, IEEE TIT, 1982 [2] Arthur et. al. ACM-SIAM, 2007 [3] Bahmani et. al., VLDB, 2012 [4] Liberty et. al., 2015 [5] Shindler et. al, NIPS, 2011 [6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003 [7] Sculley, WWW, 2010 [8] Feldman et. al.
  • 38. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. 0 1 2 3 4 5 6 7 8 10 100 500 BillableTimeinMinutes Number of clusters sagemaker other K-Means Clustering k SageMaker Other Text 1.2 GB 10 1.18E3 1.18E3 100 1.00E3 9.77E2 500 9.18.E2 9.03E2 Images 9 GB 10 3.29E2 3.28E2 100 2.72E2 2.71E2 500 2.17E2 Failed Videos 27 GB 10 2.19E2 2.18E2 100 2.03E2 2.02E2 500 1.86E2 1.85E2 Advertising 127 GB 10 1.72E7 Failed 100 1.30E7 Failed 500 1.03E7 Failed Synthetic 1,100 GB 10 3.81E7 Failed 100 3.51E7 Failed 500 2.81E7 Failed Running time vs. Number of clusters ~10x faster!
  • 39. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Principal Component Analysis (PCA)
  • 40. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Principal Component Analysis (PCA) More than 10x faster at a fraction the cost! 0.00 20.00 40.00 60.00 80.00 100.00 120.00 8 10 20 Mb/Sec/Machine Number of machines other sagemaker-deterministic sagemaker-randomized Cost vs. time Throughput and scalability 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0 5 10 15 20 25 30 35 40 45 CostinDollars Billable time in minutes other sagemaker-deterministic sagemaker-randomized
  • 41. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Neural Topic Modeling • Perplexity vs. number of topics Encoder: feedforward net Input term counts vector Document posterior Sampled document representation Decoder: Softmax Output term counts vector 0 2000 4000 6000 8000 10000 12000 0 50 100 150 200 Perplexity Number of topics NTM Other
  • 42. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Time Series Forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA 1 hour on p2.xlarge, $1 Input Network
  • 43. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Boosted Decision Trees Throughput vs. number of machinesXGBoost is one of the most commonly used implementations of boosted decision trees in the world. It is now available in Amazon SageMaker! 0 200 400 600 800 1000 1200 1400 0 10 20 30 40 50 60 70 ThroughputinMB/Sec Number of machines (C4.8xLarge)
  • 44. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Using Amazon SageMaker
  • 45. Input data From the Command Line profile=<your_profile> arn_role=<your_arn_role> training_image=382416733822.dkr.ecr.us-east-1.amazonaws.com/kmeans:1 training_job_name=clustering_text_documents_`date '+%Y_%m_%d_%H_%M_%S'` aws --profile $profile --region us-east-1 sagemaker create-training-job --training-job-name $training_job_name --algorithm-specification TrainingImage=$training_image,TrainingInputMode=File --hyper-parameters k=10,feature_dim=1024,mini_batch_size=1000 --role-arn $arn_role --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "ShardedByS3Key"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://training_output/$training_job_name --resource-config InstanceCount=2,InstanceType=ml.c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInSeconds=3600 Hardware Algorithm © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 46. From Amazon EMR Start training Parameters Hardware Apply model © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 47. From Amazon SageMaker Notebooks Parameters Hardware Start training Host model © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 48. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ML compute by the second starting at $0.0464/hr ML storage by the second at $0.14 per GB-month Data processed in notebooks and hosting at $0.016 per GB Free trial to get started quickly Pay-as-You-Go (Inexpensive, Too)
  • 49. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Amazon SageMaker – Try It Free aws.amazon.com/sagemaker
  • 50. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Please complete the session survey in the summit mobile app.
  • 51. Submit Session Feedback 1. Tap the Schedule icon. 2. Select the session you attended. 3. Tap Session Evaluation to submit your feedback. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 52. © 2018, Amazon Web Services, Inc. or its affiliates. All rights reserved. Thank you!