SlideShare a Scribd company logo
1 of 49
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Pop-up Loft
Amazon SageMaker Algorithms:
Infinitely Scalable Machine Learning
Nick Brandalone
Solutions Architect , AWS Solutions Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is Amazon SageMaker?
Exploration Training
Hosting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
The Amazon Machine Learning Stack
FRAMEWORKS & INTERFACES
Caffe2 CNTK
Apache
MXNet
PyTorch TensorFlow Chainer Keras Gluon
AWS Deep Learning AMIs
Amazon SageMaker
Rekognition Transcribe Translate Polly Comprehend Lex
AWS
DeepLens
EDUCATION
PLATFORM SERVICES
APPLICATION SERVICES
Amazon Mechanical Turk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Challenges in Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Our Customers use ML at a massive scale
“We collect 160M events
daily in the ML pipeline and
run training over the last
15 days and need it to
complete in one hour.
Effectively there's 100M
features in the model”
Valentino Volonghi, CTO
“We process 3 million ad
requests a second, 100,000
features per request. That’s
250 trillion per day. Not
your run of the mill Data
science problem!”
Bill Simmons, CTO
“Our data warehouse is
100TB and we are
processing 2TB daily. We're
running mostly gradient
boosting (trees), LDA and
K-Means clustering and
collaborative filtering.“
Shahar Cizer Kobrinsky, VP
Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
Unusable Data /
Wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Architecture and Design
Streaming, GPU/CPU, Distributed with a Shared State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
Data Size
Memory
Data Size
Time/Cost
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
GPU/CPU
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Distributed
GPU State
GPU State
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Shared State
GPU
GPU
GPU Local
State
Shared
State
Local
State
Local
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
State Model
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Abstraction and Containerization
def initialize(...)
def update(...)
def finalize(...)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
No unusable Data /
No wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon SageMaker Algorithms
• DeepAR
• Factorization Machines
• Gradient Boosted Trees (XGBoost)
• Image Classification (ResNet)
• K-Means Clustering
• Latent Dirichlet Allocation (LDA)
• Linear Learner Classification and
Regression
• Neural Topic Modeling (NTM)
• Principal Components Analysis
(PCA)
• Random Cut Forest
• Seq2Seq
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Regression:
Estimate a real valued function
Binary Classification:
Predict a 0/1 class
˜y = hw, xi + t ˜y =
(
1 if hw, xi > t
0 else
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
w1 = minw
X
i
L1(wT
xi, yi) + ↵1|w|1 + 1|w|2
wk = minw
X
i
Lk(wT
xi, yi) + ↵k|w|1 + k|w|2
Linear Learner
Train
Fit thresholds
and select
Select model with best validation performance
>8x speedup over naïve parallel training!
...
...
...
...
t = min
t0
L(˜y, y) ˜y =
(
1 if wT
i x > t0
0 else
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Regression (mean squared error)
SageMaker Other
1.02 1.06
1.09 1.02
0.332 0.183
0.086 0.129
83.3 84.5
Classification (F1 Score)
SageMaker Other
0.980 0.981
0.870 0.930
0.997 0.997
0.978 0.964
0.914 0.859
0.470 0.472
0.903 0.908
0.508 0.508
30GB datasets for web-spam and web-url classification
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
xi
1
n
X
i
min
j
kxi µjk2
µj
kxi µjk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
Method Accurate? Passes Efficient
Tuning
Comments
Lloyds [1] Yes* 5-10 No
K-Means ++ [2] Yes k+5 to k+10 No scikit-learn
K-Means|| [3] Yes 7-12 No spark.ml
Online [4] No 1 No
Streaming [5,6] No 1 No Impractical
Webscale [7] No 1 No spark streaming
Coresets [8] No 1 Yes Impractical
SageMaker Yes 1 Yes
[1] Lloyd, IEEE TIT, 1982
[2] Arthur et. al. ACM-SIAM, 2007
[3] Bahmani et. al., VLDB, 2012
[4] Liberty et. al., 2015
[5] Shindler et. al, NIPS, 2011
[6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003
[7] Sculley, WWW, 2010
[8] Feldman et. al.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
kxi P(xi)k
P(xi)
xi
X
i
kxi P(xi)k2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Neural Topic Modeling
Encoder: feedforward net
Input term counts vector
µ
z
Document
Posterior
Sampled Document
Representation
Decoder:
Softmax
Output term counts vector
Perplexity vs. Number of Topic
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
DeepAR –time series forecasting
Mean absolute
percentage error
P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits
of websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
One hour on p2.xlarge, $1
zi,t 2, xi,t 1 zi,t 1, xi,t zi,t, xi,t+1
hi,t 1 hi,t hi,t+1
`(zi,t 1|✓i,t 1) `(zi,t|✓i,t) `(zi,t+1|✓i,t+1)
zi,t 1 zi,t zi,t+1
Input
Network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Using AmazonSageMaker Algorithms
Command Line
SageMaker Notebooks
Amazon EMR
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Input Data
>> aws --profile <profile> 
--region us-west-2 
sm create-training-job 
--training-job-name kmeans-demo 
--algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east-
1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File 
--role-arn "arn:aws:iam::0123456789:role/demo" 
--input-data-config '{"ChannelName": "train", "DataSource": 
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": 
"s3://kmeans_demo/train", "S3DataDistributionType": 
"FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' 
--output-data-config S3OutputPath=s3://kmeans_demo/output 
--resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInHours=1
From Command Line
Hardware
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon SageMaker Notebooks
Parameters
Hardware
Start Training
Host model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon EMR
Start Training
Parameters
Hardware
Apply Model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Questions?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Pop-up Loft
aws.amazon.com/activate
Everything and Anything Startups
Need to Get Started on AWS

More Related Content

What's hot

Enabling Deep Learning in IoT Applications with Apache MXNet
Enabling Deep Learning in IoT Applications with Apache MXNetEnabling Deep Learning in IoT Applications with Apache MXNet
Enabling Deep Learning in IoT Applications with Apache MXNetAmazon Web Services
 
Using Amazon SageMaker to build, train, & deploy your ML Models
Using Amazon SageMaker to build, train, & deploy your ML ModelsUsing Amazon SageMaker to build, train, & deploy your ML Models
Using Amazon SageMaker to build, train, & deploy your ML ModelsAmazon Web Services
 
DataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT WorkshopDataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT WorkshopAmazon Web Services
 
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)Julien SIMON
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsJulien SIMON
 
Supercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerSupercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerAmazon Web Services
 
Build Deep Learning Applications with TensorFlow and Amazon SageMaker
Build Deep Learning Applications with TensorFlow and Amazon SageMakerBuild Deep Learning Applications with TensorFlow and Amazon SageMaker
Build Deep Learning Applications with TensorFlow and Amazon SageMakerAmazon Web Services
 
AWS DeepLens - A New Way to Learn Machine Learning
AWS DeepLens - A New Way to Learn Machine LearningAWS DeepLens - A New Way to Learn Machine Learning
AWS DeepLens - A New Way to Learn Machine LearningAmazon Web Services
 
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...Amazon Web Services
 
Machine Learning Models with Apache MXNet and AWS Fargate
Machine Learning Models with Apache MXNet and AWS FargateMachine Learning Models with Apache MXNet and AWS Fargate
Machine Learning Models with Apache MXNet and AWS FargateAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerAmazon Web Services
 
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsUsing Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsAmazon Web Services
 
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...Amazon Web Services
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitAmazon Web Services
 
Deep Dive Amazon SageMaker
Deep Dive Amazon SageMakerDeep Dive Amazon SageMaker
Deep Dive Amazon SageMakerCobus Bernard
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksAmazon Web Services
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAmazon Web Services
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Amazon Web Services
 

What's hot (20)

Enabling Deep Learning in IoT Applications with Apache MXNet
Enabling Deep Learning in IoT Applications with Apache MXNetEnabling Deep Learning in IoT Applications with Apache MXNet
Enabling Deep Learning in IoT Applications with Apache MXNet
 
Using Amazon SageMaker to build, train, & deploy your ML Models
Using Amazon SageMaker to build, train, & deploy your ML ModelsUsing Amazon SageMaker to build, train, & deploy your ML Models
Using Amazon SageMaker to build, train, & deploy your ML Models
 
DataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT WorkshopDataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT Workshop
 
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithms
 
Supercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerSupercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMaker
 
Build Deep Learning Applications with TensorFlow and Amazon SageMaker
Build Deep Learning Applications with TensorFlow and Amazon SageMakerBuild Deep Learning Applications with TensorFlow and Amazon SageMaker
Build Deep Learning Applications with TensorFlow and Amazon SageMaker
 
AWS DeepLens - A New Way to Learn Machine Learning
AWS DeepLens - A New Way to Learn Machine LearningAWS DeepLens - A New Way to Learn Machine Learning
AWS DeepLens - A New Way to Learn Machine Learning
 
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...
Broadcasting the World's Largest Sporting Events: AWS Media Services When It ...
 
Machine Learning Models with Apache MXNet and AWS Fargate
Machine Learning Models with Apache MXNet and AWS FargateMachine Learning Models with Apache MXNet and AWS Fargate
Machine Learning Models with Apache MXNet and AWS Fargate
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
 
Amazon SageMaker In Action
Amazon SageMaker In Action Amazon SageMaker In Action
Amazon SageMaker In Action
 
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsUsing Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
 
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...
[NEW LAUNCH!] [REPEAT 1] Amazon FSx for Lustre: How to build and deploy file ...
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Toronto AWS Summit
 
Deep Dive Amazon SageMaker
Deep Dive Amazon SageMakerDeep Dive Amazon SageMaker
Deep Dive Amazon SageMaker
 
Introducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech TalksIntroducing Amazon SageMaker - AWS Online Tech Talks
Introducing Amazon SageMaker - AWS Online Tech Talks
 
AWS Database and Analytics State of the Union
AWS Database and Analytics State of the UnionAWS Database and Analytics State of the Union
AWS Database and Analytics State of the Union
 
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
Alexa for Device Makers: Create Products with Alexa Built-In Using AVS (ALX30...
 

Similar to SageMaker Algorithms Infinitely Scalable Machine Learning

Amazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsMLconf
 
Debugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonDebugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonApache MXNet
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...Amazon Web Services
 
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Amazon Web Services
 
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonMCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonAmazon Web Services
 
Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Amazon Web Services
 
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...Amazon Web Services
 
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAmazon Web Services
 
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...Amazon Web Services
 
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...AWS Germany
 
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2Amazon Web Services
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Amazon Web Services
 
Building Global Serverless Backends
Building Global Serverless BackendsBuilding Global Serverless Backends
Building Global Serverless BackendsAmazon Web Services
 
re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized ServicesCalvin French-Owen
 
CON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesCON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesAmazon Web Services
 

Similar to SageMaker Algorithms Infinitely Scalable Machine Learning (20)

Amazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San Francisco
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine Learning
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
 
Debugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonDebugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet Gluon
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
 
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
 
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonMCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
 
Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0
 
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
[NEW LAUNCH!] Introducing Amazon Elastic Inference: Reduce Deep Learning Infe...
 
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech TalksAWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
AWS X-Ray: Debugging Applications at Scale - AWS Online Tech Talks
 
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
DEV325_Application Deployment Techniques for Amazon EC2 Workloads with AWS Co...
 
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...
AI & Machine Learning Web Day | Einführung in Amazon SageMaker, eine Werkbank...
 
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
 
Building Global Serverless Backends
Building Global Serverless BackendsBuilding Global Serverless Backends
Building Global Serverless Backends
 
re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Services
 
CON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesCON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized Services
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

SageMaker Algorithms Infinitely Scalable Machine Learning

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Pop-up Loft Amazon SageMaker Algorithms: Infinitely Scalable Machine Learning Nick Brandalone Solutions Architect , AWS Solutions Architecture
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is Amazon SageMaker? Exploration Training Hosting
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved The Amazon Machine Learning Stack FRAMEWORKS & INTERFACES Caffe2 CNTK Apache MXNet PyTorch TensorFlow Chainer Keras Gluon AWS Deep Learning AMIs Amazon SageMaker Rekognition Transcribe Translate Polly Comprehend Lex AWS DeepLens EDUCATION PLATFORM SERVICES APPLICATION SERVICES Amazon Mechanical Turk
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Challenges in Machine Learning
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Machine Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Our Customers use ML at a massive scale “We collect 160M events daily in the ML pipeline and run training over the last 15 days and need it to complete in one hour. Effectively there's 100M features in the model” Valentino Volonghi, CTO “We process 3 million ad requests a second, 100,000 features per request. That’s 250 trillion per day. Not your run of the mill Data science problem!” Bill Simmons, CTO “Our data warehouse is 100TB and we are processing 2TB daily. We're running mostly gradient boosting (trees), LDA and K-Means clustering and collaborative filtering.“ Shahar Cizer Kobrinsky, VP Architecture
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level Unusable Data / Wasted opportunity
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Architecture and Design Streaming, GPU/CPU, Distributed with a Shared State
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming State
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming Data Size Memory Data Size Time/Cost
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 3 1 2
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved GPU/CPU GPU State
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Distributed GPU State GPU State GPU State
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Shared State GPU GPU GPU Local State Shared State Local State Local State
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved State Model GPU State
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Abstraction and Containerization def initialize(...) def update(...) def finalize(...)
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level No unusable Data / No wasted opportunity
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon SageMaker Algorithms • DeepAR • Factorization Machines • Gradient Boosted Trees (XGBoost) • Image Classification (ResNet) • K-Means Clustering • Latent Dirichlet Allocation (LDA) • Linear Learner Classification and Regression • Neural Topic Modeling (NTM) • Principal Components Analysis (PCA) • Random Cut Forest • Seq2Seq
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Regression: Estimate a real valued function Binary Classification: Predict a 0/1 class ˜y = hw, xi + t ˜y = ( 1 if hw, xi > t 0 else
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved w1 = minw X i L1(wT xi, yi) + ↵1|w|1 + 1|w|2 wk = minw X i Lk(wT xi, yi) + ↵k|w|1 + k|w|2 Linear Learner Train Fit thresholds and select Select model with best validation performance >8x speedup over naïve parallel training! ... ... ... ... t = min t0 L(˜y, y) ˜y = ( 1 if wT i x > t0 0 else
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Regression (mean squared error) SageMaker Other 1.02 1.06 1.09 1.02 0.332 0.183 0.086 0.129 83.3 84.5 Classification (F1 Score) SageMaker Other 0.980 0.981 0.870 0.930 0.997 0.997 0.978 0.964 0.914 0.859 0.470 0.472 0.903 0.908 0.508 0.508 30GB datasets for web-spam and web-url classification
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering xi 1 n X i min j kxi µjk2 µj kxi µjk
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering Method Accurate? Passes Efficient Tuning Comments Lloyds [1] Yes* 5-10 No K-Means ++ [2] Yes k+5 to k+10 No scikit-learn K-Means|| [3] Yes 7-12 No spark.ml Online [4] No 1 No Streaming [5,6] No 1 No Impractical Webscale [7] No 1 No spark streaming Coresets [8] No 1 Yes Impractical SageMaker Yes 1 Yes [1] Lloyd, IEEE TIT, 1982 [2] Arthur et. al. ACM-SIAM, 2007 [3] Bahmani et. al., VLDB, 2012 [4] Liberty et. al., 2015 [5] Shindler et. al, NIPS, 2011 [6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003 [7] Sculley, WWW, 2010 [8] Feldman et. al.
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA) kxi P(xi)k P(xi) xi X i kxi P(xi)k2
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Neural Topic Modeling Encoder: feedforward net Input term counts vector µ z Document Posterior Sampled Document Representation Decoder: Softmax Output term counts vector Perplexity vs. Number of Topic
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved DeepAR –time series forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA One hour on p2.xlarge, $1 zi,t 2, xi,t 1 zi,t 1, xi,t zi,t, xi,t+1 hi,t 1 hi,t hi,t+1 `(zi,t 1|✓i,t 1) `(zi,t|✓i,t) `(zi,t+1|✓i,t+1) zi,t 1 zi,t zi,t+1 Input Network
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Using AmazonSageMaker Algorithms Command Line SageMaker Notebooks Amazon EMR
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Input Data >> aws --profile <profile> --region us-west-2 sm create-training-job --training-job-name kmeans-demo --algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east- 1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File --role-arn "arn:aws:iam::0123456789:role/demo" --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://kmeans_demo/output --resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInHours=1 From Command Line Hardware
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon SageMaker Notebooks Parameters Hardware Start Training Host model
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon EMR Start Training Parameters Hardware Apply Model
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Questions?
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Pop-up Loft aws.amazon.com/activate Everything and Anything Startups Need to Get Started on AWS