SlideShare a Scribd company logo
1 of 46
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Pop-up Loft
Working with Amazon SageMaker Algorithms for Faster Model
Training
Binoy Das
Partner Solutions Architect, AWS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is Amazon SageMaker?
Exploration Training
Hosting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
The Amazon Machine Learning Stack
FRAMEWORKS & INTERFACES
Caffe2 CNTK
Apache
MXNet
PyTorch
TensorFlo
w
Chainer Keras Gluon
AWS Deep Learning AMIs
Amazon SageMaker
Rekognition Transcribe Translate Polly Comprehend Lex
AWS
DeepLens
EDUCATION
PLATFORM SERVICES
APPLICATION SERVICES
Amazon Mechanical Turk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Challenges in Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Our Customers use ML at a massive scale
“We collect 160M events
daily in the ML pipeline and
run training over the last
15 days and need it to
complete in one hour.
Effectively there's 100M
features in the model”
Valentino Volonghi, CTO
“We process 3 million ad
requests a second, 100,000
features per request. That’s
250 trillion per day. Not
your run of the mill Data
science problem!”
Bill Simmons, CTO
“Our data warehouse is
100TB and we are
processing 2TB daily. We're
running mostly gradient
boosting (trees), LDA and
K-Means clustering and
collaborative filtering.“
Shahar Cizer Kobrinsky, VP
Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
Unusable Data /
Wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Architecture and Design
Streaming, GPU/CPU, Distributed with a Shared State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
Data Size
Memory
Data Size
Time/Cost
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
GPU/CPU
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Distributed
GPU State
GPU State
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Shared State
GPU
GPU
GPU
Local
State
Shared
State
Local
State
Local
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
State Model
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Abstraction and Containerization
def initialize(...)
def update(...)
def finalize(...)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
No unusable Data /
No wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon SageMaker Algorithms
• DeepAR
• Factorization Machines
• Gradient Boosted Trees (XGBoost)
• Image Classification (ResNet)
• K-Means Clustering
• Latent Dirichlet Allocation (LDA)
• Linear Learner Classification and
Regression
• Neural Topic Modeling (NTM)
• Principal Components Analysis
(PCA)
• Random Cut Forest
• Seq2Seq
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Regression:
Estimate a real valued function
Binary Classification:
Predict a 0/1 class
˜y = hw, xi + t ˜y =
(
1 if hw, xi > t
0 else
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
w1 = minw
X
i
L1(wT
xi, yi) + ↵1|w|1 + 1|w|2
wk = minw
X
i
Lk(wT
xi, yi) + ↵k|w|1 + k|w|2
Linear Learner
Train
Fit thresholds
and select
Select model with best validation performance
>8x speedup over naïve parallel training!
...
...
...
...
t = min
t0
L(˜y, y) ˜y =
(
1 if wT
i x > t0
0 else
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
xi
1
n
X
i
min
j
kxi µjk2
µj
kxi µjk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
Method Accurate? Passes Efficient
Tuning
Comments
Lloyds [1] Yes* 5-10 No
K-Means ++ [2] Yes k+5 to k+10 No scikit-learn
K-Means|| [3] Yes 7-12 No spark.ml
Online [4] No 1 No
Streaming [5,6] No 1 No Impractical
Webscale [7] No 1 No spark streaming
Coresets [8] No 1 Yes Impractical
SageMaker Yes 1 Yes
[1] Lloyd, IEEE TIT, 1982
[2] Arthur et. al. ACM-SIAM, 2007
[3] Bahmani et. al., VLDB, 2012
[4] Liberty et. al., 2015
[5] Shindler et. al, NIPS, 2011
[6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003
[7] Sculley, WWW, 2010
[8] Feldman et. al.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
kxi P(xi)k
P(xi)
xi
X
i
kxi P(xi)k2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Neural Topic Modeling
Encoder: feedforward net
Input term counts vector
µ
z
Document
Posterior
Sampled Document
Representation
Decoder:
Softmax
Output term counts vector
Perplexity vs. Number of Topic
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
DeepAR –time series forecasting
Mean absolute
percentage error P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits
of websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
One hour on p2.xlarge, $1
zi,t 2, xi,t 1 zi,t 1, xi,t zi,t, xi,t+1
hi,t 1 hi,t hi,t+1
`(zi,t 1|✓i,t 1) `(zi,t|✓i,t) `(zi,t+1|✓i,t+1)
zi,t 1 zi,t zi,t+1
Input
Network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Using AmazonSageMaker Algorithms
Command Line
SageMaker Notebooks
Amazon EMR
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Input Data
>> aws --profile <profile> 
--region us-west-2 
sm create-training-job 
--training-job-name kmeans-demo 
--algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east-
1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File 
--role-arn "arn:aws:iam::0123456789:role/demo" 
--input-data-config '{"ChannelName": "train", "DataSource": 
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": 
"s3://kmeans_demo/train", "S3DataDistributionType": 
"FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' 
--output-data-config S3OutputPath=s3://kmeans_demo/output 
--resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInHours=1
From Command Line
Hardware
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon SageMaker Notebooks
Parameters
Hardware
Start Training
Host model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon EMR
Start Training
Parameters
Hardware
Apply Model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Questions?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Pop-up Loft
aws.amazon.com/activate
Everything and Anything Startups
Need to Get Started on AWS

More Related Content

What's hot

Using Amazon SageMaker to build, train, and deploy your ML Models
Using Amazon SageMaker to build, train, and deploy your ML ModelsUsing Amazon SageMaker to build, train, and deploy your ML Models
Using Amazon SageMaker to build, train, and deploy your ML ModelsAmazon Web Services
 
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SI
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SIAmazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SI
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SIAmazon Web Services
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
AWS re:Invent 2018 - AIM302 - Machine Learning at the Edge
AWS re:Invent 2018 - AIM302  - Machine Learning at the Edge AWS re:Invent 2018 - AIM302  - Machine Learning at the Edge
AWS re:Invent 2018 - AIM302 - Machine Learning at the Edge Julien SIMON
 
AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)Julien SIMON
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsJulien SIMON
 
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Julien SIMON
 
An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)Julien SIMON
 
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsUsing Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
Amazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for BuildersAmazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for BuildersAmazon Web Services
 
Accelerate your Machine Learning workflows with Amazon SageMaker
Accelerate your Machine Learning workflows with Amazon SageMakerAccelerate your Machine Learning workflows with Amazon SageMaker
Accelerate your Machine Learning workflows with Amazon SageMakerJulien SIMON
 
AWS re:Invent 2018 - AIM401 - Deep Learning using Tensorflow
AWS re:Invent 2018 - AIM401 - Deep Learning using TensorflowAWS re:Invent 2018 - AIM401 - Deep Learning using Tensorflow
AWS re:Invent 2018 - AIM401 - Deep Learning using TensorflowJulien SIMON
 
Become a Machine Learning developer with AWS services (May 2019)
Become a Machine Learning developer with AWS services (May 2019)Become a Machine Learning developer with AWS services (May 2019)
Become a Machine Learning developer with AWS services (May 2019)Julien SIMON
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaDan Romuald Mbanga
 
Building Deep Learning Applications with TensorFlow and Amazon SageMaker
Building Deep Learning Applications with TensorFlow and Amazon SageMakerBuilding Deep Learning Applications with TensorFlow and Amazon SageMaker
Building Deep Learning Applications with TensorFlow and Amazon SageMakerAmazon Web Services
 
Integrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseIntegrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseAmazon Web Services
 
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018Amazon Web Services Korea
 

What's hot (20)

Using Amazon SageMaker to build, train, and deploy your ML Models
Using Amazon SageMaker to build, train, and deploy your ML ModelsUsing Amazon SageMaker to build, train, and deploy your ML Models
Using Amazon SageMaker to build, train, and deploy your ML Models
 
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SI
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SIAmazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SI
Amazon SageMaker Deep Dive - Meetup AWS Toulouse at D2SI
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
 
Intro to SageMaker
Intro to SageMakerIntro to SageMaker
Intro to SageMaker
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
AWS re:Invent 2018 - AIM302 - Machine Learning at the Edge
AWS re:Invent 2018 - AIM302  - Machine Learning at the Edge AWS re:Invent 2018 - AIM302  - Machine Learning at the Edge
AWS re:Invent 2018 - AIM302 - Machine Learning at the Edge
 
AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)AWS re:Invent 2018 - Machine Learning recap (December 2018)
AWS re:Invent 2018 - Machine Learning recap (December 2018)
 
Speed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithmsSpeed up your Machine Learning workflows with build-in algorithms
Speed up your Machine Learning workflows with build-in algorithms
 
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
Build, train and deploy Machine Learning models on Amazon SageMaker (May 2019)
 
An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)An Introduction to Amazon SageMaker (October 2018)
An Introduction to Amazon SageMaker (October 2018)
 
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML ModelsUsing Amazon SageMaker to Build, Train, and Deploy Your ML Models
Using Amazon SageMaker to Build, Train, and Deploy Your ML Models
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
Amazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for BuildersAmazon SageMaker Deep Dive for Builders
Amazon SageMaker Deep Dive for Builders
 
Accelerate your Machine Learning workflows with Amazon SageMaker
Accelerate your Machine Learning workflows with Amazon SageMakerAccelerate your Machine Learning workflows with Amazon SageMaker
Accelerate your Machine Learning workflows with Amazon SageMaker
 
AWS re:Invent 2018 - AIM401 - Deep Learning using Tensorflow
AWS re:Invent 2018 - AIM401 - Deep Learning using TensorflowAWS re:Invent 2018 - AIM401 - Deep Learning using Tensorflow
AWS re:Invent 2018 - AIM401 - Deep Learning using Tensorflow
 
Become a Machine Learning developer with AWS services (May 2019)
Become a Machine Learning developer with AWS services (May 2019)Become a Machine Learning developer with AWS services (May 2019)
Become a Machine Learning developer with AWS services (May 2019)
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbanga
 
Building Deep Learning Applications with TensorFlow and Amazon SageMaker
Building Deep Learning Applications with TensorFlow and Amazon SageMakerBuilding Deep Learning Applications with TensorFlow and Amazon SageMaker
Building Deep Learning Applications with TensorFlow and Amazon SageMaker
 
Integrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseIntegrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your Enterprise
 
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
 

Similar to Working with Amazon SageMaker Algorithms for Faster Model Training

Amazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon Web Services
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningAmazon Web Services
 
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdfKeith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdfAmazon Web Services
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingAmazon Web Services
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningAmazon Web Services
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsMLconf
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...Amazon Web Services
 
Debugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonDebugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonApache MXNet
 
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonMCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonAmazon Web Services
 
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Amazon Web Services
 
Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Amazon Web Services
 
Deep Learning Fundamentals
Deep Learning FundamentalsDeep Learning Fundamentals
Deep Learning FundamentalsThomas Delteil
 
CON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesCON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesAmazon Web Services
 
re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized ServicesCalvin French-Owen
 
FSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningFSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningAmazon Web Services
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Amazon Web Services
 
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Amazon Web Services
 
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...Amazon Web Services
 
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAmazon Web Services
 
Building Global Serverless Backends
Building Global Serverless BackendsBuilding Global Serverless Backends
Building Global Serverless BackendsAmazon Web Services
 

Similar to Working with Amazon SageMaker Algorithms for Faster Model Training (20)

Amazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San Francisco
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine Learning
 
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdfKeith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf
Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
SageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine LearningSageMaker Algorithms Infinitely Scalable Machine Learning
SageMaker Algorithms Infinitely Scalable Machine Learning
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
 
Debugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet GluonDebugging and Performance tricks for MXNet Gluon
Debugging and Performance tricks for MXNet Gluon
 
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and GluonMCL310_Building Deep Learning Applications with Apache MXNet and Gluon
MCL310_Building Deep Learning Applications with Apache MXNet and Gluon
 
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
 
Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0Containers on AWS - re:Invent Comes to London 2.0
Containers on AWS - re:Invent Comes to London 2.0
 
Deep Learning Fundamentals
Deep Learning FundamentalsDeep Learning Fundamentals
Deep Learning Fundamentals
 
CON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized ServicesCON320_Monitoring, Logging and Debugging Containerized Services
CON320_Monitoring, Logging and Debugging Containerized Services
 
re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Services
 
FSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningFSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine Learning
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
 
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
Advanced Patterns in Microservices Implementation with Amazon ECS - CON402 - ...
 
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...
Auto Scaling Prime Time: Target Tracking Hits the Bullseye at Netflix - CMP31...
 
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
 
Building Global Serverless Backends
Building Global Serverless BackendsBuilding Global Serverless Backends
Building Global Serverless Backends
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Working with Amazon SageMaker Algorithms for Faster Model Training

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Pop-up Loft Working with Amazon SageMaker Algorithms for Faster Model Training Binoy Das Partner Solutions Architect, AWS
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is Amazon SageMaker? Exploration Training Hosting
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved The Amazon Machine Learning Stack FRAMEWORKS & INTERFACES Caffe2 CNTK Apache MXNet PyTorch TensorFlo w Chainer Keras Gluon AWS Deep Learning AMIs Amazon SageMaker Rekognition Transcribe Translate Polly Comprehend Lex AWS DeepLens EDUCATION PLATFORM SERVICES APPLICATION SERVICES Amazon Mechanical Turk
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Challenges in Machine Learning
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Machine Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Our Customers use ML at a massive scale “We collect 160M events daily in the ML pipeline and run training over the last 15 days and need it to complete in one hour. Effectively there's 100M features in the model” Valentino Volonghi, CTO “We process 3 million ad requests a second, 100,000 features per request. That’s 250 trillion per day. Not your run of the mill Data science problem!” Bill Simmons, CTO “Our data warehouse is 100TB and we are processing 2TB daily. We're running mostly gradient boosting (trees), LDA and K-Means clustering and collaborative filtering.“ Shahar Cizer Kobrinsky, VP Architecture
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level Unusable Data / Wasted opportunity
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Architecture and Design Streaming, GPU/CPU, Distributed with a Shared State
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming State
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming Data Size Memory Data Size Time/Cost
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 3 1 2
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved GPU/CPU GPU State
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Distributed GPU State GPU State GPU State
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Shared State GPU GPU GPU Local State Shared State Local State Local State
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved State Model GPU State
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Abstraction and Containerization def initialize(...) def update(...) def finalize(...)
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level No unusable Data / No wasted opportunity
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon SageMaker Algorithms • DeepAR • Factorization Machines • Gradient Boosted Trees (XGBoost) • Image Classification (ResNet) • K-Means Clustering • Latent Dirichlet Allocation (LDA) • Linear Learner Classification and Regression • Neural Topic Modeling (NTM) • Principal Components Analysis (PCA) • Random Cut Forest • Seq2Seq
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Regression: Estimate a real valued function Binary Classification: Predict a 0/1 class ˜y = hw, xi + t ˜y = ( 1 if hw, xi > t 0 else
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved w1 = minw X i L1(wT xi, yi) + ↵1|w|1 + 1|w|2 wk = minw X i Lk(wT xi, yi) + ↵k|w|1 + k|w|2 Linear Learner Train Fit thresholds and select Select model with best validation performance >8x speedup over naïve parallel training! ... ... ... ... t = min t0 L(˜y, y) ˜y = ( 1 if wT i x > t0 0 else
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering xi 1 n X i min j kxi µjk2 µj kxi µjk
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering Method Accurate? Passes Efficient Tuning Comments Lloyds [1] Yes* 5-10 No K-Means ++ [2] Yes k+5 to k+10 No scikit-learn K-Means|| [3] Yes 7-12 No spark.ml Online [4] No 1 No Streaming [5,6] No 1 No Impractical Webscale [7] No 1 No spark streaming Coresets [8] No 1 Yes Impractical SageMaker Yes 1 Yes [1] Lloyd, IEEE TIT, 1982 [2] Arthur et. al. ACM-SIAM, 2007 [3] Bahmani et. al., VLDB, 2012 [4] Liberty et. al., 2015 [5] Shindler et. al, NIPS, 2011 [6] Guha et. al, IEEE Trans. Knowl. Data Eng. 2003 [7] Sculley, WWW, 2010 [8] Feldman et. al.
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA) kxi P(xi)k P(xi) xi X i kxi P(xi)k2
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Neural Topic Modeling Encoder: feedforward net Input term counts vector µ z Document Posterior Sampled Document Representation Decoder: Softmax Output term counts vector Perplexity vs. Number of Topic
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved DeepAR –time series forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA One hour on p2.xlarge, $1 zi,t 2, xi,t 1 zi,t 1, xi,t zi,t, xi,t+1 hi,t 1 hi,t hi,t+1 `(zi,t 1|✓i,t 1) `(zi,t|✓i,t) `(zi,t+1|✓i,t+1) zi,t 1 zi,t zi,t+1 Input Network
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Using AmazonSageMaker Algorithms Command Line SageMaker Notebooks Amazon EMR
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Input Data >> aws --profile <profile> --region us-west-2 sm create-training-job --training-job-name kmeans-demo --algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east- 1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File --role-arn "arn:aws:iam::0123456789:role/demo" --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://kmeans_demo/output --resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInHours=1 From Command Line Hardware
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon SageMaker Notebooks Parameters Hardware Start Training Host model
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon EMR Start Training Parameters Hardware Apply Model
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Questions?
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Pop-up Loft aws.amazon.com/activate Everything and Anything Startups Need to Get Started on AWS