SlideShare a Scribd company logo
1 of 49
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon SageMaker Algorithms:
Infinitely Scalable Machine Learning
Keith Steward
Sr. Specialist SA
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
What is Amazon SageMaker?
Exploration Training
Hosting
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
The Amazon Machine Learning Stack
FRAMEWORKS & INTERFACES
Caffe2 CNTK
Apache
MXNet
PyTorch
TensorFlo
w
Chainer Keras Gluon
AWS Deep Learning AMIs
Amazon SageMaker
Rekognition Transcribe Translate Polly Comprehend Lex
AWS
DeepLens
EDUCATION
PLATFORM SERVICES
APPLICATION SERVICES
Amazon Mechanical Turk
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Challenges in Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Large Scale Machine Learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Our Customers use ML at a massive scale
“We collect 160M events
daily in the ML pipeline and
run training over the last
15 days and need it to
complete in one hour.
Effectively there's 100M
features in the model”
Valentino Volonghi, CTO
“We process 3 million ad
requests a second, 100,000
features per request. That’s
250 trillion per day. Not
your run of the mill Data
science problem!”
Bill Simmons, CTO
“Our data warehouse is
100TB and we are
processing 2TB daily. We're
running mostly gradient
boosting (trees), LDA and
K-Means clustering and
collaborative filtering.“
Shahar Cizer Kobrinsky, VP
Architecture
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cost vs. Time
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
Single
Machine
Distributed, with
Strong Machines
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
Unusable Data /
Wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Architecture and Design
Streaming, GPU/CPU, Distributed with a Shared State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Streaming
Data Size
Memory
Data Size
Time/Cost
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
2
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Incremental Training
3
1
2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
GPU/CPU
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Distributed
GPU State
GPU State
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Shared State
GPU
GPU
GPU Local
State
Shared
State
Local
State
Local
State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Cost vs. Time vs. Accuracy
$$$$
$$$
$$
$
Minutes Hours Days Weeks Months
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
State Model
GPU State
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Model Selection
1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Abstraction and Containerization
def initialize(...)
def update(...)
def finalize(...)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Production Readiness
Data/Model Size
Investment Reasonable
Investment Level
No unusable Data /
No wasted opportunity
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Amazon SageMaker Algorithms
• DeepAR
• Factorization Machines
• Gradient Boosted Trees (XGBoost)
• Image Classification (ResNet)
• K-Means Clustering
• Latent Dirichlet Allocation (LDA)
• Linear Learner Classification and
Regression
• Neural Topic Modeling (NTM)
• Principal Components Analysis
(PCA)
• Random Cut Forest
• Seq2Seq
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Regression:
Estimate a real valued function
Binary Classification:
Predict a 0/1 class
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Linear Learner
Train
Fit thresholds
and select
Select model with best validation performance
>8x speedup over naïve parallel training!
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
30GB datasets for web-spam and web-url classification
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
K-Means Clustering
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Principal Component Analysis (PCA)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Neural Topic Modeling
Encoder: feedforward net
Input term counts vector
Document
Posterior
Sampled Document
Representation
Decoder:
Softmax
Output term counts vector
Perplexity vs. Number of Topic
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
DeepAR –time series forecasting
Mean absolute
percentage error
P90 Loss
DeepAR R DeepAR R
traffic
Hourly occupancy rate of 963
bay area freeways
0.14 0.27 0.13 0.24
electricity
Electricity use of 370
homes over time
0.07 0.11 0.08 0.09
pageviews
Page view hits
of websites
10k 0.32 0.32 0.44 0.31
180k 0.32 0.34 0.29 NA
One hour on p2.xlarge, $1
Input
Network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Using AmazonSageMaker Algorithms
Command Line
SageMaker Notebooks
Amazon EMR
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Input Data
>> aws --profile <profile> 
--region us-west-2 
sm create-training-job 
--training-job-name kmeans-demo 
--algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east-
1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File 
--role-arn "arn:aws:iam::0123456789:role/demo" 
--input-data-config '{"ChannelName": "train", "DataSource": 
{"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": 
"s3://kmeans_demo/train", "S3DataDistributionType": 
"FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' 
--output-data-config S3OutputPath=s3://kmeans_demo/output 
--resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 
--stopping-condition MaxRuntimeInHours=1
From Command Line
Hardware
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon SageMaker Notebooks
Parameters
Hardware
Start Training
Host model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
From Amazon EMR
Start Training
Parameters
Hardware
Apply Model
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
Questions?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
aws.amazon.com/activate
Everything and Anything Startups
Need to Get Started on AWS

More Related Content

What's hot

DataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT WorkshopDataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT Workshop
Amazon Web Services
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
Amazon Web Services
 

What's hot (20)

DataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT WorkshopDataPalooza - A Music Festival themed ML + IoT Workshop
DataPalooza - A Music Festival themed ML + IoT Workshop
 
Build, Train, & Deploy ML Models Using SageMaker
Build, Train, & Deploy ML Models Using SageMakerBuild, Train, & Deploy ML Models Using SageMaker
Build, Train, & Deploy ML Models Using SageMaker
 
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
Machine Learning: From Notebook to Production with Amazon Sagemaker (April 2018)
 
AWS Machine Learning Week SF: Build, Train & Deploy ML Models Using SageMaker
AWS Machine Learning Week SF: Build, Train & Deploy ML Models Using SageMakerAWS Machine Learning Week SF: Build, Train & Deploy ML Models Using SageMaker
AWS Machine Learning Week SF: Build, Train & Deploy ML Models Using SageMaker
 
Build Deep Learning Applications with TensorFlow & SageMaker
Build Deep Learning Applications with TensorFlow & SageMakerBuild Deep Learning Applications with TensorFlow & SageMaker
Build Deep Learning Applications with TensorFlow & SageMaker
 
AWS で構築するコンピュータビジョンアプリケーション
AWS で構築するコンピュータビジョンアプリケーションAWS で構築するコンピュータビジョンアプリケーション
AWS で構築するコンピュータビジョンアプリケーション
 
Serverless Analytics on AWS
Serverless Analytics on AWSServerless Analytics on AWS
Serverless Analytics on AWS
 
Introducing aws deep lens
Introducing aws deep lensIntroducing aws deep lens
Introducing aws deep lens
 
Big Data, Analytics and Machine Learning on AWS Lambda - SRV402 - re:Invent 2017
Big Data, Analytics and Machine Learning on AWS Lambda - SRV402 - re:Invent 2017Big Data, Analytics and Machine Learning on AWS Lambda - SRV402 - re:Invent 2017
Big Data, Analytics and Machine Learning on AWS Lambda - SRV402 - re:Invent 2017
 
Machine Learning & Amazon SageMaker
Machine Learning & Amazon SageMakerMachine Learning & Amazon SageMaker
Machine Learning & Amazon SageMaker
 
Amazon SageMaker 紹介 & ハンズオン(2018/07/03 実施)
Amazon SageMaker 紹介 & ハンズオン(2018/07/03 実施)Amazon SageMaker 紹介 & ハンズオン(2018/07/03 実施)
Amazon SageMaker 紹介 & ハンズオン(2018/07/03 実施)
 
Apache MXNet and Gluon
Apache MXNet and GluonApache MXNet and Gluon
Apache MXNet and Gluon
 
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
CMP314_Bringing Deep Learning to the Cloud with Amazon EC2
 
Use Amazon Comprehend and Amazon SageMaker to Gain Insight from Text
Use Amazon Comprehend and Amazon SageMaker to Gain Insight from TextUse Amazon Comprehend and Amazon SageMaker to Gain Insight from Text
Use Amazon Comprehend and Amazon SageMaker to Gain Insight from Text
 
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
NEW LAUNCH! Introducing Amazon SageMaker - MCL365 - re:Invent 2017
 
Amazon SageMaker workshop
Amazon SageMaker workshopAmazon SageMaker workshop
Amazon SageMaker workshop
 
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
Accelerating Apache MXNet Models on Apple Platforms Using Core ML - MCL311 - ...
 
Breaking the Monolith road to containers.pdf
Breaking the Monolith road to containers.pdfBreaking the Monolith road to containers.pdf
Breaking the Monolith road to containers.pdf
 
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWSAWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
AWS reInvent 2017 recap - Optimizing Costs as You Scale on AWS
 
Firecracker, 서버리스 컴퓨팅을 위한 오픈소스 microVM 기술 :: 류한진 - AWS ...
Firecracker, 서버리스 컴퓨팅을 위한 오픈소스 microVM 기술 :: 류한진 - AWS ...Firecracker, 서버리스 컴퓨팅을 위한 오픈소스 microVM 기술 :: 류한진 - AWS ...
Firecracker, 서버리스 컴퓨팅을 위한 오픈소스 microVM 기술 :: 류한진 - AWS ...
 

Similar to Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf

NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
Amazon Web Services
 

Similar to Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf (20)

Amazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San FranciscoAmazon SageMaker Algorithms: Machine Learning Week San Francisco
Amazon SageMaker Algorithms: Machine Learning Week San Francisco
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
FSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine LearningFSV305-Optimizing Payments Collections with Containers and Machine Learning
FSV305-Optimizing Payments Collections with Containers and Machine Learning
 
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
DEV209 A Field Guide to Monitoring in the Cloud: From Lift and Shift to AWS L...
 
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
NEW LAUNCH! Introducing Amazon Sumerian – Build VR/AR and 3D Applications - M...
 
在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)在遊戲上應用AI (包括現場展示)
在遊戲上應用AI (包括現場展示)
 
Amazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithmsAmazon sage maker infinitely scalable machine learning algorithms
Amazon sage maker infinitely scalable machine learning algorithms
 
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
NEW LAUNCH! Infinitely Scalable Machine Learning Algorithms with Amazon AI - ...
 
ENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale MigrationsENT212-An Overview of Best Practices for Large-Scale Migrations
ENT212-An Overview of Best Practices for Large-Scale Migrations
 
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
Cost Optimizing Your Architecture: Practical Design Steps for Developer Savin...
 
AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0AI / ML Services - re:Invent Comes to London 2.0
AI / ML Services - re:Invent Comes to London 2.0
 
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
Case Study: Ola Cabs Uses Amazon EBS and Elastic Volumes to Maximize MySQL De...
 
DataXDay - Machine learning models at scale with Amazon SageMaker
DataXDay - Machine learning models at scale with Amazon SageMaker DataXDay - Machine learning models at scale with Amazon SageMaker
DataXDay - Machine learning models at scale with Amazon SageMaker
 
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
ARC207 Monitoring Performance of Enterprise Applications on AWS: Understandin...
 
Working with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model TrainingWorking with Amazon SageMaker Algorithms for Faster Model Training
Working with Amazon SageMaker Algorithms for Faster Model Training
 
Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.Building a Multi-Region, Active-Active Serverless Backends.
Building a Multi-Region, Active-Active Serverless Backends.
 
re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Services
 
透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率 透過 AWS 上的終端使用者計算解決方案提升工作效率
透過 AWS 上的終端使用者計算解決方案提升工作效率
 
Enterprise Innovation? Yes, with AWS Cloud, AI, and IoT - WIN201 - re:Invent ...
Enterprise Innovation? Yes, with AWS Cloud, AI, and IoT - WIN201 - re:Invent ...Enterprise Innovation? Yes, with AWS Cloud, AI, and IoT - WIN201 - re:Invent ...
Enterprise Innovation? Yes, with AWS Cloud, AI, and IoT - WIN201 - re:Invent ...
 
Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.Chaos Engineering: Why Breaking Things Should Be Practised.
Chaos Engineering: Why Breaking Things Should Be Practised.
 

More from Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Keith Steward - SageMaker Algorithms Infinitely Scalable Machine Learning_VK.pdf

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon SageMaker Algorithms: Infinitely Scalable Machine Learning Keith Steward Sr. Specialist SA
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved What is Amazon SageMaker? Exploration Training Hosting
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved The Amazon Machine Learning Stack FRAMEWORKS & INTERFACES Caffe2 CNTK Apache MXNet PyTorch TensorFlo w Chainer Keras Gluon AWS Deep Learning AMIs Amazon SageMaker Rekognition Transcribe Translate Polly Comprehend Lex AWS DeepLens EDUCATION PLATFORM SERVICES APPLICATION SERVICES Amazon Mechanical Turk
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Challenges in Machine Learning
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Machine Learning
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Large Scale Machine Learning
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Our Customers use ML at a massive scale “We collect 160M events daily in the ML pipeline and run training over the last 15 days and need it to complete in one hour. Effectively there's 100M features in the model” Valentino Volonghi, CTO “We process 3 million ad requests a second, 100,000 features per request. That’s 250 trillion per day. Not your run of the mill Data science problem!” Bill Simmons, CTO “Our data warehouse is 100TB and we are processing 2TB daily. We're running mostly gradient boosting (trees), LDA and K-Means clustering and collaborative filtering.“ Shahar Cizer Kobrinsky, VP Architecture
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cost vs. Time $$$$ $$$ $$ $ Minutes Hours Days Weeks Months Single Machine Distributed, with Strong Machines
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level Unusable Data / Wasted opportunity
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Architecture and Design Streaming, GPU/CPU, Distributed with a Shared State
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming State
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Streaming Data Size Memory Data Size Time/Cost
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 2 3 1 2
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Incremental Training 3 1 2
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved GPU/CPU GPU State
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Distributed GPU State GPU State GPU State
  • 25. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Shared State GPU GPU GPU Local State Shared State Local State Local State
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Cost vs. Time vs. Accuracy $$$$ $$$ $$ $ Minutes Hours Days Weeks Months
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved State Model GPU State
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1 1
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Model Selection 1
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Abstraction and Containerization def initialize(...) def update(...) def finalize(...)
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Production Readiness Data/Model Size Investment Reasonable Investment Level No unusable Data / No wasted opportunity
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Amazon SageMaker Algorithms • DeepAR • Factorization Machines • Gradient Boosted Trees (XGBoost) • Image Classification (ResNet) • K-Means Clustering • Latent Dirichlet Allocation (LDA) • Linear Learner Classification and Regression • Neural Topic Modeling (NTM) • Principal Components Analysis (PCA) • Random Cut Forest • Seq2Seq
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Regression: Estimate a real valued function Binary Classification: Predict a 0/1 class
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Linear Learner Train Fit thresholds and select Select model with best validation performance >8x speedup over naïve parallel training!
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved 30GB datasets for web-spam and web-url classification
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved K-Means Clustering
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Principal Component Analysis (PCA)
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Neural Topic Modeling Encoder: feedforward net Input term counts vector Document Posterior Sampled Document Representation Decoder: Softmax Output term counts vector Perplexity vs. Number of Topic
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved DeepAR –time series forecasting Mean absolute percentage error P90 Loss DeepAR R DeepAR R traffic Hourly occupancy rate of 963 bay area freeways 0.14 0.27 0.13 0.24 electricity Electricity use of 370 homes over time 0.07 0.11 0.08 0.09 pageviews Page view hits of websites 10k 0.32 0.32 0.44 0.31 180k 0.32 0.34 0.29 NA One hour on p2.xlarge, $1 Input Network
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Using AmazonSageMaker Algorithms Command Line SageMaker Notebooks Amazon EMR
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Input Data >> aws --profile <profile> --region us-west-2 sm create-training-job --training-job-name kmeans-demo --algorithm-specification TrainingImage=0123456789.dkr.ecr.us-east- 1.amazonaws.com/kmeanswebscale:latest,TrainingInputMode=File --role-arn "arn:aws:iam::0123456789:role/demo" --input-data-config '{"ChannelName": "train", "DataSource": {"S3DataSource":{"S3DataType": "S3Prefix", "S3Uri": "s3://kmeans_demo/train", "S3DataDistributionType": "FullyReplicated"}}, "CompressionType": "None", "RecordWrapperType": "None"}' --output-data-config S3OutputPath=s3://kmeans_demo/output --resource-config InstanceCount=2,InstanceType=c4.8xlarge,VolumeSizeInGB=50 --stopping-condition MaxRuntimeInHours=1 From Command Line Hardware
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon SageMaker Notebooks Parameters Hardware Start Training Host model
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved From Amazon EMR Start Training Parameters Hardware Apply Model
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved Questions?
  • 49. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved aws.amazon.com/activate Everything and Anything Startups Need to Get Started on AWS