SlideShare a Scribd company logo
1 of 48
Download to read offline
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Supercharge your Machine Learning
Model with Amazon Sagemaker
Giuseppe A. Porcelli
AWS Solutions Architecture EMEA
A LONG HISTORY OF ML AT AMAZON
THOUSANDS OF ENGINEERS ACROSS THE COMPANY FOCUSED ON AI
Personalized
recommendations
Inventing
entirely new
customer
experiences
Fulfillment
automation and
inventory
management
Drones Voice-driven
interactions
ML @ AWS
OUR MISSION
Put Machine Learning in the
hands of every developer and
data scientist
APPLICATION SERVICES
R E K O G N I T I O N R E K O G N I T I O N
V I D E O
P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X
PLATFORMS Amazon SageMaker Amazon Mechanical Turk Spark on Amazon EMR
FRAMEWORKS
& INFRASTRUCTURE
K E R A S
F r a m e w o r k s I n t e r f a c e s
NVIDIA
Tesla V100 GPUs
(14x faster than P2)
P3
Machine Learning
AMIs
5,120 Tensor cores
128GB of memory
1 Petaflop of compute
NVLink 2.0
THE AWS MACHINE LEARNING STACK
Tens of thousands of customers running ML on AWS
LET’S REVIEW THE ML PROCESS
THE MACHINE LEARNING PROCESS
Business Problem -
ML problem
framing
Set Business Goals
• Domain knowledge
• Help formulate the right questions
THE MACHINE LEARNING PROCESS
Business Problem -
ML problem
framing
Data Collection
Data Integration
Data Preparation &
Cleaning
Build the Data Platform
• Amazon S3
• AWS Glue
• Amazon Athena
• Amazon EMR
• Amazon Redshift / Redshift Spectrum
• Amazon Kinesis
• AWS IoT Core
THE MACHINE LEARNING PROCESS
Data Visualization
& Analysis
Business Problem -
ML problem
framing
Data Collection
Data Integration
Data Preparation &
Cleaning
Feature
Engineering
Model Training &
Parameter Tuning
Model Evaluation
Experiment, Train, Tune and Evaluate
• Setup and manage Notebook
Environments
• Setup and manage Training Clusters
• Write Data Connectors
• Scale ML algorithms to large datasets
• Distribute ML training algorithm to
multiple machines
• Secure Model artifacts
THE MACHINE LEARNING PROCESS
Data Visualization
& Analysis
Business Problem -
ML problem
framing
Data Collection
Data Integration
Data Preparation &
Cleaning
Feature
Engineering
Are
Business
goals met?
Monitoring &
Debugging
- Predictions
Yes
Re-training
Model Training &
Parameter Tuning
Model Evaluation Model Deployment
Deploy, Monitor and Debug
• Setup and manage Model Inference
Clusters
• Manage and Auto-Scale Model
Inference APIs
• Monitor and Debug Model Predictions
• Models versioning and performance
tracking
• Automate New Model version
promotion to production (A/B testing)
THE MACHINE LEARNING PROCESS
Data Visualization
& Analysis
Business Problem -
ML problem
framing
Data Collection
Data Integration
Data Preparation &
Cleaning
Feature
Engineering
Are
Business
goals met?
Monitoring &
Debugging
- Predictions
YesNo
DataAugmentation
Feature
Augmentation
Re-training
Model Training &
Parameter Tuning
Model Evaluation Model Deployment
Enhance and re-train
• Add/Remove features
• Augment Data
WHERE SHOULD YOU SPEND YOUR TIME?
BUILD
DESIGN TRAINING EXPERIMENTS
BUILD TRAIN
RUN SCALABLE TRAINING
BUILD TRAIN
TUNE
TUNE HYPERPARAMETERS
BUILD TRAIN DEPLOY
TUNE
DEPLOY AND OPERATE
Amazon SageMaker
A managed service that provides the quickest and
easiest way for data scientists and developers to get
ML models from idea to production
AMAZON SAGEMAKER
Pre-built
notebooks for
common
problems
Built-in, high
performance
algorithms
BUILD
TRAIN & TUNE DEPLOY
BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
AMAZON SAGEMAKER
Pre-built
notebooks for
common
problems
Built-in, high
performance
algorithms
One-click
training
BUILD TRAIN & TUNE
DEPLOY
Hyperparameter
optimization
BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
AMAZON SAGEMAKER
Fully managed
hosting with auto-
scaling
One-click
deployment
Pre-built
notebooks for
common
problems
Built-in, high
performance
algorithms
One-click
training
BUILD TRAIN & TUNE DEPLOY
Hyperparameter
optimization
BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
AMAZON SAGEMAKER
Fully managed
hosting with auto-
scaling
One-click
deployment
Pre-built
notebooks for
common
problems
Built-in, high
performance
algorithms
One-click
training
BUILD TRAIN & TUNE DEPLOY
End-to-end encryption with KMS
End-to-end VPC support
Compliance and audit capabilities
Metadata and experiment management capabilities
Pay as you go
Hyperparameter
optimization
BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
AMAZON SAGEMAKER CUSTOMERS
“
- Ashok Srivastava, Chief Data Officer, Intuit
With Amazon SageMaker, we can accelerate our Artificial
Intelligence initiatives at scale by building and deploying our
algorithms on the platform. We will create novel large-scale
machine learning and AI algorithms and deploy them on this
platform to solve complex problems that can power
prosperity for our customers.
"
AMAZON SAGEMAKER @ INTUIT
Ad-hoc setup and management of
notebook environments
Limited choices for model
deployment
Competing for compute resources
across teams
Easy data exploration
in SageMaker notebooks
Building around virtualization for
flexibility
Auto-scalable model hosting
environment
From To
AMAZON SAGEMAKER CUSTOMERS
“
- Dr. Walter Scott, CTO of Maxar Technologies
and founder of DigitalGlobe
"
As the world’s leading provider of high-resolution Earth imagery, data and
analysis, DigitalGlobe works with enormous amounts of data every day.
DigitalGlobe is making it easier for people to find, access, and run compute
against our entire 100PB image library, which is stored in AWS’s cloud, to apply
deep learning to satellite imagery. We plan to use Amazon SageMaker to train
models against petabytes of Earth observation imagery datasets using hosted
Jupyter notebooks, so DigitalGlobe's Geospatial Big Data Platform (GBDX) users
can just push a button, create a model, and deploy it all within one scalable
distributed environment at scale.
AMAZON SAGEMAKER DEEP DIVE
AMAZON SAGEMAKER COMPONENTS
BUILT-IN ALGORITHMS
BRING YOUR OWN SCRIPT
BRING YOUR OWN ALGORITHM
NOTEBOOK INSTANCES
SDKs & LOCAL MODE
AWS CONSOLE
USER EXPERIENCE
ML TRAINING &
TUNING SERVICE
ML HOSTING
SERVICE
AMAZON SAGEMAKER COMPONENTS
BUILT-IN ALGORITHMS
BRING YOUR OWN SCRIPT
BRING YOUR OWN ALGORITHM
NOTEBOOK INSTANCES
SDKs & LOCAL MODE
AWS CONSOLE
USER EXPERIENCE
ML TRAINING &
TUNING SERVICE
ML HOSTING
SERVICE
NOTEBOOK INSTANCES
ZERO SETUP FOR EXPLORATORY DATA ANALYSIS
Authoring &
Notebooks
ETL Access to AWS
Database services
Access to S3 Data
Lake
VPC • Fully managed Jupyter notebook instances
• Choice of CPU and GPU ml instances
• Sample notebooks and «just add data»
• Recommendations/Personalization
• Fraud Detection
• Forecasting
• Image Classification
• Churn Prediction
• Marketing Email/Campaign Targeting
• Log processing and anomaly detection
• Speech to Text
• More…
• VPC Integration
• Lifecycle Configurations
SDKs AND LOCAL MODE
T r a i n w i t h
l o c a l n o t e b o o k s
Train on notebook
instances
PetaFLOP
training on p3.16xl
Go distributed
with one line of code
Same containers
Amazon SageMaker Python SDK
https://github.com/aws/sagemaker-python-sdk
Amazon SageMaker Spark SDK
https://github.com/aws/sagemaker-spark
LOCAL MODE
AWS CONSOLE
AMAZON SAGEMAKER COMPONENTS
BUILT-IN ALGORITHMS
BRING YOUR OWN SCRIPT
BRING YOUR OWN ALGORITHM
NOTEBOOK INSTANCES
SDKs & LOCAL MODE
AWS CONSOLE
USER EXPERIENCE
ML TRAINING &
TUNING SERVICE
ML HOSTING
SERVICE
MANAGED DISTRIBUTED TRAINING
Fully
managed –
VPC–
Training Code
Training Data Model Artifacts
CPU ML INSTANCES GPU ML INSTANCES HYPERPARAMETER TUNING
BUILT-IN ALGORITHMS BRING YOUR OWN SCRIPT BRING YOUR OWN ALGORITHM
Amazon ECR
BUILT-IN ALGORITHMS
Data Model
NEW DATA
PREDICTION
Algorithm
K-Means
k-nearest neighbors (k-NN)
PCA
LDA
Factorization Machines
Linear Learner
NTM
RandomCutForest
Sequence to Sequence
XGBoost
Image Classification
Object Detection
DeepAR Forecasting
BlazingText
BUILT-IN ALGORITHMS
Algorithms for
“infinite scale”
Distributed by
default
Train on a
data stream
Checkpoint
for re-training
Single pass
training
Not memory
bound
BRING YOUR OWN SCRIPT
Data Model
NEW DATA
PREDICTION
Your Own
Script
+
BRING YOUR OWN ALGORITHM
Data Model
NEW DATA
PREDICTION
Your algorithm and libraries
in your own Docker Container
HYPERPARAMETER TUNING
Run a large set of training jobs
with varying hyperparameters...
... and search the
hyperparameter space for
improved accuracy.
HOSTING
Amazon ECR
Model Artifacts
Inference Image
Model
Create a Model
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
HOSTING
Amazon ECR
Model Artifacts
Inference Image
Model versions
Create versions of a Model
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
HOSTING
Amazon ECR
30 50
10 10
Model Artifacts
Inference Image
Model versions
InstanceType: ml.c5.4xlarge
InitialInstanceCount: 3
maxInstanceCount: 10
ModelName: prod
VariantName: primary
InitialVariantWeight: 50
Create weighted
ProductionVariant(s)
ProductionVariant
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
HOSTING
Amazon ECR
30 50
10 10
Model Artifacts
Inference Image
Model versions
EndpointConfiguration
InstanceType: ml.c5.4xlarge
InitialInstanceCount: 3
maxInstanceCount: 10
ModelName: prod
VariantName: primary
InitialVariantWeight: 50
Create and
EndpointConfiguration from
one or many
ProductionVariant(s)
ProductionVariant
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
HOSTING
Amazon ECR
30 50
10 10
Model Artifacts
Inference Image
Model versions
EndpointConfiguration
Inference Endpoint
InstanceType: ml.c5.4xlarge
InitialInstanceCount: 3
maxInstanceCount: 10
ModelName: prod
VariantName: primary
InitialVariantWeight: 50
Create and Endpoint from
one EndpointConfiguration
ProductionVariant
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
HOSTING
Amazon ECR
30 50
10 10
Model Artifacts
Inference Image
Model versions
EndpointConfiguration
Inference Endpoint
InstanceType: ml.c5.4xlarge
InitialInstanceCount: 3
maxInstanceCount: 10
ModelName: prod
VariantName: primary
InitialVariantWeight: 50
One-click deployment for
built-in algorithms and
containers
ProductionVariant
EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
AUTO SCALING
BATCH TRANSFORM
Dataset in
S3 bucket
AGENT
MODEL
Instance Node 1 Instance Node n
Assembled Data
Record Batch
Request Data
Transformed
Data
…
Cluster
SAGEMAKER SAMPLE END-TO-END ARCHITECTURE
SageMaker
Notebooks
Training
Algorithm
SageMaker
Training
Amazon ECR
Code Commit
Code Pipeline
SageMaker
Hosting
Coco dataset
AWS
Lambda
API
Gateway
Build
Train
Deploy
static website hosted on S3
Inference requests
Amazon S3
Amazon
Cloudfront
Web assets on
Cloudfront
STYLE TRANSFER
IT’S NOT JUST ABOUT ML
Data Lake Storage
Amazon S3
Security
Access Control
Encryption
VPC
KMS
Auditing
Compliance
Roles
Fine Grained Access Controls
Compute
Powerful GPU & CPU Instances
AWS Lambda
Analytics
Amazon Athena
Amazon EMR
Amazon Redshift & Redshift Spectrum
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank You!

More Related Content

What's hot

Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018
Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018
Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018Amazon Web Services
 
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch ServiceBDA308 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch ServiceAmazon Web Services
 
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018Amazon Web Services
 
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...Amazon Web Services Korea
 
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Amazon Web Services
 
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018Amazon Web Services
 
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...Amazon Web Services
 
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)Amazon Web Services
 
Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Amazon Web Services
 
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud Amazon Web Services
 
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...Amazon Web Services
 
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)Amazon Web Services
 
AWSome Day Geneva Main Track: Infrastructure Part 1.pdf
AWSome Day Geneva Main Track: Infrastructure Part 1.pdfAWSome Day Geneva Main Track: Infrastructure Part 1.pdf
AWSome Day Geneva Main Track: Infrastructure Part 1.pdfAmazon Web Services
 
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Amazon Web Services
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSAmazon Web Services
 
How can your business benefit from going Serverless
How can your business benefit from going ServerlessHow can your business benefit from going Serverless
How can your business benefit from going ServerlessAmazon Web Services
 
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
AWS Live Panel Discussion: Simplify Large-Scale IoT Device ManagementAmazon Web Services
 

What's hot (20)

Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018
Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018
Leadership Session: AWS IoT (IOT218-L) - AWS re:Invent 2018
 
AWS 資料湖服務
AWS 資料湖服務AWS 資料湖服務
AWS 資料湖服務
 
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch ServiceBDA308 Deep Dive: Log Analytics with Amazon Elasticsearch Service
BDA308 Deep Dive: Log Analytics with Amazon Elasticsearch Service
 
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018
Amazon on Amazon: How Amazon Designs Chips on AWS (MFG305) - AWS re:Invent 2018
 
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...
Why customers run SAP on AWS for Industry 4.0::Douglas Bellin::제조업 이노베이션 데이 S...
 
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
Running Lean Architectures: How to Optimize for Cost Efficiency (ARC202-R2) -...
 
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018
Big Data on AWS - To infinity and beyond! - Tel Aviv Summit 2018
 
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...
Scale Your SAP HANA In-Memory Database on Amazon EC2 High Memory Instances wi...
 
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
雲上打造資料湖 (Data Lake):智能化駕馭商機 (Level 300)
 
Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)Work Anywhere with Amazon Workspaces (Level: 200)
Work Anywhere with Amazon Workspaces (Level: 200)
 
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
SRV304 IoT Building Blocks From Edge Devices to Analytics in the Cloud
 
GDPR x AWS 導覽 (Level 200)
GDPR x AWS 導覽 (Level 200)GDPR x AWS 導覽 (Level 200)
GDPR x AWS 導覽 (Level 200)
 
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...
Easy Rider: How ML, Serverless, and IoT Drive Mobility as a Service (AMT302) ...
 
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)
物聯網創新應用:車聯網解決方案 IoT Story of Connected Vehicle Solution(Level 300)
 
AWSome Day Geneva Main Track: Infrastructure Part 1.pdf
AWSome Day Geneva Main Track: Infrastructure Part 1.pdfAWSome Day Geneva Main Track: Infrastructure Part 1.pdf
AWSome Day Geneva Main Track: Infrastructure Part 1.pdf
 
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
Move Your Desktops and Applications to AWS with Amazon WorkSpaces and AppStre...
 
Managed Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDSManaged Relational Databases - Amazon RDS
Managed Relational Databases - Amazon RDS
 
Migrating database to cloud
Migrating database to cloudMigrating database to cloud
Migrating database to cloud
 
How can your business benefit from going Serverless
How can your business benefit from going ServerlessHow can your business benefit from going Serverless
How can your business benefit from going Serverless
 
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
AWS Live Panel Discussion: Simplify Large-Scale IoT Device Management
 

Similar to Machine Learning in azione con Amazon SageMaker

Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...
Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...
Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...Provectus
 
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaSviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaAmazon Web Services
 
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech Talks
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech TalksIntegrating Amazon SageMaker into your Enterprise - AWS Online Tech Talks
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech TalksAmazon Web Services
 
Supercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerSupercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerAmazon Web Services
 
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018Amazon Web Services Korea
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitAmazon Web Services
 
From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerAmazon Web Services
 
Building, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMakerBuilding, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMakerAmazon Web Services
 
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...Amazon Web Services
 
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed RaafatAWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed RaafatAWS Riyadh User Group
 
Amazon SageMaker workshop
Amazon SageMaker workshopAmazon SageMaker workshop
Amazon SageMaker workshopJulien SIMON
 
Supercharge Your Machine Learning Solutions with Amazon SageMaker
Supercharge Your Machine Learning Solutions with Amazon SageMakerSupercharge Your Machine Learning Solutions with Amazon SageMaker
Supercharge Your Machine Learning Solutions with Amazon SageMakerAmazon Web Services
 
Machine Learning with Amazon SageMaker
Machine Learning with Amazon SageMakerMachine Learning with Amazon SageMaker
Machine Learning with Amazon SageMakerVladimir Simek
 
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018Train & Deploy ML Models with Amazon Sagemaker: Collision 2018
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018Amazon Web Services
 
Integrating Deep Learning In the Enterprise
Integrating Deep Learning In the EnterpriseIntegrating Deep Learning In the Enterprise
Integrating Deep Learning In the EnterpriseAmazon Web Services
 
Integrating Deep Learning into your Enterprise
Integrating Deep Learning into your EnterpriseIntegrating Deep Learning into your Enterprise
Integrating Deep Learning into your EnterpriseAmazon Web Services
 
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...Amazon Web Services
 
Integrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseIntegrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseAmazon Web Services
 

Similar to Machine Learning in azione con Amazon SageMaker (20)

Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...
Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...
Data Summer Conf 2018, “Build, train, and deploy machine learning models at s...
 
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scalaSviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
Sviluppa, addestra e distribuisci modelli di Machine learning su qualsiasi scala
 
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech Talks
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech TalksIntegrating Amazon SageMaker into your Enterprise - AWS Online Tech Talks
Integrating Amazon SageMaker into your Enterprise - AWS Online Tech Talks
 
Supercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMakerSupercharge your Machine Learning Solutions with Amazon SageMaker
Supercharge your Machine Learning Solutions with Amazon SageMaker
 
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
엔터프라이즈를 위한 머신러닝 그리고 AWS (김일호 솔루션즈 아키텍트, AWS) :: AWS Techforum 2018
 
Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS SummitWork with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
Work with Machine Learning in Amazon SageMaker - BDA203 - Atlanta AWS Summit
 
Introducing Amazon SageMaker
Introducing Amazon SageMakerIntroducing Amazon SageMaker
Introducing Amazon SageMaker
 
From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon Sagemaker
 
Building, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMakerBuilding, Training and Deploying Custom Algorithms with Amazon SageMaker
Building, Training and Deploying Custom Algorithms with Amazon SageMaker
 
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
Train ML Models Using Amazon SageMaker with TensorFlow - SRV336 - Chicago AWS...
 
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed RaafatAWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
AWS reinvent 2019 recap - Riyadh - AI And ML - Ahmed Raafat
 
Amazon SageMaker workshop
Amazon SageMaker workshopAmazon SageMaker workshop
Amazon SageMaker workshop
 
Supercharge Your Machine Learning Solutions with Amazon SageMaker
Supercharge Your Machine Learning Solutions with Amazon SageMakerSupercharge Your Machine Learning Solutions with Amazon SageMaker
Supercharge Your Machine Learning Solutions with Amazon SageMaker
 
Machine Learning with Amazon SageMaker
Machine Learning with Amazon SageMakerMachine Learning with Amazon SageMaker
Machine Learning with Amazon SageMaker
 
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018Train & Deploy ML Models with Amazon Sagemaker: Collision 2018
Train & Deploy ML Models with Amazon Sagemaker: Collision 2018
 
Integrating Deep Learning In the Enterprise
Integrating Deep Learning In the EnterpriseIntegrating Deep Learning In the Enterprise
Integrating Deep Learning In the Enterprise
 
Integrating Deep Learning into your Enterprise
Integrating Deep Learning into your EnterpriseIntegrating Deep Learning into your Enterprise
Integrating Deep Learning into your Enterprise
 
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...
Setting up custom machine learning environments on AWS - AIM204 - Chicago AWS...
 
Integrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your EnterpriseIntegrating Deep Learning Into Your Enterprise
Integrating Deep Learning Into Your Enterprise
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Machine Learning in azione con Amazon SageMaker

  • 1. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Supercharge your Machine Learning Model with Amazon Sagemaker Giuseppe A. Porcelli AWS Solutions Architecture EMEA
  • 2. A LONG HISTORY OF ML AT AMAZON THOUSANDS OF ENGINEERS ACROSS THE COMPANY FOCUSED ON AI Personalized recommendations Inventing entirely new customer experiences Fulfillment automation and inventory management Drones Voice-driven interactions
  • 3. ML @ AWS OUR MISSION Put Machine Learning in the hands of every developer and data scientist
  • 4. APPLICATION SERVICES R E K O G N I T I O N R E K O G N I T I O N V I D E O P O L L Y T R A N S C R I B E T R A N S L A T E C O M P R E H E N D L E X PLATFORMS Amazon SageMaker Amazon Mechanical Turk Spark on Amazon EMR FRAMEWORKS & INFRASTRUCTURE K E R A S F r a m e w o r k s I n t e r f a c e s NVIDIA Tesla V100 GPUs (14x faster than P2) P3 Machine Learning AMIs 5,120 Tensor cores 128GB of memory 1 Petaflop of compute NVLink 2.0 THE AWS MACHINE LEARNING STACK
  • 5. Tens of thousands of customers running ML on AWS
  • 6. LET’S REVIEW THE ML PROCESS
  • 7. THE MACHINE LEARNING PROCESS Business Problem - ML problem framing Set Business Goals • Domain knowledge • Help formulate the right questions
  • 8. THE MACHINE LEARNING PROCESS Business Problem - ML problem framing Data Collection Data Integration Data Preparation & Cleaning Build the Data Platform • Amazon S3 • AWS Glue • Amazon Athena • Amazon EMR • Amazon Redshift / Redshift Spectrum • Amazon Kinesis • AWS IoT Core
  • 9. THE MACHINE LEARNING PROCESS Data Visualization & Analysis Business Problem - ML problem framing Data Collection Data Integration Data Preparation & Cleaning Feature Engineering Model Training & Parameter Tuning Model Evaluation Experiment, Train, Tune and Evaluate • Setup and manage Notebook Environments • Setup and manage Training Clusters • Write Data Connectors • Scale ML algorithms to large datasets • Distribute ML training algorithm to multiple machines • Secure Model artifacts
  • 10. THE MACHINE LEARNING PROCESS Data Visualization & Analysis Business Problem - ML problem framing Data Collection Data Integration Data Preparation & Cleaning Feature Engineering Are Business goals met? Monitoring & Debugging - Predictions Yes Re-training Model Training & Parameter Tuning Model Evaluation Model Deployment Deploy, Monitor and Debug • Setup and manage Model Inference Clusters • Manage and Auto-Scale Model Inference APIs • Monitor and Debug Model Predictions • Models versioning and performance tracking • Automate New Model version promotion to production (A/B testing)
  • 11. THE MACHINE LEARNING PROCESS Data Visualization & Analysis Business Problem - ML problem framing Data Collection Data Integration Data Preparation & Cleaning Feature Engineering Are Business goals met? Monitoring & Debugging - Predictions YesNo DataAugmentation Feature Augmentation Re-training Model Training & Parameter Tuning Model Evaluation Model Deployment Enhance and re-train • Add/Remove features • Augment Data
  • 12. WHERE SHOULD YOU SPEND YOUR TIME?
  • 17. Amazon SageMaker A managed service that provides the quickest and easiest way for data scientists and developers to get ML models from idea to production
  • 18. AMAZON SAGEMAKER Pre-built notebooks for common problems Built-in, high performance algorithms BUILD TRAIN & TUNE DEPLOY BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
  • 19. AMAZON SAGEMAKER Pre-built notebooks for common problems Built-in, high performance algorithms One-click training BUILD TRAIN & TUNE DEPLOY Hyperparameter optimization BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
  • 20. AMAZON SAGEMAKER Fully managed hosting with auto- scaling One-click deployment Pre-built notebooks for common problems Built-in, high performance algorithms One-click training BUILD TRAIN & TUNE DEPLOY Hyperparameter optimization BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
  • 21. AMAZON SAGEMAKER Fully managed hosting with auto- scaling One-click deployment Pre-built notebooks for common problems Built-in, high performance algorithms One-click training BUILD TRAIN & TUNE DEPLOY End-to-end encryption with KMS End-to-end VPC support Compliance and audit capabilities Metadata and experiment management capabilities Pay as you go Hyperparameter optimization BUILD, TRAIN, TUNE AND HOST YOUR OWN MODELS
  • 22. AMAZON SAGEMAKER CUSTOMERS “ - Ashok Srivastava, Chief Data Officer, Intuit With Amazon SageMaker, we can accelerate our Artificial Intelligence initiatives at scale by building and deploying our algorithms on the platform. We will create novel large-scale machine learning and AI algorithms and deploy them on this platform to solve complex problems that can power prosperity for our customers. "
  • 23. AMAZON SAGEMAKER @ INTUIT Ad-hoc setup and management of notebook environments Limited choices for model deployment Competing for compute resources across teams Easy data exploration in SageMaker notebooks Building around virtualization for flexibility Auto-scalable model hosting environment From To
  • 24. AMAZON SAGEMAKER CUSTOMERS “ - Dr. Walter Scott, CTO of Maxar Technologies and founder of DigitalGlobe " As the world’s leading provider of high-resolution Earth imagery, data and analysis, DigitalGlobe works with enormous amounts of data every day. DigitalGlobe is making it easier for people to find, access, and run compute against our entire 100PB image library, which is stored in AWS’s cloud, to apply deep learning to satellite imagery. We plan to use Amazon SageMaker to train models against petabytes of Earth observation imagery datasets using hosted Jupyter notebooks, so DigitalGlobe's Geospatial Big Data Platform (GBDX) users can just push a button, create a model, and deploy it all within one scalable distributed environment at scale.
  • 26. AMAZON SAGEMAKER COMPONENTS BUILT-IN ALGORITHMS BRING YOUR OWN SCRIPT BRING YOUR OWN ALGORITHM NOTEBOOK INSTANCES SDKs & LOCAL MODE AWS CONSOLE USER EXPERIENCE ML TRAINING & TUNING SERVICE ML HOSTING SERVICE
  • 27. AMAZON SAGEMAKER COMPONENTS BUILT-IN ALGORITHMS BRING YOUR OWN SCRIPT BRING YOUR OWN ALGORITHM NOTEBOOK INSTANCES SDKs & LOCAL MODE AWS CONSOLE USER EXPERIENCE ML TRAINING & TUNING SERVICE ML HOSTING SERVICE
  • 28. NOTEBOOK INSTANCES ZERO SETUP FOR EXPLORATORY DATA ANALYSIS Authoring & Notebooks ETL Access to AWS Database services Access to S3 Data Lake VPC • Fully managed Jupyter notebook instances • Choice of CPU and GPU ml instances • Sample notebooks and «just add data» • Recommendations/Personalization • Fraud Detection • Forecasting • Image Classification • Churn Prediction • Marketing Email/Campaign Targeting • Log processing and anomaly detection • Speech to Text • More… • VPC Integration • Lifecycle Configurations
  • 29. SDKs AND LOCAL MODE T r a i n w i t h l o c a l n o t e b o o k s Train on notebook instances PetaFLOP training on p3.16xl Go distributed with one line of code Same containers Amazon SageMaker Python SDK https://github.com/aws/sagemaker-python-sdk Amazon SageMaker Spark SDK https://github.com/aws/sagemaker-spark LOCAL MODE
  • 31. AMAZON SAGEMAKER COMPONENTS BUILT-IN ALGORITHMS BRING YOUR OWN SCRIPT BRING YOUR OWN ALGORITHM NOTEBOOK INSTANCES SDKs & LOCAL MODE AWS CONSOLE USER EXPERIENCE ML TRAINING & TUNING SERVICE ML HOSTING SERVICE
  • 32. MANAGED DISTRIBUTED TRAINING Fully managed – VPC– Training Code Training Data Model Artifacts CPU ML INSTANCES GPU ML INSTANCES HYPERPARAMETER TUNING BUILT-IN ALGORITHMS BRING YOUR OWN SCRIPT BRING YOUR OWN ALGORITHM Amazon ECR
  • 33. BUILT-IN ALGORITHMS Data Model NEW DATA PREDICTION Algorithm K-Means k-nearest neighbors (k-NN) PCA LDA Factorization Machines Linear Learner NTM RandomCutForest Sequence to Sequence XGBoost Image Classification Object Detection DeepAR Forecasting BlazingText
  • 34. BUILT-IN ALGORITHMS Algorithms for “infinite scale” Distributed by default Train on a data stream Checkpoint for re-training Single pass training Not memory bound
  • 35. BRING YOUR OWN SCRIPT Data Model NEW DATA PREDICTION Your Own Script +
  • 36. BRING YOUR OWN ALGORITHM Data Model NEW DATA PREDICTION Your algorithm and libraries in your own Docker Container
  • 37. HYPERPARAMETER TUNING Run a large set of training jobs with varying hyperparameters... ... and search the hyperparameter space for improved accuracy.
  • 38. HOSTING Amazon ECR Model Artifacts Inference Image Model Create a Model EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 39. HOSTING Amazon ECR Model Artifacts Inference Image Model versions Create versions of a Model EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 40. HOSTING Amazon ECR 30 50 10 10 Model Artifacts Inference Image Model versions InstanceType: ml.c5.4xlarge InitialInstanceCount: 3 maxInstanceCount: 10 ModelName: prod VariantName: primary InitialVariantWeight: 50 Create weighted ProductionVariant(s) ProductionVariant EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 41. HOSTING Amazon ECR 30 50 10 10 Model Artifacts Inference Image Model versions EndpointConfiguration InstanceType: ml.c5.4xlarge InitialInstanceCount: 3 maxInstanceCount: 10 ModelName: prod VariantName: primary InitialVariantWeight: 50 Create and EndpointConfiguration from one or many ProductionVariant(s) ProductionVariant EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 42. HOSTING Amazon ECR 30 50 10 10 Model Artifacts Inference Image Model versions EndpointConfiguration Inference Endpoint InstanceType: ml.c5.4xlarge InitialInstanceCount: 3 maxInstanceCount: 10 ModelName: prod VariantName: primary InitialVariantWeight: 50 Create and Endpoint from one EndpointConfiguration ProductionVariant EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 43. HOSTING Amazon ECR 30 50 10 10 Model Artifacts Inference Image Model versions EndpointConfiguration Inference Endpoint InstanceType: ml.c5.4xlarge InitialInstanceCount: 3 maxInstanceCount: 10 ModelName: prod VariantName: primary InitialVariantWeight: 50 One-click deployment for built-in algorithms and containers ProductionVariant EASY MODEL DEPLOYMENT TO AMAZON SAGEMAKER
  • 45. BATCH TRANSFORM Dataset in S3 bucket AGENT MODEL Instance Node 1 Instance Node n Assembled Data Record Batch Request Data Transformed Data … Cluster
  • 46. SAGEMAKER SAMPLE END-TO-END ARCHITECTURE SageMaker Notebooks Training Algorithm SageMaker Training Amazon ECR Code Commit Code Pipeline SageMaker Hosting Coco dataset AWS Lambda API Gateway Build Train Deploy static website hosted on S3 Inference requests Amazon S3 Amazon Cloudfront Web assets on Cloudfront STYLE TRANSFER
  • 47. IT’S NOT JUST ABOUT ML Data Lake Storage Amazon S3 Security Access Control Encryption VPC KMS Auditing Compliance Roles Fine Grained Access Controls Compute Powerful GPU & CPU Instances AWS Lambda Analytics Amazon Athena Amazon EMR Amazon Redshift & Redshift Spectrum
  • 48. © 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank You!