SlideShare a Scribd company logo
1 of 43
S U M M I T
SYDNEY
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Automatic Labelling and Model
Tuning with Amazon SageMaker
Eden Duthie
Specialist Solutions Architect
Amazon Web Services
Adam Lynch
Partner Solutions Architect
Amazon Web Services
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Agenda
• Introduction – 15 min
• Overview of Amazon SageMaker GroundTruth – 15 min
• Lab: Automatic Labelling with Amazon SageMaker GroundTruth – 45 min
• Working Lunch – 15 min
• Lab: Private Workforce Labelling – 30 min
• Overview of Model Tuning using Bayesian Optimisation – 15 min
• Lab: Automatic Model Tuning with Amazon SageMaker – 45 min
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS account for the labs
In order to complete this workshop you'll need an AWS Account with
admin access.
There are resources required by this workshop that are eligible for the
AWS free tier if your account is less than 12 months old.
We will supply some credits for other tasks.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
What data science tasks can be automated?
?
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Data science tasks
• Problem formulation
• Data collection and selection
• Feature engineering
• Model selection
• Algorithm selection
• Neural network architecture
• Deployment
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AutoML – definition and goals
AutoML aims to maximise the performace of machine learning programs
without human assistance and subject to a computational budget.
Core goals:
a) Good performance: good generalisation performance across various
input data and learning tasks can be achieved
b) No assistance from humans: configurations can be automatically done
for machine learning tools
c) High computational efficiency: the program can return a reasonable
output within a limited budget
Taking the Human out of Learning Applications:
A Survey on Automated Machine Learning
Quanming Yao et al. arXiv:
1810.13306v3 [cs.AI] 17 Jan 2019
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
The challenges of AutoML adoption
Deep learning Human design Computational budget
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Some tough challenges in machine learning
Availability of labelled data The road not travelled Drift
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Promising approaches
Learning to learn Simulation Reinforcement learning
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
KERAS
AI Services
ML Services
ML Frameworks +
Infrastructure
Vision Speech Languages Chatbots Vertical
Amazon
SageMaker
Amazon SageMaker
Ground Truth
Amazon
Sagemaker RL
AWS Marketplace
for ML
Amazon
SageMaker Neo
AWS
Deepracer
AWS
Deeplens
Amazon
Personalize
Amazon
Forecast
Amazon
Textract
Amazon
Rekognition
Amazon
Lex
Amazon
Polly
Amazon
Transcribe
Amazon
Comprehend
Amazon
Translate
N E W
N E W
N E W
C5 C5n
P3 P3dn
Amazon Elastic Inference
AWS Inferentia
AWS Greengrass
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Data labeling and machine learning
Labelled data
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Data labelling is hard
Need to label large datasets
Requires humans to perform labelling
Becomes time consuming and costly
Difficult to achieve high accuracy for labels
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker: Build, train, and deploy ML
1
2
3
1
2
3
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Key features
Automatic labeling via
machine learning
Ready-made and
custom workflows
Label
management
Private and public
human workforce
Amazon SageMaker Ground Truth
Label machine learning training data easily and accurately
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Ground Truth
How it works
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Ground Truth
How it works
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Ground Truth
How it works
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Ground Truth
How it works
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Amazon SageMaker Ground Truth
How it works
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 1: Create a Ground Truth labelling job
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 2: Provide details for a labeling job
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 2: Provide details for a labeling job
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Step 2: Provide details for a labeling
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Labelling job is now running
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Labelling app: Human workers label the images
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS Management Console: View labels for images
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
AWS Management Console: View labels
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Solving the following problem:
𝑚𝑎𝑥 𝑥 ∈ 𝐴 𝑓(𝑥)
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Defining ‘A’ using Amazon SageMaker
"ParameterRanges": {
"CategoricalParameterRanges":
[
{ "Name": "tree_method",
"Values": ["auto", "exact", "approx", "hist"]}
],
"ContinuousParameterRanges":
[
{ "Name": "eta", "MaxValue" : "0.5", "MinValue": "0" }
],
"IntegerParameterRanges":
[
{ "Name": "max_depth", "MaxValue": "10", "MinValue": "1", }
]
}
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Gaussian process regression
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Gaussian process posterior on the objective function
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Acquisition function
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
On Amazon Sagemaker – configure the tuning job
tuning_job_config = {
"ParameterRanges": {…}
"ResourceLimits": { "MaxNumberOfTrainingJobs": 20, "MaxParallelTrainingJobs": 3 },
"Strategy": "Bayesian",
"HyperParameterTuningJobObjective": {
"MetricName": "validation:auc",
"Type": "Maximize"
}
}
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Launch the tuning job
tuning_job = smclient.create_hyper_parameter_tuning_job(
HyperParameterTuningJobName = "MyTuningJob“,
HyperParameterTuningJobConfig = tuning_job_config,
TrainingJobDefinition = training_job_definition)
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Eden Duthie
duthiee@amazon.com
Adam Lynch
atlynch@amazon.com

More Related Content

What's hot

From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerAmazon Web Services
 
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018Amazon Web Services
 
Deep Learning on Amazon SageMaker (October 2018)
Deep Learning on Amazon SageMaker (October 2018)Deep Learning on Amazon SageMaker (October 2018)
Deep Learning on Amazon SageMaker (October 2018)Julien SIMON
 
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...Amazon Web Services
 
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...Amazon Web Services
 
The Future of AI (September 2019)
The Future of AI (September 2019)The Future of AI (September 2019)
The Future of AI (September 2019)Julien SIMON
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Julien SIMON
 
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018Amazon Web Services
 
Machine Learning: From Inception to Inference - AWS Summit Sydney
Machine Learning: From Inception to Inference - AWS Summit SydneyMachine Learning: From Inception to Inference - AWS Summit Sydney
Machine Learning: From Inception to Inference - AWS Summit SydneyAmazon Web Services
 
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018Amazon Web Services
 
Starting your AI/ML project right (May 2020)
Starting your AI/ML project right (May 2020)Starting your AI/ML project right (May 2020)
Starting your AI/ML project right (May 2020)Julien SIMON
 
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...Amazon Web Services
 
Building and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsBuilding and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsjavier ramirez
 
Customer-Obsessed Digital User Engagement
Customer-Obsessed Digital User EngagementCustomer-Obsessed Digital User Engagement
Customer-Obsessed Digital User EngagementAmazon Web Services
 
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018Amazon Web Services
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaDan Romuald Mbanga
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Julien SIMON
 
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018Amazon Web Services
 
Fraud Prevention and Detection on AWS
Fraud Prevention and Detection on AWSFraud Prevention and Detection on AWS
Fraud Prevention and Detection on AWSAmazon Web Services
 

What's hot (20)

From notebook to production with Amazon Sagemaker
From notebook to production with Amazon SagemakerFrom notebook to production with Amazon Sagemaker
From notebook to production with Amazon Sagemaker
 
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018
Unsupervised Learning with Amazon SageMaker (AIM333) - AWS re:Invent 2018
 
Deep Learning on Amazon SageMaker (October 2018)
Deep Learning on Amazon SageMaker (October 2018)Deep Learning on Amazon SageMaker (October 2018)
Deep Learning on Amazon SageMaker (October 2018)
 
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...
Unlock the Full Potential of Your Media Assets, ft. Fox Entertainment Group (...
 
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...
Build Deep Learning Applications Using TensorFlow and Amazon SageMaker (AIM42...
 
The Future of AI (September 2019)
The Future of AI (September 2019)The Future of AI (September 2019)
The Future of AI (September 2019)
 
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)Build, Train and Deploy Machine Learning Models at Scale (April 2019)
Build, Train and Deploy Machine Learning Models at Scale (April 2019)
 
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018
Human-in-the-Loop for Machine Learning (AIM358-R1) - AWS re:Invent 2018
 
Machine Learning: From Inception to Inference - AWS Summit Sydney
Machine Learning: From Inception to Inference - AWS Summit SydneyMachine Learning: From Inception to Inference - AWS Summit Sydney
Machine Learning: From Inception to Inference - AWS Summit Sydney
 
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018
Create a Serverless Searchable Media Library (AIM342-R1) - AWS re:Invent 2018
 
Starting your AI/ML project right (May 2020)
Starting your AI/ML project right (May 2020)Starting your AI/ML project right (May 2020)
Starting your AI/ML project right (May 2020)
 
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...
Detecting Financial Market Manipulation Using Machine Learning (AIM347) - AWS...
 
Amazon SageMaker
Amazon SageMakerAmazon SageMaker
Amazon SageMaker
 
Building and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionalsBuilding and deploying AI/ML models on AWS for Biosciences professionals
Building and deploying AI/ML models on AWS for Biosciences professionals
 
Customer-Obsessed Digital User Engagement
Customer-Obsessed Digital User EngagementCustomer-Obsessed Digital User Engagement
Customer-Obsessed Digital User Engagement
 
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018
Crafting a Conversational Platform Strategy (AIM338) - AWS re:Invent 2018
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbanga
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
 
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018
Business Process Automation Using Crowdsourcing (AIM352) - AWS re:Invent 2018
 
Fraud Prevention and Detection on AWS
Fraud Prevention and Detection on AWSFraud Prevention and Detection on AWS
Fraud Prevention and Detection on AWS
 

Similar to Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney

Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAutomatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAmazon Web Services
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...Amazon Web Services
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...Amazon Web Services
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Amazon Web Services
 
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...Amazon Web Services
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Amazon Web Services
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSAmazon Web Services
 
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Amazon Web Services
 
Drive digital transformation with AI
Drive digital transformation with AIDrive digital transformation with AI
Drive digital transformation with AIAmazon Web Services
 
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground Truth
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground TruthBuild Highly Accurate Training Datasets Using Amazon SageMaker Ground Truth
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground TruthAmazon Web Services
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyAmazon Web Services
 
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019Amazon Web Services
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI servicesAmazon Web Services
 
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon Web Services
 
Become a Machine Learning Developer with AWS Services
Become a Machine Learning Developer with AWS ServicesBecome a Machine Learning Developer with AWS Services
Become a Machine Learning Developer with AWS ServicesAmazon Web Services
 
Become a Machine Learning developer with AWS (Avril 2019)
Become a Machine Learning developer with AWS (Avril 2019)Become a Machine Learning developer with AWS (Avril 2019)
Become a Machine Learning developer with AWS (Avril 2019)Julien SIMON
 
利用 AWS Step Functions 建構穩定的資料處理流程.pdf
利用 AWS Step Functions 建構穩定的資料處理流程.pdf利用 AWS Step Functions 建構穩定的資料處理流程.pdf
利用 AWS Step Functions 建構穩定的資料處理流程.pdfAmazon Web Services
 
利用 AWS Step Functions 建構穩健的資料處理流程
利用 AWS Step Functions 建構穩健的資料處理流程利用 AWS Step Functions 建構穩健的資料處理流程
利用 AWS Step Functions 建構穩健的資料處理流程Amazon Web Services
 
Machine Learning Analytics for the rest of us
Machine Learning Analytics for the rest of usMachine Learning Analytics for the rest of us
Machine Learning Analytics for the rest of usAmazon Web Services
 
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Amazon Web Services
 

Similar to Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney (20)

Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMakerAutomatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
Automatic-Labelling-and-Model-Tuning-with-Amazon-SageMaker
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM205 ...
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM305 ...
 
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
Build Accurate Training Datasets with Amazon SageMaker Ground Truth - AIM302 ...
 
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...
Build accurate training data sets with Amazon SageMaker Ground Truth - AIM302...
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
 
Rendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWSRendi le tue app più smart con i servizi AI di AWS
Rendi le tue app più smart con i servizi AI di AWS
 
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...Optimize deep learning training and inferencing using GPU and Amazon SageMake...
Optimize deep learning training and inferencing using GPU and Amazon SageMake...
 
Drive digital transformation with AI
Drive digital transformation with AIDrive digital transformation with AI
Drive digital transformation with AI
 
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground Truth
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground TruthBuild Highly Accurate Training Datasets Using Amazon SageMaker Ground Truth
Build Highly Accurate Training Datasets Using Amazon SageMaker Ground Truth
 
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit SydneyPredicting Demand In A Diverse Retail Environment - AWS Summit Sydney
Predicting Demand In A Diverse Retail Environment - AWS Summit Sydney
 
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
AIOps - Steps Towards Autonomous Operations - AWS Summit Sydney 2019
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI services
 
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
Amazon SageMaker - ML for every developer & data scientist ft. Workday - AIM2...
 
Become a Machine Learning Developer with AWS Services
Become a Machine Learning Developer with AWS ServicesBecome a Machine Learning Developer with AWS Services
Become a Machine Learning Developer with AWS Services
 
Become a Machine Learning developer with AWS (Avril 2019)
Become a Machine Learning developer with AWS (Avril 2019)Become a Machine Learning developer with AWS (Avril 2019)
Become a Machine Learning developer with AWS (Avril 2019)
 
利用 AWS Step Functions 建構穩定的資料處理流程.pdf
利用 AWS Step Functions 建構穩定的資料處理流程.pdf利用 AWS Step Functions 建構穩定的資料處理流程.pdf
利用 AWS Step Functions 建構穩定的資料處理流程.pdf
 
利用 AWS Step Functions 建構穩健的資料處理流程
利用 AWS Step Functions 建構穩健的資料處理流程利用 AWS Step Functions 建構穩健的資料處理流程
利用 AWS Step Functions 建構穩健的資料處理流程
 
Machine Learning Analytics for the rest of us
Machine Learning Analytics for the rest of usMachine Learning Analytics for the rest of us
Machine Learning Analytics for the rest of us
 
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
Automate Security Event Management Using Trust-Based Decision Models - AWS Su...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Automatic Labelling and Model Tuning with Amazon SageMaker - AWS Summit Sydney

  • 1. S U M M I T SYDNEY
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Automatic Labelling and Model Tuning with Amazon SageMaker Eden Duthie Specialist Solutions Architect Amazon Web Services Adam Lynch Partner Solutions Architect Amazon Web Services
  • 3. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Agenda • Introduction – 15 min • Overview of Amazon SageMaker GroundTruth – 15 min • Lab: Automatic Labelling with Amazon SageMaker GroundTruth – 45 min • Working Lunch – 15 min • Lab: Private Workforce Labelling – 30 min • Overview of Model Tuning using Bayesian Optimisation – 15 min • Lab: Automatic Model Tuning with Amazon SageMaker – 45 min
  • 4. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS account for the labs In order to complete this workshop you'll need an AWS Account with admin access. There are resources required by this workshop that are eligible for the AWS free tier if your account is less than 12 months old. We will supply some credits for other tasks.
  • 5. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 6. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T What data science tasks can be automated? ?
  • 7. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Data science tasks • Problem formulation • Data collection and selection • Feature engineering • Model selection • Algorithm selection • Neural network architecture • Deployment
  • 8. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AutoML – definition and goals AutoML aims to maximise the performace of machine learning programs without human assistance and subject to a computational budget. Core goals: a) Good performance: good generalisation performance across various input data and learning tasks can be achieved b) No assistance from humans: configurations can be automatically done for machine learning tools c) High computational efficiency: the program can return a reasonable output within a limited budget Taking the Human out of Learning Applications: A Survey on Automated Machine Learning Quanming Yao et al. arXiv: 1810.13306v3 [cs.AI] 17 Jan 2019
  • 9. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T The challenges of AutoML adoption Deep learning Human design Computational budget
  • 10. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Some tough challenges in machine learning Availability of labelled data The road not travelled Drift
  • 11. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Promising approaches Learning to learn Simulation Reinforcement learning
  • 12. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T KERAS AI Services ML Services ML Frameworks + Infrastructure Vision Speech Languages Chatbots Vertical Amazon SageMaker Amazon SageMaker Ground Truth Amazon Sagemaker RL AWS Marketplace for ML Amazon SageMaker Neo AWS Deepracer AWS Deeplens Amazon Personalize Amazon Forecast Amazon Textract Amazon Rekognition Amazon Lex Amazon Polly Amazon Transcribe Amazon Comprehend Amazon Translate N E W N E W N E W C5 C5n P3 P3dn Amazon Elastic Inference AWS Inferentia AWS Greengrass
  • 13. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 14. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Data labeling and machine learning Labelled data
  • 15. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Data labelling is hard Need to label large datasets Requires humans to perform labelling Becomes time consuming and costly Difficult to achieve high accuracy for labels
  • 16. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker: Build, train, and deploy ML 1 2 3 1 2 3
  • 17. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Key features Automatic labeling via machine learning Ready-made and custom workflows Label management Private and public human workforce Amazon SageMaker Ground Truth Label machine learning training data easily and accurately
  • 18. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Ground Truth How it works
  • 19. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Ground Truth How it works
  • 20. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Ground Truth How it works
  • 21. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Ground Truth How it works
  • 22. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Amazon SageMaker Ground Truth How it works
  • 23. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 1: Create a Ground Truth labelling job
  • 24. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 2: Provide details for a labeling job
  • 25. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 2: Provide details for a labeling job
  • 26. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Step 2: Provide details for a labeling
  • 27. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Labelling job is now running
  • 28. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Labelling app: Human workers label the images
  • 29. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS Management Console: View labels for images
  • 30. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T AWS Management Console: View labels
  • 31. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 32. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 33. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 34. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 35. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Solving the following problem: 𝑚𝑎𝑥 𝑥 ∈ 𝐴 𝑓(𝑥)
  • 36. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Defining ‘A’ using Amazon SageMaker "ParameterRanges": { "CategoricalParameterRanges": [ { "Name": "tree_method", "Values": ["auto", "exact", "approx", "hist"]} ], "ContinuousParameterRanges": [ { "Name": "eta", "MaxValue" : "0.5", "MinValue": "0" } ], "IntegerParameterRanges": [ { "Name": "max_depth", "MaxValue": "10", "MinValue": "1", } ] }
  • 37. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Gaussian process regression
  • 38. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Gaussian process posterior on the objective function
  • 39. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Acquisition function
  • 40. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. On Amazon Sagemaker – configure the tuning job tuning_job_config = { "ParameterRanges": {…} "ResourceLimits": { "MaxNumberOfTrainingJobs": 20, "MaxParallelTrainingJobs": 3 }, "Strategy": "Bayesian", "HyperParameterTuningJobObjective": { "MetricName": "validation:auc", "Type": "Maximize" } }
  • 41. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Launch the tuning job tuning_job = smclient.create_hyper_parameter_tuning_job( HyperParameterTuningJobName = "MyTuningJob“, HyperParameterTuningJobConfig = tuning_job_config, TrainingJobDefinition = training_job_definition)
  • 42. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 43. Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Eden Duthie duthiee@amazon.com Adam Lynch atlynch@amazon.com