© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Soji Adeshina, Amazon AI
An Introduction to SageMaker
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Roadmap
1. What can we do with Machine Learning?
2. What are the pain points?
3. How does SageMaker mitigate those pain
points and provide leverage?
4. Walk-throughs with SageMaker
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
1. What can we do with
Machine Learning
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
We can classify things
https://www.slideshare.net/butest/machine-learning-3860079
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
We can try to identify things in pictures
http://www.robots.ox.ac.uk/~vgg/projects/seebibyte/demo.html
https://towardsdatascience.com/understanding-ssd-multibox-real-time-object-detection-in-deep-
learning-495ef744fab
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
We can try to find correlations between things
http://upibi.org/penn-team-uses-machine-learning-and-twitter-to-
predict-heart-disease-risk/
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
We can assign topics to articles
https://medium.com/@connectwithghosh/topic-modelling-with-latent-dirichlet-allocation-lda-in-pyspark-
2cb3ebd5678e
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Put machine learning in the hands of every developer and data
scientist
ML @ Amazon: Our Mission
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
2. What are the pain points of
ML?
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Let’s Review: Machine Learning to Production
DeployTrain &Tune
Data +
Data Processing
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Machine Learning Pain Points
• Manually setting up the training workflow and managing
is a project in itself.
• Keeping track of training experiments and tuning
hyperparameters is hard
• Dealing with distributed training and managing your
training cluster is often cumbersome
• Deploying to production means hosting a web service,
writing software to handle incoming requests
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
3. How does AWS and
SageMaker mitigate those pain
points and provide leverage?
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
FRAMEWORKS AND INTERFACES
AW S DEEP LEARNING AMI
Apache MXNet TensorFlowCaffe2 Torch KerasCNTK PyTorch GluonTheano
PLATFORM SERVICES
VISI ON
AWS DeepLensAmazon SageMaker
LANGUA G E
A P P L I C A T I O N S E R V I C E S
Amazon
Rekognition
Amazon
Polly
Amazon
Lex
Amazon
Rekognition
Video
Amazon Transcribe Amazon Translate
Amazon
Comprehend
Alexa for Business
VR/IR Amazon Sumerian
Amazon Kinesis
Video Streams
AWS ML Stack
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
SageMaker Overview
A fully managed servicethat enables data scientists and developers to quickly and easily build
machine-learning based models into production smart applications.
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
SageMaker Overview
BUILD
Collect & prepare training data
Data labeling & pre-built notebooks
for common problems
Choose & optimize your ML
algorithm
Model & algorithm marketplace &
built-in, high-performance
algorithms
TRAIN
Setup & manage environments for
training
One-click training on the highest
performing infrastructure
Train & tune model
Train once, run anywhere & model
optimization
DEPLOY
Deploy model in production
One-click deployment
Scale & manage the production
environment
Fully managed with auto-scaling for
75% less
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
SageMaker Python SDK
Training
Environment
Estimator
Prediction
Environment
Predictor
Training Data
Trained Model
Car?
.fit( )
.deploy( )
.predict( )
train.py
endpoint
File by Iconstock from the Noun Project
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
• Linear Learner
• Image Classification Algorithm, Object Detection Algorithm
• Factorization Machines, Principal Component Analysis (PCA)
• K-Means Algorithm
• K-Nearest Neighbors
• Latent Dirichlet Allocation (LDA), Neural Topic Model (NTM)
• Random Cut Forest, XGBoost Algorithm
• BlazingText, Sequence2Sequence
• DeepAR Forecasting
SageMaker Built-in Algorithms
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
4. Walk-throughs with
SageMaker
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Setting up Sagemaker Notebook Instance
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Setting up Sagemaker Notebook Instance
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Setting up Sagemaker Notebook Instance
© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark
Thanks!
And don’t forget to check out:
https://medium.com/apache-mxnet
https://github.com/awslabs/amazon-sagemaker-examples

Intro to SageMaker

  • 1.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Soji Adeshina, Amazon AI An Introduction to SageMaker
  • 2.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Roadmap 1. What can we do with Machine Learning? 2. What are the pain points? 3. How does SageMaker mitigate those pain points and provide leverage? 4. Walk-throughs with SageMaker
  • 3.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark 1. What can we do with Machine Learning
  • 4.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark We can classify things https://www.slideshare.net/butest/machine-learning-3860079
  • 5.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark We can try to identify things in pictures http://www.robots.ox.ac.uk/~vgg/projects/seebibyte/demo.html https://towardsdatascience.com/understanding-ssd-multibox-real-time-object-detection-in-deep- learning-495ef744fab
  • 6.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark We can try to find correlations between things http://upibi.org/penn-team-uses-machine-learning-and-twitter-to- predict-heart-disease-risk/
  • 7.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark We can assign topics to articles https://medium.com/@connectwithghosh/topic-modelling-with-latent-dirichlet-allocation-lda-in-pyspark- 2cb3ebd5678e
  • 8.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Put machine learning in the hands of every developer and data scientist ML @ Amazon: Our Mission
  • 9.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark 2. What are the pain points of ML?
  • 10.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Let’s Review: Machine Learning to Production DeployTrain &Tune Data + Data Processing
  • 11.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Machine Learning Pain Points • Manually setting up the training workflow and managing is a project in itself. • Keeping track of training experiments and tuning hyperparameters is hard • Dealing with distributed training and managing your training cluster is often cumbersome • Deploying to production means hosting a web service, writing software to handle incoming requests
  • 12.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark 3. How does AWS and SageMaker mitigate those pain points and provide leverage?
  • 13.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark FRAMEWORKS AND INTERFACES AW S DEEP LEARNING AMI Apache MXNet TensorFlowCaffe2 Torch KerasCNTK PyTorch GluonTheano PLATFORM SERVICES VISI ON AWS DeepLensAmazon SageMaker LANGUA G E A P P L I C A T I O N S E R V I C E S Amazon Rekognition Amazon Polly Amazon Lex Amazon Rekognition Video Amazon Transcribe Amazon Translate Amazon Comprehend Alexa for Business VR/IR Amazon Sumerian Amazon Kinesis Video Streams AWS ML Stack
  • 14.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark SageMaker Overview A fully managed servicethat enables data scientists and developers to quickly and easily build machine-learning based models into production smart applications.
  • 15.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark SageMaker Overview BUILD Collect & prepare training data Data labeling & pre-built notebooks for common problems Choose & optimize your ML algorithm Model & algorithm marketplace & built-in, high-performance algorithms TRAIN Setup & manage environments for training One-click training on the highest performing infrastructure Train & tune model Train once, run anywhere & model optimization DEPLOY Deploy model in production One-click deployment Scale & manage the production environment Fully managed with auto-scaling for 75% less
  • 16.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark SageMaker Python SDK Training Environment Estimator Prediction Environment Predictor Training Data Trained Model Car? .fit( ) .deploy( ) .predict( ) train.py endpoint File by Iconstock from the Noun Project
  • 17.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark • Linear Learner • Image Classification Algorithm, Object Detection Algorithm • Factorization Machines, Principal Component Analysis (PCA) • K-Means Algorithm • K-Nearest Neighbors • Latent Dirichlet Allocation (LDA), Neural Topic Model (NTM) • Random Cut Forest, XGBoost Algorithm • BlazingText, Sequence2Sequence • DeepAR Forecasting SageMaker Built-in Algorithms
  • 18.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark 4. Walk-throughs with SageMaker
  • 19.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Setting up Sagemaker Notebook Instance
  • 20.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Setting up Sagemaker Notebook Instance
  • 21.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Setting up Sagemaker Notebook Instance
  • 22.
    © 2018, AmazonWeb Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark© 2018, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon Confidential and Trademark Thanks! And don’t forget to check out: https://medium.com/apache-mxnet https://github.com/awslabs/amazon-sagemaker-examples

Editor's Notes

  • #5 Natural Language Processing Sentiment analysis/mood
  • #6 ML was created to identify images of cats.
  • #7 Sentiment analysis -> Heart Disease Natural Language Processing Sentiment analysis/mood
  • #10 What things do we have to solve to make it easy for every developer and data scientist to use ML
  • #11 Need data: tweets, mortality rates, images of animals Clean up data: supplement or remove incomplete data, resize and crop images Train your statistical model Deploy your model in the cloud so that it can be used to infer Pain points: Manually setting up the training workflow and managing it is a project in itself. Given an example. Dealing with distributed training Turning on and off the machines Deploying to production means hosting a web service, writing software to handle incoming requests
  • #20 Underlying true relationship is hidden. Cost time and money to evaluate. Must sample.
  • #21 Underlying true relationship is hidden. Cost time and money to evaluate. Must sample.
  • #22 Underlying true relationship is hidden. Cost time and money to evaluate. Must sample.