SlideShare a Scribd company logo
1 of 32
Download to read offline
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Accelerate machine learning
workloads using Amazon EC2 P3
instances
Brad Kenstler
Sr. Data Scientist
Amazon ML Solutions Lab
C M P 2 0 4
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Agenda
Machine learning at AWS
Amazon EC2 P3 overview
Whiteboard
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine learning at Amazon: A long heritage
Personalized
recommendations
Fulfillment automation
& inventory management
Drones Voice-driven
interactions
Inventing entirely new
customer experiences
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
ML @ AWS
OUR MISSION
Put machine learning in the
hands of every developer and
data scientist
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
M L F R A M E W O R K S &
I N F R A S T R U C T U R E
The Amazon ML stack: Broadest & deepest set of capabilities
A I S E R V I C E S
A M A Z O N
R E K O G N I T I O N
I M A G E
A M A Z O N
P O L L Y
T R A N S C R I B E T R A N S L A T E A M A Z O N
L E X
A M A Z O N
R E K O G N I T I O N
V I D E O
Vision Speech Chatbots
M L S E R V I C E S
F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e
E C 2 P 3
& P 3 d n
E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C
I N F E R E N C E
Language Forecasting Recommendations
A M A Z O N
T E X T R A C T
C O M P R E H E N D
C O M P R E H E N D
M E D I C A L
A M A Z O N
F O R E C A S T
A M A Z O N
P E R S O N A L I Z E
A M A Z O N
S A G E M A K E R
G R O U N D T R U T H
N O T E B O O K S M A R K E T P L A C E
A L G O R I T H M S
R E I N F O R C E M E N T
L E A R N I N G
New
T R A I N I N G
O P T I M I Z A T I O N
( N E O )
D E P L O Y M E N T
H O S T I N G
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon EC2 P3 Instances (October 2017)
• Up to eight NVIDIA Tesla V100 GPUs
• 1 PetaFLOPs of computational performance – Up
to 14x better than P2
• 300 GB/s GPU-to-GPU communication (NVLink)
– 9x better than P2
• 16 GB GPU memory with 900 GB/sec peak GPU
memory bandwidth
One of the fastest, most powerful GPU instances in the cloud
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Airbnb
“At Airbnb, we’re using machine learning to optimize search
recommendations and improve dynamic pricing guidance for hosts,
both of which translate to increased booking conversions. These
use cases are highly specific to our industry and require machine
learning models that use several different types of data sources,
such as guest and host preferences, listing location and condition,
seasonality, and price.
With Amazon EC2 P3 instances, we have the ability to run training
workloads faster, enabling us to iterate more, build better machine
learning models and reduce cost.”
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Toyota Research Institute's (TRI) mission is to improve
the quality of human life through advances in artificial
intelligence, automated driving, and robotics. The core
compute capability needed by TRI to accelerate the
training of their machine learning models is powered by
multiple Amazon EC2 P3 instances.
With P3 instances, TRI is seeing a 4x faster time-to-train
than the P2 instances they had used previously. This
gives them significant agility to optimize and retrain their
models quickly and deploy them in their test cars or
simulations environment for further testing.
Furthermore, the significant performance improvement
in P3s over P2s, coupled with pay-as-you-go model
translates to lower operating costs.
Toyota Research Institute
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Machine learning 101
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Training a machine learning model: Supervised learning
Training Data Training
Input Data
Predictions
Model
Examples of
correct data
How good are
your predictions?
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Training machine learning models
AlexNet, 2012
• A large, deep convolutional neural network with 5 convolutional layer, 60
million parameters, and 650,000 neurons
• Created by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton
• Won the 2012 ILSVRC (ImageNet Large-Scale Visual Recognition
Challenge)
• Used two NVIDIA GTX 580 GPUs
• Took nearly a week to train!
Source - https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
• 10s–100s of processing cores
• Pre-defined instruction set &
datapath widths
• Optimized for general-purpose
computing
CPU
CPUs vs. GPUs vs. FPGA for compute
• 1,000s of processing cores
• Pre-defined instruction set and
datapath widths
• Highly effective at parallel
execution
GPU
• Millions of programmable
digital logic cells
• No predefined instruction set
or datapath widths
• Hardware timed execution
FPGA
DRAM
Control
ALU
ALU
Cache
DRAM
ALU
ALU
Control
ALU
ALU
Cache
DRAM
ALU
ALU
Control
ALU
ALU
Cache
DRAM
ALU
ALU
Control
ALU
ALU
Cache
DRAM
ALU
ALU
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
GPUs for machine learning training
Matrix A
b11 b12 b13
b21 b22 b23
b31 b32 b33
Matrix B
a11.b11 + a12.b21 +
a13.b31
a11.b12 + a12.b22 +
a13.b32
a11.b13 + a12.b23 +
a13.b33
a21.b11 + a22.b21 +
a23.b31
a21.b12 + a22.b22 +
a23.b32
a21.b13 + a22.b23 +
a23.b33
a31.b11 + a32.b21 +
a33.b31
a31.b12 + a32.b22 +
a33.b32
a31.b13 + a32.b23 +
a33.b33
Matrix C
These multiply and accumulate
operations can be parallelized across
1,000s of core available in a typical
GPU
a11 a12 a13
a21 a22 a23
a31 a32 a33
• Routines for training machine learning models fundamentally map to matrix multiplications
• This coupled with extremely high memory bandwidth makes GPUs ideal for training
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
b11 b12 b13
b21 b22 b23
b31 b32 b33
GPUs for machine learning training
Numeric precision:
• Half precision (FP16) – 16 bit Floating point
• Single precision (FP32) – 32 bit Floating point
• Double precision (FP64) – 64 bit Floating point
a11 a12 a13
a21 a22 a23
a31 a32 a33
Matrix A Matrix B
a11.b11 + a12.b21 +
a13.b31
a11.b12 + a12.b22 +
a13.b32
a11.b13 + a12.b23 +
a13.b33
a21.b11 + a22.b21 +
a23.b31
a21.b12 + a22.b22 +
a23.b32
a21.b13 + a22.b23 +
a23.b33
a31.b11 + a32.b21 +
a33.b31
a31.b12 + a32.b22 +
a33.b32
a31.b13 + a32.b23 +
a33.b33
Matrix C
• Routines for training machine learning models fundamentally map to matrix multiplications
• This coupled with extremely high memory bandwidth makes GPUs ideal for training
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
New Tensor Cores in NVIDIA V100 GPUs
• In addition to 5,120 CUDA Cores, each Tesla V100 has 640 Tensor Cores with
125 TFLOPs of mixed-precision performance per GPU
• Each Tensor Core provides a 4x4x4 matrix processing array, which performs
the operation D = A X B + C.
Tesla V100 GPU
a11 a12 a13 a14
a21 a22 a23 a24
a31 a32 a33 a34
a41 a42 a43 a44
A (FP16)
b11 b12 b13 b14
b21 b22 b23 b24
b31 b32 b33 b34
b41 b42 b43 b44
B (FP16)
c11 c12 c13 c14
c21 c22 c23 c24
c31 c32 c33 c34
c41 c42 c43 c44
C (FP16 or FP32)
D
(FP16 or FP32)
Source:NVIDIA
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
P3 instance details
Instance size GPUs
GPU Peer
to Peer
vCPUs
Memory
(GB)
Network
Bandwidth
EBS
Bandwidth
On-Demand
Price/hr*
1-yr RI
Effective
Hourly*
3-yr RI
Effective
Hourly*
P3.2xlarge 1 No 8 61
Up to 10
Gbps
1.7 Gbps $3.06
$1.99
(35% Disc.)
$1.23
(60% Disc.)
P3.8xlarge 4 NVLink 32 244 10 Gbps 7 Gbps $12.24
$7.96
(35% Disc.)
$4.93
(60% Disc.)
P3.16xlarge 8 NVLink 64 488 25 Gbps 14 Gbps $24.48
$15.91
(35% Disc.)
$9.87
(60% Disc.)
Note: P3 instances are also supported via Spot Instances with up to 70% discount
Regional Availability
P3 instances are generally available in AWS US
East (Northern Virginia), US East (Ohio), US West
(Oregon), EU (Ireland), Asia Pacific (Seoul), Asia
Pacific (Tokyo), AWS GovCloud (US), and China
(Beijing) Regions.
Framework Support
P3 instances and their V100 GPUs supported
across all major frameworks (such as
TensorFlow, MXNet, PyTorch, Caffe2, and
CNTK)
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
EC2 pricing and usage options
On-Demand
Pay for compute capacity per second. No
longer-term commitments or upfront
payments are needed. You can increase
or decrease your compute capacity
depending on the demands of your
application and only pay the hourly rate.
Recommended for:
• Users that prefer the low cost and
flexibility of Amazon EC2 without any
upfront payment or long-term
commitment
• Applications with short-term, spiky, or
unpredictable workloads that cannot
be interrupted
• Applications being developed or tested
on Amazon EC2 for the first time
Reserved Instances
Reserved Instances provide you with a
significant discount (up to 60%)
compared to On-Demand Instance
pricing. In addition, when Reserved
Instances are assigned to a specific
Availability Zone, they provide a
capacity reservation, giving you
additional confidence in your ability to
launch instances when you need them.
Recommended for:
• Applications with steady state usage
• Applications that may require
reserved capacity
• Customers that can commit to using
EC2 over a 1 or 3 year term to reduce
their total computing costs
Spot Instances
Spot Instances allow you to request
spare Amazon EC2 computing capacity
for up to 70% off the On-Demand
price. The only difference between On-
Demand Instances and Spot Instances is
that Spot Instances can be interrupted
by EC2 with two minutes of notification
when EC2 needs the capacity back.
Recommended for:
• Applications that have flexible start
and end times
• Applications that are only feasible at
very low compute prices
• Users with urgent computing needs
for large amounts of additional
capacity
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
P3 instances details
Instance Size GPUs
GPU Peer
to Peer
vCPUs
Memory
(GB)
Network
Bandwidth
EBS
Bandwidth
On-Demand
Price/hr*
1-yr RI
Effective
Hourly*
3-yr RI
Effective
Hourly*
P3.2xlarge 1 No 8 61
Up to 10
Gbps
1.7 Gbps $3.06
$1.99
(35% Disc.)
$1.23
(60% Disc.)
P3.8xlarge 4 NVLink 32 244 10 Gbps 7 Gbps $12.24
$7.96
(35% Disc.)
$4.93
(60% Disc.)
P3.16xlarge 8 NVLink 64 488 25 Gbps 14 Gbps $24.48
$15.91
(35% Disc.)
$9.87
(60% Disc.)
• P3 instances provide GPU-to-GPU data
transfer over NVLink
• P2 instanced provided GPU-to-GPU data
transfer over PCI Express
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Description P3.16xlarge P2.16xlarge
P3 GPU Performance
Improvement
Number of GPUs 8 16 -
Number of Accelerators 8 (V100) 8 (K80)
GPU – Peer to Peer NVLink – 300 GB/s PCI-Express - 32 GB/s 9.4x
CPU to GPU Throughput
PCIe throughput per GPUs
8 GB/s 1 GB/s 8x
CPU to GPU Throughput
Total instance PCIe throughput
64 GB/s
(Four x16 Gen3)
16 GB/s
(One x16 Gen3)
4x
P3 vs. P2 peer-to-peer configurations
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
P3 PCIe and NVLink configurations
CPU0
GPU0
GPU1
GPU2
GPU3
PCIe Switches
CPU1
GPU4
GPU5
GPU6
GPU7
PCIe Switches
QPI
NVLink
PCIExpress
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
P3 PCIe and NVLink configurations
CPU0
GPU0
GPU1
GPU2
GPU3
PCIe Switches
CPU1
GPU4
GPU5
GPU6
GPU7
PCIe Switches
QPI
NVLink
PCIExpress
0xFF
0xFF
0xFF
0xFF
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Lowering training times with
Amazon EC2 P3 Instances
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS P3 vs. P2 Instance
GPU perfo rma nce co mpa riso n
• P2 Instances use K80 Accelerator (Kepler architecture)
• P3 Instances use V100 Accelerator (Volta architecture)
0
2
4
6
8
10
12
14
16
K80 P100 V100
FP32 Perf (TFLOPS)
1.7X
0
1
2
3
4
5
6
7
8
K80 P100 V100
FP64 Perf (TFLOPS)
2.6X
0
20
40
60
80
100
120
140
K80 P100 V100
Mixed/FP16 Perf (TFLOPS)
14X
over K80’s
max perf.
FP32
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
112
430
846870
3440
6850
0
1000
2000
3000
4000
5000
6000
7000
8000
1 Accelerator 4 Accelerator 8 Accelerator
Images/Second ResNet-50 training performance
(using synthetic data, MXNet)
P2 (1 Accelerator = 2 GPUs) Images/S P3 (1 Accelerator = 1 GPU) Images/S
8.1X
8X
7.8X
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Scaling performance using distributed training
-
5,000
10,000
15,000
20,000
25,000
30,000
35,000
40,000
45,000
50,000
1 2 4 8 16 32 64
Images/Second
Number of GPUs
Training using P3 instances
(ResNet-50, ImageNet Images/Second)
• Using single P3 instances, with Volta GPUs,
customers can cut down training times of
their machine learning models from days to
a few hours.
• Using distributed training via multiple P3
instances, high performance networking and
storage solutions, customers can further cut
down their time-to-train from hours in to
minutes.
• Example – We were to train ResNet-50 to
Top1 validation accuracy of 76% in 15 mins
using 32 P3.16xlarge instances (256 GPUs).
https://aws.amazon.com/blogs/machine-learning/scalable-multi-node-training-with-tensorflow/
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
OptimizedfordistributedMLtraining
• Oneof the mostpowerful GPUinstancesavailablein thecloud
• 100Gbpsof networkingthroughput
• 96vCPUusingAWScustomerSkylakeCPUs,and768GBof systemmemory
• Basedon NVIDIA’slatest GPUTeslaV100with 32GBofmemory
Instancesize GPUs GPU memory
GPU
peer topeer
vCPUs CPUtype
Memor
y
(GB)
Network
bandwidt
h
AmazonEBS
bandwidth
Localinstance
storage
P3.2xlarge 1 xV100 16GB/GPU No 8 Broadwell 61 Upto 10Gbps 1.7 Gbps NA
P3.8xlarge 4 xV100 16GB/GPU NVLink 32 Broadwell 244 10 Gbps 7 Gbps NA
P3.16xlarge 8 xV100 16GB/GPU NVLink 64 Broadwell 488 25Gbps 14Gbps NA
P3dn.24xlarge 8 xV100 32 GB/GPU NVLink 96 Skylake 768 100 Gbps 14Gbps 2 TBNVMe
LatestNVIDAV100GPUwith32GB
memoryforlargemodelsandhigher
batchsizes
96SkylakevCPUswith supportfor
AVX-512instructionsforpre-
processingof trainingdata
100Gbps of networkingthroughput for
large-scaledistributedtraining& fast
dataaccess
New larger P3 size – P3dn.24xlarge
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Getting started
© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Deep Learning AMI
• Get started quickly with easy-to-launch tutorials
• Hassle-free setup and configuration
• Pay only for what you use – no additional charge for the
AMI
• Accelerate your model training and deployment
• Support for popular deep learning frameworks
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
© 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T
Thank you!
S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
Brad Kenstler
kenstler@amazon.com

More Related Content

What's hot

Smart-Energy-Connect-Accelerating-Innovation-at-CLP
Smart-Energy-Connect-Accelerating-Innovation-at-CLPSmart-Energy-Connect-Accelerating-Innovation-at-CLP
Smart-Energy-Connect-Accelerating-Innovation-at-CLPAmazon Web Services
 
Analyze customer sentiment using AI - AIM307 - New York AWS Summit
Analyze customer sentiment using AI - AIM307 - New York AWS SummitAnalyze customer sentiment using AI - AIM307 - New York AWS Summit
Analyze customer sentiment using AI - AIM307 - New York AWS SummitAmazon Web Services
 
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Amazon Web Services
 
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...Amazon Web Services
 
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019AWS Summits
 
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...Amazon Web Services
 
Alexa + IoT - SVC203 - New York AWS Summit
Alexa + IoT - SVC203 - New York AWS SummitAlexa + IoT - SVC203 - New York AWS Summit
Alexa + IoT - SVC203 - New York AWS SummitAmazon Web Services
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI servicesAmazon Web Services
 
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS Summit
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS SummitManaging microservices using AWS App Mesh - MAD302 - Chicago AWS Summit
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS SummitAmazon Web Services
 
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Amazon Web Services
 
Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Amazon Web Services
 
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...Amazon Web Services
 
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdf
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdfAdd intelligence to applications - AIM205 - Santa Clara AWS Summit.pdf
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdfAmazon Web Services
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Amazon Web Services
 
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...Amazon Web Services
 
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...Amazon Web Services
 
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...Amazon Web Services
 
Build AR/VR apps with AWS - SVC202 - Atlanta AWS Summit
Build AR/VR apps with AWS - SVC202 - Atlanta AWS SummitBuild AR/VR apps with AWS - SVC202 - Atlanta AWS Summit
Build AR/VR apps with AWS - SVC202 - Atlanta AWS SummitAmazon Web Services
 
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...Amazon Web Services
 
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Amazon Web Services
 

What's hot (20)

Smart-Energy-Connect-Accelerating-Innovation-at-CLP
Smart-Energy-Connect-Accelerating-Innovation-at-CLPSmart-Energy-Connect-Accelerating-Innovation-at-CLP
Smart-Energy-Connect-Accelerating-Innovation-at-CLP
 
Analyze customer sentiment using AI - AIM307 - New York AWS Summit
Analyze customer sentiment using AI - AIM307 - New York AWS SummitAnalyze customer sentiment using AI - AIM307 - New York AWS Summit
Analyze customer sentiment using AI - AIM307 - New York AWS Summit
 
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
 
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...
Migliora la disponibilità e le prestazioni delle tue applicazioni con Amazon ...
 
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019
Frontend and Mobile with AWS Amplify | AWS Summit Tel Aviv 2019
 
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...
[REPEAT] Get hands on with AWS DeepRacer & compete in the AWS DeepRacer Leagu...
 
Alexa + IoT - SVC203 - New York AWS Summit
Alexa + IoT - SVC203 - New York AWS SummitAlexa + IoT - SVC203 - New York AWS Summit
Alexa + IoT - SVC203 - New York AWS Summit
 
Building intelligent applications using AI services
Building intelligent applications using AI servicesBuilding intelligent applications using AI services
Building intelligent applications using AI services
 
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS Summit
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS SummitManaging microservices using AWS App Mesh - MAD302 - Chicago AWS Summit
Managing microservices using AWS App Mesh - MAD302 - Chicago AWS Summit
 
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
Migration to AWS: The foundation for enterprise transformation - SVC210 - New...
 
Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?Pensi di essere pronto per i microservizi?
Pensi di essere pronto per i microservizi?
 
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
Introduction to the Well-Architected Framework and Tool - SVC212 - Chicago AW...
 
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdf
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdfAdd intelligence to applications - AIM205 - Santa Clara AWS Summit.pdf
Add intelligence to applications - AIM205 - Santa Clara AWS Summit.pdf
 
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
Build accurate training datasets with Amazon SageMaker Ground Truth - AIM301 ...
 
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...
Build sophisticated forecasting & recommendation models - AIM204 - Santa Clar...
 
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
AWS IoT services - Extract value for industrial applications - SVC205 - Santa...
 
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...
[REPEAT] Optimize your workloads with Amazon EC2 & AMD EPYC - DEM01-R - Santa...
 
Build AR/VR apps with AWS - SVC202 - Atlanta AWS Summit
Build AR/VR apps with AWS - SVC202 - Atlanta AWS SummitBuild AR/VR apps with AWS - SVC202 - Atlanta AWS Summit
Build AR/VR apps with AWS - SVC202 - Atlanta AWS Summit
 
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...
AWS identity services: Enabling and securing your cloud journey - SEC203 - Ne...
 
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
Get hands-on with AWS DeepRacer and compete in the AWS DeepRacer League - AIM...
 

Similar to Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 - Chicago AWS Summit

Enabling Research Using Cloud Computing
Enabling Research Using Cloud ComputingEnabling Research Using Cloud Computing
Enabling Research Using Cloud ComputingAmazon Web Services
 
Machine Learning with Kubernetes- AWS Container Day 2019 Barcelona
Machine Learning with Kubernetes- AWS Container Day 2019 BarcelonaMachine Learning with Kubernetes- AWS Container Day 2019 Barcelona
Machine Learning with Kubernetes- AWS Container Day 2019 BarcelonaAmazon Web Services
 
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...Amazon Web Services
 
Introduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelIntroduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelAmazon Web Services
 
Introduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelIntroduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelAmazon Web Services
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Amazon Web Services
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersAmazon Web Services
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Julien SIMON
 
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...Amazon Web Services
 
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerDeep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerAmazon Web Services
 
Machine learning using Kubernetes
Machine learning using KubernetesMachine learning using Kubernetes
Machine learning using KubernetesArun Gupta
 
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Julien SIMON
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfAmazon Web Services
 
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Amazon Web Services
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML OverviewBESPIN GLOBAL
 
Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon Web Services
 

Similar to Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 - Chicago AWS Summit (20)

Enabling Research Using Cloud Computing
Enabling Research Using Cloud ComputingEnabling Research Using Cloud Computing
Enabling Research Using Cloud Computing
 
Machine Learning with Kubernetes- AWS Container Day 2019 Barcelona
Machine Learning with Kubernetes- AWS Container Day 2019 BarcelonaMachine Learning with Kubernetes- AWS Container Day 2019 Barcelona
Machine Learning with Kubernetes- AWS Container Day 2019 Barcelona
 
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...
Introducing Amazon EC2 P3 Instance - Featuring the Most Powerful GPU for Mach...
 
Introduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelIntroduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day Israel
 
Introduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day IsraelIntroduction to AI services for Developers - Builders Day Israel
Introduction to AI services for Developers - Builders Day Israel
 
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
Accelerate Machine Learning Workloads using Amazon EC2 P3 Instances - SRV201 ...
 
Scale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for BuildersScale - Amazon SageMaker Deep Dive for Builders
Scale - Amazon SageMaker Deep Dive for Builders
 
Webinar: Ask the Experts - AIML (Español)
Webinar: Ask the Experts - AIML (Español)Webinar: Ask the Experts - AIML (Español)
Webinar: Ask the Experts - AIML (Español)
 
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
Deep Learning with Tensorflow and Apache MXNet on AWS (April 2019)
 
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...
Accelerate ML workloads using EC2 accelerated computing - CMP202 - Santa Clar...
 
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMakerDeep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
Deep Learning con TensorFlow and Apache MXNet su Amazon SageMaker
 
Amazon EC2 Foundations
Amazon EC2 FoundationsAmazon EC2 Foundations
Amazon EC2 Foundations
 
Machine learning using Kubernetes
Machine learning using KubernetesMachine learning using Kubernetes
Machine learning using Kubernetes
 
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
Deep Learning with TensorFlow and Apache MXNet on Amazon SageMaker (March 2019)
 
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdfSviluppa, addestra e distribuisci modelli di machine learning.pdf
Sviluppa, addestra e distribuisci modelli di machine learning.pdf
 
re:Invent 2018: AI/ML Services
re:Invent 2018: AI/ML Servicesre:Invent 2018: AI/ML Services
re:Invent 2018: AI/ML Services
 
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
Train once, deploy anywhere on the cloud and at the edge with Neo - AIM301 - ...
 
Amazon AI/ML Overview
Amazon AI/ML OverviewAmazon AI/ML Overview
Amazon AI/ML Overview
 
SRV319 Amazon EC2 Foundations
SRV319 Amazon EC2 FoundationsSRV319 Amazon EC2 Foundations
SRV319 Amazon EC2 Foundations
 
Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Accelerate machine-learning workloads using Amazon EC2 P3 instances - CMP204 - Chicago AWS Summit

  • 1. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Accelerate machine learning workloads using Amazon EC2 P3 instances Brad Kenstler Sr. Data Scientist Amazon ML Solutions Lab C M P 2 0 4
  • 2. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Agenda Machine learning at AWS Amazon EC2 P3 overview Whiteboard
  • 3. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 4. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Machine learning at Amazon: A long heritage Personalized recommendations Fulfillment automation & inventory management Drones Voice-driven interactions Inventing entirely new customer experiences
  • 5. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. ML @ AWS OUR MISSION Put machine learning in the hands of every developer and data scientist
  • 6. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. M L F R A M E W O R K S & I N F R A S T R U C T U R E The Amazon ML stack: Broadest & deepest set of capabilities A I S E R V I C E S A M A Z O N R E K O G N I T I O N I M A G E A M A Z O N P O L L Y T R A N S C R I B E T R A N S L A T E A M A Z O N L E X A M A Z O N R E K O G N I T I O N V I D E O Vision Speech Chatbots M L S E R V I C E S F r a m e w o r k s I n t e r f a c e s I n f r a s t r u c t u r e E C 2 P 3 & P 3 d n E C 2 C 5 F P G A s G R E E N G R A S S E L A S T I C I N F E R E N C E Language Forecasting Recommendations A M A Z O N T E X T R A C T C O M P R E H E N D C O M P R E H E N D M E D I C A L A M A Z O N F O R E C A S T A M A Z O N P E R S O N A L I Z E A M A Z O N S A G E M A K E R G R O U N D T R U T H N O T E B O O K S M A R K E T P L A C E A L G O R I T H M S R E I N F O R C E M E N T L E A R N I N G New T R A I N I N G O P T I M I Z A T I O N ( N E O ) D E P L O Y M E N T H O S T I N G
  • 7. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 8. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon EC2 P3 Instances (October 2017) • Up to eight NVIDIA Tesla V100 GPUs • 1 PetaFLOPs of computational performance – Up to 14x better than P2 • 300 GB/s GPU-to-GPU communication (NVLink) – 9x better than P2 • 16 GB GPU memory with 900 GB/sec peak GPU memory bandwidth One of the fastest, most powerful GPU instances in the cloud
  • 9. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Airbnb “At Airbnb, we’re using machine learning to optimize search recommendations and improve dynamic pricing guidance for hosts, both of which translate to increased booking conversions. These use cases are highly specific to our industry and require machine learning models that use several different types of data sources, such as guest and host preferences, listing location and condition, seasonality, and price. With Amazon EC2 P3 instances, we have the ability to run training workloads faster, enabling us to iterate more, build better machine learning models and reduce cost.”
  • 10. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Toyota Research Institute's (TRI) mission is to improve the quality of human life through advances in artificial intelligence, automated driving, and robotics. The core compute capability needed by TRI to accelerate the training of their machine learning models is powered by multiple Amazon EC2 P3 instances. With P3 instances, TRI is seeing a 4x faster time-to-train than the P2 instances they had used previously. This gives them significant agility to optimize and retrain their models quickly and deploy them in their test cars or simulations environment for further testing. Furthermore, the significant performance improvement in P3s over P2s, coupled with pay-as-you-go model translates to lower operating costs. Toyota Research Institute
  • 11. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Machine learning 101
  • 12. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Training a machine learning model: Supervised learning Training Data Training Input Data Predictions Model Examples of correct data How good are your predictions?
  • 13. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Training machine learning models AlexNet, 2012 • A large, deep convolutional neural network with 5 convolutional layer, 60 million parameters, and 650,000 neurons • Created by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton • Won the 2012 ILSVRC (ImageNet Large-Scale Visual Recognition Challenge) • Used two NVIDIA GTX 580 GPUs • Took nearly a week to train! Source - https://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
  • 14. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. • 10s–100s of processing cores • Pre-defined instruction set & datapath widths • Optimized for general-purpose computing CPU CPUs vs. GPUs vs. FPGA for compute • 1,000s of processing cores • Pre-defined instruction set and datapath widths • Highly effective at parallel execution GPU • Millions of programmable digital logic cells • No predefined instruction set or datapath widths • Hardware timed execution FPGA DRAM Control ALU ALU Cache DRAM ALU ALU Control ALU ALU Cache DRAM ALU ALU Control ALU ALU Cache DRAM ALU ALU Control ALU ALU Cache DRAM ALU ALU
  • 15. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. GPUs for machine learning training Matrix A b11 b12 b13 b21 b22 b23 b31 b32 b33 Matrix B a11.b11 + a12.b21 + a13.b31 a11.b12 + a12.b22 + a13.b32 a11.b13 + a12.b23 + a13.b33 a21.b11 + a22.b21 + a23.b31 a21.b12 + a22.b22 + a23.b32 a21.b13 + a22.b23 + a23.b33 a31.b11 + a32.b21 + a33.b31 a31.b12 + a32.b22 + a33.b32 a31.b13 + a32.b23 + a33.b33 Matrix C These multiply and accumulate operations can be parallelized across 1,000s of core available in a typical GPU a11 a12 a13 a21 a22 a23 a31 a32 a33 • Routines for training machine learning models fundamentally map to matrix multiplications • This coupled with extremely high memory bandwidth makes GPUs ideal for training
  • 16. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. b11 b12 b13 b21 b22 b23 b31 b32 b33 GPUs for machine learning training Numeric precision: • Half precision (FP16) – 16 bit Floating point • Single precision (FP32) – 32 bit Floating point • Double precision (FP64) – 64 bit Floating point a11 a12 a13 a21 a22 a23 a31 a32 a33 Matrix A Matrix B a11.b11 + a12.b21 + a13.b31 a11.b12 + a12.b22 + a13.b32 a11.b13 + a12.b23 + a13.b33 a21.b11 + a22.b21 + a23.b31 a21.b12 + a22.b22 + a23.b32 a21.b13 + a22.b23 + a23.b33 a31.b11 + a32.b21 + a33.b31 a31.b12 + a32.b22 + a33.b32 a31.b13 + a32.b23 + a33.b33 Matrix C • Routines for training machine learning models fundamentally map to matrix multiplications • This coupled with extremely high memory bandwidth makes GPUs ideal for training
  • 17. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. New Tensor Cores in NVIDIA V100 GPUs • In addition to 5,120 CUDA Cores, each Tesla V100 has 640 Tensor Cores with 125 TFLOPs of mixed-precision performance per GPU • Each Tensor Core provides a 4x4x4 matrix processing array, which performs the operation D = A X B + C. Tesla V100 GPU a11 a12 a13 a14 a21 a22 a23 a24 a31 a32 a33 a34 a41 a42 a43 a44 A (FP16) b11 b12 b13 b14 b21 b22 b23 b24 b31 b32 b33 b34 b41 b42 b43 b44 B (FP16) c11 c12 c13 c14 c21 c22 c23 c24 c31 c32 c33 c34 c41 c42 c43 c44 C (FP16 or FP32) D (FP16 or FP32) Source:NVIDIA
  • 18. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. P3 instance details Instance size GPUs GPU Peer to Peer vCPUs Memory (GB) Network Bandwidth EBS Bandwidth On-Demand Price/hr* 1-yr RI Effective Hourly* 3-yr RI Effective Hourly* P3.2xlarge 1 No 8 61 Up to 10 Gbps 1.7 Gbps $3.06 $1.99 (35% Disc.) $1.23 (60% Disc.) P3.8xlarge 4 NVLink 32 244 10 Gbps 7 Gbps $12.24 $7.96 (35% Disc.) $4.93 (60% Disc.) P3.16xlarge 8 NVLink 64 488 25 Gbps 14 Gbps $24.48 $15.91 (35% Disc.) $9.87 (60% Disc.) Note: P3 instances are also supported via Spot Instances with up to 70% discount Regional Availability P3 instances are generally available in AWS US East (Northern Virginia), US East (Ohio), US West (Oregon), EU (Ireland), Asia Pacific (Seoul), Asia Pacific (Tokyo), AWS GovCloud (US), and China (Beijing) Regions. Framework Support P3 instances and their V100 GPUs supported across all major frameworks (such as TensorFlow, MXNet, PyTorch, Caffe2, and CNTK)
  • 19. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. EC2 pricing and usage options On-Demand Pay for compute capacity per second. No longer-term commitments or upfront payments are needed. You can increase or decrease your compute capacity depending on the demands of your application and only pay the hourly rate. Recommended for: • Users that prefer the low cost and flexibility of Amazon EC2 without any upfront payment or long-term commitment • Applications with short-term, spiky, or unpredictable workloads that cannot be interrupted • Applications being developed or tested on Amazon EC2 for the first time Reserved Instances Reserved Instances provide you with a significant discount (up to 60%) compared to On-Demand Instance pricing. In addition, when Reserved Instances are assigned to a specific Availability Zone, they provide a capacity reservation, giving you additional confidence in your ability to launch instances when you need them. Recommended for: • Applications with steady state usage • Applications that may require reserved capacity • Customers that can commit to using EC2 over a 1 or 3 year term to reduce their total computing costs Spot Instances Spot Instances allow you to request spare Amazon EC2 computing capacity for up to 70% off the On-Demand price. The only difference between On- Demand Instances and Spot Instances is that Spot Instances can be interrupted by EC2 with two minutes of notification when EC2 needs the capacity back. Recommended for: • Applications that have flexible start and end times • Applications that are only feasible at very low compute prices • Users with urgent computing needs for large amounts of additional capacity
  • 20. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. P3 instances details Instance Size GPUs GPU Peer to Peer vCPUs Memory (GB) Network Bandwidth EBS Bandwidth On-Demand Price/hr* 1-yr RI Effective Hourly* 3-yr RI Effective Hourly* P3.2xlarge 1 No 8 61 Up to 10 Gbps 1.7 Gbps $3.06 $1.99 (35% Disc.) $1.23 (60% Disc.) P3.8xlarge 4 NVLink 32 244 10 Gbps 7 Gbps $12.24 $7.96 (35% Disc.) $4.93 (60% Disc.) P3.16xlarge 8 NVLink 64 488 25 Gbps 14 Gbps $24.48 $15.91 (35% Disc.) $9.87 (60% Disc.) • P3 instances provide GPU-to-GPU data transfer over NVLink • P2 instanced provided GPU-to-GPU data transfer over PCI Express
  • 21. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Description P3.16xlarge P2.16xlarge P3 GPU Performance Improvement Number of GPUs 8 16 - Number of Accelerators 8 (V100) 8 (K80) GPU – Peer to Peer NVLink – 300 GB/s PCI-Express - 32 GB/s 9.4x CPU to GPU Throughput PCIe throughput per GPUs 8 GB/s 1 GB/s 8x CPU to GPU Throughput Total instance PCIe throughput 64 GB/s (Four x16 Gen3) 16 GB/s (One x16 Gen3) 4x P3 vs. P2 peer-to-peer configurations
  • 22. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. P3 PCIe and NVLink configurations CPU0 GPU0 GPU1 GPU2 GPU3 PCIe Switches CPU1 GPU4 GPU5 GPU6 GPU7 PCIe Switches QPI NVLink PCIExpress
  • 23. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. P3 PCIe and NVLink configurations CPU0 GPU0 GPU1 GPU2 GPU3 PCIe Switches CPU1 GPU4 GPU5 GPU6 GPU7 PCIe Switches QPI NVLink PCIExpress 0xFF 0xFF 0xFF 0xFF
  • 24. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Lowering training times with Amazon EC2 P3 Instances
  • 25. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS P3 vs. P2 Instance GPU perfo rma nce co mpa riso n • P2 Instances use K80 Accelerator (Kepler architecture) • P3 Instances use V100 Accelerator (Volta architecture) 0 2 4 6 8 10 12 14 16 K80 P100 V100 FP32 Perf (TFLOPS) 1.7X 0 1 2 3 4 5 6 7 8 K80 P100 V100 FP64 Perf (TFLOPS) 2.6X 0 20 40 60 80 100 120 140 K80 P100 V100 Mixed/FP16 Perf (TFLOPS) 14X over K80’s max perf. FP32
  • 26. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 112 430 846870 3440 6850 0 1000 2000 3000 4000 5000 6000 7000 8000 1 Accelerator 4 Accelerator 8 Accelerator Images/Second ResNet-50 training performance (using synthetic data, MXNet) P2 (1 Accelerator = 2 GPUs) Images/S P3 (1 Accelerator = 1 GPU) Images/S 8.1X 8X 7.8X
  • 27. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Scaling performance using distributed training - 5,000 10,000 15,000 20,000 25,000 30,000 35,000 40,000 45,000 50,000 1 2 4 8 16 32 64 Images/Second Number of GPUs Training using P3 instances (ResNet-50, ImageNet Images/Second) • Using single P3 instances, with Volta GPUs, customers can cut down training times of their machine learning models from days to a few hours. • Using distributed training via multiple P3 instances, high performance networking and storage solutions, customers can further cut down their time-to-train from hours in to minutes. • Example – We were to train ResNet-50 to Top1 validation accuracy of 76% in 15 mins using 32 P3.16xlarge instances (256 GPUs). https://aws.amazon.com/blogs/machine-learning/scalable-multi-node-training-with-tensorflow/
  • 28. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. OptimizedfordistributedMLtraining • Oneof the mostpowerful GPUinstancesavailablein thecloud • 100Gbpsof networkingthroughput • 96vCPUusingAWScustomerSkylakeCPUs,and768GBof systemmemory • Basedon NVIDIA’slatest GPUTeslaV100with 32GBofmemory Instancesize GPUs GPU memory GPU peer topeer vCPUs CPUtype Memor y (GB) Network bandwidt h AmazonEBS bandwidth Localinstance storage P3.2xlarge 1 xV100 16GB/GPU No 8 Broadwell 61 Upto 10Gbps 1.7 Gbps NA P3.8xlarge 4 xV100 16GB/GPU NVLink 32 Broadwell 244 10 Gbps 7 Gbps NA P3.16xlarge 8 xV100 16GB/GPU NVLink 64 Broadwell 488 25Gbps 14Gbps NA P3dn.24xlarge 8 xV100 32 GB/GPU NVLink 96 Skylake 768 100 Gbps 14Gbps 2 TBNVMe LatestNVIDAV100GPUwith32GB memoryforlargemodelsandhigher batchsizes 96SkylakevCPUswith supportfor AVX-512instructionsforpre- processingof trainingdata 100Gbps of networkingthroughput for large-scaledistributedtraining& fast dataaccess New larger P3 size – P3dn.24xlarge
  • 29. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Getting started
  • 30. © 2019, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Deep Learning AMI • Get started quickly with easy-to-launch tutorials • Hassle-free setup and configuration • Pay only for what you use – no additional charge for the AMI • Accelerate your model training and deployment • Support for popular deep learning frameworks
  • 31. S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.
  • 32. © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved.S U M M I T Thank you! S U M M I T © 2019, Amazon Web Services, Inc. or its affiliates. All rights reserved. Brad Kenstler kenstler@amazon.com