SlideShare a Scribd company logo
1 of 48
Download to read offline
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS re:INVENT
High Performance Computing on AWS
D a v i d P e l l e r i n
H e a d o f W W B u s i n e s s D e v e l o p m e n t , I n f o t e c h , A W S
C M P 2 0 7
N o v e m b e r 2 7 , 2 0 1 7
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Agenda
 What is HPC? Survey of applications
 Grid computing, cluster computing, and “grids of clusters”
 Performance best-practices for HPC
 Accelerated computing using GPUs and FPGAs
 Automation and batch processing
 Graphics for HPC pre- and post-processing
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Defining HPC—example use-cases
Data Light
Minimal
requirements for
high performance
storage
Data Heavy
Benefits from
access to high
performance
storage
Fluid dynamics
Weather forecasting
Materials simulations
Crash simulations
Risk simulations
Molecular modeling
Contextual search
Logistics simulations
Animation and VFX
Semiconductor verification
Image processing/GIS
Genomics
Seismic processing
Metagenomics
Astrophysics
Deep learning
Clustered (Tightly Coupled)
Distributed/Grid (Loosely Coupled)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Important enablers in HPC
 Compute performance—CPUs, GPUs, FPGAs
 Memory performance—high RAM requirements in many applications
 Network performance—throughput, latency, and consistency
 Storage performance—including shared filesystems
 Automation and cluster/job management
 Graphics for pre- and post-processing
…and SCALE
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cluster HPC
Tightly coupled, latency-
sensitive applications
Use larger Amazon EC2
compute instances,
placement groups, enhanced
networking, HPC job
schedulers
Grid HPC
Loosely coupled, pleasingly
parallel
Use a variety of EC2
instances, multiple AZs, Spot,
Auto Scaling, Amazon SQS,
AWS Batch
Grids of clusters
Running parallel cluster
jobs, parameter studies
Use a grid strategy on
the cloud to run a group
of parallel, individually
clustered HPC jobs
Cluster and grid HPC in the cloud
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Grid computing examples
(Scale -o u t)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Global-scale grids for research
Large Hadron Collider (LHC)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Global-scale grids for research
Best-practices using Spot: diversify computing with many instance types,
multiple AZs, multiple regions, and with stateless architectures
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
1.1M vCPUs for machine learning
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC grids in financial services
U s i n g G P U A c c e l e r a t i o n The challenge
Spinning up large numbers of GPUs quickly and
inexpensively to meet ABSI’s customers financial
modeling and reporting needs
ABSI uses proprietary algorithms (Monte Carlo
simulations) running millions of times
The solution
ABSI moved its infrastructure to AWS and deprecated its
co-located data center
ABSI built a front end on AWS for its processing
solution, automatically running GPU instances on
Amazon EC2 using Amazon EBS in an Amazon VPC for
security.
The result
Can be as much as 500 times more efficient in terms of
“Using AWS helps us reduce a 10-
day process to 10 minutes. That’s
transformative: it broadens our
ability to discover.”
–Peter Phillips
Managing Director, Aon Benfield Securities
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC in design and manufacturing
Applications for engineering:
 Molecular dynamics, CAD, CAE, EDA
 Collaboration tools for engineering
 Big data for manufacturing yield analysis
Running drive-head
simulations at scale:
Millions of parallel parameter
sweeps, running months of
simulations in just hours
Over 85,000 Intel cores
running at peak, using Spot
Instances
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cluster computing examples
(Scale -u p )
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Tightly coupled HPC—weather
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Structural simulation
0
20
40
60
80
100
120
140
160
0
2000
4000
6000
8000
10000
12000
14000
16000
18000
20000
0 50 100 150 200 250 300 350
ScaleUp
Time(s)
Cores
c4.8xlarge Time c4.8xlarge Scaleup
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fluid dynamics—Ansys Fluent
 C4.8xlarge instance type
 140M cell model
 F1 car CFD benchmark
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC in aerospace
B o o m l e v e r a g e s r e s c a l e a n d A W S t o
e n a b l e s u p e r s o n i c t r a v e l
• Simulated vortex lift with 200M cell models on 512+ cores
• Increased simulation throughput: 100 jobs in parallel with 6x speedup per job → 600x speedup
• Eliminated IT overhead, including server capital costs, and in-house IT and software teams
• Elastic HPC capacity and pay-as-you-go AWS clusters allow business agility and ability to scale
“Rescale’s ScaleX cloud platform
is a game-changer for
engineering. It gives Boom
computing resources comparable
to building a large on-premise
HPC center. Rescale lets us move
fast with minimal capital
spending and resources
overhead.”
–Josh Krall, CTO & cofounder
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance considerations
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance considerations
f o r t i g h t l y c o u p l e d c l u s t e r w o r k l o a d s
Test using real-world examples
 Use large cases for testing: do
not benchmark scalability using
only small examples
Domain decomposition
 Choose number of cells per core
for either per-core efficiency or
for faster results
MPI libraries
 Test with Intel MPI and
OpenMPI 3.0, and make use of
available tunings
Network
 Use a placement group
 Enable enhanced networking
0
0.5
1
1.5
2
2.5
3
3.5
4
4.5
5
0.0
500.0
1000.0
1500.0
2000.0
2500.0
0 500 1000 1500 2000 2500 3000 3500 4000
Time(S)
Scale-Up
Cores
WRF 2.5 km CONUS Benchmark
Scale-Up time
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Performance considerations
f o r a l l H P C w o r k l o a d s
OS version
 Use Amazon Linux or an
updated 3.10+ kernel–4.0+ if
using NVME on F1 or I3
Processor states
 Use P-states to reduce
processor variability
Instance types
 C5, C4, M4, R4 are the best
choices today—but always test
with the latest EC2 instances
Hyper-threading and affinity
 Test with Hyper-Threading (HT)
on and off—usually off is best,
but not always
 Use CPU affinity to pin threads
to CPU cores when HT is off
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Choice of AWS instances for HPC
M4
General
purpose
Compute
optimized
Storage and IO
optimized
GPU, FPGA
accelerated
Memory
optimized
X1 F1
P3
I3 D2
R4
C5
C4
P2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Instance sizes: R4 example
R4 instances are optimized for
memory-intensive applications
 Xeon E5-2686 v4 processors
 DDR4 Memory
 Enhanced Networking, up to
25 Gb throughput
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Elastic Network Adaptor (ENI)
 Latest generation of enhanced
networking
 Hardware checksums
 Multi-queue support
 Receive side steering
 25 Gbps in a placement group
 Open source amazon network driver
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AWS Storage is a platform
Amazon EFS Amazon EBS
Amazon EC2
Instance Store
Amazon
S3/S3-IA
Amazon Glacier
Object
Data Transfer
AWS Direct
Connect
ISV
Connectors
Amazon
Kinesis
Firehose
Storage
Gateway
S3 Transfer
Acceleration
AWS
Snowball
Amazon
CloudFront
Internet/
VPN
BlockFile
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon S3
Secure, durable, highly
scalable object
storage. Fast access,
low cost.
For long-term durable
storage of data, in a
readily accessible
get/put access format.
Primary durable and
scalable storage for
critical data
Amazon Glacier
Secure, durable, long
term, highly cost-
effective object
storage.
For long-term storage
and archival of data
that is infrequently
accessed.
Use for long-term,
lower-cost archival of
critical data
EBS+EC2
Create a single-AZ,
shared file system
using EC2 and EBS,
with third-party or
open source software
(ZFS, Weka.io, Avere,
Intel Lustre, etc.).
For near-line storage
of files optimized for
high IOPS.
Use for high-IOPS,
temporary working
storage
Optimize HPC storage
Amazon EFS
Highly available,
multi-AZ, fully
managed network-
attached elastic file
system.
For near-line, highly-
available storage of
files in a traditional
NFS format (NFSv4).
Use for read-often,
temporary working
storage
HPC storage dataflow
Corporate
data center Amazon
Glacier
AWS Direct
Connect
ISV
Connectors
Storage
Gateway
AWS
Snowball
Internet/VP
N
Ingress
Egress
Lifecycle
policies
EC2 Instance
EFS EBS+EC2
Instance
Store
Data Transfer Object, Block, File Storage
Amazon
Kinesis
Firehose
S3 Transfer
Acceleration
Amazon
CloudFront
Or other mounted
shared file system
Amazon
S3/S3-IA
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
High-performance NFS on AWS
 EC2+EBS is the most performant method of
creating scale-up file servers on AWS
 Build your own NFS or CIFS implementation
or use a partner solution
 EC2 instances as fileservers, using EBS for
block storage—tuned for application needs
 Single fileserver performance up to 25 Gb/s
over the network
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Accelerated computing
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Accelerated computing on AWS
P a r a l l e l i s m i n c r e a s e s t h r o u g h o u t
CPU: High speed, highly flexible GPU/FPGA: High throughput, high efficiency
GPUs and FPGAs can provide massive parallelism and higher efficiency than
CPUs for many categories of applications
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
NVIDIA GPU
P2: GPU-accelerated computing
 Enabling a high degree of parallelism—each
GPU has thousands of cores
 Consistent, well documented set of APIs
(CUDA, OpenACC, OpenCL)
 Supported by a wide variety of ISVs and
open source frameworks
Xilinx
UltraScale+
FPGA
F1: FPGA-accelerated computing
 Massively parallel—each FPGA includes
millions of parallel system logic cells
 Flexible—no fixed instruction set, can
implement wide or narrow datapaths
 Programmable using available, cloud-based
FPGA development tools
Accelerated computing on AWS
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Parallel processing: GPU and FPGA
A GPU is effective at processing the same instruction in
parallel, for example, calculating pixel values in parallel
for graphics shading, or running many parallel financial
computations. A GPU has a well-defined instruction set,
and fixed word sizes.
An FPGA is effective at processing the same or
different instructions in parallel, for example, creating a
complex pipeline of parallel, multistage operations on a
video stream, or performing a sequence of dependent
calculations and data manipulations for genomics
processing. An FPGA does not have a predefined
instruction set, or a fixed data width.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
4D medical imaging on GPU
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Deep learning on GPU
MXNet training on EC2 P2
instances:
 Training of a popular image
analysis algorithm, Inception v3,
using MXNet and running on P2
instances
 Scaling efficiency of 85%
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Genomics processing on FPGA
F1
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
 Financial computing
 Genomics sequencing
 Test and measurement
 Image and video processing
 Big data and machine learning
 Security, compression
 …and more
FPGA use-cases and F1 partners
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC deployment and automation
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Deploying HPC on AWS
Shared File Storage
Cloud-based, auto-scaling HPC cluster
License managers and cluster
head nodes with job schedulers
3D graphics virtual workstation
AWS Direct Connect
On-Premises HPC
Resources
Thin or Zero Client
- No local data -
Storage CacheAmazon S3
and Amazon
Glacier
Corporate datacenter
On AWS, secure
and well-
optimized HPC
clusters can be
automatically
created, operated,
and torn down in
just minutes AWS Snowball
AWS Batch
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC automation with CfnCluster
CfnCluster simplifies
deployment of HPC in the cloud,
including integrating with
popular HPC schedulers
Built on AWS CloudFormation,
easy to modify to meet specific
application or project
requirements
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
CfnCluster HPC stack
Shared File Storage
Cloud-based, auto scaling HPC cluster
on EC2
Cluster head node with
job scheduler
Amazon S3
and
Amazon
Glacier
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Fully managed
No software to install or
servers to manage. AWS
Batch provisions, manages,
and scales your
infrastructure.
Integrated with AWS
Natively integrated with the
AWS Platform, AWS Batch
jobs can easily and securely
interact with services such as
Amazon S3, DynamoDB, and
Amazon Rekognition.
Cost-optimized
resource provisioning
AWS Batch
automatically provisions
compute resources
tailored to the needs of
your jobs using Amazon
EC2 and EC2 Spot.
AWS Batch for HPC workloads
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Hybrid HPC
U s i n g A m a z o n
E C 2 S y s t e m s M a n a g e r
Capabilities
Run
Command
State
Manager
Inventory Maintenance
Window
Patch
Manager
Automation Parameter
Store
Documents
AWS cloud
corporate data
center
IT admin, DevOps engineer
Role-based access control
 Manage thousands of Windows and Linux nodes
running on Amazon EC2 or on premises
 Control user actions and scope with secure,
granular access control
 Safely execute changes with rate control to
reduce blast radius
 Audit every user action with change tracking
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Graphics for HPC applications
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Secure graphics and collaboration
Cloud can be used for pre-and post processing as well as HPC
 Use GPUs in the cloud for remote rendering and remote desktops
Cloud is more secure
for collaboration
 Encrypt the data in flight
and at rest
 Manage your own keys
and credentials
 Deliver pixels to your
collaborators, not the
actual data
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
HPC workstations in the cloud
E C 2 + E l a s t i c G P U
Small GPU
:
:
Large GPU
Attach Elastic GPU to an instance at launch, similar to attaching
an EBS volume
M4
X1
T2
R4
C5
C4
F1
P2
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Elastic GPU for HPC applications
Wide range of available instance types depending
on application needs, for example, high-memory
Graphics attachment in any of a range of sizes
depending on size of the project or model
More cost-effective, higher performance than a
one-size-fits-all technical workstation
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Desktop application streaming
wi th Amazon AppStream 2.0
Stream desktop applications securely
to any web browser
Pay as you go Scale globally
Secure apps and dataRun desktop apps
in a web browser
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
AppStream 2.0 graphics support
 Multiple Instance types—including General Purpose,
Compute Optimized, Memory Optimized, Graphics
Design, Graphics Pro, and Graphics Desktop
 Always-On or On-Demand pricing models
 Support for OpenGL, DirectX, OpenCL and CUDA
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Summary
 HPC encompasses a wide range of applications—if you
need more computing to solve a problem than one
desktop or server, you need HPC
 HPC on AWS provides flexibility, speed of deployment,
automation, and scale
 Graphics is key for delivering and viewing HPC results—
HPC on AWS is not just about batch processing
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Thank you!
d p e l l e r i @ a m a z o n . c o m
C M P 2 0 7

More Related Content

What's hot

CTD403_Supercharge Your Websites with the Power of Lambda@Edge
CTD403_Supercharge Your Websites with the Power of Lambda@EdgeCTD403_Supercharge Your Websites with the Power of Lambda@Edge
CTD403_Supercharge Your Websites with the Power of Lambda@EdgeAmazon Web Services
 
MCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSMCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSAmazon Web Services
 
Build your case for the cloud and engage your business stakeholders
Build your case for the cloud and engage your business stakeholdersBuild your case for the cloud and engage your business stakeholders
Build your case for the cloud and engage your business stakeholdersAmazon Web Services
 
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...Amazon Web Services
 
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...Amazon Web Services
 
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...Amazon Web Services
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingAmazon Web Services
 
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017Amazon Web Services
 
DVC304_Compliance and Top Security Threats in the Cloud—Are You Protected
DVC304_Compliance and Top Security Threats in the Cloud—Are You ProtectedDVC304_Compliance and Top Security Threats in the Cloud—Are You Protected
DVC304_Compliance and Top Security Threats in the Cloud—Are You ProtectedAmazon Web Services
 
GPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsGPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsAmazon Web Services
 
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWS
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWSGPSWKS406-Migrating a Microsoft ASP.NET Application to AWS
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWSAmazon Web Services
 
WPT202_Bridging the Racial Digital Divide
WPT202_Bridging the Racial Digital DivideWPT202_Bridging the Racial Digital Divide
WPT202_Bridging the Racial Digital DivideAmazon Web Services
 
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web Services
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web ServicesCMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web Services
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web ServicesAmazon Web Services
 
ARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersAmazon Web Services
 
CMP213_GPU(G3) Applications in Media and Entertainment Workloads
CMP213_GPU(G3) Applications in Media and Entertainment WorkloadsCMP213_GPU(G3) Applications in Media and Entertainment Workloads
CMP213_GPU(G3) Applications in Media and Entertainment WorkloadsAmazon Web Services
 
ARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSAmazon Web Services
 
DEV207_Deploying and Managing Ruby Applications on AWS
DEV207_Deploying and Managing Ruby Applications on AWSDEV207_Deploying and Managing Ruby Applications on AWS
DEV207_Deploying and Managing Ruby Applications on AWSAmazon Web Services
 
CON203_Driving Innovation with Containers
CON203_Driving Innovation with ContainersCON203_Driving Innovation with Containers
CON203_Driving Innovation with ContainersAmazon Web Services
 
DEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceDEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceAmazon Web Services
 

What's hot (20)

CTD403_Supercharge Your Websites with the Power of Lambda@Edge
CTD403_Supercharge Your Websites with the Power of Lambda@EdgeCTD403_Supercharge Your Websites with the Power of Lambda@Edge
CTD403_Supercharge Your Websites with the Power of Lambda@Edge
 
MCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWSMCL302_Maximizing the Customer Experience with AI on AWS
MCL302_Maximizing the Customer Experience with AI on AWS
 
Build your case for the cloud and engage your business stakeholders
Build your case for the cloud and engage your business stakeholdersBuild your case for the cloud and engage your business stakeholders
Build your case for the cloud and engage your business stakeholders
 
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...
ARC311_Serverless Encoding at Scale with Content Moderation via Deep Learning...
 
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
MBL309_User Engagement, Messaging, and Analytics Using Amazon Pinpoint from A...
 
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
RET303_Drive Warehouse Efficiencies with the Same AWS IoT Technology that Pow...
 
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of ManufacturingGPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
GPSTEC326-GPS Industry 4.0 AI and the Future of Manufacturing
 
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017
Oracle Enterprise Solutions on AWS - ENT326 - re:Invent 2017
 
DVC304_Compliance and Top Security Threats in the Cloud—Are You Protected
DVC304_Compliance and Top Security Threats in the Cloud—Are You ProtectedDVC304_Compliance and Top Security Threats in the Cloud—Are You Protected
DVC304_Compliance and Top Security Threats in the Cloud—Are You Protected
 
GPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital MarketsGPSTEC305-Machine Learning in Capital Markets
GPSTEC305-Machine Learning in Capital Markets
 
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWS
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWSGPSWKS406-Migrating a Microsoft ASP.NET Application to AWS
GPSWKS406-Migrating a Microsoft ASP.NET Application to AWS
 
WPT202_Bridging the Racial Digital Divide
WPT202_Bridging the Racial Digital DivideWPT202_Bridging the Racial Digital Divide
WPT202_Bridging the Racial Digital Divide
 
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web Services
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web ServicesCMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web Services
CMP323_AWS Batch Easy & Efficient Batch Computing on Amazon Web Services
 
ARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million UsersARC201_Scaling Up to Your First 10 Million Users
ARC201_Scaling Up to Your First 10 Million Users
 
CMP213_GPU(G3) Applications in Media and Entertainment Workloads
CMP213_GPU(G3) Applications in Media and Entertainment WorkloadsCMP213_GPU(G3) Applications in Media and Entertainment Workloads
CMP213_GPU(G3) Applications in Media and Entertainment Workloads
 
ARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWSARC214_Addressing Your Business Needs with AWS
ARC214_Addressing Your Business Needs with AWS
 
DEV207_Deploying and Managing Ruby Applications on AWS
DEV207_Deploying and Managing Ruby Applications on AWSDEV207_Deploying and Managing Ruby Applications on AWS
DEV207_Deploying and Managing Ruby Applications on AWS
 
CON203_Driving Innovation with Containers
CON203_Driving Innovation with ContainersCON203_Driving Innovation with Containers
CON203_Driving Innovation with Containers
 
DVC202_The Open Guide to AWS
DVC202_The Open Guide to AWSDVC202_The Open Guide to AWS
DVC202_The Open Guide to AWS
 
DEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 ServiceDEV206_Life of a Code Change to a Tier 1 Service
DEV206_Life of a Code Change to a Tier 1 Service
 

Similar to CMP207_High Performance Computing on AWS

成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)
成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)
成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)Amazon Web Services
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWSAmazon Web Services
 
AWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAmazon Web Services
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWSAmazon Web Services
 
Accelerating Life Sciences with HPC on AWS - AWS Online Tech Talks
Accelerating Life Sciences with HPC on AWS - AWS Online Tech TalksAccelerating Life Sciences with HPC on AWS - AWS Online Tech Talks
Accelerating Life Sciences with HPC on AWS - AWS Online Tech TalksAmazon Web Services
 
Computação de Alta Performance (HPC) na AWS - CMP201 - Sao Paulo Summit
Computação de Alta Performance (HPC) na AWS -  CMP201 - Sao Paulo SummitComputação de Alta Performance (HPC) na AWS -  CMP201 - Sao Paulo Summit
Computação de Alta Performance (HPC) na AWS - CMP201 - Sao Paulo SummitAmazon Web Services
 
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)Amazon Web Services
 
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...Amazon Web Services
 
Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon Web Services
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWSAmazon Web Services
 
Real world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSReal world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSAmazon Web Services
 
SRV317_Unlocking High Performance Computing for Financial Services with Serve...
SRV317_Unlocking High Performance Computing for Financial Services with Serve...SRV317_Unlocking High Performance Computing for Financial Services with Serve...
SRV317_Unlocking High Performance Computing for Financial Services with Serve...Amazon Web Services
 
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017Amazon Web Services
 
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...Amazon Web Services
 
What would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSWhat would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSAmazon Web Services
 
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...Amazon Web Services
 
High Performance Computing on AWS: Driving Innovation without Infrastructure ...
High Performance Computing on AWS: Driving Innovation without Infrastructure ...High Performance Computing on AWS: Driving Innovation without Infrastructure ...
High Performance Computing on AWS: Driving Innovation without Infrastructure ...Amazon Web Services
 
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...Amazon Web Services
 
What would you do with a million cores - HPC on AWS
What would you do with a million cores - HPC on AWSWhat would you do with a million cores - HPC on AWS
What would you do with a million cores - HPC on AWSAmazon Web Services
 
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Amazon Web Services
 

Similar to CMP207_High Performance Computing on AWS (20)

成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)
成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)
成本節約之道:加速設計週期 x 大規模運行高效能運算 (HPC) 工作負載 (Level: 300)
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWS
 
AWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWSAWS Compute Evolved Week: High Performance Computing on AWS
AWS Compute Evolved Week: High Performance Computing on AWS
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWS
 
Accelerating Life Sciences with HPC on AWS - AWS Online Tech Talks
Accelerating Life Sciences with HPC on AWS - AWS Online Tech TalksAccelerating Life Sciences with HPC on AWS - AWS Online Tech Talks
Accelerating Life Sciences with HPC on AWS - AWS Online Tech Talks
 
Computação de Alta Performance (HPC) na AWS - CMP201 - Sao Paulo Summit
Computação de Alta Performance (HPC) na AWS -  CMP201 - Sao Paulo SummitComputação de Alta Performance (HPC) na AWS -  CMP201 - Sao Paulo Summit
Computação de Alta Performance (HPC) na AWS - CMP201 - Sao Paulo Summit
 
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)
透過最新的 AWS 服務在 2019 年為您的業務轉型 (Level 200)
 
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...
The Future of Research Computing on AWS - AWS Public Sector Summit Singapore ...
 
Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017Amazon EC2 Foundations - CMP203 - re:Invent 2017
Amazon EC2 Foundations - CMP203 - re:Invent 2017
 
High Performance Computing on AWS
High Performance Computing on AWSHigh Performance Computing on AWS
High Performance Computing on AWS
 
Real world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWSReal world High Performance & High Throughput Computing on AWS
Real world High Performance & High Throughput Computing on AWS
 
SRV317_Unlocking High Performance Computing for Financial Services with Serve...
SRV317_Unlocking High Performance Computing for Financial Services with Serve...SRV317_Unlocking High Performance Computing for Financial Services with Serve...
SRV317_Unlocking High Performance Computing for Financial Services with Serve...
 
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017
Architectures for HPC/HTC Workloads on AWS - CMP306 - re:Invent 2017
 
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...
Use HPC on AWS for Physics-Based Simulation, ML, and Statistics in CAE (CMP32...
 
What would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWSWhat would You do with a Million cores? HPC on AWS
What would You do with a Million cores? HPC on AWS
 
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...
Stack Mastery: Create and Optimize Advanced AWS CloudFormation Templates - DE...
 
High Performance Computing on AWS: Driving Innovation without Infrastructure ...
High Performance Computing on AWS: Driving Innovation without Infrastructure ...High Performance Computing on AWS: Driving Innovation without Infrastructure ...
High Performance Computing on AWS: Driving Innovation without Infrastructure ...
 
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...
CMP217_Scale In-Memory Workloads on Amazon EC2 X1 and X1e Instances with up t...
 
What would you do with a million cores - HPC on AWS
What would you do with a million cores - HPC on AWSWhat would you do with a million cores - HPC on AWS
What would you do with a million cores - HPC on AWS
 
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
Grid computing in the cloud for Financial Services industry - CMP205-I - New ...
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

CMP207_High Performance Computing on AWS

  • 1. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS re:INVENT High Performance Computing on AWS D a v i d P e l l e r i n H e a d o f W W B u s i n e s s D e v e l o p m e n t , I n f o t e c h , A W S C M P 2 0 7 N o v e m b e r 2 7 , 2 0 1 7
  • 2. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Agenda  What is HPC? Survey of applications  Grid computing, cluster computing, and “grids of clusters”  Performance best-practices for HPC  Accelerated computing using GPUs and FPGAs  Automation and batch processing  Graphics for HPC pre- and post-processing
  • 3. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Defining HPC—example use-cases Data Light Minimal requirements for high performance storage Data Heavy Benefits from access to high performance storage Fluid dynamics Weather forecasting Materials simulations Crash simulations Risk simulations Molecular modeling Contextual search Logistics simulations Animation and VFX Semiconductor verification Image processing/GIS Genomics Seismic processing Metagenomics Astrophysics Deep learning Clustered (Tightly Coupled) Distributed/Grid (Loosely Coupled)
  • 4. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Important enablers in HPC  Compute performance—CPUs, GPUs, FPGAs  Memory performance—high RAM requirements in many applications  Network performance—throughput, latency, and consistency  Storage performance—including shared filesystems  Automation and cluster/job management  Graphics for pre- and post-processing …and SCALE
  • 5. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cluster HPC Tightly coupled, latency- sensitive applications Use larger Amazon EC2 compute instances, placement groups, enhanced networking, HPC job schedulers Grid HPC Loosely coupled, pleasingly parallel Use a variety of EC2 instances, multiple AZs, Spot, Auto Scaling, Amazon SQS, AWS Batch Grids of clusters Running parallel cluster jobs, parameter studies Use a grid strategy on the cloud to run a group of parallel, individually clustered HPC jobs Cluster and grid HPC in the cloud
  • 6. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Grid computing examples (Scale -o u t)
  • 7. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Global-scale grids for research Large Hadron Collider (LHC)
  • 8. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Global-scale grids for research Best-practices using Spot: diversify computing with many instance types, multiple AZs, multiple regions, and with stateless architectures
  • 9. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 1.1M vCPUs for machine learning
  • 10. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC grids in financial services U s i n g G P U A c c e l e r a t i o n The challenge Spinning up large numbers of GPUs quickly and inexpensively to meet ABSI’s customers financial modeling and reporting needs ABSI uses proprietary algorithms (Monte Carlo simulations) running millions of times The solution ABSI moved its infrastructure to AWS and deprecated its co-located data center ABSI built a front end on AWS for its processing solution, automatically running GPU instances on Amazon EC2 using Amazon EBS in an Amazon VPC for security. The result Can be as much as 500 times more efficient in terms of “Using AWS helps us reduce a 10- day process to 10 minutes. That’s transformative: it broadens our ability to discover.” –Peter Phillips Managing Director, Aon Benfield Securities
  • 11. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC in design and manufacturing Applications for engineering:  Molecular dynamics, CAD, CAE, EDA  Collaboration tools for engineering  Big data for manufacturing yield analysis Running drive-head simulations at scale: Millions of parallel parameter sweeps, running months of simulations in just hours Over 85,000 Intel cores running at peak, using Spot Instances
  • 12. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Cluster computing examples (Scale -u p )
  • 13. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Tightly coupled HPC—weather
  • 14. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Structural simulation 0 20 40 60 80 100 120 140 160 0 2000 4000 6000 8000 10000 12000 14000 16000 18000 20000 0 50 100 150 200 250 300 350 ScaleUp Time(s) Cores c4.8xlarge Time c4.8xlarge Scaleup
  • 15. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Fluid dynamics—Ansys Fluent  C4.8xlarge instance type  140M cell model  F1 car CFD benchmark
  • 16. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC in aerospace B o o m l e v e r a g e s r e s c a l e a n d A W S t o e n a b l e s u p e r s o n i c t r a v e l • Simulated vortex lift with 200M cell models on 512+ cores • Increased simulation throughput: 100 jobs in parallel with 6x speedup per job → 600x speedup • Eliminated IT overhead, including server capital costs, and in-house IT and software teams • Elastic HPC capacity and pay-as-you-go AWS clusters allow business agility and ability to scale “Rescale’s ScaleX cloud platform is a game-changer for engineering. It gives Boom computing resources comparable to building a large on-premise HPC center. Rescale lets us move fast with minimal capital spending and resources overhead.” –Josh Krall, CTO & cofounder
  • 17. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance considerations
  • 18. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance considerations f o r t i g h t l y c o u p l e d c l u s t e r w o r k l o a d s Test using real-world examples  Use large cases for testing: do not benchmark scalability using only small examples Domain decomposition  Choose number of cells per core for either per-core efficiency or for faster results MPI libraries  Test with Intel MPI and OpenMPI 3.0, and make use of available tunings Network  Use a placement group  Enable enhanced networking 0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5 0.0 500.0 1000.0 1500.0 2000.0 2500.0 0 500 1000 1500 2000 2500 3000 3500 4000 Time(S) Scale-Up Cores WRF 2.5 km CONUS Benchmark Scale-Up time
  • 19. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Performance considerations f o r a l l H P C w o r k l o a d s OS version  Use Amazon Linux or an updated 3.10+ kernel–4.0+ if using NVME on F1 or I3 Processor states  Use P-states to reduce processor variability Instance types  C5, C4, M4, R4 are the best choices today—but always test with the latest EC2 instances Hyper-threading and affinity  Test with Hyper-Threading (HT) on and off—usually off is best, but not always  Use CPU affinity to pin threads to CPU cores when HT is off
  • 20. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Choice of AWS instances for HPC M4 General purpose Compute optimized Storage and IO optimized GPU, FPGA accelerated Memory optimized X1 F1 P3 I3 D2 R4 C5 C4 P2
  • 21. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Instance sizes: R4 example R4 instances are optimized for memory-intensive applications  Xeon E5-2686 v4 processors  DDR4 Memory  Enhanced Networking, up to 25 Gb throughput
  • 22. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Elastic Network Adaptor (ENI)  Latest generation of enhanced networking  Hardware checksums  Multi-queue support  Receive side steering  25 Gbps in a placement group  Open source amazon network driver
  • 23. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AWS Storage is a platform Amazon EFS Amazon EBS Amazon EC2 Instance Store Amazon S3/S3-IA Amazon Glacier Object Data Transfer AWS Direct Connect ISV Connectors Amazon Kinesis Firehose Storage Gateway S3 Transfer Acceleration AWS Snowball Amazon CloudFront Internet/ VPN BlockFile
  • 24. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Amazon S3 Secure, durable, highly scalable object storage. Fast access, low cost. For long-term durable storage of data, in a readily accessible get/put access format. Primary durable and scalable storage for critical data Amazon Glacier Secure, durable, long term, highly cost- effective object storage. For long-term storage and archival of data that is infrequently accessed. Use for long-term, lower-cost archival of critical data EBS+EC2 Create a single-AZ, shared file system using EC2 and EBS, with third-party or open source software (ZFS, Weka.io, Avere, Intel Lustre, etc.). For near-line storage of files optimized for high IOPS. Use for high-IOPS, temporary working storage Optimize HPC storage Amazon EFS Highly available, multi-AZ, fully managed network- attached elastic file system. For near-line, highly- available storage of files in a traditional NFS format (NFSv4). Use for read-often, temporary working storage
  • 25. HPC storage dataflow Corporate data center Amazon Glacier AWS Direct Connect ISV Connectors Storage Gateway AWS Snowball Internet/VP N Ingress Egress Lifecycle policies EC2 Instance EFS EBS+EC2 Instance Store Data Transfer Object, Block, File Storage Amazon Kinesis Firehose S3 Transfer Acceleration Amazon CloudFront Or other mounted shared file system Amazon S3/S3-IA
  • 26. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. High-performance NFS on AWS  EC2+EBS is the most performant method of creating scale-up file servers on AWS  Build your own NFS or CIFS implementation or use a partner solution  EC2 instances as fileservers, using EBS for block storage—tuned for application needs  Single fileserver performance up to 25 Gb/s over the network
  • 27. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Accelerated computing
  • 28. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Accelerated computing on AWS P a r a l l e l i s m i n c r e a s e s t h r o u g h o u t CPU: High speed, highly flexible GPU/FPGA: High throughput, high efficiency GPUs and FPGAs can provide massive parallelism and higher efficiency than CPUs for many categories of applications
  • 29. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. NVIDIA GPU P2: GPU-accelerated computing  Enabling a high degree of parallelism—each GPU has thousands of cores  Consistent, well documented set of APIs (CUDA, OpenACC, OpenCL)  Supported by a wide variety of ISVs and open source frameworks Xilinx UltraScale+ FPGA F1: FPGA-accelerated computing  Massively parallel—each FPGA includes millions of parallel system logic cells  Flexible—no fixed instruction set, can implement wide or narrow datapaths  Programmable using available, cloud-based FPGA development tools Accelerated computing on AWS
  • 30. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Parallel processing: GPU and FPGA A GPU is effective at processing the same instruction in parallel, for example, calculating pixel values in parallel for graphics shading, or running many parallel financial computations. A GPU has a well-defined instruction set, and fixed word sizes. An FPGA is effective at processing the same or different instructions in parallel, for example, creating a complex pipeline of parallel, multistage operations on a video stream, or performing a sequence of dependent calculations and data manipulations for genomics processing. An FPGA does not have a predefined instruction set, or a fixed data width.
  • 31. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. 4D medical imaging on GPU
  • 32. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Deep learning on GPU MXNet training on EC2 P2 instances:  Training of a popular image analysis algorithm, Inception v3, using MXNet and running on P2 instances  Scaling efficiency of 85%
  • 33. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Genomics processing on FPGA F1
  • 34. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.  Financial computing  Genomics sequencing  Test and measurement  Image and video processing  Big data and machine learning  Security, compression  …and more FPGA use-cases and F1 partners
  • 35. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC deployment and automation
  • 36. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Deploying HPC on AWS Shared File Storage Cloud-based, auto-scaling HPC cluster License managers and cluster head nodes with job schedulers 3D graphics virtual workstation AWS Direct Connect On-Premises HPC Resources Thin or Zero Client - No local data - Storage CacheAmazon S3 and Amazon Glacier Corporate datacenter On AWS, secure and well- optimized HPC clusters can be automatically created, operated, and torn down in just minutes AWS Snowball AWS Batch
  • 37. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC automation with CfnCluster CfnCluster simplifies deployment of HPC in the cloud, including integrating with popular HPC schedulers Built on AWS CloudFormation, easy to modify to meet specific application or project requirements
  • 38. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. CfnCluster HPC stack Shared File Storage Cloud-based, auto scaling HPC cluster on EC2 Cluster head node with job scheduler Amazon S3 and Amazon Glacier
  • 39. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Fully managed No software to install or servers to manage. AWS Batch provisions, manages, and scales your infrastructure. Integrated with AWS Natively integrated with the AWS Platform, AWS Batch jobs can easily and securely interact with services such as Amazon S3, DynamoDB, and Amazon Rekognition. Cost-optimized resource provisioning AWS Batch automatically provisions compute resources tailored to the needs of your jobs using Amazon EC2 and EC2 Spot. AWS Batch for HPC workloads
  • 40. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Hybrid HPC U s i n g A m a z o n E C 2 S y s t e m s M a n a g e r Capabilities Run Command State Manager Inventory Maintenance Window Patch Manager Automation Parameter Store Documents AWS cloud corporate data center IT admin, DevOps engineer Role-based access control  Manage thousands of Windows and Linux nodes running on Amazon EC2 or on premises  Control user actions and scope with secure, granular access control  Safely execute changes with rate control to reduce blast radius  Audit every user action with change tracking
  • 41. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Graphics for HPC applications
  • 42. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Secure graphics and collaboration Cloud can be used for pre-and post processing as well as HPC  Use GPUs in the cloud for remote rendering and remote desktops Cloud is more secure for collaboration  Encrypt the data in flight and at rest  Manage your own keys and credentials  Deliver pixels to your collaborators, not the actual data
  • 43. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. HPC workstations in the cloud E C 2 + E l a s t i c G P U Small GPU : : Large GPU Attach Elastic GPU to an instance at launch, similar to attaching an EBS volume M4 X1 T2 R4 C5 C4 F1 P2
  • 44. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Elastic GPU for HPC applications Wide range of available instance types depending on application needs, for example, high-memory Graphics attachment in any of a range of sizes depending on size of the project or model More cost-effective, higher performance than a one-size-fits-all technical workstation
  • 45. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Desktop application streaming wi th Amazon AppStream 2.0 Stream desktop applications securely to any web browser Pay as you go Scale globally Secure apps and dataRun desktop apps in a web browser
  • 46. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. AppStream 2.0 graphics support  Multiple Instance types—including General Purpose, Compute Optimized, Memory Optimized, Graphics Design, Graphics Pro, and Graphics Desktop  Always-On or On-Demand pricing models  Support for OpenGL, DirectX, OpenCL and CUDA
  • 47. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Summary  HPC encompasses a wide range of applications—if you need more computing to solve a problem than one desktop or server, you need HPC  HPC on AWS provides flexibility, speed of deployment, automation, and scale  Graphics is key for delivering and viewing HPC results— HPC on AWS is not just about batch processing
  • 48. © 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Thank you! d p e l l e r i @ a m a z o n . c o m C M P 2 0 7