SlideShare a Scribd company logo
1 of 49
Accelerating Time to Science:
Transforming Research in the Cloud
Jamie Kinney - @jamiekinney
Director of Scientific Computing, a.k.a. “SciCo” – Amazon Web Services
Why Do Researchers Love AWS?
Time to Science
Access research
infrastructure in minutes
Low Cost
Pay-as-you-go pricing
Elastic
Easily add or remove capacity
Globally Accessible
Easily Collaborate with
researchers around the world
Secure
A collection of tools to
protect data and privacy
Scalable
Access to effectively
limitless capacity
Why does Amazon care about Scientific Computing?
• In order to fundamentally accelerate the pace of scientific discovery
• It is a great application of AWS with a broad customer base
• The scientific community helps us innovate on behalf of all customers
– Streaming data processing & analytics
– Exabyte scale data management solutions and exaflop scale compute
– Collaborative research tools and techniques
– New AWS regions
– Significant advances in low-power compute, storage and data centers
– Identify efficiencies which will lower our costs and therefore reduce pricing
for all AWS customers
How is AWS Used for Scientific Computing?
• High Throughput Computing (HTC) for Data-Intensive Analytics
• High Performance Computing (HPC) for Engineering and Simulation
• Collaborative Research Environments
• Hybrid Supercomputing Centers
• Science-as-a-Service
• Citizen Science
Research Grants
AWS provides free usage
credits to help researchers:
• Teach advanced courses
• Explore new projects
• Create resources for the
scientific community
aws.amazon.com/grants
Amazon Public Data Sets
AWS hosts “gold standard” reference
data at our expense in order to
catalyze rapid innovation and
increased AWS adoption
A few examples:
1,000 Genomes ~250 TB
Common Crawl
OpenStreetMap
Actively Developing…
Cancer Genomics Data Sets ~2-6 PB
SKA Precursor Data 1PB+
Public Data Sets
Nepal
Earthquake
Individuals
around the world
are analyzing
before/after
imagery of
Kathmandu
in order to more-
effectively direct
emergency
response and
recovery efforts
Peering with all global research networks
Image courtesy John Hover - Brookhaven National Lab
AWS Egress Waiver for Research & Education
Timeline:
• 2013: Initial trial in Australia for users connecting via AARNET and AAPT
• 2014: Extended the waiver to include ESNET and Internet2
• 2015: Extending support to other major NRENs
Terms:
• AWS waives egress fees up to 15% of total AWS bill, customers are responsible for
anything above this amount
• Majority of traffic must transit via NREN with no transit costs
• 15% waiver applies to aggregate usage when consolidated billing is used
• Does not apply to workloads for which egress is the service we are providing (e.g. live video
streaming, MOOCs, Web Hosting, etc…)
• Available regardless of AWS procurement method (i.e. direct purchase or Internet2 Net+)
Contact us if you would like to sign up!
Breaking news! Restricted-access genomics on
AWS
aws.amazon.com/genomics
Data-Intensive Computing
The Square Kilometer Array will link 250,000 radio telescopes
together, creating the world’s most sensitive telescope. The SKA will
generate zettabytes of raw data, publishing exabytes annually over
30-40 years.
Researchers are using AWS to develop and test:
• Data processing pipelines
• Image visualization tools
• Exabyte-scale research data management
• Collaborative research environments
www.skatelescope.org/ska-aws-astrocompute-call-for-proposals/
Astrocompute in the Cloud Program
• AWS is adding 1PB of SKA pre-cursor data to the
Amazon Public Data Sets program
• We are also providing $500K in AWS Research
Grants for the SKA to direct towards projects
focused on:
– High-throughput data analysis
– Image analysis algorithms
– Data mining discoveries (i.e. ML, CV and
data compression)
– Exascale data management techniques
– Collaborative research enablement
https://www.skatelescope.org/ska-aws-astrocompute-call-for-proposals/
Schrodinger & Cycle Computing:
Computational Chemistry for Better Solar Power
Simulation by Mark Thompson of the
University of Southern California to see
which of 205,000 organic compounds
could be used for photovoltaic cells for
solar panel material.
Estimated computation time 264 years
completed in 18 hours.
• 156,314 core cluster, 8 regions
• 1.21 petaFLOPS (Rpeak)
• $33,000 or 16¢ per molecule
Loosely
Coupled
Some Core AWS Concepts
Region
• Geographic area where
AWS services are
available
• Customers choose
region(s) for their AWS
resources
• Eleven regions worldwide
AZ
AZ
AZ AZ AZ
Transit
Transit
Availability Zone (AZ)
• Each region has multiple,
isolated locations known as
Availability Zones
• Low-latency links between AZs
in a region <2ms, usually <1ms
• When launching an EC2
instance, a customer chooses
an AZ
• Private AWS fiber links
interconnect all major regions
AVAILABILITY ZONE 3
EC2
AVAILABILITY ZONE 2
AVAILABILITY ZONE 1
EC2
EC2
EC2
REGION
Example AWS Availability Zone
AZ
AZ
AZ AZ AZ
Transit
Transit
Example AWS Data Center
Virtual Private Cloud (VPC)
• Logically isolated section of
the AWS cloud, virtual
network defined by the
customer
• When launching instances
and other resources,
customers place them in a
VPC
• All new customers have a
default VPC
AVAILABILITY ZONE 1
REGION
AVAILABILITY ZONE 2
AVAILABILITY ZONE 3
VPC
EC2
EC2
EC2
EC2
Spot Fleet
What is Spot?
• Name your own price for EC2 Compute
– A market where price of compute changes
based upon Supply and Demand
– When Bid Price exceeds Spot Market Price,
instance is launched
– Instance is terminated (with 2 minute
warning) if market price exceeds bid price
• Where does capacity come from?
– Unused EC2 Instances
m1.small
On Demand
Price:
$0.044/hr
c3.8xlarge
On Demand
Price:
$1.68/hr
cc2.8xlarge
On Demand
Price:
$2.00/hr
Spot allows customers to run workloads
that they would likely not run anywhere
else..
But today, to be successful in Spot
requires a little bit of additional effort
UNDIFFERENTIATED
HEAVY LIFTING
The Spot Experience today
• Build stateless, distributed, scalable applications
• Choose which instance types fit your workload the best
• Ingest price feed data for AZs and regions
• Make run time decisions on which Spot pools to launch in
based on price and volatility
• Manage interruptions
• Monitor and manage market prices across Azs and
instance types
• Manage the capacity footprint in the fleet
• And all of this while you don’t know where the capacity is
• Serve your customers
Making Spot Fleet Requests
• Simply specify:
– Target Capacity – The number of EC2 instances that
you want in your fleet.
– Maximum Bid Price – The maximum bid price that
you are willing to pay.
– Launch Specifications – # of and types of
instances, AMI id, VPC, subnets or AZs, etc.
– IAM Fleet Role – The name of an IAM role. It must
allow EC2 to terminate instances on your behalf.
Spot Fleet
• Will attempt to reach the desired target capacity
given the choices that were given
• Manage the capacity even as Spot prices
change
• Launch using launch specifications provided
Using Spot Fleet
• Create EC2 Spot Fleet IAM Role
• Requesting a fleet:
– aws ec2 request-spot-fleet --spot-fleet-request-config
file://mySmallFleet.json
• Describe fleet:
– aws ec2 describe-spot-fleet-requests
– aws ec2 describe-spot-fleet-requests --spot-fleet-request-ids <sfr-
………..>
• Describe instances within the fleet
– aws ec2 describe-spot-fleet-instances --spot-fleet-request-id <sfr-
…………>
• Cancel Spot Fleet (with termination):
– aws ec2 cancel-spot-fleet-requests --spot-fleet-request-ids <sfr-
…………..> -terminate-instances
http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/spot-fleet.html
mySpotFleet.json
{
"SpotPrice": "0.50",
"TargetCapacity": 20,
"IamFleetRole": "arn:aws:iam::123456789012:role/my-spot-fleet-role",
"LaunchSpecifications": [
{
"ImageId": "ami-1a2b3c4d",
"InstanceType": "cc2.8xlarge",
"SubnetId": "subnet-a61dafcf"
},
{
"ImageId": "ami-1a2b3c4d",
"InstanceType": "r3.8xlarge",
"SubnetId": "subnet-a61dafcf"
}
]
}
Elastic File System
The AWS storage portfolio
Amazon S3
• Object storage: data presented as buckets of objects
• Data access via APIs over the Internet
Amazon
EFS
• File storage (analogous to NAS): data presented as a file system
• Shared low-latency access from multiple EC2 instances
Amazon
Elastic Block
Store
• Block storage (analogous to SAN): data presented as disk volumes
• Lowest-latency access from single Amazon EC2 instances
Amazon
Glacier
• Archival storage: data presented as vaults/archives of objects
• Lowest-cost storage, infrequent access via APIs over the Internet
Amazon Elastic File System
• Fully managed file system for EC2 instances
• Provides standard file system semantics
• Works with standard operating system APIs
• Sharable across thousands of instances
• Elastically grows to petabyte scale
• Delivers performance for a wide variety of workloads
• Highly available and durable
• NFS v4–based
EFS is designed for a broad range of use cases,
such as…
• Content repositories
• Development environments
• Home directories
• Big data
Amazon Elastic Container Service
+
Key Components
Docker Daemon
Task Definitions
Containers
Clusters
Container Instances
Typical User Workflow
I have a Docker
image, and I want to
run the image on a
cluster
Typical User Workflow
Push Image(s)
Typical User Workflow
Create Task Definition Amazon ECS
Declare resource
requirements
Typical User Workflow
Run Instances EC2
Use custom AMI with
Docker support and
ECS Agent. Instances
will register with
default cluster.
Typical User Workflow
Describe Cluster Amazon ECS
Get information about
cluster state and
available resources
Typical User Workflow
Run Task Amazon ECS
Using the task definition
created above
Typical User Workflow
Amazon ECS
Describe Cluster
Get information about
cluster state and
running containers
Thank you!
Jamie Kinney
jkinney@amazon.com
@jamiekinney
Additional resources…
• aws.amazon.com/big-data
• aws.amazon.com/compliance
• aws.amazon.com/datasets
• aws.amazon.com/grants
• aws.amazon.com/genomics
• aws.amazon.com/hpc
• aws.amazon.com/security

More Related Content

What's hot

Introduction to amazon web services for developers
Introduction to amazon web services for developersIntroduction to amazon web services for developers
Introduction to amazon web services for developersCiklum Ukraine
 
AWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavisAWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavisAmazon Web Services
 
Introduction to Amazon Web Services
Introduction to Amazon Web ServicesIntroduction to Amazon Web Services
Introduction to Amazon Web ServicesJames Armes
 
AWS basics
AWS basicsAWS basics
AWS basicsmbaric
 
Webinar aws 101 a walk through the aws cloud- introduction to cloud computi...
Webinar aws 101   a walk through the aws cloud- introduction to cloud computi...Webinar aws 101   a walk through the aws cloud- introduction to cloud computi...
Webinar aws 101 a walk through the aws cloud- introduction to cloud computi...Amazon Web Services
 
Aws 101 A walk-through the aws cloud (2013)
Aws 101  A walk-through the aws cloud (2013)Aws 101  A walk-through the aws cloud (2013)
Aws 101 A walk-through the aws cloud (2013)Martin Yan
 
AWS 101, London - September 2014
AWS 101, London - September 2014AWS 101, London - September 2014
AWS 101, London - September 2014Ian Massingham
 
Introduction to Amazon Web Services
Introduction to Amazon Web ServicesIntroduction to Amazon Web Services
Introduction to Amazon Web ServicesAmazon Web Services
 
Overview of AWS by Andy Jassy - SVP, AWS
Overview of AWS by Andy Jassy - SVP, AWSOverview of AWS by Andy Jassy - SVP, AWS
Overview of AWS by Andy Jassy - SVP, AWSAmazon Web Services
 
gkkAwscloudpractitioneressentialstraining
gkkAwscloudpractitioneressentialstraininggkkAwscloudpractitioneressentialstraining
gkkAwscloudpractitioneressentialstrainingAnne Starr
 
AWS 101 and the benefits of Migrating to the Cloud
AWS 101 and the benefits of Migrating to the CloudAWS 101 and the benefits of Migrating to the Cloud
AWS 101 and the benefits of Migrating to the CloudCloudHesive
 
Aws seminar report
Aws seminar report Aws seminar report
Aws seminar report Rahul Kumar
 
Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3) Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3) Adrian Cockcroft
 
Aberdeen Oil & Gas Event - Introduction to the AWS Cloud
Aberdeen Oil & Gas Event - Introduction to the AWS CloudAberdeen Oil & Gas Event - Introduction to the AWS Cloud
Aberdeen Oil & Gas Event - Introduction to the AWS CloudAmazon Web Services
 

What's hot (20)

Introduction to amazon web services for developers
Introduction to amazon web services for developersIntroduction to amazon web services for developers
Introduction to amazon web services for developers
 
AWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavisAWS Architecting for the Cloud - matt tavis
AWS Architecting for the Cloud - matt tavis
 
Aws ppt
Aws pptAws ppt
Aws ppt
 
Introduction to Amazon Web Services
Introduction to Amazon Web ServicesIntroduction to Amazon Web Services
Introduction to Amazon Web Services
 
AWS basics
AWS basicsAWS basics
AWS basics
 
Webinar aws 101 a walk through the aws cloud- introduction to cloud computi...
Webinar aws 101   a walk through the aws cloud- introduction to cloud computi...Webinar aws 101   a walk through the aws cloud- introduction to cloud computi...
Webinar aws 101 a walk through the aws cloud- introduction to cloud computi...
 
Aws 101 A walk-through the aws cloud (2013)
Aws 101  A walk-through the aws cloud (2013)Aws 101  A walk-through the aws cloud (2013)
Aws 101 A walk-through the aws cloud (2013)
 
AWS 101, London - September 2014
AWS 101, London - September 2014AWS 101, London - September 2014
AWS 101, London - September 2014
 
Aws overview
Aws overviewAws overview
Aws overview
 
AWS Overview
AWS Overview AWS Overview
AWS Overview
 
Introduction to Amazon Web Services
Introduction to Amazon Web ServicesIntroduction to Amazon Web Services
Introduction to Amazon Web Services
 
Overview of AWS by Andy Jassy - SVP, AWS
Overview of AWS by Andy Jassy - SVP, AWSOverview of AWS by Andy Jassy - SVP, AWS
Overview of AWS by Andy Jassy - SVP, AWS
 
Getting Started on AWS
Getting Started on AWSGetting Started on AWS
Getting Started on AWS
 
AWS
AWSAWS
AWS
 
gkkAwscloudpractitioneressentialstraining
gkkAwscloudpractitioneressentialstraininggkkAwscloudpractitioneressentialstraining
gkkAwscloudpractitioneressentialstraining
 
AWS 101 and the benefits of Migrating to the Cloud
AWS 101 and the benefits of Migrating to the CloudAWS 101 and the benefits of Migrating to the Cloud
AWS 101 and the benefits of Migrating to the Cloud
 
Aws seminar report
Aws seminar report Aws seminar report
Aws seminar report
 
Aws overview
Aws overviewAws overview
Aws overview
 
Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3) Cloud Architecture Tutorial - Why and What (1of 3)
Cloud Architecture Tutorial - Why and What (1of 3)
 
Aberdeen Oil & Gas Event - Introduction to the AWS Cloud
Aberdeen Oil & Gas Event - Introduction to the AWS CloudAberdeen Oil & Gas Event - Introduction to the AWS Cloud
Aberdeen Oil & Gas Event - Introduction to the AWS Cloud
 

Similar to Kinney j aws

Building a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsBuilding a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsAvere Systems
 
Aws Architecture Fundamentals
Aws Architecture FundamentalsAws Architecture Fundamentals
Aws Architecture Fundamentals2nd Watch
 
Scaling on AWS for the First 10 Million Users
Scaling on AWS for the First 10 Million UsersScaling on AWS for the First 10 Million Users
Scaling on AWS for the First 10 Million UsersAmazon Web Services
 
Backup and archiving in the aws cloud
Backup and archiving in the aws cloudBackup and archiving in the aws cloud
Backup and archiving in the aws cloudAmazon Web Services
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Amazon Web Services
 
Aws webcast - Scaling on AWS 13 08-20
Aws webcast - Scaling on AWS 13 08-20Aws webcast - Scaling on AWS 13 08-20
Aws webcast - Scaling on AWS 13 08-20Amazon Web Services
 
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...Amazon Web Services
 
AWS Webcast - Library Systems on the AWS Cloud
AWS Webcast - Library Systems on the AWS CloudAWS Webcast - Library Systems on the AWS Cloud
AWS Webcast - Library Systems on the AWS CloudAmazon Web Services
 
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...Amazon Web Services
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computingiasaglobal
 
Cloud Overview
Cloud OverviewCloud Overview
Cloud Overviewiasaglobal
 
Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWSMigrating enterprise workloads to AWS
Migrating enterprise workloads to AWSTom Laszewski
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWSAmazon Web Services
 
#lspe Q1 2013 dynamically scaling netflix in the cloud
#lspe Q1 2013   dynamically scaling netflix in the cloud#lspe Q1 2013   dynamically scaling netflix in the cloud
#lspe Q1 2013 dynamically scaling netflix in the cloudCoburn Watson
 
T1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsT1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsAmazon Web Services
 
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Amazon Web Services
 
Cloud computing aws -key services
Cloud computing  aws -key servicesCloud computing  aws -key services
Cloud computing aws -key servicesSelvaraj Kesavan
 

Similar to Kinney j aws (20)

Building a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsBuilding a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for Analysts
 
Aws Architecture Fundamentals
Aws Architecture FundamentalsAws Architecture Fundamentals
Aws Architecture Fundamentals
 
Scaling on AWS for the First 10 Million Users
Scaling on AWS for the First 10 Million UsersScaling on AWS for the First 10 Million Users
Scaling on AWS for the First 10 Million Users
 
Backup and archiving in the aws cloud
Backup and archiving in the aws cloudBackup and archiving in the aws cloud
Backup and archiving in the aws cloud
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
 
Aws webcast - Scaling on AWS 13 08-20
Aws webcast - Scaling on AWS 13 08-20Aws webcast - Scaling on AWS 13 08-20
Aws webcast - Scaling on AWS 13 08-20
 
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
AWS re:Invent 2016: Building HPC Clusters as Code in the (Almost) Infinite Cl...
 
AWS Webcast - Library Systems on the AWS Cloud
AWS Webcast - Library Systems on the AWS CloudAWS Webcast - Library Systems on the AWS Cloud
AWS Webcast - Library Systems on the AWS Cloud
 
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
 
AWS_Manivannan.pptx
AWS_Manivannan.pptxAWS_Manivannan.pptx
AWS_Manivannan.pptx
 
Cloud Computing
Cloud ComputingCloud Computing
Cloud Computing
 
Cloud Overview
Cloud OverviewCloud Overview
Cloud Overview
 
AWS Storage and Edge Processing
AWS Storage and Edge ProcessingAWS Storage and Edge Processing
AWS Storage and Edge Processing
 
Migrating enterprise workloads to AWS
Migrating enterprise workloads to AWSMigrating enterprise workloads to AWS
Migrating enterprise workloads to AWS
 
AWS-services.pdf
AWS-services.pdfAWS-services.pdf
AWS-services.pdf
 
High Performance Computing with AWS
High Performance Computing with AWSHigh Performance Computing with AWS
High Performance Computing with AWS
 
#lspe Q1 2013 dynamically scaling netflix in the cloud
#lspe Q1 2013   dynamically scaling netflix in the cloud#lspe Q1 2013   dynamically scaling netflix in the cloud
#lspe Q1 2013 dynamically scaling netflix in the cloud
 
T1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on awsT1 – Architecting highly available applications on aws
T1 – Architecting highly available applications on aws
 
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
Overview of AWS Services for Data Storage and Migration - SRV205 - Anaheim AW...
 
Cloud computing aws -key services
Cloud computing  aws -key servicesCloud computing  aws -key services
Cloud computing aws -key services
 

Recently uploaded

Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AIabhishek36461
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxJoão Esperancinha
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024hassan khalil
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx959SahilShah
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSCAESB
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptSAURABHKUMAR892774
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...Chandu841456
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvLewisJB
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIkoyaldeepu123
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)Dr SOUNDIRARAJ N
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfROCENODodongVILLACER
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxPoojaBan
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .Satyam Kumar
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEroselinkalist12
 

Recently uploaded (20)

Past, Present and Future of Generative AI
Past, Present and Future of Generative AIPast, Present and Future of Generative AI
Past, Present and Future of Generative AI
 
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptxDecoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
Decoding Kotlin - Your guide to solving the mysterious in Kotlin.pptx
 
Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024Architect Hassan Khalil Portfolio for 2024
Architect Hassan Khalil Portfolio for 2024
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Application of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptxApplication of Residue Theorem to evaluate real integrations.pptx
Application of Residue Theorem to evaluate real integrations.pptx
 
GDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentationGDSC ASEB Gen AI study jams presentation
GDSC ASEB Gen AI study jams presentation
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Arduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.pptArduino_CSE ece ppt for working and principal of arduino.ppt
Arduino_CSE ece ppt for working and principal of arduino.ppt
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...An experimental study in using natural admixture as an alternative for chemic...
An experimental study in using natural admixture as an alternative for chemic...
 
POWER SYSTEMS-1 Complete notes examples
POWER SYSTEMS-1 Complete notes  examplesPOWER SYSTEMS-1 Complete notes  examples
POWER SYSTEMS-1 Complete notes examples
 
Work Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvvWork Experience-Dalton Park.pptxfvvvvvvv
Work Experience-Dalton Park.pptxfvvvvvvv
 
EduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AIEduAI - E learning Platform integrated with AI
EduAI - E learning Platform integrated with AI
 
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
UNIT III ANALOG ELECTRONICS (BASIC ELECTRONICS)
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Risk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdfRisk Assessment For Installation of Drainage Pipes.pdf
Risk Assessment For Installation of Drainage Pipes.pdf
 
Heart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptxHeart Disease Prediction using machine learning.pptx
Heart Disease Prediction using machine learning.pptx
 
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
VICTOR MAESTRE RAMIREZ - Planetary Defender on NASA's Double Asteroid Redirec...
 
Churning of Butter, Factors affecting .
Churning of Butter, Factors affecting  .Churning of Butter, Factors affecting  .
Churning of Butter, Factors affecting .
 
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETEINFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
INFLUENCE OF NANOSILICA ON THE PROPERTIES OF CONCRETE
 

Kinney j aws

  • 1. Accelerating Time to Science: Transforming Research in the Cloud Jamie Kinney - @jamiekinney Director of Scientific Computing, a.k.a. “SciCo” – Amazon Web Services
  • 2. Why Do Researchers Love AWS? Time to Science Access research infrastructure in minutes Low Cost Pay-as-you-go pricing Elastic Easily add or remove capacity Globally Accessible Easily Collaborate with researchers around the world Secure A collection of tools to protect data and privacy Scalable Access to effectively limitless capacity
  • 3. Why does Amazon care about Scientific Computing? • In order to fundamentally accelerate the pace of scientific discovery • It is a great application of AWS with a broad customer base • The scientific community helps us innovate on behalf of all customers – Streaming data processing & analytics – Exabyte scale data management solutions and exaflop scale compute – Collaborative research tools and techniques – New AWS regions – Significant advances in low-power compute, storage and data centers – Identify efficiencies which will lower our costs and therefore reduce pricing for all AWS customers
  • 4. How is AWS Used for Scientific Computing? • High Throughput Computing (HTC) for Data-Intensive Analytics • High Performance Computing (HPC) for Engineering and Simulation • Collaborative Research Environments • Hybrid Supercomputing Centers • Science-as-a-Service • Citizen Science
  • 5. Research Grants AWS provides free usage credits to help researchers: • Teach advanced courses • Explore new projects • Create resources for the scientific community aws.amazon.com/grants
  • 6.
  • 7.
  • 9. AWS hosts “gold standard” reference data at our expense in order to catalyze rapid innovation and increased AWS adoption A few examples: 1,000 Genomes ~250 TB Common Crawl OpenStreetMap Actively Developing… Cancer Genomics Data Sets ~2-6 PB SKA Precursor Data 1PB+ Public Data Sets
  • 10. Nepal Earthquake Individuals around the world are analyzing before/after imagery of Kathmandu in order to more- effectively direct emergency response and recovery efforts
  • 11. Peering with all global research networks Image courtesy John Hover - Brookhaven National Lab
  • 12. AWS Egress Waiver for Research & Education Timeline: • 2013: Initial trial in Australia for users connecting via AARNET and AAPT • 2014: Extended the waiver to include ESNET and Internet2 • 2015: Extending support to other major NRENs Terms: • AWS waives egress fees up to 15% of total AWS bill, customers are responsible for anything above this amount • Majority of traffic must transit via NREN with no transit costs • 15% waiver applies to aggregate usage when consolidated billing is used • Does not apply to workloads for which egress is the service we are providing (e.g. live video streaming, MOOCs, Web Hosting, etc…) • Available regardless of AWS procurement method (i.e. direct purchase or Internet2 Net+) Contact us if you would like to sign up!
  • 13. Breaking news! Restricted-access genomics on AWS aws.amazon.com/genomics
  • 14. Data-Intensive Computing The Square Kilometer Array will link 250,000 radio telescopes together, creating the world’s most sensitive telescope. The SKA will generate zettabytes of raw data, publishing exabytes annually over 30-40 years. Researchers are using AWS to develop and test: • Data processing pipelines • Image visualization tools • Exabyte-scale research data management • Collaborative research environments www.skatelescope.org/ska-aws-astrocompute-call-for-proposals/
  • 15. Astrocompute in the Cloud Program • AWS is adding 1PB of SKA pre-cursor data to the Amazon Public Data Sets program • We are also providing $500K in AWS Research Grants for the SKA to direct towards projects focused on: – High-throughput data analysis – Image analysis algorithms – Data mining discoveries (i.e. ML, CV and data compression) – Exascale data management techniques – Collaborative research enablement https://www.skatelescope.org/ska-aws-astrocompute-call-for-proposals/
  • 16. Schrodinger & Cycle Computing: Computational Chemistry for Better Solar Power Simulation by Mark Thompson of the University of Southern California to see which of 205,000 organic compounds could be used for photovoltaic cells for solar panel material. Estimated computation time 264 years completed in 18 hours. • 156,314 core cluster, 8 regions • 1.21 petaFLOPS (Rpeak) • $33,000 or 16¢ per molecule Loosely Coupled
  • 17. Some Core AWS Concepts
  • 18. Region • Geographic area where AWS services are available • Customers choose region(s) for their AWS resources • Eleven regions worldwide AZ AZ AZ AZ AZ Transit Transit
  • 19. Availability Zone (AZ) • Each region has multiple, isolated locations known as Availability Zones • Low-latency links between AZs in a region <2ms, usually <1ms • When launching an EC2 instance, a customer chooses an AZ • Private AWS fiber links interconnect all major regions AVAILABILITY ZONE 3 EC2 AVAILABILITY ZONE 2 AVAILABILITY ZONE 1 EC2 EC2 EC2 REGION
  • 20. Example AWS Availability Zone AZ AZ AZ AZ AZ Transit Transit
  • 22. Virtual Private Cloud (VPC) • Logically isolated section of the AWS cloud, virtual network defined by the customer • When launching instances and other resources, customers place them in a VPC • All new customers have a default VPC AVAILABILITY ZONE 1 REGION AVAILABILITY ZONE 2 AVAILABILITY ZONE 3 VPC EC2 EC2 EC2 EC2
  • 24. What is Spot? • Name your own price for EC2 Compute – A market where price of compute changes based upon Supply and Demand – When Bid Price exceeds Spot Market Price, instance is launched – Instance is terminated (with 2 minute warning) if market price exceeds bid price • Where does capacity come from? – Unused EC2 Instances
  • 28.
  • 29. Spot allows customers to run workloads that they would likely not run anywhere else.. But today, to be successful in Spot requires a little bit of additional effort
  • 30. UNDIFFERENTIATED HEAVY LIFTING The Spot Experience today • Build stateless, distributed, scalable applications • Choose which instance types fit your workload the best • Ingest price feed data for AZs and regions • Make run time decisions on which Spot pools to launch in based on price and volatility • Manage interruptions • Monitor and manage market prices across Azs and instance types • Manage the capacity footprint in the fleet • And all of this while you don’t know where the capacity is • Serve your customers
  • 31. Making Spot Fleet Requests • Simply specify: – Target Capacity – The number of EC2 instances that you want in your fleet. – Maximum Bid Price – The maximum bid price that you are willing to pay. – Launch Specifications – # of and types of instances, AMI id, VPC, subnets or AZs, etc. – IAM Fleet Role – The name of an IAM role. It must allow EC2 to terminate instances on your behalf.
  • 32. Spot Fleet • Will attempt to reach the desired target capacity given the choices that were given • Manage the capacity even as Spot prices change • Launch using launch specifications provided
  • 33. Using Spot Fleet • Create EC2 Spot Fleet IAM Role • Requesting a fleet: – aws ec2 request-spot-fleet --spot-fleet-request-config file://mySmallFleet.json • Describe fleet: – aws ec2 describe-spot-fleet-requests – aws ec2 describe-spot-fleet-requests --spot-fleet-request-ids <sfr- ………..> • Describe instances within the fleet – aws ec2 describe-spot-fleet-instances --spot-fleet-request-id <sfr- …………> • Cancel Spot Fleet (with termination): – aws ec2 cancel-spot-fleet-requests --spot-fleet-request-ids <sfr- …………..> -terminate-instances http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/spot-fleet.html
  • 34. mySpotFleet.json { "SpotPrice": "0.50", "TargetCapacity": 20, "IamFleetRole": "arn:aws:iam::123456789012:role/my-spot-fleet-role", "LaunchSpecifications": [ { "ImageId": "ami-1a2b3c4d", "InstanceType": "cc2.8xlarge", "SubnetId": "subnet-a61dafcf" }, { "ImageId": "ami-1a2b3c4d", "InstanceType": "r3.8xlarge", "SubnetId": "subnet-a61dafcf" } ] }
  • 36. The AWS storage portfolio Amazon S3 • Object storage: data presented as buckets of objects • Data access via APIs over the Internet Amazon EFS • File storage (analogous to NAS): data presented as a file system • Shared low-latency access from multiple EC2 instances Amazon Elastic Block Store • Block storage (analogous to SAN): data presented as disk volumes • Lowest-latency access from single Amazon EC2 instances Amazon Glacier • Archival storage: data presented as vaults/archives of objects • Lowest-cost storage, infrequent access via APIs over the Internet
  • 37. Amazon Elastic File System • Fully managed file system for EC2 instances • Provides standard file system semantics • Works with standard operating system APIs • Sharable across thousands of instances • Elastically grows to petabyte scale • Delivers performance for a wide variety of workloads • Highly available and durable • NFS v4–based
  • 38. EFS is designed for a broad range of use cases, such as… • Content repositories • Development environments • Home directories • Big data
  • 40. Key Components Docker Daemon Task Definitions Containers Clusters Container Instances
  • 41. Typical User Workflow I have a Docker image, and I want to run the image on a cluster
  • 43. Typical User Workflow Create Task Definition Amazon ECS Declare resource requirements
  • 44. Typical User Workflow Run Instances EC2 Use custom AMI with Docker support and ECS Agent. Instances will register with default cluster.
  • 45. Typical User Workflow Describe Cluster Amazon ECS Get information about cluster state and available resources
  • 46. Typical User Workflow Run Task Amazon ECS Using the task definition created above
  • 47. Typical User Workflow Amazon ECS Describe Cluster Get information about cluster state and running containers
  • 49. Additional resources… • aws.amazon.com/big-data • aws.amazon.com/compliance • aws.amazon.com/datasets • aws.amazon.com/grants • aws.amazon.com/genomics • aws.amazon.com/hpc • aws.amazon.com/security

Editor's Notes

  1. Four main reasons why Amazon EMR
  2. http://news.cnet.com/8301-1001_3-57611919-92/supercomputing-simulation-employs-156000-amazon-processor-cores http://blog.cyclecomputing.com/2013/11/back-to-the-future-121-petaflopsrpeak-156000-core-cyclecloud-hpc-runs-264-years-of-materials-science.html Computational compound analysis Solar panel material Estimated computation time 264 years
  3. [After first bullet point]: Each region designed to be completely isolated from other regions – this achieves the greatest possible fault tolerance and stability [After second bullet point]: So when you launch an EC2 instance, you select the region it should be in. Customers can select a region based on, e.g., latency requirements or legal requirements
  4. [After first bullet point]: Really a logical group of data centers [After third bullet point]: Note that some customers distribute their instances across multiple AZs to have extremely high fault tolerance. We have 28 availability zones worldwide
  5. [At end]: Can launch resources in default VPC or another one that you’ve created
  6. Points to touch upon:- - Different auction pools - around 50 instance types - 28 availability zones - Linux/Windows Operating System Lots of capacity to pick from Be flexible to be successful with Spot Look across instance types. E.g., *.8xlarge may be cheaper than on-demand *.4xlarge Look across instance families e.g. r3.8xlarge is the same vCPUs as c3.8xlarge but with more memory Look across AZs – may not be possible for all workloads Look across regions – may not be possible for all workloads - Why’s price higher than on-demand
  7. Points to touch upon:- - Different auction pools - around 50 instance types - 28 availability zones - Linux/Windows Operating System Lots of capacity to pick from Be flexible to be successful with Spot Look across instance types. E.g., *.8xlarge may be cheaper than on-demand *.4xlarge Look across instance families e.g. r3.8xlarge is the same vCPUs as c3.8xlarge but with more memory Look across AZs – may not be possible for all workloads Look across regions – may not be possible for all workloads - Why’s price higher than on-demand
  8. And building upon the last slide.. twice the memory than r3.2xlarge Way more cores Still cheaper than on-demand price of r3.2xlarge And pretty stable market conditions Why aren’t customers leveraging this pool instead?
  9. Customers have to ingest this data and make their decisions at launch time and then throughout the time that their workload is running - which region/AZ to launch in Which instance(s) to launch Manage price throughout the workload is running Manage interruptions Build a lot of code to do all of this
  10. Launch a fleet of 5 instances.
  11. Let me start by talking about our current storage offerings, and how EFS enhances our set of offerings S3: Object storage An object is a piece of data, like a document/image/video that is stored with some metadata in a flat structure. Provides that data to applications via APIs over the Internet Super-simple to build for example a web application that delivers content to users by making API calls over the Internet. EBS: Block storage for EC2 instances Data presented to your instance as a disk volume Provides low-single-digit latency access to single EC2 instances Very popular for example for boot volumes and DBs Glacier Archival storage Our lowest-cost storage offering, intended for data that’s infrequently accessed And now we have EFS File storage, data presented via a file system to instances When attached to an instance it acts just like a local file system Can provide shared access to data to multiple EC2 instances, with low latencies
  12. So what are the features of EFS? [Read each bullet point, then add to each] Managed service – no hardware to maintain, so file software layer to maintain Standard file system semantics – you get what you’d expect from a file system. Read-after-write consistency, ability to have a hierarchical directory structure, file operations like appends, atomic renames, the ability to write to a particular block in the middle of a file Standard OS APIs – EFS appears like any other file system to your operating system. So applications that leverage standard OS APIs to work with files will work with EFS. Sharable – a common data source for applications and workloads across EC2 instances Elastically grows to PB scale – don’t specify a provisioned size upfront. You just create a file system, and it grows and shrinks automatically as you add and remove data. Performance for a wide variety of workloads – SSD-based. Low latencies, high throughput, high IOPS. Available/durable – Replicate data across AZs within a region. Means that your files are highly available, accessible from multiple AZs, and also well-protected from data loss. NFSv4 based – NFS is a network protocol for interacting with file systems. In the background, when you connect an instance to EFS, the data is sent over the network via NFS. Open standard, widely adopted. Included on virtually all Linux distributions.
  13. EFS is designed for a broad range of use cases – let me walk you through a few examples Content management systems store and serve information for a wide range of applications E.g., online publications, reference libraries. EFS: durable, high throughput backing store for these applications. Development and build environments often have hundreds of nodes accessing a common set of source files, binaries, and other resources. EFS: serve as the storage layer in these environments, providing a high level of IOPS to serve demanding development and test needs. Many organizations provide storage for individual and team use. E.g., research scientists commonly share data sets that each scientist may perform ad hoc analyses on, and these scientists also store files in their own home directories. EFS: an administrator can create a file system where parts of it are accessible to groups in an organization and parts are accessible to individuals. Big Data workloads often require file system operations and read-after-write consistency, as well as the ability to scale to large amounts of storage and throughput. EFS: file storage that EC2 clusters can work with directly for Big Data workloads
  14. The Docker daemon is a process that starts the Docker container on the instance. It is called by our ECSAgent to start, stop, and describe containers A Task Definition describes the components of your application such as the Docker containers you want to run, what resources they will use, and how they are linked together. A Container is the actual virtualized process that encapsulates your application you are running. A Service references how many Task Definitions you want to run and if you want to use an Elastic Load Balancer to route traffic to the containers. The Service then instantiates the requested number of Task Definitions into Tasks that consist of one or more Containers that run on Container Instances. A the Cluster is a logical grouping of Container Instances used to run the Tasks A Container Instance is an Amazon EC2 instance.
  15. The user has a Docker image that they want to deploy onto an ECS cluster
  16. The user first
  17. The user creates a Task Definition that specifies one or more containers required for the task, including the Docker repository and image, memory and CPU requirements, shared data volumes, and how the containers are linked to each other.
  18. The user spins up EC2 instances that is running the ECS agent and registers it with ECS
  19. User runs Describe Cluster command to get information about the cluster state and the available resources
  20. The user uses the Task Definition created earlier and runs the Run Task command to deploy the containers to the ECS cluster
  21. User calls Describe Cluster again and gets information about the cluster state and the running containers