SlideShare a Scribd company logo
1 of 47
AWS Government, Education, &
Nonprofits Symposium
Canberra, Australia | May 6, 2015
Time to Science, Time to Results. Accelerating
Scientific research in the Cloud
Brendan Bouffler (“boof”)
Scientific Computing Group, Amazon Web Services
AWS Global Impact Initiatives for Science
AWS Research Grants AWS Hosted Public Datasets
• Dedicated team focusing on
Scientific Computing &
Research workloads
• Globally focussed and engaged
in Big Science projects like the
SKA.
• Leveraging AWS resources all
over the world.
• Ensuring the cloud is able to
make a disruptive impact on
science.
AWS SciCo Team
• Grants to initiate & support development
of cloud-enabled technologies.
• Typically one-off grants of AWS
resources like EC2 (compute) or S3 &
EBS (storage) or more exotic like
Kinesis & twitter feeds.
• Frequently results in reusable
resources, like AMIs or open data,
which we strongly encourage.
• Lowers the risk to try the cloud.
• Large and globally significant datasets hosted
and paid for by AWS for community use.
• Data can be quickly and easily processed
with elastic computing resources in the
surrounding cloud.
• AWS hopes to enable more innovation, more
quickly.
• Provided in partnership with content owners,
who curate the data.
We are providing a grants pool of AWS credits and up to one
petabyte of storage for an AWS Public Data Set.
The data set will be initially provided by several of the SKA’s
precursor telescopes including CSIRO’s ASKAP, ICRAR’s MWA
in Australia, and KAT-7 (pathfinder to the SKA precursor
telescope Meerkat) in South Africa.
The grants are open to anyone who is making use of radio
astronomical telescopes or radio astronomical data resources
around the world.
The grants will be administered by the SKA. They will be looking
for innovative, cloud-based algorithms and tools that will be able
to handle and process this never ending data stream.
https://aws.amazon.com/blogs/aws/new-astrocompute-in-the-cloud-grants-program/
What the AWS is doing with SKA
$7B retail business
10,000 employees
A whole lot of servers
2006 2014
Every day, AWS adds enough
server capacity to power this
$7B enterprise
AWS Regions
Cray Supercomputer
Beowulf Cluster
HPC became an optimisation problem
A top 500 supercomputer
For less than $100/hr
Ready in 100 seconds
# CPUs
time
In theory …
(the spherical model of owning a cluster)
# CPUs
time
Empirical data …
You’re still paying for
this, but not using it.
Actual CPU usage
Meeeelions of uncoupled workloads
0
2
3
5
6
# CPUs
time
Spot Market
0.00
1.50
3.00
4.50
6.00
# CPUs
time
Spot Market
Our ultimate space
filler.
Spot Instances allow you
to name your own price for
spare AWS computing
capacity.
Great for workloads that
aren’t time sensitive, and
especially popular in
research (hint: it’s really
cheap).
Cloud Growth
0.00
1.50
3.00
4.50
6.00
# CPUs
time
Predictable growth
All of this makes it much
easier for AWS to predict
growth in aggregate
demand, and hence to
invest more to grow the
cloud.
As a result, we’re
expanding the cloud all
the time, ready for more
workload.
Time traveling workloads
# CPUs
time
# CPUs
time
Wall clock time: 1 hour Wall clock time: 1 week
Cost: equal
The Solution
When you only pay for what you use …
• If you’re only able to use your compute, say, 30%
of the time, you only pay for that time.
1Pocket the savings
• Buy chocolate
• Buy a spectrometer
• Hire a research assistant.
2 Go faster
• Use 3x the cores to
run your jobs at 3x
the speed.
3 Go Large
• Do 3x the science,
or consume 3x the
data.
… you have options.
Why do researchers love using AWS?
Time to Science
Access research
infrastructure in minutes
Low Cost
Pay-as-you-go pricing
Elastic
Easily add or remove
capacity
Globally Accessible
Easily Collaborate with
researchers around the world
Secure
A collection of tools to
protect data and privacy
Scalable
Access to effectively
limitless capacity
Collaboration is easier in the cloud
More time spent computing the data than moving the data.
Public Datasets
Cloud resources for Scientific Workloads
Instances
http://aws.amazon.com/ec2/instance-types/
There’s a couple
dozen EC2
compute instance
types alone, each
of which is
optimised for
different things.
One size does not
fit all.
C4
Intel Xeon E5-2666 v3, custom built for AWS.
Intel Haswell, 16 FLOPS/tick
2.9 GHz, turbo to 3.5 GHz
http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/c4-instances.html
Feature Specification
Processor Number E5-2666 v3
Intel® Smart Cache 25 MiB
Instruction Set 64-bit
Instruction Set Extensions AVX 2.0
Lithography 22 nm
Processor Base Frequency 2.9 GHz
Max All Core Turbo Frequency 3.2 GHz
Max Turbo Frequency 3.5 GHz (available on c4.2xLarge)
Intel® Turbo Boost Technology 2.0
Intel® vPro Technology Yes
Intel® Hyper-Threading Technology Yes
Intel® Virtualization Technology (VT-x) Yes
Intel® Virtualization Technology for Directed I/O (VT-d) Yes
Intel® VT-x with Extended Page Tables (EPT) Yes
Intel® 64 Yes
boto – the AWS API
Java
Python
Ruby
PHP
Perl
Shell
…
… and may other languages.
http://boto.readthedocs.org/en/latest/
Anything you can do in the GUI, you can do on the command line.
AWS CLI – command line interface
http://aws.amazon.com/cli/
Anything you can do in the GUI, you can do on the command line.
as-create-launch-config spotlc-5cents

--image-id ami-e565ba8c

--instance-type m1.small

--spot-price “0.05”
. . .
as-create-auto-scaling-group spotasg
--launch-configuration spotlc-5cents
--availability-zones “us-east-1a,us-east-1b”
--max-size 16
--min-size 1
--desiredcapacity 3
Bright Cluster Manager
http://www.brightcomputing.com
Bright cluster manager is an established very popular HPC
cluster management platform that can simultaneously
manage both on-premises clusters as well as infrastructure
in the cloud - all using the same system images.
Bright has offices in the UK, Netherlands (HQ) and US.
Bright Cluster Manager
1. User submits job to queue
2. Bright creates “data-transfer”
job
3. Bright runs compute job when
data-transfer job is complete
4. Bright transfers output data
back after completion
cfnCluster – provision an HPC cluster in minutes
#cfncluster
https://github.com/awslabs/cfncluster
cfncluster is a sample code framework that deploys and maintains
clusters on AWS. It is reasonably agnostic to what the cluster is
for and can easily be extended to support different frameworks.
The CLI is stateless, everything is done using CloudFormation or
resources within AWS.
10 minutes
Configuration is simple ….
There’s not a great deal involved
getting a cluster up and running.
This config file is enough to do it.
There are more options available,
but this is the minimum set.
[global]
cluster_template = default
update_check = true
sanity_check = true
[aws]
aws_region_name = ap-southeast-2
[cluster default]
key_location = /Users/bouffler/.ssh
key_name = boof-cluster
compute_instance_type = c3.2xLarge
scheduler = sge
vpc_settings = public
[vpc public]
vpc_id = vpc-c48a4fa1
master_subnet_id = subnet-3108f146
10 minutes
Infrastructure as code
#cfncluster
The creation process might take a few minutes (maybe up to
5 mins or so, depending on how you configured it.
Because the API to Cloud Formation (the service that does all
the orchestration) is asynchronous, we can kill the terminal
session if we wanted to and watch the whole show from the
AWS console (where you’ll find it all under the “Cloud
Formation”dashboard in the events tab for this stack.
$ cfnCluster create boof-cluster
Starting: boof-cluster
Status: cfncluster-boof-cluster - CREATE_COMPLETE
Output:"MasterPrivateIP"="10.0.0.17"
Output:"MasterPublicIP"="54.66.174.113"
Output:"GangliaPrivateURL"="http://10.0.0.17/ganglia/"
Output:"GangliaPublicURL"="http://54.66.174.113/ganglia/"
Familiar HPC architecture
Head
node
Instance
Compute
node
Instance
Compute
node
Instance
Compute
node
Instance
Compute
node
Instance
10G Network
Auto-scaling group
Virtual Private Cloud
/shared
Head Instance
2 or more cores (as needed)
CentOS 6.x
OpenMPI, gcc etc…
Choice of scheduler: Torque, SGE,
OpenLava
Slurm (coming soon)
Compute Instances
2 or more cores (as needed)
CentOS 6.x
Auto Scaling group driven by scheduler queue
length.
Can start with 0 (zero) nodes and only scale
when there are jobs.
Yes, it’s a real HPC cluster
#cfncluster
Now you have a cluster, probably running CentOS 6.x, with Sun Grid Engine as a default scheduler, and openMPI and a bunch of other great utilities installed that you’re
already familiar with. You also have a shared filesystem in /shared and an autoscaling group ready to expand the number of compute nodes in the cluster when the existing
ones get busy.
arthur ~ [26] $ cfnCluster create boof-cluster
Starting: boof-cluster
Status: cfncluster-boof-cluster - CREATE_COMPLETE
Output:"MasterPrivateIP"="10.0.0.17"
Output:"MasterPublicIP"="54.66.174.113"
Output:"GangliaPrivateURL"="http://10.0.0.17/ganglia/"
Output:"GangliaPublicURL"="http://54.66.174.113/ganglia/"
arthur ~ [27] $ ssh ec2-user@54.66.174.113
The authenticity of host '54.66.174.113 (54.66.174.113)' can't be established.
RSA key fingerprint is 45:3e:17:76:1d:01:13:d8:d4:40:1a:74:91:77:73:31.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added '54.66.174.113' (RSA) to the list of known hosts.
[ec2-user@ip-10-0-0-17 ~]$ df
Filesystem 1K-blocks Used Available Use% Mounted on
/dev/xvda1 10185764 7022736 2639040 73% /
tmpfs 509312 0 509312 0% /dev/shm
/dev/xvdf 20961280 32928 20928352 1% /shared
[ec2-user@ip-10-0-0-17 ~]$ qhost
HOSTNAME ARCH NCPU NSOC NCOR NTHR LOAD MEMTOT MEMUSE SWAPTO SWAPUS
----------------------------------------------------------------------------------------------
global - - - - - - - - - -
ip-10-0-0-136 lx-amd64 8 1 4 8 - 14.6G - 1024.0M -
ip-10-0-0-154 lx-amd64 8 1 4 8 - 14.6G - 1024.0M -
[ec2-user@ip-10-0-0-17 ~]$ qstat
[ec2-user@ip-10-0-0-17 ~]$
[ec2-user@ip-10-0-0-17 ~]$ ed hw.qsub
hw.qsub: No such file or directory
a
#!/bin/bash
#
#$ -cwd
#$ -j y
#$ -pe mpi 2
#$ -S /bin/bash
#
module load openmpi-x86_64
mpirun -np 2 hostname
.
w
110
q
[ec2-user@ip-10-0-0-17 ~]$ ll
total 4
-rw-rw-r-- 1 ec2-user ec2-user 110 Feb 1 05:57 hw.qsub
[ec2-user@ip-10-0-0-17 ~]$ qsub hw.qsub
Your job 1 ("hw.qsub") has been submitted
[ec2-user@ip-10-0-0-17 ~]$
[ec2-user@ip-10-0-0-17 ~]$ qstat
job-ID prior name user state submit/start at queue
slots ja-task-ID
---------------------------------------------------------------------------
---------------------
1 0.55500 hw.qsub ec2-user r 02/01/2015 05:57:25
all.q@ip-10-0-0-44.ap-southeas 2
[ec2-user@ip-10-0-0-17 ~]$ qstat
[ec2-user@ip-10-0-0-17 ~]$ ls -l
total 8
-rw-rw-r-- 1 ec2-user ec2-user 110 Feb 1 05:57 hw.qsub
-rw-r--r-- 1 ec2-user ec2-user 26 Feb 1 05:57 hw.qsub.o1
[ec2-user@ip-10-0-0-17 ~]$ cat hw.qsub.o1
ip-10-0-0-136
ip-10-0-0-154
[ec2-user@ip-10-0-0-17 ~]$
Upgrade from Ivy Bridge to Haswell
#cfncluster
Yes, really :-)
You can upgrade your whole cluster in a keystroke or two. It’s an easy way to test which CPUs or instance
properties are important to your code’s performance. For example, you may find that Haswell doesn’t
impact your code’s performance sufficiently to make the additional cost worthwhile, so you can just as
easily downgrade the CPUs you’re using.
Config options to explore …
#cfncluster
Many options, but the
most interesting ones
immediately are:
# (defaults to t2.micro for default template)
compute_instance_type = t2.micro
# Master Server EC2 instance type
# (defaults to t2.micro for default template
#master_instance_type = t2.micro
# Inital number of EC2 instances to launch as compute nodes in the cluster.
# (defaults to 2 for default template)
#initial_queue_size = 1
# Maximum number of EC2 instances that can be launched in the cluster.
# (defaults to 10 for the default template)
#max_queue_size = 10
# Boolean flag to set autoscaling group to maintain initial size and scale back
# (defaults to false for the default template)
#maintain_initial_size = true
# Cluster scheduler
# (defaults to sge for the default template)
scheduler = sge
# Type of cluster to launch i.e. ondemand or spot
# (defaults to ondemand for the default template)
#cluster_type = ondemand
# Spot price for the ComputeFleet
#spot_price = 0.00
# Cluster placement group. This placement group must already exist.
# (defaults to NONE for the default template)
#placement_group = NONE
How is AWS Used for Scientific Computing?
• High Performance Computing (HPC) for Engineering and Simulation
• High Throughput Computing (HTC) for Data-Intensive Analytics
• Hybrid Supercomputing centres
• Collaborative Research Environments
• Citizen Science
• Science-as-a-Service
• Science where the workload changes (hint: almost all science)
Mission-Critical Computing
Astronomy in the Cloud
CHILES will produce the first neutral hydrogen deep field, to be carried out with
the VLA in B array and covering a redshift range from z=0 to z=0.45. The field
is centered at the COSMOS field. It will produce neutral hydrogen images of at
least 300 galaxies spread over the entire redshift range.
Working with AWS’s SciCo team to exploit the SPOT market in the cloud, the
team at ICRAR in Australia have been able to implement the entire processing
pipeline in the cloud for around $2,000 per month, which means the $1.75M
they otherwise needed to spend on an HPC cluster can be spent on way cooler
things that impact their research … like astronomers.
High Throughput Computing at Scale
The Large Hadron Collider @
CERN includes 6,000+
researchers from over 40
countries and produces
approximately 25PB of data each
year.
The ATLAS and CMS
experiments are using AWS for
monte carlo simulations and
analysis of LHC data.
Zooniverse
“The Zooniverse is heavily reliant on Amazon
Web Services (AWS), particularly Elastic
Compute Cloud (EC2) virtual private servers and
Simple Storage Service (S3) data storage. AWS
is the most cost-effective solution for the dynamic
needs of Zooniverse’s infrastructure …”
http://wwwconference.org/proceedings/www2014/companion/p1049.pdf
The World’s Largest Citizen Science Platform
… cost is a factor – running a central API means that when the Zooniverse is
quiet and there aren’t many people about we can scale back the number of
servers we’re running (automagically on Amazon Web Services) to a minimal
level.
Novartis
39 years of computational chemistry in 9 hours
Novartis ran a project that involved virtually screening 10 million
compounds against a common cancer target in less than a week.
They calculated that it would take 50,000 cores and close to a $40
million investment if they wanted to run the experiment internally.
Partnering with Cycle Computing and Amazon Web
Services (AWS), Novartis built a platform leveraging
Amazon Simple Storage Service (Amazon S3),
Amazon Elastic Block Store (Amazon EBS), and
four Availability Zones. The project ran across
10,600 Spot Instances (approximately 87,000
compute cores) and allowed Novartis to conduct 39
years of computational chemistry in 9 hours for a
cost of $4,232. Out of the 10 million compounds
screened, three were successfully identified.
Globus Genomics
Globus Genomics is an indispensible
platform for Core Labs (bioinformatics,
se- quencing, HPC) to meet their
customers’ needs for cost-effective,
large-scale NGS analysis. Globus
Genomics provides a flexible,
extensible solution to ad- dress the
varying analysis and resource
requirements of bioscience
researchers, through powerful data
management tools, customized
workflow environments, and cloud-
based elastic computational
infrastructure.
www.globus.org/genomics
Aquaria – 3D Protein Visualization
Aquaria is a publicly-available web tool, designed for
biologists, for visualizing and working with the 3D structure of
proteins. It has radically simplified the process of analyzing
more than 500,000 proteins from the protein data bank.
Being able to visualize the three-dimensional structure of
proteins has been of great interest to scientists since long
before the genomic age.
The project was led by Dr Sean O’Donoghue from the
CSIRO in Australia along with a team from the Garvan
Institute in Sydney, and a key collaboration with Dr Andrea
Schafferhans from the Technical University of Munich.

Aquaria is fast & it comes with an easy-to-use interface and
contains twice as many models as all other similar resources
combined. It also allows users to view additional information,
like the genetic differences between individuals, mapped
onto 3D structures.
http://aquaria.ws/
What the SKA is saying about AWS
“No one’s ever built anything this big
before, and we really don’t understand
the ins and outs of operating it…Cloud
systems — which provide on-demand,
‘elastic’ access to shared, remote
computing resources — would provide
an amount of flexibility for the project
that buying dedicated hardware might
not.”
- SKA architect Tim Cornwell, Nature
May 27, 2014
We are providing a grants pool of AWS credits and up to one
petabyte of storage for an AWS Public Data Set.
The data set will be initially provided by several of the SKA’s
precursor telescopes including CSIRO’s ASKAP, ICRAR’s MWA
in Australia, and KAT-7 (pathfinder to the SKA precursor
telescope Meerkat) in South Africa.
The grants are open to anyone who is making use of radio
astronomical telescopes or radio astronomical data resources
around the world.
The grants will be administered by the SKA. They will be looking
for innovative, cloud-based algorithms and tools that will be able
to handle and process this never ending data stream.
https://aws.amazon.com/blogs/aws/new-astrocompute-in-the-cloud-grants-program/
What the AWS is doing with SKA
We Feel Emotion Explorer
We Feel is a project that explores whether social media can
provide an accurate, real-time signal of the world’s emotional
state.
A joint collaboration between CSIRO, mental health researchers
at The Black Dog Institute, Amazon Web Services and GNIP.
The We Feel is built on Amazon’s Big Data technologies and
currently analyzes approximately 27 million tweets/day
The outcomes?
1. We can now monitor, in real-time, the emotional health of the
world
2. Seamlessly scale infrastructure up or down in direct relation to
social activity
3. Amazon’s Big Data platform enables real-time trend analysis,
queries of historical data and geospatial analytics
http://wefeel.csiro.au/
“The AWS model works when we have the greatest variety of
uncoupled workloads all using the cloud. When it works, it drives the
cost of computation down to trivial levels so people can concentrate
more on their data, their science and their ideas, rather than bothering
to worry about infrastructure.
Science is one of the greatest areas of computation and also happens
to be the one that can most benefit from that democratisation in cost
and global accessibility and where we think Amazon can make a huge,
really disruptive, impact on the world by participating - which is, at the
most basic level, what we are about as a company.”
Thank You

More Related Content

What's hot

Re invent announcements_2016_hcls_use_cases_mchampion
Re invent announcements_2016_hcls_use_cases_mchampionRe invent announcements_2016_hcls_use_cases_mchampion
Re invent announcements_2016_hcls_use_cases_mchampionMia D Champion
 
Introducing SciaaS @ Sanger
Introducing SciaaS @ SangerIntroducing SciaaS @ Sanger
Introducing SciaaS @ SangerPeter Clapham
 
HPC on Azure for Reserach
HPC on Azure for ReserachHPC on Azure for Reserach
HPC on Azure for ReserachJürgen Ambrosi
 
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とはCloudera Japan
 
Lab Manual Combaring Redis with Relational
Lab Manual Combaring Redis with RelationalLab Manual Combaring Redis with Relational
Lab Manual Combaring Redis with RelationalAmazon Web Services
 
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...Amazon Web Services
 
Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudRevolution Analytics
 
AutoScaling and Drupal
AutoScaling and DrupalAutoScaling and Drupal
AutoScaling and DrupalPromet Source
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansPeter Clapham
 
Cassandra CLuster Management by Japan Cassandra Community
Cassandra CLuster Management by Japan Cassandra CommunityCassandra CLuster Management by Japan Cassandra Community
Cassandra CLuster Management by Japan Cassandra CommunityHiromitsu Komatsu
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesAmazon Web Services
 
Lab Manual Managed Database Basics
Lab Manual Managed Database BasicsLab Manual Managed Database Basics
Lab Manual Managed Database BasicsAmazon Web Services
 
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARN
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARNOne Grid to rule them all: Building a Multi-tenant Data Cloud with YARN
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARNDataWorks Summit
 
Running BSD on AWS
Running BSD on AWSRunning BSD on AWS
Running BSD on AWSJulien SIMON
 
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best PracticesAWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best PracticesAmazon Web Services
 
Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2Kornel Lugosi
 
Amazon EMR Facebook Presto Meetup
Amazon EMR Facebook Presto MeetupAmazon EMR Facebook Presto Meetup
Amazon EMR Facebook Presto Meetupstevemcpherson
 
Heat up your stack
Heat up your stackHeat up your stack
Heat up your stackRico Lin
 

What's hot (20)

Re invent announcements_2016_hcls_use_cases_mchampion
Re invent announcements_2016_hcls_use_cases_mchampionRe invent announcements_2016_hcls_use_cases_mchampion
Re invent announcements_2016_hcls_use_cases_mchampion
 
Introducing SciaaS @ Sanger
Introducing SciaaS @ SangerIntroducing SciaaS @ Sanger
Introducing SciaaS @ Sanger
 
HPC on Azure for Reserach
HPC on Azure for ReserachHPC on Azure for Reserach
HPC on Azure for Reserach
 
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは分散DB Apache KuduのアーキテクチャDBの性能と一貫性を両立させる仕組み「HybridTime」とは
分散DB Apache Kuduのアーキテクチャ DBの性能と一貫性を両立させる仕組み 「HybridTime」とは
 
Lab Manual Combaring Redis with Relational
Lab Manual Combaring Redis with RelationalLab Manual Combaring Redis with Relational
Lab Manual Combaring Redis with Relational
 
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
 
Speeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the CloudSpeeding up R with Parallel Programming in the Cloud
Speeding up R with Parallel Programming in the Cloud
 
AutoScaling and Drupal
AutoScaling and DrupalAutoScaling and Drupal
AutoScaling and Drupal
 
Sanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticiansSanger, upcoming Openstack for Bio-informaticians
Sanger, upcoming Openstack for Bio-informaticians
 
Flexible compute
Flexible computeFlexible compute
Flexible compute
 
Cassandra CLuster Management by Japan Cassandra Community
Cassandra CLuster Management by Japan Cassandra CommunityCassandra CLuster Management by Japan Cassandra Community
Cassandra CLuster Management by Japan Cassandra Community
 
Deep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instancesDeep Dive on Amazon EC2 instances
Deep Dive on Amazon EC2 instances
 
Lab Manual Managed Database Basics
Lab Manual Managed Database BasicsLab Manual Managed Database Basics
Lab Manual Managed Database Basics
 
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARN
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARNOne Grid to rule them all: Building a Multi-tenant Data Cloud with YARN
One Grid to rule them all: Building a Multi-tenant Data Cloud with YARN
 
Running BSD on AWS
Running BSD on AWSRunning BSD on AWS
Running BSD on AWS
 
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best PracticesAWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
 
Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2Hosting Drupal on Amazon EC2
Hosting Drupal on Amazon EC2
 
Amazon EMR Facebook Presto Meetup
Amazon EMR Facebook Presto MeetupAmazon EMR Facebook Presto Meetup
Amazon EMR Facebook Presto Meetup
 
Heat up your stack
Heat up your stackHeat up your stack
Heat up your stack
 
Brisk hadoop june2011
Brisk hadoop june2011Brisk hadoop june2011
Brisk hadoop june2011
 

Viewers also liked

EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...ChemAxon
 
(SEC202) Best Practices for Securely Leveraging the Cloud
(SEC202) Best Practices for Securely Leveraging the Cloud(SEC202) Best Practices for Securely Leveraging the Cloud
(SEC202) Best Practices for Securely Leveraging the CloudAmazon Web Services
 
(SEC203) Journey to Securing Time Inc's Move to the Cloud
(SEC203) Journey to Securing Time Inc's Move to the Cloud(SEC203) Journey to Securing Time Inc's Move to the Cloud
(SEC203) Journey to Securing Time Inc's Move to the CloudAmazon Web Services
 
(SEC201) How Should We All Think About Security?
(SEC201) How Should We All Think About Security?(SEC201) How Should We All Think About Security?
(SEC201) How Should We All Think About Security?Amazon Web Services
 
深入淺出 AWS 大數據工具
深入淺出 AWS 大數據工具深入淺出 AWS 大數據工具
深入淺出 AWS 大數據工具Amazon Web Services
 

Viewers also liked (7)

EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
EUGM15 - Michael J. Bodkin (Evotec): Algorithms, Evolution and Network-Based ...
 
(SEC202) Best Practices for Securely Leveraging the Cloud
(SEC202) Best Practices for Securely Leveraging the Cloud(SEC202) Best Practices for Securely Leveraging the Cloud
(SEC202) Best Practices for Securely Leveraging the Cloud
 
AWS and Scientific Computing
AWS and Scientific ComputingAWS and Scientific Computing
AWS and Scientific Computing
 
(SEC203) Journey to Securing Time Inc's Move to the Cloud
(SEC203) Journey to Securing Time Inc's Move to the Cloud(SEC203) Journey to Securing Time Inc's Move to the Cloud
(SEC203) Journey to Securing Time Inc's Move to the Cloud
 
(SEC201) How Should We All Think About Security?
(SEC201) How Should We All Think About Security?(SEC201) How Should We All Think About Security?
(SEC201) How Should We All Think About Security?
 
深入淺出 AWS 大數據工具
深入淺出 AWS 大數據工具深入淺出 AWS 大數據工具
深入淺出 AWS 大數據工具
 
Towards Full Stack Security
Towards Full Stack Security Towards Full Stack Security
Towards Full Stack Security
 

Similar to Time to Science, Time to Results. Accelerating Scientific research in the Cloud

Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016Jisc
 
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...Amazon Web Services
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Amazon Web Services
 
AWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAmazon Web Services
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyPeter Clapham
 
High Performance Computing in AWS, Immersion Day Huntsville 2019
High Performance Computing in AWS, Immersion Day Huntsville 2019High Performance Computing in AWS, Immersion Day Huntsville 2019
High Performance Computing in AWS, Immersion Day Huntsville 2019Amazon Web Services
 
Building a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsBuilding a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsAvere Systems
 
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Amazon Web Services
 
Cisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackCisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackDataStax Academy
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureRevolution Analytics
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesVladimir Simek
 
AWS Summit 2018 Summary
AWS Summit 2018 SummaryAWS Summit 2018 Summary
AWS Summit 2018 SummaryAshish Mrig
 
Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Dave Gardner
 
(BDT208) A Technical Introduction to Amazon Elastic MapReduce
(BDT208) A Technical Introduction to Amazon Elastic MapReduce(BDT208) A Technical Introduction to Amazon Elastic MapReduce
(BDT208) A Technical Introduction to Amazon Elastic MapReduceAmazon Web Services
 
Aws container webinar day 1
Aws container webinar day 1Aws container webinar day 1
Aws container webinar day 1HoseokSeo7
 
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Dave Holland
 
Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesYevgeniy Brikman
 
Migrating existing open source machine learning to azure
Migrating existing open source machine learning to azureMigrating existing open source machine learning to azure
Migrating existing open source machine learning to azureMicrosoft Tech Community
 

Similar to Time to Science, Time to Results. Accelerating Scientific research in the Cloud (20)

Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
Cloud present, future and trajectory (Amazon Web Services) - JIsc Digifest 2016
 
HPC in the Cloud
HPC in the CloudHPC in the Cloud
HPC in the Cloud
 
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
Building HPC Clusters as Code in the (Almost) Infinite Cloud | AWS Public Sec...
 
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
Risk Management and Particle Accelerators: Innovating with New Compute Platfo...
 
Self-Service Supercomputing
Self-Service SupercomputingSelf-Service Supercomputing
Self-Service Supercomputing
 
AWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWSAWS Webcast - An Introduction to High Performance Computing on AWS
AWS Webcast - An Introduction to High Performance Computing on AWS
 
HPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journeyHPC and cloud distributed computing, as a journey
HPC and cloud distributed computing, as a journey
 
High Performance Computing in AWS, Immersion Day Huntsville 2019
High Performance Computing in AWS, Immersion Day Huntsville 2019High Performance Computing in AWS, Immersion Day Huntsville 2019
High Performance Computing in AWS, Immersion Day Huntsville 2019
 
Building a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for AnalystsBuilding a Just-in-Time Application Stack for Analysts
Building a Just-in-Time Application Stack for Analysts
 
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
Architectures for HPC and HTC Workloads on AWS | AWS Public Sector Summit 2017
 
Cisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStackCisco: Cassandra adoption on Cisco UCS & OpenStack
Cisco: Cassandra adoption on Cisco UCS & OpenStack
 
Migrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to AzureMigrating Existing Open Source Machine Learning to Azure
Migrating Existing Open Source Machine Learning to Azure
 
How to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutesHow to run your Hadoop Cluster in 10 minutes
How to run your Hadoop Cluster in 10 minutes
 
AWS Summit 2018 Summary
AWS Summit 2018 SummaryAWS Summit 2018 Summary
AWS Summit 2018 Summary
 
Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2Running Cassandra on Amazon EC2
Running Cassandra on Amazon EC2
 
(BDT208) A Technical Introduction to Amazon Elastic MapReduce
(BDT208) A Technical Introduction to Amazon Elastic MapReduce(BDT208) A Technical Introduction to Amazon Elastic MapReduce
(BDT208) A Technical Introduction to Amazon Elastic MapReduce
 
Aws container webinar day 1
Aws container webinar day 1Aws container webinar day 1
Aws container webinar day 1
 
Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017Sanger OpenStack presentation March 2017
Sanger OpenStack presentation March 2017
 
Reusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modulesReusable, composable, battle-tested Terraform modules
Reusable, composable, battle-tested Terraform modules
 
Migrating existing open source machine learning to azure
Migrating existing open source machine learning to azureMigrating existing open source machine learning to azure
Migrating existing open source machine learning to azure
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitecturePixlogix Infotech
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhisoniya singh
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Hyundai Motor Group
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsHyundai Motor Group
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machinePadma Pradeep
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking MenDelhi Call girls
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphNeo4j
 

Recently uploaded (20)

Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Understanding the Laravel MVC Architecture
Understanding the Laravel MVC ArchitectureUnderstanding the Laravel MVC Architecture
Understanding the Laravel MVC Architecture
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | DelhiFULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
FULL ENJOY 🔝 8264348440 🔝 Call Girls in Diplomatic Enclave | Delhi
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2Next-generation AAM aircraft unveiled by Supernal, S-A2
Next-generation AAM aircraft unveiled by Supernal, S-A2
 
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter RoadsSnow Chain-Integrated Tire for a Safe Drive on Winter Roads
Snow Chain-Integrated Tire for a Safe Drive on Winter Roads
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365
 
Install Stable Diffusion in windows machine
Install Stable Diffusion in windows machineInstall Stable Diffusion in windows machine
Install Stable Diffusion in windows machine
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptxVulnerability_Management_GRC_by Sohang Sengupta.pptx
Vulnerability_Management_GRC_by Sohang Sengupta.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge GraphSIEMENS: RAPUNZEL – A Tale About Knowledge Graph
SIEMENS: RAPUNZEL – A Tale About Knowledge Graph
 

Time to Science, Time to Results. Accelerating Scientific research in the Cloud

  • 1. AWS Government, Education, & Nonprofits Symposium Canberra, Australia | May 6, 2015 Time to Science, Time to Results. Accelerating Scientific research in the Cloud Brendan Bouffler (“boof”) Scientific Computing Group, Amazon Web Services
  • 2. AWS Global Impact Initiatives for Science AWS Research Grants AWS Hosted Public Datasets • Dedicated team focusing on Scientific Computing & Research workloads • Globally focussed and engaged in Big Science projects like the SKA. • Leveraging AWS resources all over the world. • Ensuring the cloud is able to make a disruptive impact on science. AWS SciCo Team • Grants to initiate & support development of cloud-enabled technologies. • Typically one-off grants of AWS resources like EC2 (compute) or S3 & EBS (storage) or more exotic like Kinesis & twitter feeds. • Frequently results in reusable resources, like AMIs or open data, which we strongly encourage. • Lowers the risk to try the cloud. • Large and globally significant datasets hosted and paid for by AWS for community use. • Data can be quickly and easily processed with elastic computing resources in the surrounding cloud. • AWS hopes to enable more innovation, more quickly. • Provided in partnership with content owners, who curate the data.
  • 3. We are providing a grants pool of AWS credits and up to one petabyte of storage for an AWS Public Data Set. The data set will be initially provided by several of the SKA’s precursor telescopes including CSIRO’s ASKAP, ICRAR’s MWA in Australia, and KAT-7 (pathfinder to the SKA precursor telescope Meerkat) in South Africa. The grants are open to anyone who is making use of radio astronomical telescopes or radio astronomical data resources around the world. The grants will be administered by the SKA. They will be looking for innovative, cloud-based algorithms and tools that will be able to handle and process this never ending data stream. https://aws.amazon.com/blogs/aws/new-astrocompute-in-the-cloud-grants-program/ What the AWS is doing with SKA
  • 4. $7B retail business 10,000 employees A whole lot of servers 2006 2014 Every day, AWS adds enough server capacity to power this $7B enterprise
  • 6.
  • 9. HPC became an optimisation problem
  • 10. A top 500 supercomputer For less than $100/hr Ready in 100 seconds
  • 11. # CPUs time In theory … (the spherical model of owning a cluster)
  • 12. # CPUs time Empirical data … You’re still paying for this, but not using it. Actual CPU usage
  • 13. Meeeelions of uncoupled workloads 0 2 3 5 6 # CPUs time
  • 14. Spot Market 0.00 1.50 3.00 4.50 6.00 # CPUs time Spot Market Our ultimate space filler. Spot Instances allow you to name your own price for spare AWS computing capacity. Great for workloads that aren’t time sensitive, and especially popular in research (hint: it’s really cheap).
  • 15. Cloud Growth 0.00 1.50 3.00 4.50 6.00 # CPUs time Predictable growth All of this makes it much easier for AWS to predict growth in aggregate demand, and hence to invest more to grow the cloud. As a result, we’re expanding the cloud all the time, ready for more workload.
  • 16. Time traveling workloads # CPUs time # CPUs time Wall clock time: 1 hour Wall clock time: 1 week Cost: equal
  • 17. The Solution When you only pay for what you use … • If you’re only able to use your compute, say, 30% of the time, you only pay for that time. 1Pocket the savings • Buy chocolate • Buy a spectrometer • Hire a research assistant. 2 Go faster • Use 3x the cores to run your jobs at 3x the speed. 3 Go Large • Do 3x the science, or consume 3x the data. … you have options.
  • 18. Why do researchers love using AWS? Time to Science Access research infrastructure in minutes Low Cost Pay-as-you-go pricing Elastic Easily add or remove capacity Globally Accessible Easily Collaborate with researchers around the world Secure A collection of tools to protect data and privacy Scalable Access to effectively limitless capacity
  • 19. Collaboration is easier in the cloud More time spent computing the data than moving the data.
  • 21. Cloud resources for Scientific Workloads
  • 22. Instances http://aws.amazon.com/ec2/instance-types/ There’s a couple dozen EC2 compute instance types alone, each of which is optimised for different things. One size does not fit all.
  • 23. C4 Intel Xeon E5-2666 v3, custom built for AWS. Intel Haswell, 16 FLOPS/tick 2.9 GHz, turbo to 3.5 GHz http://docs.aws.amazon.com/AWSEC2/latest/UserGuide/c4-instances.html Feature Specification Processor Number E5-2666 v3 Intel® Smart Cache 25 MiB Instruction Set 64-bit Instruction Set Extensions AVX 2.0 Lithography 22 nm Processor Base Frequency 2.9 GHz Max All Core Turbo Frequency 3.2 GHz Max Turbo Frequency 3.5 GHz (available on c4.2xLarge) Intel® Turbo Boost Technology 2.0 Intel® vPro Technology Yes Intel® Hyper-Threading Technology Yes Intel® Virtualization Technology (VT-x) Yes Intel® Virtualization Technology for Directed I/O (VT-d) Yes Intel® VT-x with Extended Page Tables (EPT) Yes Intel® 64 Yes
  • 24. boto – the AWS API Java Python Ruby PHP Perl Shell … … and may other languages. http://boto.readthedocs.org/en/latest/ Anything you can do in the GUI, you can do on the command line.
  • 25. AWS CLI – command line interface http://aws.amazon.com/cli/ Anything you can do in the GUI, you can do on the command line. as-create-launch-config spotlc-5cents
 --image-id ami-e565ba8c
 --instance-type m1.small
 --spot-price “0.05” . . . as-create-auto-scaling-group spotasg --launch-configuration spotlc-5cents --availability-zones “us-east-1a,us-east-1b” --max-size 16 --min-size 1 --desiredcapacity 3
  • 26. Bright Cluster Manager http://www.brightcomputing.com Bright cluster manager is an established very popular HPC cluster management platform that can simultaneously manage both on-premises clusters as well as infrastructure in the cloud - all using the same system images. Bright has offices in the UK, Netherlands (HQ) and US.
  • 27. Bright Cluster Manager 1. User submits job to queue 2. Bright creates “data-transfer” job 3. Bright runs compute job when data-transfer job is complete 4. Bright transfers output data back after completion
  • 28. cfnCluster – provision an HPC cluster in minutes #cfncluster https://github.com/awslabs/cfncluster cfncluster is a sample code framework that deploys and maintains clusters on AWS. It is reasonably agnostic to what the cluster is for and can easily be extended to support different frameworks. The CLI is stateless, everything is done using CloudFormation or resources within AWS. 10 minutes
  • 29. Configuration is simple …. There’s not a great deal involved getting a cluster up and running. This config file is enough to do it. There are more options available, but this is the minimum set. [global] cluster_template = default update_check = true sanity_check = true [aws] aws_region_name = ap-southeast-2 [cluster default] key_location = /Users/bouffler/.ssh key_name = boof-cluster compute_instance_type = c3.2xLarge scheduler = sge vpc_settings = public [vpc public] vpc_id = vpc-c48a4fa1 master_subnet_id = subnet-3108f146 10 minutes
  • 30. Infrastructure as code #cfncluster The creation process might take a few minutes (maybe up to 5 mins or so, depending on how you configured it. Because the API to Cloud Formation (the service that does all the orchestration) is asynchronous, we can kill the terminal session if we wanted to and watch the whole show from the AWS console (where you’ll find it all under the “Cloud Formation”dashboard in the events tab for this stack. $ cfnCluster create boof-cluster Starting: boof-cluster Status: cfncluster-boof-cluster - CREATE_COMPLETE Output:"MasterPrivateIP"="10.0.0.17" Output:"MasterPublicIP"="54.66.174.113" Output:"GangliaPrivateURL"="http://10.0.0.17/ganglia/" Output:"GangliaPublicURL"="http://54.66.174.113/ganglia/"
  • 31. Familiar HPC architecture Head node Instance Compute node Instance Compute node Instance Compute node Instance Compute node Instance 10G Network Auto-scaling group Virtual Private Cloud /shared Head Instance 2 or more cores (as needed) CentOS 6.x OpenMPI, gcc etc… Choice of scheduler: Torque, SGE, OpenLava Slurm (coming soon) Compute Instances 2 or more cores (as needed) CentOS 6.x Auto Scaling group driven by scheduler queue length. Can start with 0 (zero) nodes and only scale when there are jobs.
  • 32. Yes, it’s a real HPC cluster #cfncluster Now you have a cluster, probably running CentOS 6.x, with Sun Grid Engine as a default scheduler, and openMPI and a bunch of other great utilities installed that you’re already familiar with. You also have a shared filesystem in /shared and an autoscaling group ready to expand the number of compute nodes in the cluster when the existing ones get busy. arthur ~ [26] $ cfnCluster create boof-cluster Starting: boof-cluster Status: cfncluster-boof-cluster - CREATE_COMPLETE Output:"MasterPrivateIP"="10.0.0.17" Output:"MasterPublicIP"="54.66.174.113" Output:"GangliaPrivateURL"="http://10.0.0.17/ganglia/" Output:"GangliaPublicURL"="http://54.66.174.113/ganglia/" arthur ~ [27] $ ssh ec2-user@54.66.174.113 The authenticity of host '54.66.174.113 (54.66.174.113)' can't be established. RSA key fingerprint is 45:3e:17:76:1d:01:13:d8:d4:40:1a:74:91:77:73:31. Are you sure you want to continue connecting (yes/no)? yes Warning: Permanently added '54.66.174.113' (RSA) to the list of known hosts. [ec2-user@ip-10-0-0-17 ~]$ df Filesystem 1K-blocks Used Available Use% Mounted on /dev/xvda1 10185764 7022736 2639040 73% / tmpfs 509312 0 509312 0% /dev/shm /dev/xvdf 20961280 32928 20928352 1% /shared [ec2-user@ip-10-0-0-17 ~]$ qhost HOSTNAME ARCH NCPU NSOC NCOR NTHR LOAD MEMTOT MEMUSE SWAPTO SWAPUS ---------------------------------------------------------------------------------------------- global - - - - - - - - - - ip-10-0-0-136 lx-amd64 8 1 4 8 - 14.6G - 1024.0M - ip-10-0-0-154 lx-amd64 8 1 4 8 - 14.6G - 1024.0M - [ec2-user@ip-10-0-0-17 ~]$ qstat [ec2-user@ip-10-0-0-17 ~]$ [ec2-user@ip-10-0-0-17 ~]$ ed hw.qsub hw.qsub: No such file or directory a #!/bin/bash # #$ -cwd #$ -j y #$ -pe mpi 2 #$ -S /bin/bash # module load openmpi-x86_64 mpirun -np 2 hostname . w 110 q [ec2-user@ip-10-0-0-17 ~]$ ll total 4 -rw-rw-r-- 1 ec2-user ec2-user 110 Feb 1 05:57 hw.qsub [ec2-user@ip-10-0-0-17 ~]$ qsub hw.qsub Your job 1 ("hw.qsub") has been submitted [ec2-user@ip-10-0-0-17 ~]$ [ec2-user@ip-10-0-0-17 ~]$ qstat job-ID prior name user state submit/start at queue slots ja-task-ID --------------------------------------------------------------------------- --------------------- 1 0.55500 hw.qsub ec2-user r 02/01/2015 05:57:25 all.q@ip-10-0-0-44.ap-southeas 2 [ec2-user@ip-10-0-0-17 ~]$ qstat [ec2-user@ip-10-0-0-17 ~]$ ls -l total 8 -rw-rw-r-- 1 ec2-user ec2-user 110 Feb 1 05:57 hw.qsub -rw-r--r-- 1 ec2-user ec2-user 26 Feb 1 05:57 hw.qsub.o1 [ec2-user@ip-10-0-0-17 ~]$ cat hw.qsub.o1 ip-10-0-0-136 ip-10-0-0-154 [ec2-user@ip-10-0-0-17 ~]$
  • 33. Upgrade from Ivy Bridge to Haswell #cfncluster Yes, really :-) You can upgrade your whole cluster in a keystroke or two. It’s an easy way to test which CPUs or instance properties are important to your code’s performance. For example, you may find that Haswell doesn’t impact your code’s performance sufficiently to make the additional cost worthwhile, so you can just as easily downgrade the CPUs you’re using.
  • 34. Config options to explore … #cfncluster Many options, but the most interesting ones immediately are: # (defaults to t2.micro for default template) compute_instance_type = t2.micro # Master Server EC2 instance type # (defaults to t2.micro for default template #master_instance_type = t2.micro # Inital number of EC2 instances to launch as compute nodes in the cluster. # (defaults to 2 for default template) #initial_queue_size = 1 # Maximum number of EC2 instances that can be launched in the cluster. # (defaults to 10 for the default template) #max_queue_size = 10 # Boolean flag to set autoscaling group to maintain initial size and scale back # (defaults to false for the default template) #maintain_initial_size = true # Cluster scheduler # (defaults to sge for the default template) scheduler = sge # Type of cluster to launch i.e. ondemand or spot # (defaults to ondemand for the default template) #cluster_type = ondemand # Spot price for the ComputeFleet #spot_price = 0.00 # Cluster placement group. This placement group must already exist. # (defaults to NONE for the default template) #placement_group = NONE
  • 35. How is AWS Used for Scientific Computing? • High Performance Computing (HPC) for Engineering and Simulation • High Throughput Computing (HTC) for Data-Intensive Analytics • Hybrid Supercomputing centres • Collaborative Research Environments • Citizen Science • Science-as-a-Service • Science where the workload changes (hint: almost all science)
  • 37. Astronomy in the Cloud CHILES will produce the first neutral hydrogen deep field, to be carried out with the VLA in B array and covering a redshift range from z=0 to z=0.45. The field is centered at the COSMOS field. It will produce neutral hydrogen images of at least 300 galaxies spread over the entire redshift range. Working with AWS’s SciCo team to exploit the SPOT market in the cloud, the team at ICRAR in Australia have been able to implement the entire processing pipeline in the cloud for around $2,000 per month, which means the $1.75M they otherwise needed to spend on an HPC cluster can be spent on way cooler things that impact their research … like astronomers.
  • 38. High Throughput Computing at Scale The Large Hadron Collider @ CERN includes 6,000+ researchers from over 40 countries and produces approximately 25PB of data each year. The ATLAS and CMS experiments are using AWS for monte carlo simulations and analysis of LHC data.
  • 39. Zooniverse “The Zooniverse is heavily reliant on Amazon Web Services (AWS), particularly Elastic Compute Cloud (EC2) virtual private servers and Simple Storage Service (S3) data storage. AWS is the most cost-effective solution for the dynamic needs of Zooniverse’s infrastructure …” http://wwwconference.org/proceedings/www2014/companion/p1049.pdf The World’s Largest Citizen Science Platform … cost is a factor – running a central API means that when the Zooniverse is quiet and there aren’t many people about we can scale back the number of servers we’re running (automagically on Amazon Web Services) to a minimal level.
  • 40. Novartis 39 years of computational chemistry in 9 hours Novartis ran a project that involved virtually screening 10 million compounds against a common cancer target in less than a week. They calculated that it would take 50,000 cores and close to a $40 million investment if they wanted to run the experiment internally. Partnering with Cycle Computing and Amazon Web Services (AWS), Novartis built a platform leveraging Amazon Simple Storage Service (Amazon S3), Amazon Elastic Block Store (Amazon EBS), and four Availability Zones. The project ran across 10,600 Spot Instances (approximately 87,000 compute cores) and allowed Novartis to conduct 39 years of computational chemistry in 9 hours for a cost of $4,232. Out of the 10 million compounds screened, three were successfully identified.
  • 41. Globus Genomics Globus Genomics is an indispensible platform for Core Labs (bioinformatics, se- quencing, HPC) to meet their customers’ needs for cost-effective, large-scale NGS analysis. Globus Genomics provides a flexible, extensible solution to ad- dress the varying analysis and resource requirements of bioscience researchers, through powerful data management tools, customized workflow environments, and cloud- based elastic computational infrastructure. www.globus.org/genomics
  • 42. Aquaria – 3D Protein Visualization Aquaria is a publicly-available web tool, designed for biologists, for visualizing and working with the 3D structure of proteins. It has radically simplified the process of analyzing more than 500,000 proteins from the protein data bank. Being able to visualize the three-dimensional structure of proteins has been of great interest to scientists since long before the genomic age. The project was led by Dr Sean O’Donoghue from the CSIRO in Australia along with a team from the Garvan Institute in Sydney, and a key collaboration with Dr Andrea Schafferhans from the Technical University of Munich.
 Aquaria is fast & it comes with an easy-to-use interface and contains twice as many models as all other similar resources combined. It also allows users to view additional information, like the genetic differences between individuals, mapped onto 3D structures. http://aquaria.ws/
  • 43. What the SKA is saying about AWS “No one’s ever built anything this big before, and we really don’t understand the ins and outs of operating it…Cloud systems — which provide on-demand, ‘elastic’ access to shared, remote computing resources — would provide an amount of flexibility for the project that buying dedicated hardware might not.” - SKA architect Tim Cornwell, Nature May 27, 2014
  • 44. We are providing a grants pool of AWS credits and up to one petabyte of storage for an AWS Public Data Set. The data set will be initially provided by several of the SKA’s precursor telescopes including CSIRO’s ASKAP, ICRAR’s MWA in Australia, and KAT-7 (pathfinder to the SKA precursor telescope Meerkat) in South Africa. The grants are open to anyone who is making use of radio astronomical telescopes or radio astronomical data resources around the world. The grants will be administered by the SKA. They will be looking for innovative, cloud-based algorithms and tools that will be able to handle and process this never ending data stream. https://aws.amazon.com/blogs/aws/new-astrocompute-in-the-cloud-grants-program/ What the AWS is doing with SKA
  • 45. We Feel Emotion Explorer We Feel is a project that explores whether social media can provide an accurate, real-time signal of the world’s emotional state. A joint collaboration between CSIRO, mental health researchers at The Black Dog Institute, Amazon Web Services and GNIP. The We Feel is built on Amazon’s Big Data technologies and currently analyzes approximately 27 million tweets/day The outcomes? 1. We can now monitor, in real-time, the emotional health of the world 2. Seamlessly scale infrastructure up or down in direct relation to social activity 3. Amazon’s Big Data platform enables real-time trend analysis, queries of historical data and geospatial analytics http://wefeel.csiro.au/
  • 46. “The AWS model works when we have the greatest variety of uncoupled workloads all using the cloud. When it works, it drives the cost of computation down to trivial levels so people can concentrate more on their data, their science and their ideas, rather than bothering to worry about infrastructure. Science is one of the greatest areas of computation and also happens to be the one that can most benefit from that democratisation in cost and global accessibility and where we think Amazon can make a huge, really disruptive, impact on the world by participating - which is, at the most basic level, what we are about as a company.”