77. A MAZON
V PC
A RCHITECTURE
Customer’s
isolated AWS
resources
10
.32
. 2. 0
/24 Subnets
10 10
.32 .32
.1.0 . 3. 0
/24 /24
VPN Gateway
Amazon
Secure VPN Connection
Web Services
over the Internet
Cloud
External
Your Network Customers
103. BLAT @ U. PENN
Map 100 million, 100 base paired end reads
Quad core with 5 GB of RAM would take 16 days
30 high-memory instances; 32 hours; $195
104. HEAVY-ION COLLISIONS
Problem: Quark matter physics conference
imminent but no compute resources handy
Solution: NIMBUS context broker allowed
researchers to provision 300 nodes and get the
simulations done
122. Crossbow: Rapid whole
genome SNP analysis
Ben Langmead
http://bowtie-bio.sourceforge.net/crossbow/index.shtml
123. Crossbow: Rapid whole genome SNP analysis
Preprocessed reads
Map: Bowtie
Sort: Bin and partition
Reduce: SoapSNP
Langmead B, Schatz MC, Lin, J, Pop M, Salzberg SL. Genome Biol 10(11): R134.
124. Crossbow
condenses
over
1,000
hours
of
resequencing
computa:on
into
a
few
hours
without
requiring
the
user
to
own
or
operate
a
computer
cluster
125. Scalable Genome Assembly
Assembly of Large Genomes with Cloud Computing.
http://contrail-bio.sourceforge.net Schatz MC, Sommer D, Kelley D, Pop M, et al. In Preparation.
126. Amazon Elastic MapReduce
Amazon EC2 Instances
End
Deploy Application
Hadoop Hadoop Hadoop
Elastic Elastic
MapReduce MapReduce
Hadoop Hadoop Hadoop Notify
Web Console, Command
line tools Input output
dataset results
Input
S3
Output
S3
Get Results
Input Data
bucket bucket
Amazon S3