Most Beautiful Call Girl in Chennai 7427069034 Contact on WhatsApp
ย
Why The Cloud Is A Computational Biologist's Best Friend
1. Amazon Cloud: A Religious Experience
Yannick Pouliot
2/23/2012
2. Amazon Cloud services in a nutshell:
Highly flexible storage
and compute power sold
on a use basis
3. Why the Cloud?
โข Complete flexibility of computing power and
storage
โข Grow or diminish as needed
โข Arbitrary number of machines
โข Ridiculously powerful machine made
affordable on a short lease basis to address
particular task (e.g., 15B ANOVAs)
โข Unusual architectures (e.g., GPUs)
4. There Are Many Cloud Providersโฆ
โฆ but Amazon is clear leader, IMO
5. Q: What does working with a Cloud
machine feel like?
A: Itโs not materially different than
accessing a machine on our cluster,
except you can do anything you want
6. Main Services Provided by Amazon Cloud
โข Storage
โซ Traditional disk volumes
โซ S3 buckets (โSimple Storage Systemโ)
โข Computing (EC2 โ โElastic Compute Cloudโ)
โซ Single machine instances
โซ Clusters of various types
โข Machine types
โซ
โซ
โซ
โซ
โซ
Compute servers
Database servers
Cluster
Specialized architectures
Variety of operating systems (LINUX flavors, Windows)
7. Types of Instances
โข Based on definition of the virtual machine
definition
โซ
โซ
โซ
โซ
I/O bus
Number of CPUs
Memory
Type of CPU, cluster
โข Deployment: Spot market vs. โReservedโ
8. Costs
โข You pay for (almost) everything you do
โซ Data transfers (out)
โซ Storage
โซ CPU cycles (depends on instance type; one
instance is free)
โข Can purchase cycles at below average market
price
โซ Can provide access to vast amounts of computing
power at a price you can afford
โข Research grants from Amazon
9. Controlling Your Services
โข Web-base console
โข Command-line tools
โซ EC2 API tools
โข Third party systems: RightScale
10. Using & Distributing Instances
โข You can always make images of your instances for
later use/backup
โข Images can be made public
โข You can launch other peopleโs images (i.e., public
images), e.g.,
โซ CloudBioLinux: pre-made biocomputational instances
โซ Galaxy Cloud: pre-made Cluster-based Galaxy
instance (Web-based, no less)
โซ PathSeq: pre-made comprehensive bowtie engine that
uses Hadoop
11. Issues
โข Security
โซ Lots of it
โข Data transfers
โซ Free for upload; $ for download
โซ No big deal, so far
โซ Can send drivesโฆ
โข Latency
โซ No big deal
โข Small โephemeralโ storage
โซ Gotcha if you donโt know
โข Max 1 terabyte per disk
โซ Humโฆ
โข โMaxโ 20 disks per instance
โซ Can be circumvented
โข No sharing of disks between instances, usually
12. Support
โข Unless you purchase support, youโre on your own
โข Hasnโt been an issue for me, though it can consume time to find
solutionโฆ
Support options: