8 mattwoodaws-intro-pdf-110411093115-phpapp01

639 views
552 views

Published on

Matt Wood of AWS
"Cloud Research"
Europe April 2011 @ the Eagle Genomics Symposium

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
639
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
4
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

8 mattwoodaws-intro-pdf-110411093115-phpapp01

  1. 1. Cloud Research Matt Wood T E C H N O L O G Y E VA N G E L I S T
  2. 2. Hello.
  3. 3. Text
  4. 4. Thank you.
  5. 5. The Cloud by Example
  6. 6. The Cloud by Example
  7. 7. Infrastructure services
  8. 8. ?
  9. 9. On demand
  10. 10. Pay as you go
  11. 11. Pay for what you use
  12. 12. Elastic capacity
  13. 13. Capacity Estimated demand Time
  14. 14. Capacity Infrastructure Investment Estimated demand Time
  15. 15. Capacity Infrastructure Real demand Time
  16. 16. Capacity Elastic capacity Real demand Time
  17. 17. Agility
  18. 18. Faster toprototype
  19. 19. Faster toproduction
  20. 20. Undifferentiated heavy lifting
  21. 21. Tools foraccelerating research
  22. 22. 300 225 150 75Q4 2006 Q4 2007 Q4 2008 0 Q4 2009 Q4 2010
  23. 23. The Cloud by Example
  24. 24. Data management
  25. 25. Biomarker Warehousepre-clinical, clinical, 3rd party data and publications ;<./5=>?6@ !)*(%"&& 23,341561789:1 !#%&$(%&&& +,-./01 !"#$%"&& 6178170 6A.7341 B817-135 Estimated cost: 10 TB warehouse over 3 years
  26. 26. Data processing
  27. 27. http://cyclecomputing.com
  28. 28. http://web.mit.edu/stardev/cluster/
  29. 29. sudo gem install cloud-crowd http://cyclecomputing.comhttp://wiki.github.com/documentcloud/cloud-crowd
  30. 30. http://www.rightscale.com
  31. 31. Amazon Elastic MapReduce Amazon EC2 Instances EndDeploy Application Hadoop Hadoop Hadoop Elastic Elastic MapReduce MapReduce Hadoop Hadoop Hadoop NotifyWeb Console, Command line tools Input output dataset results Input  S3   Output  S3   Get Results Input Data bucket bucket Amazon S3
  32. 32. Crossbow: Rapid whole genome SNP analysis Preprocessed reads Map: Bowtie Sort: Bin and partition Reduce: SoapSNP Langmead B, Schatz MC, Lin, J, Pop M, Salzberg SL. Genome Biol 10(11): R134.
  33. 33. CloudBurstCatalog k-mers Collect seeds End-to-end alignment http://cloudburst-bio.sourceforge.net; Bioinformatics 2009 25: 1363-1369
  34. 34. ASSEMBLING GENOMES 140  million  454  readsImage:  Ma)  Wood
  35. 35. BLAT @ U. PENNMap 100 million, 100 base paired end readsQuad core with 5 GB of RAM would take 16 days30 high-memory instances; 32 hours; $195
  36. 36. HEAVY-ION COLLISIONS @ RHICProblem: Quark physics conference imminentbut no compute resources handySolution: NIMBUS context broker allowedresearchers to provision 300 nodes and get thesimulations done
  37. 37. Collaboration
  38. 38. http://aws.amazon.com/publicdatasets/
  39. 39. http://www.cloudbiolinux.com/
  40. 40. http://usegalaxy.org/cloud
  41. 41. Applications and platforms
  42. 42. http://heroku.com
  43. 43. http://chempedia.com/
  44. 44. Security
  45. 45. Sharedresponsibility
  46. 46. Requirementbased access
  47. 47. Certification
  48. 48. ISO 27001 +SAS 70 Type II
  49. 49. PCI DSS Level 1
  50. 50. Control objectivesSecurity organisation Employee lifecycle Logical security Secure data handling Physical security Environmental safeguardsChange management Incident handling Availability and Data integrity redundancy
  51. 51. Data access control
  52. 52. Identity and access
  53. 53. GeographicallyIndependent buildings Separate flood zones separated Redundant Redundant Highly monitored power connectivity
  54. 54. Default deny firewall
  55. 55. Security groups
  56. 56. DDOSMan in the Middle IP spoofing
  57. 57. ResourceisolationVirtual Private Cloud
  58. 58. Customer’s isolated AWS resources Subnet 1 Subnet 2 VPN Gateway Router Secure VPN connection over the internet Subnet 3 Subnet 4Customer’s network Amazon Web Services infrastructure
  59. 59. DedicatedinstancesVirtual Private Cloud
  60. 60. aws.amazon.com/security
  61. 61. Data stays local
  62. 62. aws.amazon.com
  63. 63. Thank you!
  64. 64. Q U E S T I O N S + C O M M E N T Smatthew@amazon.com @mza O N T W I T T E R

×