Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge

1,022 views

Published on

Published in: Technology, Business
0 Comments
1 Like
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,022
On SlideShare
0
From Embeds
0
Number of Embeds
2
Actions
Shares
0
Downloads
0
Comments
0
Likes
1
Embeds 0
No embeds

No notes for slide

Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge

  1. 1. Customer Presentation Eagle Genomics William Spooner CTO
  2. 2. http://aws.amazon.com/solutions/case-studies/unilever “Unilever’s digital data program now processes genetic sequence twenty times faster – without incurring higher compute costs. – Pete Keeley, eScience IT Lead for Cloud, Unilever Research Transforming Informatics for Unilever Research William Spooner, CTO and Founder, Eagle Genomics Introduction to AWS | Cambridge 30th May 2012©Eagle Genomics Ltd ©Eagle Genomics Ltd
  3. 3. AWS Case Study: UnileverAnglo/Dutch multinationalConsumer goodsOver 400 brands in 180countries.Covers all facets of daily life, from food to cleaning to health and well-being.Unilever Research Over 6,000 specialists In twenty countrieshttp://aws.amazon.com/solutions/case-studies/unileverIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 3
  4. 4. Technology Partner: EagleBabraham-based consultancyInformatics: life science R&DCustomers in US, Europe, AsiaOperating for 4 years12 EmployeesIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 4
  5. 5. The DNA Path1 mile10,000 letters1 gene; BRCA2 BReast CAncer 2 Tumor suppressor © Keith Edkins (CC BY-SA 2.0)Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 5
  6. 6. The HumanGenome3,000,000,000 letters20,000 genesx10 round the worldFirst sequence (HGP); Released in 2000 Took 10 years Cost $100M © webdesignhot.com (CC SA 3.0)Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 6
  7. 7. © S. Ballard (CC BY-SA 2.0)Next Generation DNASequencing (NGS)Latest figures (2012) Takes 12 days Costs $10,000Costs still fallingrapidlyType footer in here ©Eagle Genomics Ltd June 6, 2012 7
  8. 8. 1000 HumanGenomes200 TB sequence data AWS public data setData analysis, e.g. Fragment assembly06/06/2012 8 ©Eagle Genomics Ltd
  9. 9. © T. HarrisHPC at SangerOver 10,000 coresOver 10 PB storageSupported by a largeteam © Genome Research Ltd.Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 9
  10. 10. HPC with AWSVirtual supercomputer 50,000 cores $5,000/hourVs. Hardware cost of: ~$15,000,000Used for Proteinsimulation experiment ©Eagle Genomics Ltd
  11. 11. Bacterial diversity: Gingivitis Health Shared Gingivitis © David Taylor, Suzi Adams, Unilever Research. Generated using CytoscapeSample (healthy site, n=40)Sample (gingivitis site, n=36) Actinobacteria Proteobacteria Bacteriodetes SR1 Cyanobacteria SpirochaetesOTU coloured by phyla Firmicutes TM7OTU size proportional to log Fusobacteria Tenericutescount Unclassified Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 11
  12. 12. AWS sFTP Web UI EC2 instance Main data input/output Exchange for user access Workflow Blackboard Server Instances only launched Job fetching and on workflow demand/loadS3 status updating Data input/output storage elastic EC2 instances 12 ©Eagle Genomics Ltd © David Taylor, Pete Keeley - Unilever Research
  13. 13. Results of PilotJune 2011 Jan 201210 Studies per Year 50 Studies per YearRuntime – Weeks (realtime) Runtime – Hours (realtime)1 pipeline 2 pipelinesRun jobs sequentially Run all jobs in parallel1 UK Lab 6 Global Labs12 direct users 50 direct usersFew Specialists Locally All Biologists GloballyIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 13 © David Taylor, Pete Keeley - Unilever Research
  14. 14. “Unilever’s digital data program now processes genetic sequence twenty times faster – without incurring higher compute costs. “In addition, its robust architecture supports ten times as many scientists, all working simultaneously” – Pete Keeley, eScience IT Lead for Cloud, Unilever Research info@eaglegenomics.com www.eaglegenomics.com +44 (0)1223 654481 @wspoonr facebook.com/eaglegenomics blog.eaglegenomics.com @eaglegen ©Eagle Genomics LtdEagle® is a registered trademark no. 010418135 of Eagle Genomics Ltd.Postal address: Eagle Genomics Ltd., Babraham Research Campus, Cambridge CB22 3AT, United Kingdom. ©Eagle Genomics Ltd

×