• Save
Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge
Upcoming SlideShare
Loading in...5
×
 

Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge

on

  • 1,094 views

 

Statistics

Views

Total Views
1,094
Views on SlideShare
1,094
Embed Views
0

Actions

Likes
1
Downloads
0
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge Customer presentation: Eagle Genomics, Introduction to AWS, Cambridge Presentation Transcript

  • Customer Presentation Eagle Genomics William Spooner CTO
  • http://aws.amazon.com/solutions/case-studies/unilever “Unilever’s digital data program now processes genetic sequence twenty times faster – without incurring higher compute costs. – Pete Keeley, eScience IT Lead for Cloud, Unilever Research Transforming Informatics for Unilever Research William Spooner, CTO and Founder, Eagle Genomics Introduction to AWS | Cambridge 30th May 2012©Eagle Genomics Ltd ©Eagle Genomics Ltd
  • AWS Case Study: UnileverAnglo/Dutch multinationalConsumer goodsOver 400 brands in 180countries.Covers all facets of daily life, from food to cleaning to health and well-being.Unilever Research Over 6,000 specialists In twenty countrieshttp://aws.amazon.com/solutions/case-studies/unileverIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 3
  • Technology Partner: EagleBabraham-based consultancyInformatics: life science R&DCustomers in US, Europe, AsiaOperating for 4 years12 EmployeesIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 4
  • The DNA Path1 mile10,000 letters1 gene; BRCA2 BReast CAncer 2 Tumor suppressor © Keith Edkins (CC BY-SA 2.0)Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 5
  • The HumanGenome3,000,000,000 letters20,000 genesx10 round the worldFirst sequence (HGP); Released in 2000 Took 10 years Cost $100M © webdesignhot.com (CC SA 3.0)Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 6
  • © S. Ballard (CC BY-SA 2.0)Next Generation DNASequencing (NGS)Latest figures (2012) Takes 12 days Costs $10,000Costs still fallingrapidlyType footer in here ©Eagle Genomics Ltd June 6, 2012 7
  • 1000 HumanGenomes200 TB sequence data AWS public data setData analysis, e.g. Fragment assembly06/06/2012 8 ©Eagle Genomics Ltd
  • © T. HarrisHPC at SangerOver 10,000 coresOver 10 PB storageSupported by a largeteam © Genome Research Ltd.Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 9
  • HPC with AWSVirtual supercomputer 50,000 cores $5,000/hourVs. Hardware cost of: ~$15,000,000Used for Proteinsimulation experiment ©Eagle Genomics Ltd
  • Bacterial diversity: Gingivitis Health Shared Gingivitis © David Taylor, Suzi Adams, Unilever Research. Generated using CytoscapeSample (healthy site, n=40)Sample (gingivitis site, n=36) Actinobacteria Proteobacteria Bacteriodetes SR1 Cyanobacteria SpirochaetesOTU coloured by phyla Firmicutes TM7OTU size proportional to log Fusobacteria Tenericutescount Unclassified Introduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 11
  • AWS sFTP Web UI EC2 instance Main data input/output Exchange for user access Workflow Blackboard Server Instances only launched Job fetching and on workflow demand/loadS3 status updating Data input/output storage elastic EC2 instances 12 ©Eagle Genomics Ltd © David Taylor, Pete Keeley - Unilever Research
  • Results of PilotJune 2011 Jan 201210 Studies per Year 50 Studies per YearRuntime – Weeks (realtime) Runtime – Hours (realtime)1 pipeline 2 pipelinesRun jobs sequentially Run all jobs in parallel1 UK Lab 6 Global Labs12 direct users 50 direct usersFew Specialists Locally All Biologists GloballyIntroduction to AWS | Cambridge ©Eagle Genomics Ltd May 30th 2012 13 © David Taylor, Pete Keeley - Unilever Research
  • “Unilever’s digital data program now processes genetic sequence twenty times faster – without incurring higher compute costs. “In addition, its robust architecture supports ten times as many scientists, all working simultaneously” – Pete Keeley, eScience IT Lead for Cloud, Unilever Research info@eaglegenomics.com www.eaglegenomics.com +44 (0)1223 654481 @wspoonr facebook.com/eaglegenomics blog.eaglegenomics.com @eaglegen ©Eagle Genomics LtdEagle® is a registered trademark no. 010418135 of Eagle Genomics Ltd.Postal address: Eagle Genomics Ltd., Babraham Research Campus, Cambridge CB22 3AT, United Kingdom. ©Eagle Genomics Ltd