Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.
Newtons ideas and methods are     preserved forever:      how about yours? Marco Roos, Kristina Hettne, Jun Zhao, Mark Tho...
Wednesday, December 19, 2012   Digital preservation for the modern scientist   2
Reproduced workflows                  Mass                                                Power & Mass                    ...
Case studyBioinformatics analysis of Metabolic SyndromeKristina Hettne, Harish Dharuri      Genome Wide Association       ...
Reproducible SciencePreservation for the  wet laboratory  scientist             From Van Roon-Mom et al., BMC Molecular Bi...
Reproducible Science?What is the digital equivalent?Is it equally good?Can we do better?  - or worse?                     ...
Reproducible Science                      What is our incentive?                      Nobility                            ...
Reproducible Science                      What is our incentive?                                              I’ll be the ...
CHALLENGE    Stimulate preservation and    reproducibility while speeding up    the research processWednesday, December 19...
Enhance the research cycle         What slows us down? Research QuestionFind Methods        Get          Understand    For...
Bottlenecks • Loosing track of what you did • Messy storage • Preparing material for a publication • Understanding the com...
Getting on with workflowsWednesday, December 19, 2012   Towards preserving bioinformatics experiments   12
Monolithic Tool →              Web Services → Workflows → (Web) Tool              Example: Anni 2.0 → Anni workflows      ...
Digital Repository          myExperiment.orgThe recipes store•   Find workflows•   Share workflows & files•   Find people•...
Instructions for workflow authors                      10 Best Practices for creating workflows        1.         Make a s...
Reproducible Science            Is a workflow sufficient?  Useful Preservation           =Understandable ObjectsReproduce,...
Useful preservation 1                      myExperiment PacksWednesday, December 19, 2012   Towards preserving bioinformat...
Useful preservation                      Research Object Model                             Research Object Model          ...
Research Object (RO) ModelRO = ORE + AO + vocabulariesObject Re-use and Exchange (OAI-ORE)   Describes aggregations of res...
Research Object Model
Research Object: “Hello World”https://github.com/wf4ever/ro-catalogue/tree/master/v0.1/HelloWorld
Help organize the materials andmethods of computational analysisResearch Object Portal                                    ...
Expected on myExperiment  Research Objects inside!  • Packs more prominent  • Start a pack when you    upload a workflow  ...
Fame and Glory  It was  me, me,                      What    HDAC1 interacts with Parvb   me!                  I     Disco...
Nanopublication Model                         Getting credit for digital results                Nanopublication ID        ...
Nanopub.orgWednesday, December 19, 2012   Towards preserving bioinformatics experiments   26
ExamplesWednesday, December 19, 2012   Towards preserving bioinformatics experiments   27
Examples in RDF formatWednesday, December 19, 2012   Towards preserving bioinformatics experiments   28
ValidatorWednesday, December 19, 2012   Towards preserving bioinformatics experiments   29
Example: LOVD
Nanopublications of Genetic Variations          visualized on the genome                                                  ...
Fame and Glory  It was                        Nanopublication  me, me,                      What       <CS7183> <associate...
Summary (1/2)• Preservation under the hood of digital research  tools• Research Object Model: annotated aggregates• Nanopu...
Summary (2/2)• Semantic Web for exchange and interoperability• In progress: RO-enabling myExperiment  Watch myExperiment.o...
AcknowledgementsEU Wf4Ever project (270129)funded under EU FP7 (ICT- 2009.4.1).(http://www.wf4ever-project.org)
Thank you for your attention                          36http://biosemantics.org
Reproducible SciencePreserved materials  and methods for the  ‘wet laboratory’  scientist                        From Van ...
Reproducible Science?What is the digital equivalent?Is it equally good?Can we do better?  - or worse?                     ...
Reproducible ScienceWhat is the digital equivalent?Is it equally good?Can we do better?  – or worse?               Can you...
Reproducible Science                      What is our incentive?                      Nobility                            ...
Reproducible Science                      What is our incentive?                                              I’ll be the ...
Our aim                                 ‘Useful’ preservation                          Support reproducibility            ...
Preservation                                                                                What?                         ...
Preservation                                Deemed                 Deemed                                Valuable         ...
Acknowledgements                                  http://biosemantics.org/■   Erik Schultes          ■   Paul Groth       ...
Upcoming SlideShare
Loading in …5
×

Marco Roos: Newton's ideas and methods are preserved forever: how about yours?

1,074 views

Published on

Marco Roos talk at ISCB-Asia: Newton's ideas and methods are preserved forever: how about yours? December 19th 2012

  • Be the first to comment

  • Be the first to like this

Marco Roos: Newton's ideas and methods are preserved forever: how about yours?

  1. 1. Newtons ideas and methods are preserved forever: how about yours? Marco Roos, Kristina Hettne, Jun Zhao, Mark Thompson Cloud and Workflows for Reproducible Bioinformatics Shenzhen, December 19, 2012
  2. 2. Wednesday, December 19, 2012 Digital preservation for the modern scientist 2
  3. 3. Reproduced workflows Mass Power & Mass Force Web Service AccelerationWednesday, December 19, 2012 Towards preserving bioinformatics experiments 3
  4. 4. Case studyBioinformatics analysis of Metabolic SyndromeKristina Hettne, Harish Dharuri Genome Wide Association Studies What is the genetic basis for the diseases associated with Metabolic Syndrome?
  5. 5. Reproducible SciencePreservation for the wet laboratory scientist From Van Roon-Mom et al., BMC Molecular Biology 2008 doi: 10.1186/1471-2199-9-84.
  6. 6. Reproducible Science?What is the digital equivalent?Is it equally good?Can we do better? - or worse? GroundHog DB Reproduced from Jelier et al., Schuemie et al., Hettne et al., Haagen et al., http://biosemantics.org , myExperiment.org/workflows/2197
  7. 7. Reproducible Science What is our incentive? Nobility Greater Good Good Reproducible Science Serve the publicWednesday, December 19, 2012 Towards preserving bioinformatics experiments 7
  8. 8. Reproducible Science What is our incentive? I’ll be the first in Nature Fame and Glory Getting on with it...Wednesday, December 19, 2012 Towards preserving bioinformatics experiments 8
  9. 9. CHALLENGE Stimulate preservation and reproducibility while speeding up the research processWednesday, December 19, 2012 Towards preserving bioinformatics experiments 9
  10. 10. Enhance the research cycle What slows us down? Research QuestionFind Methods Get Understand Format and Data, + Methods Methods and (Align)their Owners and Data Data Data Design Interpret the Compute Publish Results Analysis 10
  11. 11. Bottlenecks • Loosing track of what you did • Messy storage • Preparing material for a publication • Understanding the computational procedure • Communication with (non-technical) colleagues • Keeping tools working • Getting credit for digital results outside of traditional publicationsWednesday, December 19, 2012 Towards preserving bioinformatics experiments 11
  12. 12. Getting on with workflowsWednesday, December 19, 2012 Towards preserving bioinformatics experiments 12
  13. 13. Monolithic Tool → Web Services → Workflows → (Web) Tool Example: Anni 2.0 → Anni workflows AnniWFhttp://workflow.biosemantics.org/t2web/workflow/2725
  14. 14. Digital Repository myExperiment.orgThe recipes store• Find workflows• Share workflows & files• Find people• Build communities• Publish packages• Tag workflows• Score, rate, comment
  15. 15. Instructions for workflow authors 10 Best Practices for creating workflows 1. Make a sketch workflow 2. Use modules 3. Think about the output 4. Provide example inputs and outputs 5. Annotate 6. Test execution from outside local environment 7. Choose services carefully 8. Reuse existing workflows 9. Advertise 10. MaintainWednesday, December 19, 2012 Towards preserving bioinformatics experiments 15
  16. 16. Reproducible Science Is a workflow sufficient? Useful Preservation =Understandable ObjectsReproduce, Reuse, Repurpose, Repair, ... What is this doing? Reproduced from Jelier et al., Schuemie et al., Hettne et al., Haagen et al., http://biosemantics.org , myExperiment.org/workflows/2197
  17. 17. Useful preservation 1 myExperiment PacksWednesday, December 19, 2012 Towards preserving bioinformatics experiments 17
  18. 18. Useful preservation Research Object Model Research Object Model Aggregation and Annotation Model for Digital Methods http://wf4ever.github.com/ro/Wednesday, December 19, 2012 Towards preserving bioinformatics experiments 18
  19. 19. Research Object (RO) ModelRO = ORE + AO + vocabulariesObject Re-use and Exchange (OAI-ORE) Describes aggregations of resources: data, metadata, papers, etc.Annotation Ontology (AO) Associates RDF metadata descriptions with resourcesGeneric and domain-specific vocabularies Used in annotation bodies to provide information about resources (types, dependencies, descriptions, etc.)Builds on RDF, leading to RDF as a natural implementation choiceModel specification: http://wf4ever.github.com/ro/
  20. 20. Research Object Model
  21. 21. Research Object: “Hello World”https://github.com/wf4ever/ro-catalogue/tree/master/v0.1/HelloWorld
  22. 22. Help organize the materials andmethods of computational analysisResearch Object Portal Materials & Methods of Metabolic Syndrome Analysis Kristina Hettne Harish Dharuri 22
  23. 23. Expected on myExperiment Research Objects inside! • Packs more prominent • Start a pack when you upload a workflow • Upload wizards, pack management, export • Checklists, automated star ratings • Add workflow runs and example data • Sticky annotations RO-enabled myExperiment mockupWednesday, December 19, 2012 Towards preserving bioinformatics experiments 23
  24. 24. Fame and Glory It was me, me, What HDAC1 interacts with Parvb me! I Discovered by: me found Published by: meResearch Object How I found it 24
  25. 25. Nanopublication Model Getting credit for digital results Nanopublication ID Integrity Key Assertion Provenance associa- sio:statis- is ticalAssociatio tion n Supporting Attribution sio:has- measure Association_1 this dcterms: mentValu _p_value nanopu created sio: e b refers-to opm: assertio was n Derived pav: From authored- is sio:has-value By opm: wasGene- … ratedBy dcterms: Sio:probability 6.56e-5 DOI -value ^^xsd:floatWednesday, December 19, 2012 Towards preserving bioinformatics experiments 25
  26. 26. Nanopub.orgWednesday, December 19, 2012 Towards preserving bioinformatics experiments 26
  27. 27. ExamplesWednesday, December 19, 2012 Towards preserving bioinformatics experiments 27
  28. 28. Examples in RDF formatWednesday, December 19, 2012 Towards preserving bioinformatics experiments 28
  29. 29. ValidatorWednesday, December 19, 2012 Towards preserving bioinformatics experiments 29
  30. 30. Example: LOVD
  31. 31. Nanopublications of Genetic Variations visualized on the genome Zuotian Tatum, Jesse van Dam Other OtherSources Tools Nanopublication Store 31
  32. 32. Fame and Glory It was Nanopublication me, me, What <CS7183> <associatedWith> <MetS> me! I Discovered by: me found Published by: meResearch Object How I found http://purl.org/nanopub/123 http://purl.org/ResObj/345 it 32
  33. 33. Summary (1/2)• Preservation under the hood of digital research tools• Research Object Model: annotated aggregates• Nanopublication: fine-grained digital credit Check Nanopub.org to stay updatedWednesday, December 19, 2012 Towards preserving bioinformatics experiments 33
  34. 34. Summary (2/2)• Semantic Web for exchange and interoperability• In progress: RO-enabling myExperiment Watch myExperiment.org in 2013!• Plans to RO-enable Taverna, Galaxy, GenomeSpaceWednesday, December 19, 2012 Towards preserving bioinformatics experiments 34
  35. 35. AcknowledgementsEU Wf4Ever project (270129)funded under EU FP7 (ICT- 2009.4.1).(http://www.wf4ever-project.org)
  36. 36. Thank you for your attention 36http://biosemantics.org
  37. 37. Reproducible SciencePreserved materials and methods for the ‘wet laboratory’ scientist From Van Roon-Mom et al., BMC Molecular Biology 2008 doi: 10.1186/1471-2199-9-84.
  38. 38. Reproducible Science?What is the digital equivalent?Is it equally good?Can we do better? - or worse? Reproduced from Jelier et al., Schuemie et al., Hettne et al., Haagen et al., http://biosemantics.org , myExperiment.org/workflows/2197
  39. 39. Reproducible ScienceWhat is the digital equivalent?Is it equally good?Can we do better? – or worse? Can you tell what this is doing? Reproduced from Jelier et al., Schuemie et al., Hettne et al., Haagen et al., http://biosemantics.org , myExperiment.org/workflows/2197
  40. 40. Reproducible Science What is our incentive? Nobility Greater Good Good Reproducible Science Serve the publicWednesday, December 19, 2012 Towards preserving bioinformatics experiments 40
  41. 41. Reproducible Science What is our incentive? I’ll be the first in Nature Fame and Glory Getting on with it...Wednesday, December 19, 2012 Towards preserving bioinformatics experiments 41
  42. 42. Our aim ‘Useful’ preservation Support reproducibility in tools and by guidelines that speed up your research get you acknowledgementWednesday, December 19, 2012 Towards preserving bioinformatics experiments 42
  43. 43. Preservation What? How? Nanopublication Assertion Research Results Provenance Attribution SupportingWednesday, December 19, 2012 Towards preserving bioinformatics experiments 43
  44. 44. Preservation Deemed Deemed Valuable of of Digital for scientific scientific Value What? value by scientists value by How? scientists scientists Nanopublication Assertion Research Results Provenance Attribution SupportingWednesday, December 19, 2012 Towards preserving bioinformatics experiments 44
  45. 45. Acknowledgements http://biosemantics.org/■ Erik Schultes ■ Paul Groth ■ Christine Chichester■ Andrew Gibson ■ Frank van ■ Kees Burger - NBIC■ Reinout van Schouwen Harmelen ■ Spyros Kotoulas - VU■ Kostas Karasavvas ■ Antonis Loizou - VU■ Kristina Hettne ■ Valery Tkachenko - RSC■ Harish Dharuri ■ Andra Waagmeester -■ Eleni Mina Maastricht■ Jesse van Dam ■ Erik van Mulligen ■ Sune Askjaer - Lundbeck■ Herman van Haagen ■ Bharat Singh ■ Steve Pettifer - Manchester■ Zuotian Tatum ■ Jan Kors ■ Lee Harland - Pfizer/CD■ Johan den Dunnen ■ Carina Haupt - Fraunhofer■ Peter-Bram ‘t Hoen ■ Colin Batchelor - RSC■ Barend Mons ■ Miguel Vazquez - CNIO■ Gert-Jan van Ommen ■ José María Fernández - CNIO ■ Jahn Saito - Maastricht ■ Andrew Gibson (Outside Expert) - Amsterdam ■ Louis Wich - DTU Melton Foundation

×