Wageningen phenotype meeting


Published on

Published in: Technology, Education
  • Be the first to comment

  • Be the first to like this

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Wageningen phenotype meeting

  1. 1. Phenotype Foundation<br />Why? What? How?<br />Kees van Bochove, 6 september 2011<br />
  2. 2. History<br />
  3. 3. Multiple instances: NMC, NuGO, TNO…<br />
  4. 4. A few screenshots…<br />
  5. 5. Example of a studytimeline<br />
  6. 6. NextGenerationSequencing module<br />
  7. 7. Transcriptomics module<br />
  8. 8. SimpleAssay Module<br />
  9. 9. Visualization – PPS1 paper figure<br />
  10. 10. Galaxy (toolbox / visualization) <br />
  11. 11. Core reasons<br />Governance of dbNP - now dbXP<br />Communication – many project, many partners<br />Professionalize open source development<br />Guarantee sustainability of software and data<br />Represent dbNP to outside<br />
  12. 12. Mission statement<br />The Phenotype Foundation mission is to empower scientists, within various communities of practice, with standardized software and knowledge stores to store, manage, and retrieve information and data on genotypes and phenotypes.<br />
  13. 13. My reasons<br />To find more testers! Engage more biologists.<br />One stop place forourefforts – currently all over the web<br />Professionalize open sourcedevelopment – itshouldbe easy, not hard, to join in<br />Deployment issues – configuring servers, databases… who is responsible?<br />Sustainability!<br />
  14. 14. Testing time<br />Estimates of testing time formoderately complex GUI applications as percentage of project time range from 30% to 60%<br />Whichmeansthatjusthiringdevelopers is notnearlyenough to accomplish a succesful software development project<br />And… testing is the most complex part of the project to manage!<br />
  15. 15. Current state ofdbXPdocumentation<br />
  16. 16. Current state of documentation<br />Source code: http://trac.nbic.nl<br />Infrastructure: http://nmcdsp.org (Nexus server)<br />Website http://dbnp.org, http://trac.nbic.nl, http://wiki.nbic.nl<br />
  17. 17. Advantages of open source- in theory<br />Lowers adaptation barrier, thereby improving collaboration and standardization<br />Transparency and clarity presumably lead to higher quality of code<br />Actually, open source is the only legally feasible way of sharing program code across multiple short-lived projects, organizations and institutions<br />
  18. 18. Advantages of open source- the reality in bioinformatics<br />The 'open source hype' has lead to countless bioinformatics open source tools, some of them mere 'code dumps', others highly active collaborative projects, with or without a (large) user base, in many different programming languages<br />Many projects are interesting, but have no documentation, or the source code is in bad shape (lack of software engineering skills among bioinformaticians)<br />Abundance of non-interoperable web-enabled databases further complicates this picture<br />
  19. 19. Bikeshed argument<br />
  20. 20. Bikeshed argument<br />
  21. 21. Bikeshed argument<br />
  22. 22. Bikeshed argument<br />
  23. 23. Case study: Plone<br />An open source CMS (Content Management System) written in Python, nowadays backing thousands of production grade websites<br />Started by 2 developers in 2000, now an active open source project with hundreds of active developers<br />In 2004, the Plone Foundation was formed to formalize IP and secure the future of Plone<br />
  24. 24. Plone - how it works, legally<br />The Plone Foundation is owner of code and serves to protect and promote Plone. It guarantees that Plone will always be available under an OSI approved license<br />Developers willing to contribute to the Plone core need to sign a contributors agreement, transferring IP of their contributions to the Foundation. In return, they get irrevocable rights to use and distribute their contributions<br />The Foundation is governed by a Board of Directors, which is elected yearly by the members of the Foundation<br />Only 1 person is employed by the Foundation: Release Manager<br />
  25. 25. Geir Baekholt,past president of the Plone Foundation<br />"The Plone Foundation is a meritocracy"<br />
  26. 26. Plone, in practice<br />The active community of developers is the basis of the success of Plone. Some are hobbyists, but many also work for companies<br />There is a large Plone Collective (marketplace for Plone add-ons). Add-ons are not subject to the contributors agreement (although of course should comply with the Plone license if they import Plone code)<br />Contrary to popular belief, developers in open source projects such as Apache or Plone meet regularly face to face: ApacheCon, Plone Developer Days, sprints..<br />
  27. 27. Plone?<br />Plone Foundation, Apache Software Foundation, Open Bioinformatics Foundation etc.all have onecrucial detail: all members and the board are programmers<br />In Phenotype Foundation, we have 2 or even3 different types of peoplethat are bothstakeholders: programmers and scientists (and senior management)<br />
  28. 28. And how does Galaxy do it?<br />Penn State University + Emoryuniversity have the lead and are the only core committers!<br />Ifyou have suggestionsforimprovement of the code, youcansuggestthemusing a ‘pull request’<br />Creation of a ‘toolshed’ wherebioinformaticianscanuploadtheirtools<br />
  29. 29. Beware the Empty Chair!<br />Especially if we start integrating different projects, we have to make sure someone is responsible for maintaining a consistent end user experience. <br />The EmptyChair is a metaphorfrom the book<br />"Adrenaline Junkies and Template Zombies"<br />
  30. 30. Beware mañana<br />Any goal furtherawaythan 90 days is not urgent enough to do anythingabouttoday.<br />Sowhat is ournext step?<br />From "Adrenaline Junkies and Template Zombies"<br />
  31. 31. caBIG<br />
  32. 32. caBIG – WG recommendations<br />Create a ScientificAdvisory Group of scientific, technology and informatics experts<br />Focus onscientificmission, notoncreating a ‘software brand’<br />Semantic data integration (interoperability) is very important<br />Create a legalframeworkfor data sharing<br />
  33. 33. Conclusions – programmerperspective<br />At this moment, communicationbetweenprogrammersgoesreasonablywell, because the samepeople are involved<br />But we are notreadyfor the future<br />For outsiders, it is hard to findourcommunity and at this moment impossible to join without a lot of insideknowledge<br />We need to improvethisif we don’t want to end up on the ‘graveyard’!<br />
  34. 34. Foundation outline<br />Board of Directors<br />TechnicalCommunity<br />Board member(s) (liason to board)<br />Release manager<br />Database managers<br />Transfer of code ownership to the Foundation?<br />ScientificCommunity<br />Board members(s)<br />Product Liaisons<br />Frequent meetings<br />