Feasting On Brains With Taverna Public

Loading...

Flash Player 9 (or above) is needed to view presentations.
We have detected that you do not have it on your computer. To install it, go here.

0 comments

Post a comment

    Post a comment
    Embed Video
    Edit your comment Cancel

    Notes on slide 1

    E-science presentation Manchester September 25 2006 06/10/09

    Favorites, Groups & Events

    Feasting On Brains With Taverna Public - Presentation Transcript

    1. Feasting on Brains with Taverna Tutorial and demonstration by Marco Roos acknowledging Carole Goble, Dave de Roure, Alan Williams, Jiten Bhagat, Katy Wolstencroft, Martijn Schuemie, Edgar Meij, Sophia Katrenko, Willem van Hage, Scott Marshall, Pieter Adriaans, NBIC, OMII-UK, the myGrid team
    2. Feasting on your brain!
        • Please help me by filling out the form
      • preferably on
        • http://www.tinyurl.com/TavernaBrains
    3. What can Taverna do for me?
      • Benelux bioinformaticians think…
    4. Introducing myself A biologist
    5. My prime interest Structure and function of DNA in the nucleus Escherichia coli Mouse fibroblast (skin) cells
    6. How did I end up here?
        • Marco Roos
        • Biologist and bioinformatician (e-bioscientist) at the Informatics Institute, University of Amsterdam (BioRange/VL-e)
        • Project or Area Liaison (PAL) OMII-UK/myGrid
        • Member BioAssist programme committee NBIC
        • Member UK All Hands e-Science Foundation
    7. Components controlling structure & function of DNA
    8. Connecting the dots (example: protein interaction network in yeast)
    9. 1070 databases Nucleic Acids Research Jan 2008 (96 in Jan 2001)
      • Proteomics, Genomics, Transcriptomics, Protein sequence prediction, Phenotypic studies, Phylogeny, Sequence analysis, Protein Structure prediction, Protein-protein interaction, Metabolomics, Model organism collections, Systems Biology, Epidemiology, etcetera …
      All with a splendid interface … all different, of course
    10. A typical biologist… A needy biologist Tiny brain Lots of data to deal with Lots of methods and algorithms to try and combine No computational superpowers Lots of knowledge to deal with
    11. Start at the beginning I have a computational question…
    12. ‘ Old school’ Bioinformatics A typical bioinformatician
    13. ‘ Old school’ Bioinformatics A biologist behind a computer who (just) learned perl
    14. /* * determines ridges in htm expression table */ #include &quot;ridge.h&quot; int selecthtm(PGconn *conn, char *htmtablename, char *chromname, PGresult *htmtable) { char querystring[256]; sprintf(&quot;SELECT * FROM %s WHERE chrom = %s ORDER BY genstart&quot;, htmtablename, chromname); htmtable = PQexec(conn, querystring); return(validquery(htmtable, querystring)); } int is_ridge(PGresult *htmtable, int row, double exprthreshold, int mincount) /* determines if mincount genes in a row are (part of) a ridge */ /* pre: htmtable is valid and sorted on genStart (ascending) /* post: { if (mincount<=0) return TRUE; if (row>=PQntuples(htmtable)) return FALSE; if(PQgetvalue(htmtable, 0, PQfnumber(htmtable, &quot;movmed39expr&quot;)) < exprthreshold) { return FALSE; } return(is_ridge(htmtable, ++row, exprthreshold, --mincount)); } int main() { PGconn *conn; /* holds database connection */ char querystring[256]; /* query string */ PGresult *result; int i; conn = PQconnectdb(&quot;dbname=htm port=6400 user=mroos password=geheim&quot;); if (PQstatus(conn)==CONNECTION_BAD) { fprintf(stderr, &quot;connection to database failed. &quot;); fprintf(stderr, &quot;%s&quot;, PQerrorMessage(conn)); exit(1); } else printf(&quot;Connection ok &quot;); sprintf(querystring, &quot;SELECT * FROM chromosomes&quot;); printf(&quot;%s &quot;, querystring); result = PQexec(conn, querystring); if (validquery(result, querystring)) { printresults(result); } else { PQclear(result); PQfinish(conn); return FALSE; } PQclear(result); PQfinish(conn); return TRUE; } int printresults(PGresult *tuples) { int i; for (i=0; i< PQntuples(tuples) && i < 10; i++) { printf(&quot;%d, &quot;, i); printf(&quot;%s &quot;, PQgetvalue(tuples,i,0)); } return TRUE; } int validquery(PGresult *result, char *querystring) { printf(&quot; in validquery &quot;); if (PQresultStatus(result) != PGRES_TUPLES_OK) { printf(&quot;Query %s failed. &quot;, querystring); fprintf(stderr, &quot;Query %s failed. &quot;, querystring); return FALSE; } return TRUE; }
    15. The ‘spaghetti’ approach
    16. Computational tools graveyard rephrasing David Shotton
    17. Database survival: <20% ‘no problems’
    18. Data graveyard quoting David Shotton
    19. Old school bioinformatics for biologists
      • Lots of data, knowledge, and methods to deal with
      • Bioinformaticians make spaghetti and graveyards
    20. e -Science? ‘ enhanced science’ Research and development for enhancing science
    21. What about…
      • e -Science?
      • ‘ enhanced science’
      • Research and development from the field of computer science to enhance science with their methodologies
    22. Which diseases are associated with my protein of interest ‘EZH2’
    23. Biological knowledge extraction Biological question/model Computational experiment Extracted knowledge >17 million citations +400,000/yr
    24. An e-science approach
      • Combining expertise
      • Collaborating and sharing
      • Technology
    25. Combining expertise Edgar Meij Information retrieval expert
    26. Combining expertise Sophia Katrenko Machine learning expert
    27. Combining expertise Willem van Hage Semantic web expert (and bass guitar player)
    28. Combining expertise Towards a knowledge framework Computer scientist and bioinformatician Scott Marshall
    29. The AIDA toolbox, Web Services for knowledge extraction and knowledge management
    30. e -Science collaboration AIDA toolbox
    31. “ Collaboration through Web Services” Bio-text mining expert BioSemantics group, Erasmus University Rotterdam Martijn Schuemie
    32. “ Collaboration through Web Services” Biological Database expert Hideaki Sugawara
    33. “ Collaboration through Web Services” e -bioscientist
    34. A nice experiment design
    35. A not so nice experiment design
    36. A workflow Protocol for a computational experiment
    37. 10/06/09 BioAID
    38. 10/06/09 BioAID
    39. Sharing and publishing my designs
    40. Bio AID Disease Discovery workflow 10/06/09 BioAID AIDA AIDA OMIM service (Japan) AIDA ‘ Taverna shim’ Taverna ‘shim’
    41. Bio AID Disease discovery workflow 10/06/09 BioAID
    42. Bio AID Disease discovery workflow 10/06/09 BioAID
    43. An insightful computational experiment
    44. Feasting on brilliant brains with Taverna! Want this…
    45. Feasting on brilliant brains with Taverna! … need this
    46. How many brains do you want to use? – One?
    47. Some?
    48. Many? Some statistics >4000 Taverna users >3000 Web Services in Taverna >400 workflows on myExperiment >1000 registered myExperiment users in one year
    49. myExperiment users
    50. What can Taverna do for me?
      • Please now fill out questions 4 - 7
      • Benelux bioinformaticians think…
    51. Take home lesson
      • Use the Taverna-users mailing list! (subscribe via http://taverna.sourceforge.net )
      • and
      • Use the BioAssist-users mailing list
      • (subscribe via nbic)
    52. Wrong reasons to use Taverna
      • Creating workflows is easy
        • No, experiment design is difficult, Taverna ones are insightful
      • It is faster than scripts
        • No, scripts will be faster
      • It solves bioinformatics data conversion issues
        • No, ‘shims’ are the most common parts of workflows; several attempt are made to improve their availability.
      • Workflows are reproducible
        • Not guaranteed, it depends on the services that are outside Taverna’s controle, but it will be addressed by BioCatalogue
      • It is for biologists
        • No, Taverna is for bioinformaticians (or specially trained biologists)
      • I can feast on brains without giving credit
        • No, you probably violate the Science Commons license provided by myExperiment
      On line
    53. Reasons to not use (Taverna) workflows
      • You want to do it all by yourself
      • You want your colleagues and peers to depend on your code and you as the only one who understands it
      • Speed is all you care about
      • You expect no-one will ever want to (re)use or extend your work
      • If you think a data integration experiment requires integrated data (do you really need yet another warehouse?)
    54. For feasting on brains with Semantic Web formats and tools (and workflows) please visit poster 31
    55. End of presentation...
      • Thank you
      • http://adaptivedisclosure.org

    custom

    205 views, 0 favs, 0 embeds more stats

    Tutorial and demonstration of AIDA and Taverna for more

    More info about this document

    © All Rights Reserved

    Go to text version

    • Total Views 205
      • 205 on SlideShare
      • 0 from embeds
    • Comments 0
    • Favorites 0
    • Downloads 0
    Most viewed embeds

    more

    All embeds

    less

    Flagged as inappropriate Flag as inappropriate
    Flag as inappropriate

    Select your reason for flagging this presentation as inappropriate. If needed, use the feedback form to let us know more details.

    Cancel
    File a copyright complaint
    Having problems? Go to our helpdesk?

    Categories