SlideShare a Scribd company logo
From Laboratory to e-Laboratory? Introduction for ‘Lab-J’ of the LUMC Human Genetics Department Marco Roos Acknowledging the colleagues from BioSemantics, myGrid, OMII-UK, AID, The LUMC BioInformatics Expertise Centre
Introducing 2 Me
Liaison biology/bioinformatics – informatics 3 Biologist and bioinformatician, e-(bio)science researcher Coordinator BioSemantics group LeidenHuman Genetics Department Leiden University Medical Centre and Informatics Institute University of Amsterdam Project or Area Liaison (PAL) OMII-UK  Member BioAssist programme committee NBIC
also about 4 You
First about 5 Me
My C.V. before e-Sciencebefore 2003 6 Molecular & Cellular biology(MSc) microscopy and image analysis of chromosome structure ‘minor’ computer science Image analysis methods to measure DNA content in bull sperm cells(civil service) Chromatin structure & function(PhD molecular cytology) F.I.S.H., microscopy, image analysis, statistics 3-D chromosome structure during cell cycle (no luck) DNA movement in Escherichia coli(success) Human Transcriptome Map (post-doc) Gene expression to human genome sequence Analysis of regions of increased gene expression
MotivationStructure and function of DNA in the nucleus Escherichia coli Muntiacusmuntjak
8 Why bioinformatics? Lab-J suggests…
25/09/2009 BioAID 9 Bioinformatics A typical bioinformatician
25/09/2009 BioAID 10 Bioinformatics A biologist behind a computer who (just) learned perl
25/09/2009 BioAID 11 /*  * determines ridges in htm expression table */ #include "ridge.h" intselecthtm(PGconn *conn, char *htmtablename, char *chromname, PGresult *htmtable) { 	char querystring[256]; sprintf("SELECT * FROM %s WHERE chrom = %s ORDER BY genstart", htmtablename, chromname); htmtable = PQexec(conn, querystring); 	return(validquery(htmtable, querystring)); } intis_ridge(PGresult *htmtable, int row, double exprthreshold, intmincount) /* determines if mincount genes in a row are (part of) a ridge */ /* pre: htmtable is valid and sorted on genStart (ascending) /* post:  { 	if (mincount<=0) return TRUE; 	if (row>=PQntuples(htmtable)) return FALSE; 	if(PQgetvalue(htmtable, 0, PQfnumber(htmtable, "movmed39expr")) < exprthreshold) 	{ 	  return FALSE; 	} 	return(is_ridge(htmtable, ++row, exprthreshold, --mincount)); } int main() { PGconn	*conn;	/* holds database connection */ 	char querystring[256]; /* query string */ PGresult *result; inti; conn = PQconnectdb("dbname=htm port=6400 user=mroos password=geheim"); 	if (PQstatus(conn)==CONNECTION_BAD) 	{ fprintf(stderr, "connection to database failed."); fprintf(stderr, "%s", PQerrorMessage(conn)); 		exit(1); 	} 	else printf("Connection ok"); sprintf(querystring, "SELECT * FROM chromosomes"); printf("%s", querystring); 	result = PQexec(conn, querystring); 	if (validquery(result, querystring)) 	{ printresults(result); 	} 	else 	{ PQclear(result); PQfinish(conn); 		return FALSE; 	} PQclear(result); PQfinish(conn); 	return TRUE; } intprintresults(PGresult *tuples) { inti; 	for (i=0; i< PQntuples(tuples) && i < 10; i++) 	{ printf("%d, ", i); printf("%s", PQgetvalue(tuples,i,0)); 	} 	return TRUE; } intvalidquery(PGresult *result, char *querystring) { printf(" in validquery"); 	if (PQresultStatus(result) != PGRES_TUPLES_OK)  	{ printf("Query %s failed.", querystring); fprintf(stderr, "Query %s failed.", querystring); 		return FALSE; 	} 	return TRUE; }
State of the art applied computer science to a biologist 12
Why e-science? What is wrong with bioinformatics? 13 Human geneticists think…
Why should a biologist be interested in e-science? 14 BioAssistantsguessed… Involves Computation Interpretation of results Biology isn’t that interesting Reduce reinvention of the wheel Current lack of standards Sharing results Reshaping biology Synergy between different sciences Emerging Data driven science
15 Why e-Science? Lots of data to deal with Single tiny brain Lots of knowledge to deal with No computationalsuperpowers Lots of methodsand algorithms to try and combine Aneedy biologist
16 1070 databasesNucleic Acids Research Jan 2008(96 in Jan 2001) Proteomics, Genomics, Transcriptomics, Protein sequence prediction, Phenotypic studies, Phylogeny, Sequence analysis, Protein Structure prediction, Protein-protein interaction, Metabolomics, Model organism collections, Systems Biology, Epidemiology, etcetera … All with a splendid interface 	… all different, of course
25/09/2009 17 Traditional data integration in bioinformatics Local Database Local Database
18 The ‘spaghetti’ approach
Some of my observations Reinvention How many reannotation pipelines do you need? Little reuse of components Reproducibility Black boxes  Emphasis not on clarity Can we understand bioinformatics as wet lab protocols? Focus on technicalities, not biological analysis Should bioinformaticians write ‘job submission’ scripts? Data graveyards Do we need >1000 databases? Can we understand our own data? 19
How did I end up here? 20 Marco Roos Biologist and bioinformatician, Post-doc e-(bio)scienceHuman Genetics Department Leiden University Medical Centre and Informatics Institute University of Amsterdam Project or Area Liaison (PAL) OMII-UK  Member BioAssist programme committee NBIC
Some examples from field of e-Science 21
Enhancement 1: Workflows(Taverna workflow) 22
Enhancement 2: exploiting brains 23
Exploiting Brains By Web Servicessource: http://biocatalogue.org(launched at ISMB2009) 24 >1000 annotated services, >3000 known to Taverna Includes BioMart, R, Text mining, Kegg, NCBI Pubmed, Ensembl, etc. Web Services run remotely
25 Exploiting more brains by sharing workflowssource: http://myExperiment.org Social community web site for scientists 2300 registered users in two years 750 workflows
Bioinformatics and e-science Customized experiments with reusable components Single purpose,single person, black box application My component Your component My component My component Your component
What do we know of our data? 27 Sufficient? ,[object Object]
Query across experiment?
Fit biological modelling?
Good basis for new experiments?
Flexible enough?,[object Object]
Model based data integrationExample: UCSC genome browser partOf
Semantic Web (Linked Open Data) 30
31 Empower me with a ‘virtual brain’ * My ws Your ws My ws My ws Your ws * From P.J. Verschure, Journal of Cellular Biochemistry 2006, vol. 99(1), pg 23-34
32 Query Add query to semantic model Retrieve documents from Medline Add documents (IDs) to semantic model Extract proteins (Homo sapiens) Add proteins to semantic model Calculate ranking scores Add scores to semantic model Create biological cross references Add cross references to semantic model Convert to table (html) Workflow and Semantic Web
Concept web from a users point of view 33
34 e-Laboratories and e-Laboratory factories
e-Galaxy for NBIC 35 ,[object Object]
Workflows & Web Services
Grid enabled Taverna
MOLGENIS
Semantic/Concept Web
myExperiment/BioCatalogue
Scientific Research ObjectsVacancy! (software engineer)
SRO = a pack of models - Tool models - Data/ui models - Flow models +Attached data SRO enactment = a running e-laboratory Tools Uses tools services Model SROs my protocols my data my protocols my data Portal to workflows 2.0 mashup data  Flows mashup tools e-biologist e-bioinformatician Uses data services Portal to workflows Data   programmatic interaction user interfacing

More Related Content

What's hot

Kanterakis bosc2010 molgenis
Kanterakis bosc2010 molgenisKanterakis bosc2010 molgenis
Kanterakis bosc2010 molgenisBOSC 2010
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitBOSC 2010
 
Swertz bosc2010 molgenis
Swertz bosc2010 molgenisSwertz bosc2010 molgenis
Swertz bosc2010 molgenisBOSC 2010
 
Rice bosc2010 emboss
Rice bosc2010 embossRice bosc2010 emboss
Rice bosc2010 embossBOSC 2010
 
Mercer bosc2010 microsoft_framework
Mercer bosc2010 microsoft_frameworkMercer bosc2010 microsoft_framework
Mercer bosc2010 microsoft_frameworkBOSC 2010
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsDuncan Hull
 
Mid-Ontology Learning from Linked Data @JIST2011
Mid-Ontology Learning from Linked Data @JIST2011Mid-Ontology Learning from Linked Data @JIST2011
Mid-Ontology Learning from Linked Data @JIST2011Lihua Zhao
 
Instance-Based Ontological Knowledge Acquisition
Instance-Based Ontological Knowledge AcquisitionInstance-Based Ontological Knowledge Acquisition
Instance-Based Ontological Knowledge AcquisitionLihua Zhao
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksCarole Goble
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsCarole Goble
 
All together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeAll together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeChris Mungall
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyMelanie Courtot
 

What's hot (20)

Kanterakis bosc2010 molgenis
Kanterakis bosc2010 molgenisKanterakis bosc2010 molgenis
Kanterakis bosc2010 molgenis
 
Venkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkitVenkatesan bosc2010 onto-toolkit
Venkatesan bosc2010 onto-toolkit
 
Swertz bosc2010 molgenis
Swertz bosc2010 molgenisSwertz bosc2010 molgenis
Swertz bosc2010 molgenis
 
Rice bosc2010 emboss
Rice bosc2010 embossRice bosc2010 emboss
Rice bosc2010 emboss
 
Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...Chemistry Online and The vision and challenges associated with building the c...
Chemistry Online and The vision and challenges associated with building the c...
 
Mercer bosc2010 microsoft_framework
Mercer bosc2010 microsoft_frameworkMercer bosc2010 microsoft_framework
Mercer bosc2010 microsoft_framework
 
The Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of BioinformaticsThe Seven Deadly Sins of Bioinformatics
The Seven Deadly Sins of Bioinformatics
 
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
The Crop Ontology - Harmonizing Semantics for Agricultural Field Data, by Eli...
 
2015_CV_J_SHELTON_linked
2015_CV_J_SHELTON_linked2015_CV_J_SHELTON_linked
2015_CV_J_SHELTON_linked
 
Mid-Ontology Learning from Linked Data @JIST2011
Mid-Ontology Learning from Linked Data @JIST2011Mid-Ontology Learning from Linked Data @JIST2011
Mid-Ontology Learning from Linked Data @JIST2011
 
NETTAB 2013
NETTAB 2013NETTAB 2013
NETTAB 2013
 
Instance-Based Ontological Knowledge Acquisition
Instance-Based Ontological Knowledge AcquisitionInstance-Based Ontological Knowledge Acquisition
Instance-Based Ontological Knowledge Acquisition
 
Semantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life SciencesSemantic (Web) Technologies for Translational Research in Life Sciences
Semantic (Web) Technologies for Translational Research in Life Sciences
 
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object FrameworksResults Vary: The Pragmatics of Reproducibility and Research Object Frameworks
Results Vary: The Pragmatics of Reproducibility and Research Object Frameworks
 
2015 illinois-talk
2015 illinois-talk2015 illinois-talk
2015 illinois-talk
 
OpenTox Europe 2013
OpenTox Europe 2013OpenTox Europe 2013
OpenTox Europe 2013
 
B.3.5
B.3.5B.3.5
B.3.5
 
RARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research ObjectsRARE and FAIR Science: Reproducibility and Research Objects
RARE and FAIR Science: Reproducibility and Research Objects
 
All together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of lifeAll together now: piecing together the knowledge graph of life
All together now: piecing together the knowledge graph of life
 
Ontologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontologyOntologies for life sciences: examples from the gene ontology
Ontologies for life sciences: examples from the gene ontology
 

Similar to From Laboratory to e-Laboratory

Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009Ian Foster
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and KnowledgeIan Foster
 
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksOscar Corcho
 
Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Paolo Romano
 
Open-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitOpen-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitGreg Landrum
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
 
Reproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookReproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookKeiichiro Ono
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXStian Soiland-Reyes
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Stian Soiland-Reyes
 
OREChem Services and Workflows
OREChem Services and WorkflowsOREChem Services and Workflows
OREChem Services and Workflowsmarpierc
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchDavid Ruau
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects Carole Goble
 
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Keiichiro Ono
 

Similar to From Laboratory to e-Laboratory (20)

'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008
 
Services For Science April 2009
Services For Science April 2009Services For Science April 2009
Services For Science April 2009
 
DCC Keynote 2007
DCC Keynote 2007DCC Keynote 2007
DCC Keynote 2007
 
Computation and Knowledge
Computation and KnowledgeComputation and Knowledge
Computation and Knowledge
 
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
 
Pine education-platform
Pine education-platformPine education-platform
Pine education-platform
 
Ingredients for Semantic Sensor Networks
Ingredients for Semantic Sensor NetworksIngredients for Semantic Sensor Networks
Ingredients for Semantic Sensor Networks
 
Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?Scientific Workflows: what do we have, what do we miss?
Scientific Workflows: what do we have, what do we miss?
 
Open-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKitOpen-source from/in the enterprise: the RDKit
Open-source from/in the enterprise: the RDKit
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Reproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter NotebookReproducible Workflow with Cytoscape and Jupyter Notebook
Reproducible Workflow with Cytoscape and Jupyter Notebook
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTXTaverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
Taverna workflow management system (2010 11-30 Bath Workflow Tools) PPTX
 
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)Taverna workflow management system (2010 11-30 Bath Workflow Tools)
Taverna workflow management system (2010 11-30 Bath Workflow Tools)
 
Feasting On Brains With Taverna Public
Feasting On Brains With Taverna PublicFeasting On Brains With Taverna Public
Feasting On Brains With Taverna Public
 
OREChem Services and Workflows
OREChem Services and WorkflowsOREChem Services and Workflows
OREChem Services and Workflows
 
Overview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data AnalysisOverview of Next Gen Sequencing Data Analysis
Overview of Next Gen Sequencing Data Analysis
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
Cool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical ResearchCool Informatics Tools and Services for Biomedical Research
Cool Informatics Tools and Services for Biomedical Research
 
The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
 
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...Introduction to Biological Network Analysis and Visualization with Cytoscape ...
Introduction to Biological Network Analysis and Visualization with Cytoscape ...
 

More from Leiden University Medical Center

More from Leiden University Medical Center (10)

Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation
Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation
Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation
 
Linked Data and Ontology Tutorial (for RD-Connect)
Linked Data and Ontology Tutorial (for RD-Connect)Linked Data and Ontology Tutorial (for RD-Connect)
Linked Data and Ontology Tutorial (for RD-Connect)
 
Nanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readabilityNanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readability
 
Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...
 
Feasting onbrainswithworkflows
Feasting onbrainswithworkflowsFeasting onbrainswithworkflows
Feasting onbrainswithworkflows
 
Enabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audienceEnabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audience
 
CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009
 
Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007
 
Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007
 
E Science4 Chromatin Research
E Science4 Chromatin ResearchE Science4 Chromatin Research
E Science4 Chromatin Research
 

Recently uploaded

NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxssuserbdd3e8
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptSourabh Kumar
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptxJosvitaDsouza2
 
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...SachinKumar945617
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXMIRIAMSALINAS13
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasGeoBlogs
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfSpecial education needs
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPCeline George
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxEduSkills OECD
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismDeeptiGupta154
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxRaedMohamed3
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...Nguyen Thanh Tu Collection
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chipsGeoBlogs
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfThiyagu K
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativePeter Windle
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345beazzy04
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfkaushalkr1407
 

Recently uploaded (20)

NCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdfNCERT Solutions Power Sharing Class 10 Notes pdf
NCERT Solutions Power Sharing Class 10 Notes pdf
 
NLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptxNLC-2024-Orientation-for-RO-SDO (1).pptx
NLC-2024-Orientation-for-RO-SDO (1).pptx
 
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.pptBasic_QTL_Marker-assisted_Selection_Sourabh.ppt
Basic_QTL_Marker-assisted_Selection_Sourabh.ppt
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
Extraction Of Natural Dye From Beetroot (Beta Vulgaris) And Preparation Of He...
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptxStudents, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
Students, digital devices and success - Andreas Schleicher - 27 May 2024..pptx
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
GIÁO ÁN DẠY THÊM (KẾ HOẠCH BÀI BUỔI 2) - TIẾNG ANH 8 GLOBAL SUCCESS (2 CỘT) N...
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Unit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdfUnit 8 - Information and Communication Technology (Paper I).pdf
Unit 8 - Information and Communication Technology (Paper I).pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
The Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdfThe Roman Empire A Historical Colossus.pdf
The Roman Empire A Historical Colossus.pdf
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 

From Laboratory to e-Laboratory

  • 1. From Laboratory to e-Laboratory? Introduction for ‘Lab-J’ of the LUMC Human Genetics Department Marco Roos Acknowledging the colleagues from BioSemantics, myGrid, OMII-UK, AID, The LUMC BioInformatics Expertise Centre
  • 3. Liaison biology/bioinformatics – informatics 3 Biologist and bioinformatician, e-(bio)science researcher Coordinator BioSemantics group LeidenHuman Genetics Department Leiden University Medical Centre and Informatics Institute University of Amsterdam Project or Area Liaison (PAL) OMII-UK Member BioAssist programme committee NBIC
  • 6. My C.V. before e-Sciencebefore 2003 6 Molecular & Cellular biology(MSc) microscopy and image analysis of chromosome structure ‘minor’ computer science Image analysis methods to measure DNA content in bull sperm cells(civil service) Chromatin structure & function(PhD molecular cytology) F.I.S.H., microscopy, image analysis, statistics 3-D chromosome structure during cell cycle (no luck) DNA movement in Escherichia coli(success) Human Transcriptome Map (post-doc) Gene expression to human genome sequence Analysis of regions of increased gene expression
  • 7. MotivationStructure and function of DNA in the nucleus Escherichia coli Muntiacusmuntjak
  • 8. 8 Why bioinformatics? Lab-J suggests…
  • 9. 25/09/2009 BioAID 9 Bioinformatics A typical bioinformatician
  • 10. 25/09/2009 BioAID 10 Bioinformatics A biologist behind a computer who (just) learned perl
  • 11. 25/09/2009 BioAID 11 /* * determines ridges in htm expression table */ #include "ridge.h" intselecthtm(PGconn *conn, char *htmtablename, char *chromname, PGresult *htmtable) { char querystring[256]; sprintf("SELECT * FROM %s WHERE chrom = %s ORDER BY genstart", htmtablename, chromname); htmtable = PQexec(conn, querystring); return(validquery(htmtable, querystring)); } intis_ridge(PGresult *htmtable, int row, double exprthreshold, intmincount) /* determines if mincount genes in a row are (part of) a ridge */ /* pre: htmtable is valid and sorted on genStart (ascending) /* post: { if (mincount<=0) return TRUE; if (row>=PQntuples(htmtable)) return FALSE; if(PQgetvalue(htmtable, 0, PQfnumber(htmtable, "movmed39expr")) < exprthreshold) { return FALSE; } return(is_ridge(htmtable, ++row, exprthreshold, --mincount)); } int main() { PGconn *conn; /* holds database connection */ char querystring[256]; /* query string */ PGresult *result; inti; conn = PQconnectdb("dbname=htm port=6400 user=mroos password=geheim"); if (PQstatus(conn)==CONNECTION_BAD) { fprintf(stderr, "connection to database failed."); fprintf(stderr, "%s", PQerrorMessage(conn)); exit(1); } else printf("Connection ok"); sprintf(querystring, "SELECT * FROM chromosomes"); printf("%s", querystring); result = PQexec(conn, querystring); if (validquery(result, querystring)) { printresults(result); } else { PQclear(result); PQfinish(conn); return FALSE; } PQclear(result); PQfinish(conn); return TRUE; } intprintresults(PGresult *tuples) { inti; for (i=0; i< PQntuples(tuples) && i < 10; i++) { printf("%d, ", i); printf("%s", PQgetvalue(tuples,i,0)); } return TRUE; } intvalidquery(PGresult *result, char *querystring) { printf(" in validquery"); if (PQresultStatus(result) != PGRES_TUPLES_OK) { printf("Query %s failed.", querystring); fprintf(stderr, "Query %s failed.", querystring); return FALSE; } return TRUE; }
  • 12. State of the art applied computer science to a biologist 12
  • 13. Why e-science? What is wrong with bioinformatics? 13 Human geneticists think…
  • 14. Why should a biologist be interested in e-science? 14 BioAssistantsguessed… Involves Computation Interpretation of results Biology isn’t that interesting Reduce reinvention of the wheel Current lack of standards Sharing results Reshaping biology Synergy between different sciences Emerging Data driven science
  • 15. 15 Why e-Science? Lots of data to deal with Single tiny brain Lots of knowledge to deal with No computationalsuperpowers Lots of methodsand algorithms to try and combine Aneedy biologist
  • 16. 16 1070 databasesNucleic Acids Research Jan 2008(96 in Jan 2001) Proteomics, Genomics, Transcriptomics, Protein sequence prediction, Phenotypic studies, Phylogeny, Sequence analysis, Protein Structure prediction, Protein-protein interaction, Metabolomics, Model organism collections, Systems Biology, Epidemiology, etcetera … All with a splendid interface … all different, of course
  • 17. 25/09/2009 17 Traditional data integration in bioinformatics Local Database Local Database
  • 19. Some of my observations Reinvention How many reannotation pipelines do you need? Little reuse of components Reproducibility Black boxes Emphasis not on clarity Can we understand bioinformatics as wet lab protocols? Focus on technicalities, not biological analysis Should bioinformaticians write ‘job submission’ scripts? Data graveyards Do we need >1000 databases? Can we understand our own data? 19
  • 20. How did I end up here? 20 Marco Roos Biologist and bioinformatician, Post-doc e-(bio)scienceHuman Genetics Department Leiden University Medical Centre and Informatics Institute University of Amsterdam Project or Area Liaison (PAL) OMII-UK Member BioAssist programme committee NBIC
  • 21. Some examples from field of e-Science 21
  • 24. Exploiting Brains By Web Servicessource: http://biocatalogue.org(launched at ISMB2009) 24 >1000 annotated services, >3000 known to Taverna Includes BioMart, R, Text mining, Kegg, NCBI Pubmed, Ensembl, etc. Web Services run remotely
  • 25. 25 Exploiting more brains by sharing workflowssource: http://myExperiment.org Social community web site for scientists 2300 registered users in two years 750 workflows
  • 26. Bioinformatics and e-science Customized experiments with reusable components Single purpose,single person, black box application My component Your component My component My component Your component
  • 27.
  • 30. Good basis for new experiments?
  • 31.
  • 32. Model based data integrationExample: UCSC genome browser partOf
  • 33. Semantic Web (Linked Open Data) 30
  • 34. 31 Empower me with a ‘virtual brain’ * My ws Your ws My ws My ws Your ws * From P.J. Verschure, Journal of Cellular Biochemistry 2006, vol. 99(1), pg 23-34
  • 35. 32 Query Add query to semantic model Retrieve documents from Medline Add documents (IDs) to semantic model Extract proteins (Homo sapiens) Add proteins to semantic model Calculate ranking scores Add scores to semantic model Create biological cross references Add cross references to semantic model Convert to table (html) Workflow and Semantic Web
  • 36. Concept web from a users point of view 33
  • 37. 34 e-Laboratories and e-Laboratory factories
  • 38.
  • 39. Workflows & Web Services
  • 44. Scientific Research ObjectsVacancy! (software engineer)
  • 45. SRO = a pack of models - Tool models - Data/ui models - Flow models +Attached data SRO enactment = a running e-laboratory Tools Uses tools services Model SROs my protocols my data my protocols my data Portal to workflows 2.0 mashup data Flows mashup tools e-biologist e-bioinformatician Uses data services Portal to workflows Data programmatic interaction user interfacing
  • 46. e-Galaxy mock-up 37 Suggestions by semantic components Your Scientific Research Object Underlying workflow Related research and documents Adlsjfladjslfadsflkjalfdadsf Adfljadlfkjaladlfjlakdjflkjadf Adflkjlakjlkjadsflakdfjlfladoioewn Jlakdsfooiuwfjaoijaoisdflvoaijdf MOLGENIS Convert Import/Export Research Objects Store Configure Run
  • 47. e-Science requirement: Reuse 38 E-Laboratorycomponent
  • 49. Research and development aims Automated support for hypothesis formation E.g. on epigenetic mechanisms Apply Workflow, Semantic Web, Concept Web Concept-based meta-analysis Automated triple creation from computational analysis 40
  • 50. Research and development ambitions Co-develop e-Laboratories e-Galaxy epiGenius BioBanking Help BEC with support environment Concept Web services Web services E-Laboratory components Transparent creation of triples Personal semantic repositories 41
  • 51. Liaison Bioinformatics Expertise Centre LUMC Statistical and computer science expertise Generic support NBIC BioAssist core software development Grid tools, Concept Web, e-Labs BioSemantics Rotterdam Text mining Concept profile meta-analysis AIDUniversity of Amsterdam e-Science experts Grid tools You? OMII-UK Manchester, Southampton, Edinburgh (ca. 30 engineers) Taverna, myExperiment, e-Labs Concept Web Content, tools and infrastructure W3C Health Care & Life Sciences Interest Group Semantic Web experts Linked Open Data
  • 52. ‘e’ for enhance, not enforce Please help me to help you Register for: http://snipurl.com/biosemanticsusers (http://www.myexperiment.org/groups/211) Allows me to Give you preferential treatment Not spam everybody Keep you informed Ask your opinion (user driven development!) 43
  • 53. Visit the BioSemantics web sitehttp://www.biosemantics.org/ 44
  • 54. Word of warning Computer scientists are scientists too! Need to publish Score by papers, not by software Addressed by OMII-UK and BioAssist Compare “How can I use it in the clinic?” “How can I use it in the lab?” 45
  • 55. Dissemination Come by for help or information Internal ‘mini-courses’? Send me suggestions! FYI: Course ‘Managing Life Science Information’ for PhD students, 2010 46
  • 56. Key points Liaisingbetween technology contacts and you, the colleagues of Human Genetics. No obligationsTry any new developments that we are involved in with our help, but don't feel obliged. Help us help youExpress your wishes, problems, try things and give feedback – and be patient sometimes Please join the biosemantics users group on myExperiment.org to help us communicate. 47
  • 57. 48 Thank you for your attention Lots of accessible data Communitybrain power Knowledge basesto query Other people’scomputationalsuperpowers Web Services, Workflows, and their creatorsavailable Anenhanced biologist Homo biologicusenhancis