SlideShare a Scribd company logo
1 of 34
MOLGENIS  and the eXtensible Genotype And Phenotype database project (xgap) Morris A. Swertz et al DAM & BOSC sigs Stockholm, June 27 2009  EBI Biobanking platform
Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
MOLGENIS, why and how biologist biologist biological challenges inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. map
MOLGENIS, why and how biologist biologist biological challenges suitable infrastructure inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. map bioinformatician softw engineers €
MOLGENIS, why and how biologist biologist Reinventing wheels, Wasting time Hard to integrate biological challenges suitable infrastructure bioinformatician softw engineers inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks €
Alternative strategy http://www.molgenis.org Swertz & Jansen (2007)  Nature Reviews Genetics  8, 235-243 http://www.molgenis.org
MOLGENIS, why and how Platform and generators Little language <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot; ExperimentID(autonum )&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; + bioinformatician softw engineer inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. 10.000 map biologist biologist http://www.molgenis.org Swertz & Jansen (2007)  Nature Reviews Genetics  8, 235-243
Upgrade to new research Platform and generators Little language <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; + bioinformatician softw engineer biologist biologist New Biology New Biology http://www.molgenis.org Swertz & Jansen (2007)  Nature Reviews Genetics  8, 235-243 inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks
Upgrade to new software tools Platform and software generators Little language <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity   name= &quot;Experiment&quot;   label= &quot;Experiment&quot; > <field   name= &quot;ExperimentID&quot;   key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field   name= &quot;Medium&quot;  type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; />   />  <field   name= &quot;Protocol&quot;  label= &quot;Experiment Protocol&quot; />  <field   name= &quot;Temperature&quot;   type= &quot;int&quot; + bioinformatician softw engineer biologist biologist inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks http://www.molgenis.org Swertz & Jansen (2007)  Nature Reviews Genetics  8, 235-243
Demo
Step 1: model* m.a.swertz@rug.nl /  individuals expressions probes *Can also extract automatically from an existing database
m.a.swertz@rug.nl /
Step 2: generate Download and customize... Model file XML Generate APIs in Java, R, Web services and HTTP MyScript Plugins FormGen MenuGen TreeGen PluginGen MatrixGen JTypeGen JDBCMapGen JListGen JReadCsvGen HSQLGen JDatabaseGen MySQLGen    RMatrixGen WSGen RListGen data infrastructure user interaction infrastructure Communication infrastructure
Step 3: use result ,[object Object]
eXtensible Genotype And Phenotype database  for QTL and GWAS experiments
[object Object],[object Object],[object Object],[object Object],[object Object],Example projects
XGAP - DAM Challenges ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. 10.000 map Main work flow Data dependency Biomaterial/result Lab/analysis process Scale of information Associated data files process material 10,000
First objective m.a.swertz@rug.nl /  researcher researcher annotations Raw and processed data database my GaP
[object Object],1. Data model M A R K E R S Subjects: STRAINS DATA ELEMENTS T r a i t s: TRAIT    SUBJECT Looking at standards and existing data sets Simple enough for everybody to create
1. Data model ,[object Object],/ P R O B E S Traits: MARKERS T r a i t s:
1. Model /  DATA  (matrix) TRAIT SUBJECT DATA  ELEMENT TRAIT    SUBJECT
1. Model /  DATA  ELEMENT TRAIT SUBJECT DATA  ELEMENT ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],columns rows dimension  ELEMENT
Extensions for new experiments DATA  ELEMENT TRAIT SUBJECT ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],And so on … And so on … columns rows dimension  ELEMENT
Protocol graph from FuGE ,[object Object],/  FuGE: Jones et al  Nature Biotech 25, 1127-1133   DATA DATA Genotype data QTL data QTL  Mapping Affy Array SNP Array DATA Expression data Mapping Protocol Illumina R Software Illumina Protocol Affy M430 Protocol Bead Studio DATA application Protocol Software Equipment Bioconductor Norm. Affy M430 platform DATA DATA DATA FuGE:
UML: XGAP extends FuGE m.a.swertz@rug.nl /  Uniform core to ease sharing of data and tools Various traits for new research Various subjects for new research ? ?
2. Mode, run MOLGENIS
Connect to R statistics Workflow ready web-services UML documentation of your model Edit & trace your data Import/export to Excel plugin your own scripts (R/QTL) Tech keywords: object oriented data models, multi-platform java, tomcat/glassfish web server, mysql/postgresql database, Eclipse/Netbeans IDE, Java API, WSDL/SOAP API, R-project API, MVC, freemarker templates and css for custom layout, open source.
Proof of the pudding
Ongoing work
Next step: add processing Sheets thanks to Joeri van der Velde and Danny Arends Generalize for all MOLGENIS instances: (1) Extend MOLGENIS model for tool integration <tool name=“rqtl”> <input name=“data” entity=“data”/> … </tool> (2) Integrate workflow definition and execution Extending on Taverna/Galaxy model & APIs…
Next step: semantics ,[object Object],[object Object],Generalize for all MOLGENIS instances: Next: Add MOLGENIS components to integrate: (1) Ontology browsing  Extending on BioPortal/OLS frameworks? (2) Semantic integration layer  ???
[object Object],[object Object],[object Object],Federation? Cloud computing?
Acknowledgements ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Questions http://www.molgenis.org http:// www.xgap.org

More Related Content

Similar to Swertz Molgenis Bosc2009

Architecting Web Services
Architecting Web ServicesArchitecting Web Services
Architecting Web Services
Lorna Mitchell
 
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
Jean-Paul Calbimonte
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP Project
Stuart Chalk
 
UM03 - Learning Know..
UM03 - Learning Know..UM03 - Learning Know..
UM03 - Learning Know..
butest
 
Jsonsaga
JsonsagaJsonsaga
Jsonsaga
nohmad
 
course slides -- powerpoint
course slides -- powerpointcourse slides -- powerpoint
course slides -- powerpoint
webhostingguy
 

Similar to Swertz Molgenis Bosc2009 (20)

PPePR Overview Web2 Ireland
PPePR Overview Web2 IrelandPPePR Overview Web2 Ireland
PPePR Overview Web2 Ireland
 
Struts2
Struts2Struts2
Struts2
 
XML and Web Services with PHP5 and PEAR
XML and Web Services with PHP5 and PEARXML and Web Services with PHP5 and PEAR
XML and Web Services with PHP5 and PEAR
 
Architecting Web Services
Architecting Web ServicesArchitecting Web Services
Architecting Web Services
 
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
Tutorial ESWC2011 Building Semantic Sensor Web - 04 - Querying_semantic_strea...
 
Introduction to XML
Introduction to XMLIntroduction to XML
Introduction to XML
 
ORM JPA
ORM JPAORM JPA
ORM JPA
 
Building a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP ProjectBuilding a Standard for Standards: The ChAMP Project
Building a Standard for Standards: The ChAMP Project
 
SAFE EDBT 2011
SAFE EDBT 2011SAFE EDBT 2011
SAFE EDBT 2011
 
Ruby on Rails
Ruby on RailsRuby on Rails
Ruby on Rails
 
E Science4 Chromatin Research
E Science4 Chromatin ResearchE Science4 Chromatin Research
E Science4 Chromatin Research
 
Easy R
Easy REasy R
Easy R
 
3 xml namespaces and xml schema
3   xml namespaces and xml schema3   xml namespaces and xml schema
3 xml namespaces and xml schema
 
UM03 - Learning Know..
UM03 - Learning Know..UM03 - Learning Know..
UM03 - Learning Know..
 
20110725 ibc xml
20110725 ibc xml20110725 ibc xml
20110725 ibc xml
 
Odp
OdpOdp
Odp
 
Lucene And Solr Intro
Lucene And Solr IntroLucene And Solr Intro
Lucene And Solr Intro
 
Jsonsaga
JsonsagaJsonsaga
Jsonsaga
 
course slides -- powerpoint
course slides -- powerpointcourse slides -- powerpoint
course slides -- powerpoint
 
Inroduction to XSLT with PHP4
Inroduction to XSLT with PHP4Inroduction to XSLT with PHP4
Inroduction to XSLT with PHP4
 

More from bosc

Bosc Intro 20090627
Bosc Intro 20090627Bosc Intro 20090627
Bosc Intro 20090627
bosc
 
Software Patterns Panel Bosc2009
Software Patterns Panel Bosc2009Software Patterns Panel Bosc2009
Software Patterns Panel Bosc2009
bosc
 
Schbath Rmes Bosc2009
Schbath Rmes Bosc2009Schbath Rmes Bosc2009
Schbath Rmes Bosc2009
bosc
 
Kallio Chipster Bosc2009
Kallio Chipster Bosc2009Kallio Chipster Bosc2009
Kallio Chipster Bosc2009
bosc
 
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
bosc
 
Rice Emboss Bosc2009
Rice Emboss Bosc2009Rice Emboss Bosc2009
Rice Emboss Bosc2009
bosc
 
Prlic Bio Java Bosc2009
Prlic Bio Java Bosc2009Prlic Bio Java Bosc2009
Prlic Bio Java Bosc2009
bosc
 
Senger Soaplab Bosc2009
Senger Soaplab Bosc2009Senger Soaplab Bosc2009
Senger Soaplab Bosc2009
bosc
 
Cock Biopython Bosc2009
Cock Biopython Bosc2009Cock Biopython Bosc2009
Cock Biopython Bosc2009
bosc
 
Hanmer Software Patterns Bosc2009
Hanmer Software Patterns Bosc2009Hanmer Software Patterns Bosc2009
Hanmer Software Patterns Bosc2009
bosc
 
Snell Psoda Bosc2009
Snell Psoda Bosc2009Snell Psoda Bosc2009
Snell Psoda Bosc2009
bosc
 
Procter Vamsas Bosc2009
Procter Vamsas Bosc2009Procter Vamsas Bosc2009
Procter Vamsas Bosc2009
bosc
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009
bosc
 
Fauteux Seeder Bosc2009
Fauteux Seeder Bosc2009Fauteux Seeder Bosc2009
Fauteux Seeder Bosc2009
bosc
 
Moeller Debian Bosc2009
Moeller Debian Bosc2009Moeller Debian Bosc2009
Moeller Debian Bosc2009
bosc
 
Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009
bosc
 
Wilczynski_BNFinder_BOSC2009
Wilczynski_BNFinder_BOSC2009Wilczynski_BNFinder_BOSC2009
Wilczynski_BNFinder_BOSC2009
bosc
 
Welsh_BioHDF_BOSC2009
Welsh_BioHDF_BOSC2009Welsh_BioHDF_BOSC2009
Welsh_BioHDF_BOSC2009
bosc
 
Varre_Biomanycores_BOSC2009
Varre_Biomanycores_BOSC2009Varre_Biomanycores_BOSC2009
Varre_Biomanycores_BOSC2009
bosc
 
Trelles_QnormBOSC2009
Trelles_QnormBOSC2009Trelles_QnormBOSC2009
Trelles_QnormBOSC2009
bosc
 

More from bosc (20)

Bosc Intro 20090627
Bosc Intro 20090627Bosc Intro 20090627
Bosc Intro 20090627
 
Software Patterns Panel Bosc2009
Software Patterns Panel Bosc2009Software Patterns Panel Bosc2009
Software Patterns Panel Bosc2009
 
Schbath Rmes Bosc2009
Schbath Rmes Bosc2009Schbath Rmes Bosc2009
Schbath Rmes Bosc2009
 
Kallio Chipster Bosc2009
Kallio Chipster Bosc2009Kallio Chipster Bosc2009
Kallio Chipster Bosc2009
 
Welch Wordifier Bosc2009
Welch Wordifier Bosc2009Welch Wordifier Bosc2009
Welch Wordifier Bosc2009
 
Rice Emboss Bosc2009
Rice Emboss Bosc2009Rice Emboss Bosc2009
Rice Emboss Bosc2009
 
Prlic Bio Java Bosc2009
Prlic Bio Java Bosc2009Prlic Bio Java Bosc2009
Prlic Bio Java Bosc2009
 
Senger Soaplab Bosc2009
Senger Soaplab Bosc2009Senger Soaplab Bosc2009
Senger Soaplab Bosc2009
 
Cock Biopython Bosc2009
Cock Biopython Bosc2009Cock Biopython Bosc2009
Cock Biopython Bosc2009
 
Hanmer Software Patterns Bosc2009
Hanmer Software Patterns Bosc2009Hanmer Software Patterns Bosc2009
Hanmer Software Patterns Bosc2009
 
Snell Psoda Bosc2009
Snell Psoda Bosc2009Snell Psoda Bosc2009
Snell Psoda Bosc2009
 
Procter Vamsas Bosc2009
Procter Vamsas Bosc2009Procter Vamsas Bosc2009
Procter Vamsas Bosc2009
 
Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009Drablos Composite Motifs Bosc2009
Drablos Composite Motifs Bosc2009
 
Fauteux Seeder Bosc2009
Fauteux Seeder Bosc2009Fauteux Seeder Bosc2009
Fauteux Seeder Bosc2009
 
Moeller Debian Bosc2009
Moeller Debian Bosc2009Moeller Debian Bosc2009
Moeller Debian Bosc2009
 
Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009Prins Bio Lib Bosc 2009
Prins Bio Lib Bosc 2009
 
Wilczynski_BNFinder_BOSC2009
Wilczynski_BNFinder_BOSC2009Wilczynski_BNFinder_BOSC2009
Wilczynski_BNFinder_BOSC2009
 
Welsh_BioHDF_BOSC2009
Welsh_BioHDF_BOSC2009Welsh_BioHDF_BOSC2009
Welsh_BioHDF_BOSC2009
 
Varre_Biomanycores_BOSC2009
Varre_Biomanycores_BOSC2009Varre_Biomanycores_BOSC2009
Varre_Biomanycores_BOSC2009
 
Trelles_QnormBOSC2009
Trelles_QnormBOSC2009Trelles_QnormBOSC2009
Trelles_QnormBOSC2009
 

Recently uploaded

IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Recently uploaded (20)

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

Swertz Molgenis Bosc2009

  • 1. MOLGENIS and the eXtensible Genotype And Phenotype database project (xgap) Morris A. Swertz et al DAM & BOSC sigs Stockholm, June 27 2009 EBI Biobanking platform
  • 2.
  • 3. MOLGENIS, why and how biologist biologist biological challenges inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. map
  • 4. MOLGENIS, why and how biologist biologist biological challenges suitable infrastructure inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. map bioinformatician softw engineers €
  • 5. MOLGENIS, why and how biologist biologist Reinventing wheels, Wasting time Hard to integrate biological challenges suitable infrastructure bioinformatician softw engineers inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks €
  • 6. Alternative strategy http://www.molgenis.org Swertz & Jansen (2007) Nature Reviews Genetics 8, 235-243 http://www.molgenis.org
  • 7. MOLGENIS, why and how Platform and generators Little language <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot; ExperimentID(autonum )&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; + bioinformatician softw engineer inbreed 100 10.000 1,000,000 100,000 10,000 10 10,000,00 QTL profiles network correlate genome strains individuals markers expressions preprocess probes microarrays 100 hybridize 100,000 genotype genotypes norm exprs. 10.000 map biologist biologist http://www.molgenis.org Swertz & Jansen (2007) Nature Reviews Genetics 8, 235-243
  • 8. Upgrade to new research Platform and generators Little language <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; + bioinformatician softw engineer biologist biologist New Biology New Biology http://www.molgenis.org Swertz & Jansen (2007) Nature Reviews Genetics 8, 235-243 inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks
  • 9. Upgrade to new software tools Platform and software generators Little language <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; Blueprint model <!-- entity organization --> <entity name= &quot;Experiment&quot; label= &quot;Experiment&quot; > <field name= &quot;ExperimentID&quot; key= &quot;1“ r eadonly= &quot;true&quot; label= &quot;ExperimentID(autonum)&quot;/> <field name= &quot;Medium&quot; type= &quot;xref&quot; xref_field= &quot;Medium.name&quot; /> /> <field name= &quot;Protocol&quot; label= &quot;Experiment Protocol&quot; /> <field name= &quot;Temperature&quot; type= &quot;int&quot; + bioinformatician softw engineer biologist biologist inbreed 100 100.000 10,000,000 1000 10,000 10 1000 genotype individuals mass peaks genotypes QTL profiles strains network SNP arrays correlate LC/MS genome map preprocess aligned peaks http://www.molgenis.org Swertz & Jansen (2007) Nature Reviews Genetics 8, 235-243
  • 10. Demo
  • 11. Step 1: model* m.a.swertz@rug.nl / individuals expressions probes *Can also extract automatically from an existing database
  • 13. Step 2: generate Download and customize... Model file XML Generate APIs in Java, R, Web services and HTTP MyScript Plugins FormGen MenuGen TreeGen PluginGen MatrixGen JTypeGen JDBCMapGen JListGen JReadCsvGen HSQLGen JDatabaseGen MySQLGen    RMatrixGen WSGen RListGen data infrastructure user interaction infrastructure Communication infrastructure
  • 14.
  • 15. eXtensible Genotype And Phenotype database for QTL and GWAS experiments
  • 16.
  • 17.
  • 18. First objective m.a.swertz@rug.nl / researcher researcher annotations Raw and processed data database my GaP
  • 19.
  • 20.
  • 21. 1. Model / DATA (matrix) TRAIT SUBJECT DATA ELEMENT TRAIT  SUBJECT
  • 22.
  • 23.
  • 24.
  • 25. UML: XGAP extends FuGE m.a.swertz@rug.nl / Uniform core to ease sharing of data and tools Various traits for new research Various subjects for new research ? ?
  • 26. 2. Mode, run MOLGENIS
  • 27. Connect to R statistics Workflow ready web-services UML documentation of your model Edit & trace your data Import/export to Excel plugin your own scripts (R/QTL) Tech keywords: object oriented data models, multi-platform java, tomcat/glassfish web server, mysql/postgresql database, Eclipse/Netbeans IDE, Java API, WSDL/SOAP API, R-project API, MVC, freemarker templates and css for custom layout, open source.
  • 28. Proof of the pudding
  • 30. Next step: add processing Sheets thanks to Joeri van der Velde and Danny Arends Generalize for all MOLGENIS instances: (1) Extend MOLGENIS model for tool integration <tool name=“rqtl”> <input name=“data” entity=“data”/> … </tool> (2) Integrate workflow definition and execution Extending on Taverna/Galaxy model & APIs…
  • 31.
  • 32.
  • 33.
  • 34.

Editor's Notes

  1. International data standards: adopt them but preserve flexibility Flexible (standard) data models: ease extension ‘beyond’ today for new research Dynamic software infrastructure: how to ensure a long life (30 yrs!)