XMLPipeDB

A Reusable, Open Source Tool Chain for Building Relational Databases from XML Sources BOSC Vienna, Austria July 20, 2007 Kam D. Dahlquist Department of Biology Jeffrey Nicholas John David N. Dionisio Department of Electrical Engineering & Computer Science http://xmlpipedb.cs.lmu.edu Loyola Marymount University

Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

How GenMAPP Works http://www.GenMAPP.org ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

GenMAPP Design and Implementation ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

MAPPFinder Determines Which GO Terms Are Overrepresented in a GenMAPP Expression Dataset

Maintaining and Updating GenMAPP Gene Databases has been a Bottleneck for Development ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

XMLPipeDB: A Reusable, Open Source Tool Chain for Building Relational Databases from XML Sources ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],First task, reported last year at BOSC, was to build a GenMAPP Gene Database for Escherichia coli K12

GenMAPP Gene Database Schema for Escherichia coli K12

Data Sources Required for a “Minimal” GenMAPP Gene Database UniProt • UniProt complete proteome sets for many species are made available as XML downloads by the Integr8 resource Gene Ontology • OBO XML format UniProt to GO associations • GOA downloads also available at Integr8

Produces: Java source code SQL DDL file Hibernate mappings Apache Ant build.xml

UniProtDB and GODB Required Only Nominal Post-processing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

“ Rule of Three” XMLPipeDB Utilities Library is a Suite of Java Classes that Provide Functions Common to Most XMLPipeDB Database Applications ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

GenMAPP Builder Interacts with PostgreSQL in Three Ways

GenMAPP Builder Uses the XMLPipeDB Utilities Library to Configure the PostgreSQL Database … and import XML

[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The User Chooses Which Gene ID Systems and Relations to Export to the Gene Database

GenMAPP Gene Database for Escherichia coli K12 Was the First Milestone for XMLPipeDB ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

The Next Challenge was to Create a Gene Database for the Plant, Arabidopsis thaliana

The Next Challenge was to Create a Gene Database for the Plant, Arabidopsis thaliana ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

Wish List for XMLPipeDB Development ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

XSD-to-DB Adam Carasso Jeffrey Nicholas Scott Spicer XMLPipeDBUtils David Hoffman Babak Naffas Jeffrey Nicholas Ryan Nakamoto UniProtDB Joe Boyle Joey Barrett GODB Scott Spicer Roberto Ruiz GenMAPP Builder Joey Barrett Jeffrey Nicholas Scott Spicer Special Thanks GenMAPP.org Development Group Caskey L. Dickson, Wesley T. Citti NSF CCLI Program (http://recourse.cs.lmu.edu) http://xmlpipedb.cs.lmu.edu LMU Bioinformatics Group Kam D. Dahlquist http://myweb.lmu.edu/kdahqui [email_address] John David N. Dionisio http://myweb.lmu.edu/dondi [email_address]

XMLPipeDB

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to XMLPipeDB

Similar to XMLPipeDB (20)

More from bosc

More from bosc (20)

Recently uploaded

Recently uploaded (20)

XMLPipeDB

Editor's Notes