Haider Embrace Bosc2008
Upcoming SlideShare
Loading in...5
×
 

Like this? Share it with your network

Share

Haider Embrace Bosc2008

on

  • 936 views

 

Statistics

Views

Total Views
936
Views on SlideShare
936
Embed Views
0

Actions

Likes
0
Downloads
2
Comments
0

0 Embeds 0

No embeds

Accessibility

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

Haider Embrace Bosc2008 Presentation Transcript

  • 1. EMBRACE – BioMart Developments & Future Syed Haider Rice Group - EBI July 2008
  • 2. EMBRACE www.embracegrid.info European Model for Bioinformatics Research and Community Education
    • Objective:
      • to integrate the major databases and software tools in bioinformatics
  • 3. BioMart
    • A Collaboration:
    • European Bioinformatics Institute (EBI)
    • Ontario Institute for Cancer Research (OICR)
    www.biomart.org
  • 4. BioMart
        • A generic data management system with a particular focus on supporting biological research featuring:
        • - Built-in query optimisation for fast data retrieval
        • - Data Federation
        • Easy to use interfaces and APIs
        • Web Services and DAS
  • 5. In a nutshell ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG Source data (MySQL, Oracle, Postgres) DB Mart
  • 6. Deploying BioMart
      • STEP 1 - Transformation
      • STEP 2 - Configuration
  • 7. 1. Transformation ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG ATGCTGTTGTGC ATGCTGGACTGG ATGGCCCGATGG DB Mart Source data (MySQL, Oracle, Postgres)
  • 8. 1. Transformation MartBuilder
  • 9. 2. Configuration Mart Mart Mart
  • 10. 2. Configuration MartEditor
  • 11. User Interfaces
  • 12. Concepts for End Users
    • Dataset
    • Filter
    • Attribute
  • 13. Examples
    • of all rat genes
      • located on chromosome 1, expressed in lungs
      • name, chromosome, description
      • of all mouse genes
      • ENSMUSG00000042351
      • exon sequences in FASTA format
      • of all rat genes
      • up-regulated in brain and associated with a QTL for
      • a neurological disorder
      • Upstream sequences
  • 14. Web Service Access <Query> <Dataset name=&quot; hsapiens_gene_ensembl &quot; > <Filter name=&quot; chromosome_name &quot; value=&quot; 1 &quot;/> <Attribute name=&quot; ensembl_gene_id &quot;/> <Attribute name=&quot; ensembl_transcript_id &quot;/> <Attribute name=&quot; biotype &quot;/> </Dataset> </Query> wget --post-data 'query= ‘ http://www.biomart.org/biomart/martservice
  • 15. Web Service Access <Query> <Dataset name=&quot; hsapiens_gene_ensembl &quot; > <Filter name=&quot; chromosome_name &quot; value=&quot; 1 &quot;/> <Attribute name=&quot; ensembl_gene_id &quot;/> <Attribute name=&quot; ensembl_transcript_id &quot;/> <Attribute name=&quot; biotype &quot;/> </Dataset> </Query> wget --post-data 'query= ‘ http://www.biomart.org/biomart/martservice martview
  • 16. VIRTUALSCHEMANAME=default &ATTRIBUTES = hsapiens_gene_ensembl .default.feature_page. ensembl_gene_id &FILTERS = hsapiens_gene_ensembl .default.filters. chromosome_name.&quot;1&quot; Web Service Access XML Free URL http://biomart.org/biomart/martview?
  • 17. BioMart DAS Access http://www.YourBioMart.org/biomart/das/ DATASET /features? segment= FILTERS http://www.biomart.org/biomart/das/ default__hsapiens_gene_ensembl__ensembl_das_chr /features? segment= 1:1,100000 http://www.biomart.org/biomart/das/ default__hsapiens_gene_ensembl__ensembl_das_gene /features? segment= ENSG00000197194
  • 18. Web based Access How far it has gone ?
  • 19. Taverna
  • 20. Bioma R t - BioConductor package
  • 21. Cytoscape
  • 22. Galaxy
  • 23. Template Queries
  • 24.  
  • 25.  
  • 26. Learn as you go.... Show URL Request Show XML Query Show Perl Script
  • 27.
    • Scalability
      • Maintaining large databases and configurations
    • Security
      • UserName/Password based access for clinical and experimental data etc
    • Multiple and Custom GUIs
    Future
  • 28.
    • Beyond rows and columns
      • Framework for Visualisations and Analysis Tools
    Future
  • 29. Visualisation: Gene List Analysis & Clinical Significance
    • Query
    Gene List Visualisation Gene list analysis Clinical Significance
  • 30. Map Genes onto Genome
  • 31. Visualisation: Gene List Analysis & Clinical Significance
    • Query
    Gene List Visualisation Gene list analysis Clinical Significance
  • 32. Map Genes onto GO GO Biological process (32) Cellular component (18) Molecular Function (24) Stem cell maintenance (7) Positive regulation of developmental process (8) Leukocyte mediated cytotoxicity (5) regulation of cell killing (12) Developmental process (15) Cell killing (17)
  • 33. Visualisation: Gene List Analysis & Clinical Significance
    • Query
    Gene List Visualisation Gene list analysis Clinical Significance
  • 34. Map Genes onto Pathways Reactome Apoptosis (43) Intrinsic pathway for apoptosis (26) Signaling by Wnt (10) Signaling by TGF β (23) Activation of BH3-only proteins (5) Permeabilization of mitochondria (3) Release of apoptotic factors from mitochondria (18)
  • 35. Future
    • Summary Pages
    • Annotation for each gene
    • Entrez/Ensembl gene info
    • Gene ontology/pathways
    • Biblography
    • Transcript & protein info, etc.
    • Genomic variations for each gene
    • for each cancer studied
    • Information for each patient
    • Demographics
    • History of cancer
    • Progress & outcome
    • Types of samples available
    • Histopathology of tumor
    Submission support
  • 36.
    • BioMart Team
      • Arek Kasprzyk (OICR-Toronto)‏
      • Syed Haider (Rice Group-EBI)‏
    • Acknowledgements
      • Benoit Ballester (Ensembl) Richard Holland (Ensembl)‏
      • Andreas Kahari (Ensembl) Craig Melsopp (Ensembl)‏
      • Damian Smedley (Ensembl) Arne Stabenau (Ensembl)‏
      • Asif Kibria (EBI) Gulam Patel (EBI)‏
      • Stephen Robinson (EBI) Katerina Tzouvara (EBI)‏
      • Will Spooner (CSHL) Gudmundur Thorisson (CSHL)‏
      • Darin London (Duke University) Don Gilbert (Indiana University)‏
      • Steffen Durinck (NCI NIH) Eric Just (Northwestern University)‏
      • Paul Donlon (Unilever)‏ Christina Yung (OICR)
      • Igor Antoshechkin (Caltech)
    Galaxy Credits References
  • 37.
    • Thanks.
  • 38. BioMart Central Portal – queries served