SlideShare a Scribd company logo
1 of 32
MoBeDAC
Microbiome of the Built Environment Data Analysis Core
Background

 Rob Knight’s group, University of Colorado at Boulder

 Since we’re talking about air quality…
Boulder, Colorado
Boulder, Colorado
Boulder, Colorado
Right now…
Right now…
Right now…
About the Knight Lab

 Rob Knight, PhD, PI, Smartypants

 Qiime – Quantitative Insights Into Microbial Ecology

 Qiime on the web, data transport, MoBeDAC, etc…

 Spend a lot of time on standards and data consistency
Why do we need standards?

 Everyone likes the idea

 Everyone uses their own standard

 Problem: Leads to situations such as this…
It’s a cat.
Or maybe a shark?
Such a nice monkey…
Err wait, is it a walrus?
A local…
Or… a dolphin?
Unexpected Results

 Different tools can (and do!) lead to different results

 Answer: speak the same language
Presenting MoBeDAC

   Central repository for microbial metadata and sequence data

   Implements and enforces metadata standards – GSC checklists

   Enforces sequence data consistency – quality filtering, trimming

   Brings together an array of utilities and resources:
       VAMPS
       MG-RAST
       Qiime
       FungiDB
       Microbe.net
       Future platforms via open API
MoBeDAC Overview
Metadata: GSC Checklists
Metadata: GSC Checklists
The Technology

 Platform Agnostic – keep it really simple

 REST API for communication

 JSON for encoding
REST – Representational State Transfer

 Sounds really fancy… but it’s really simple:
     Usually runs over HTTP
     Not a standard per-se, a series of guidelines. Flexible.

 Only 4 Commands (verbs):
     GET - List resources or elements of resource
     PUT - Replace entire collection with new data
     POST - Add new item to collection
     DELETE - Remove item or collection

 WWW is the largest REST system – everyone uses it without
  knowing
JSON – JavaScript Object Notation

   Sounds fancy too… but it’s:
       Ubiquitous
       Simple, human readable
       If you have data, put lots of brackets around it
       Send it

   For example:
       If I have a dictionary that looks like:
        a: apple
        b: bunny
        c: kitty shark
        JSON says it should look like:
        {"a": "apple", "c": "kitty shark", "b": "bunny"}

   Current MoBeDAC API specification available at:
    http://metagenomics.anl.gov/Html/api.html
Resources
   MoBeDAC
                                           Microbe.net
       http://mobedac.org
                                                 http://www.microbe.net/
   VAMPS
                                           GSC
       http://vamps.mbl.edu
                                                 http://gensc.org
   MG-RAST
                                           BE Package Terms
       http://metagenomics.anl.gov
                                                 http://www.microbe.net/wp-
                                                  content/uploads/2012/05/built_environm
   Qiime                                         ent-metadata-terms-v51.xls
       http://microbio.me/qiime
        http://www.qiime.org/              Sloan Foundation
                                                 http://www.sloan.org/
   FungiDB
       http://fungidb.org/fungidb.b2
Thanks!

 Questions?

More Related Content

Similar to MoBeDAC - Microbiome of the Built Environment Data Analysis Core

Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011
cmzmasek
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
Philip Cheung
 
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
EBI
 
Jc synthetic biology 6-15-2012
Jc synthetic biology   6-15-2012Jc synthetic biology   6-15-2012
Jc synthetic biology 6-15-2012
Diane Wu
 

Similar to MoBeDAC - Microbiome of the Built Environment Data Analysis Core (20)

ChemSpider – An Online Database and Registration System Linking the Web
ChemSpider – An Online Database and  Registration System Linking the WebChemSpider – An Online Database and  Registration System Linking the Web
ChemSpider – An Online Database and Registration System Linking the Web
 
Role of bioinformatics in life sciences research
Role of bioinformatics in life sciences researchRole of bioinformatics in life sciences research
Role of bioinformatics in life sciences research
 
Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011Zmasek TOPSAN Biohackathon 2011
Zmasek TOPSAN Biohackathon 2011
 
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
Getting the best of Linked Data and Property Graphs: rdf2neo and the KnetMine...
 
BioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadataBioAssay Express: Creating and exploiting assay metadata
BioAssay Express: Creating and exploiting assay metadata
 
Connecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics InstituteConnecting life sciences data at the European Bioinformatics Institute
Connecting life sciences data at the European Bioinformatics Institute
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
The Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean DataThe Place of Schema.org in Linked Ocean Data
The Place of Schema.org in Linked Ocean Data
 
Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012Bio2RDF presentation at Combine 2012
Bio2RDF presentation at Combine 2012
 
SADI SWSIP '09 'cause you can't always GET what you want!
SADI SWSIP '09  'cause you can't always GET what you want!SADI SWSIP '09  'cause you can't always GET what you want!
SADI SWSIP '09 'cause you can't always GET what you want!
 
HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10HKU Data Curation MLIM7350 Class 10
HKU Data Curation MLIM7350 Class 10
 
How the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data finalHow the web has weaved a web of interlinked chemistry data final
How the web has weaved a web of interlinked chemistry data final
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
Feeding and consuming data to support open notebook science via the chem spid...
Feeding and consuming data to support open notebook science via the chem spid...Feeding and consuming data to support open notebook science via the chem spid...
Feeding and consuming data to support open notebook science via the chem spid...
 
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl GenomesGenome resources at EMBL-EBI: Ensembl and Ensembl Genomes
Genome resources at EMBL-EBI: Ensembl and Ensembl Genomes
 
Jc synthetic biology 6-15-2012
Jc synthetic biology   6-15-2012Jc synthetic biology   6-15-2012
Jc synthetic biology 6-15-2012
 
Data sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK StoryData sharing - Data management - The SysMO-SEEK Story
Data sharing - Data management - The SysMO-SEEK Story
 
Data management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
 
Connecting Chemistry Across the Internet Using ChemSpider
Connecting Chemistry Across the Internet Using ChemSpiderConnecting Chemistry Across the Internet Using ChemSpider
Connecting Chemistry Across the Internet Using ChemSpider
 
Paola masuzzo-ome-2017
Paola masuzzo-ome-2017Paola masuzzo-ome-2017
Paola masuzzo-ome-2017
 

Recently uploaded

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Recently uploaded (20)

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Corporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptxCorporate and higher education May webinar.pptx
Corporate and higher education May webinar.pptx
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Ransomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdfRansomware_Q4_2023. The report. [EN].pdf
Ransomware_Q4_2023. The report. [EN].pdf
 

MoBeDAC - Microbiome of the Built Environment Data Analysis Core

  • 1. MoBeDAC Microbiome of the Built Environment Data Analysis Core
  • 2. Background  Rob Knight’s group, University of Colorado at Boulder  Since we’re talking about air quality…
  • 9. About the Knight Lab  Rob Knight, PhD, PI, Smartypants  Qiime – Quantitative Insights Into Microbial Ecology  Qiime on the web, data transport, MoBeDAC, etc…  Spend a lot of time on standards and data consistency
  • 10. Why do we need standards?  Everyone likes the idea  Everyone uses their own standard  Problem: Leads to situations such as this…
  • 12. Or maybe a shark?
  • 13.
  • 14.
  • 15. Such a nice monkey…
  • 16. Err wait, is it a walrus?
  • 17.
  • 18.
  • 21.
  • 22.
  • 23. Unexpected Results  Different tools can (and do!) lead to different results  Answer: speak the same language
  • 24. Presenting MoBeDAC  Central repository for microbial metadata and sequence data  Implements and enforces metadata standards – GSC checklists  Enforces sequence data consistency – quality filtering, trimming  Brings together an array of utilities and resources:  VAMPS  MG-RAST  Qiime  FungiDB  Microbe.net  Future platforms via open API
  • 28. The Technology  Platform Agnostic – keep it really simple  REST API for communication  JSON for encoding
  • 29. REST – Representational State Transfer  Sounds really fancy… but it’s really simple:  Usually runs over HTTP  Not a standard per-se, a series of guidelines. Flexible.  Only 4 Commands (verbs):  GET - List resources or elements of resource  PUT - Replace entire collection with new data  POST - Add new item to collection  DELETE - Remove item or collection  WWW is the largest REST system – everyone uses it without knowing
  • 30. JSON – JavaScript Object Notation  Sounds fancy too… but it’s:  Ubiquitous  Simple, human readable  If you have data, put lots of brackets around it  Send it  For example:  If I have a dictionary that looks like: a: apple b: bunny c: kitty shark JSON says it should look like: {"a": "apple", "c": "kitty shark", "b": "bunny"}  Current MoBeDAC API specification available at: http://metagenomics.anl.gov/Html/api.html
  • 31. Resources  MoBeDAC  Microbe.net  http://mobedac.org  http://www.microbe.net/  VAMPS  GSC  http://vamps.mbl.edu  http://gensc.org  MG-RAST  BE Package Terms  http://metagenomics.anl.gov  http://www.microbe.net/wp- content/uploads/2012/05/built_environm  Qiime ent-metadata-terms-v51.xls  http://microbio.me/qiime http://www.qiime.org/  Sloan Foundation  http://www.sloan.org/  FungiDB  http://fungidb.org/fungidb.b2