• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Eisen.All Hands
 

Eisen.All Hands

on

  • 1,953 views

Talk summarizing our GEBA Genomic Encylopedia of Bacteria and Archaea project for "All Hands" meeting at the Joint Genome Institute

Talk summarizing our GEBA Genomic Encylopedia of Bacteria and Archaea project for "All Hands" meeting at the Joint Genome Institute

Statistics

Views

Total Views
1,953
Views on SlideShare
1,394
Embed Views
559

Actions

Likes
1
Downloads
18
Comments
0

1 Embed 559

http://phylogenomics.wordpress.com 559

Accessibility

Upload Details

Uploaded via as Adobe PDF

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment

    Eisen.All Hands Eisen.All Hands Presentation Transcript

    • A phylogeny driven genomic encyclopedia of bacteria and archaea (or what is GEBA anyway?) Jonathan A. Eisen October 27, 2009
    • From http://genomesonline.org
    • rRNA Tree of Life
    • The Tree is not Happy
    • As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group OP8 phyla of Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA WS3 Gemmimonas Firmicutes Fusobacteria Actinobacteria OP9 Cyanobacteria Synergistes Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on OP11 Hugenholtz, 2002
    • As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group OP8 phyla of Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Genome WS3 Gemmimonas Firmicutes sequences are Fusobacteria Actinobacteria mostly from OP9 Cyanobacteria Synergistes three phyla Deferribacteres Chrysiogenetes NKB19 Verrucomicrobia Chlamydia OP3 Planctomycetes Spriochaetes Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on OP11 Hugenholtz, 2002
    • As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group OP8 phyla of Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Genome WS3 Gemmimonas Firmicutes sequences are Fusobacteria Actinobacteria mostly from OP9 Cyanobacteria Synergistes three phyla Deferribacteres Chrysiogenetes NKB19 • Some other Verrucomicrobia Chlamydia OP3 phyla are Planctomycetes Spriochaetes only sparsely Coprothmermobacter OP10 Thermomicrobia sampled Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Thermudesulfobacteria Thermotogae OP1 Based on OP11 Hugenholtz, 2002
    • As of 2002 Proteobacteria TM6 OS-K • At least 40 Acidobacteria Termite Group OP8 phyla of Nitrospira Bacteroides bacteria Chlorobi Fibrobacteres Marine GroupA • Genome WS3 Gemmimonas Firmicutes sequences are Fusobacteria Actinobacteria mostly from OP9 Cyanobacteria Synergistes three phyla Deferribacteres Chrysiogenetes NKB19 • Some other Verrucomicrobia Chlamydia OP3 phyla are Planctomycetes Spriochaetes only sparsely Coprothmermobacter OP10 Thermomicrobia sampled Chloroflexi TM7 Deinococcus-Thermus • Same trend in Dictyoglomus Aquificae Thermudesulfobacteria Archaea Thermotogae OP1 Based on OP11 Hugenholtz, 2002
    • Need for Tree Guidance Well Established • Common approach within some eukaryotic groups • Many small projects funded to fill in some bacterial or archaeal gaps • Phylogenetic gaps in bacterial and archaeal projects commonly lamented in literature
    • Proteobacteria TM6 OS-K • At least 100 phyla of Acidobacteria Termite Group OP8 bacteria Nitrospira Bacteroides Chlorobi • Genome sequences are Fibrobacteres Marine GroupA mostly from three phyla WS3 Gemmimonas Firmicutes • Most phyla with cultured Fusobacteria Actinobacteria species are sparsely OP9 Cyanobacteria Synergistes sampled Deferribacteres Chrysiogenetes NKB19 • Lineages with no cultured Verrucomicrobia Chlamydia OP3 taxa even more poorly Planctomycetes Spriochaetes sampled Coprothmermobacter OP10 Thermomicrobia Chloroflexi • Solution - use tree to really TM7 Deinococcus-Thermus fill gaps Dictyoglomus Aquificae Well sampled phyla Thermudesulfobacteria Thermotogae OP1 OP11
    • http://www.jgi.doe.gov/programs/GEBA/pilot.html
    • GEBA Pilot Project Overview • Identify major branches in rRNA tree for which no genomes are available • Identify a cultured representative for each group • Grow > 200 of these and prep. DNA • Sequence and finish 100 • Annotate, analyze, release data • Assess benefits of tree guided sequencing
    • GEBA Pilot Project: Components • Project overview (Phil Hugenholtz, Nikos Kyrpides, Jonathan Eisen, Eddy Rubin, Jim Bristow) • Project management (David Bruce, Eileen Dalin, Lynne Goodwin) • Culture collection and DNA prep (DSMZ, Hans-Peter Klenk) • Sequencing and closure (Eileen Dalin, Susan Lucas, Alla Lapidus, Mat Nolan, Alex Copeland, Cliff Han, Feng Chen, Jan-Fang Cheng) • Annotation and data release (Nikos Kyrpides, Victor Markowitz, et al) • Analysis (Dongying Wu, Kostas Mavrommatis, Martin Wu, Victor Kunin, Neil Rawlings, Ian Paulsen, Patrick Chain, Patrik D’Haeseleer, Sean Hooper, Iain Anderson, Amrita Pati, Natalia N. Ivanova, Athanasios Lykidis, Adam Zemla) • Adopt a microbe education project (Cheryl Kerfeld) • Outreach (David Gilbert) • $$$ (DOE, Eddy Rubin, Jim Bristow)
    • Some Lessons From GEBA
    • GEBA Lesson 1 rRNA Tree of Life is a Useful Guide and Genomes Improve Resolution
    • GEBA Lesson 2 Phylogenetically Guided Selection Can Help Annotate Other Genomes
    • Most/All Functional Prediction Improves w/ Better Phylogenetic Sampling • Better definition of protein family sequence “patterns” • Greatly improves “comparative” and “evolutionary” based predictions • Conversion of hypothetical into conserved hypotheticals • Linking distantly related members of protein families • Improved non-homology prediction Kostas Natalia Thanos Nikos Iain Mavrommatis Ivanova Lykidis Kyrpides Anderson
    • GEBA Lesson 3 Phylogenetically Guided Selection Can Help Study Uncultured Organisms
    • Environmental Shotgun Sequencing shotgun sequence
    • Binning challenge A T B U C V D W E X F Y G Z
    • Metagenomic Analysis Improves Sean Hooper • Small but real improvement in metagenomic Amrita Pati annotation and analysis
    • GEBA Lesson 4 We have still only scratched the surface of microbial diversity
    • Protein Family Rarefaction Curves • Take data set of multiple complete genomes • Identify all protein families using MCL • Plot # of genomes vs. # of protein families
    • Phylogenetic Distribution Novelty: 1st Bacterial Actin Related Protein Victor Kunin Patrik D’haeseleer Adam Zemla Haliangium ochraceum DSM 14365
    • Phylogenetic Diversity with GEBA
    • Phylogenetic Diversity: Isolates
    • Phylogenetic Diversity: All
    • Proteobacteria TM6 OS-K • At least 40 phyla of Acidobacteria Termite Group OP8 bacteria Nitrospira Bacteroides Chlorobi • Genome sequences are Fibrobacteres Marine GroupA mostly from three phyla WS3 Gemmimonas Firmicutes • Most phyla with cultured Fusobacteria Actinobacteria species are sparsely OP9 Cyanobacteria Synergistes sampled Deferribacteres Chrysiogenetes NKB19 • Lineages with no cultured Verrucomicrobia Chlamydia OP3 taxa even more poorly Planctomycetes Spriochaetes sampled Coprothmermobacter OP10 Thermomicrobia Chloroflexi TM7 Deinococcus-Thermus Dictyoglomus Aquificae Well sampled phyla Thermudesulfobacteria Thermotogae Poorly sampled OP1 OP11 No cultured taxa
    • Uncultured Lineages: Technical Approaches • Get into culture • Enrichment cultures • If abundant in low diversity ecosystems • Flow sorting • Microbeads • Microfluidic sorting • Single cell amplification
    • GEBA Lesson 6 Need Experiments from Across the Tree of Life too
    • Adopt a Microbe
    • MICROBES
    • A Happy Tree of Life
    • Related Lesson 1 METADATA ROCKS
    • SIGS • The Genomic Standards Consortium • The GSC is an open-membership working body which formed in September 2005. • The goal of this international community is to promote mechanisms that standardize the description of genomes and the exchange and integration of genomic data. • See http://gensc.org/gc_wiki/index.php/Main_Page