Discovery of Cow Rumen Biomass-Degrading Genes and Genomes through DNA Sequencing, DNA Synthesis and Single Cell Genomics, DOE Joint Genome Institute, Eddy Rubin, Copenhagenomics 2012

Gene and genome discovery through
metagenomicsgene synthesis and single cell
genomics

Eddy Rubin
Lawrence Berkeley National Lab
Berkeley CA

Reduce our Dependence
onPETROLEUM

• Decreased Consumption
(Improved Efficiency)

• New Renewable Sources
(Biofuels)

Do We Need Biofuels and Why Not Just Batteries
?
Energy Density
100 fold more energy / kilo of petroleum than in a
kilo of the very best battery

• A 747 jet needs 200,000
kilos of fuel to fly from
SF to Copenhagen.

 With a 20,000,000 kilo battery
a 747 jet won’t get off the
ground

Cellulosic Biofuels

CO2
CO2

Biomass

CO2

Land Use

Cellulosic Biofuel Production

Need to discover of enzymes of greater diversity
Inefficient
with newproperties
&
Expensive

Enzymes

Biofuels

Biomass

Deconstruction Fermentation and Fuel Synthesis

Biomass degradation communities
being studied by the JGI

Marsupials Birds Insects

Mollusks

Hoatzin (Stink Bird)

Ruminate

Cow Rumen Highly Efficient at
Biomass Deconstruction

Fistulated Cow

36 hr

Biomass ~55% reduction in
(Switchgrass) cellulose

Fistulated Cow
IlluminaHiSeq
4 billion reads
(200bp, 3kb, 5kb)
Metagenomic
Fiber-attached DNA
~ ½ Terabase of Sequence Data
microbes

½ Tb of Rumen Sequence

Gb This study
300,000
600,000 Warnecke 07
(Sanger, Termite Hindgut)

1/2 of a TB
500,000
250,000
200,000
400,000

150,000
300,000

200,000
100,000
71Mb 80Mb
100,000
50,000
0 Termite hindgut TammarWallaby

Total Assembly Based Pipeline
Gene HMMSearch
Assembly prediction (Glyco Hydrol & CB
Module domains)

179,092 scaffolds Thousands of Potential
2.5 M genes
Cellulolytic Enzymes

Genes Known to Deconstruct
Cellulose
Cellulose

-(1-4)
Endoglucanase
EC 3.2.1.4

Cellobiohydrolase
EC 3.2.1.91 & EC 3.2.1.150 -Glucosidase
EC 3.2.1.21 & EC 3.2.1.74

Previously Newly Discovered in
Known Total Rumen Data

3350
11

~556

Many in “computational” enzymes predicted

But are the in silica predictions real & functional?

Digital Information Functional Information
(Sequence Data) ???????????? (Biochemical Activity)

Large Scale Gene Synthesis

From Sequence Datato Function Info
From Sequence to Functional

SAMPLE

SEQUENCE EXPRESSION
DATA ANALYSIS SYNTHESIS
VALIDATE SYSTEM

Next-Gen Expression
Oligo synthesis and assembly Sequencing Vectors

GH1 Functional Analysis

Industrial process requires:

-Activity at 70 degrees C

Cellobiose -Stability at pH 4.5
GH1

Glucose

Select 300 Candidates for Synthesis and Functional
Characterization Maximizing Phylogenetic Space

Cellobiose

GH1

Glucose

Synthetic GH1 activity profiles
(Temperature & pH )

Settemperature optima

17 Enzymes identified active at pH 4.5 & 70 degrees C

How can we use this information
to improve bioenergy yield?

How can we use this information
to improve bioenergy yield?

Biofuel Amylase
Corn

Conclusion
Trawling deep metagenomic data is a successful
strategy to massively add to the diversity of enzymes
with desired activities

Conclusion
Trawling deep metagenomic data is a successful
strategy to massively add to the diversity of enzymes
with desired activities

Can we assemble genomes from deep metagenomic data?

Information in Genomes vs Genes

Pathways, Which organism is doing what, Capabilities of
particular organism…

Short Read Assemblers
(varied parameters combined results)

0.5 TB Metagenome

Assembly 1 Assembly 2 Assembly 3

Clustering

Contigs
&
Scaffolds

What we have are scaffolds but no Genomes

Bin of Scaffolds into Draft Genomes
Tetra Nucleotide Frequency and Sequence Coverage

Fragments (scaffolds) of DNA from the same bacterial
species have a similar tetranucleotideferquenciy

Fibrobacteres
Proteobacteria
Cyanobacteria

Assembled from Cow Rumen Microbiome 15 Draft Genomes
None of Which Have Ever Been Previously Reported
(1.8-3.3 MB)

Proof that in silica assembled genomes of hard to culture
organisms without a reference genome?

Single Cell Genome Sequencing

72h

rumen community

FACS

single cells
Multiple Displacement Amplification
(MDA)

Shot gun sequencing
and genome assembly
of DNA from isolated
single amplified genomes single cell

Metagenomic versus Single Cell
Derived Genome

Binned MetagenomicScaffolds
(Draft In Silica 3.1 Mb Genome)

Single Cell
Genome Reads
Single Cell Genome Reads Map to
every MetagenomicScaffold
Suggests that the scaffolds that bin
together are from the same
organism

>90% of single cell genome reads map to this single draft genome
Suggests that the draft genome is fairly complete

Conclusion

Ultra deep metagenomic
sequencing, even with short
reads, likely to increasingly
become a method of choice to
identify genes and characterize
the genomes of uncultured
organisms

Will enable the exploitation of the
diverse capabilities present in
environmental organisms to offer
biotech solutions

Voxelation
Gene expression tomography
Alex Sczyrba
Matthias Hess
TanjaWoyke

Voxelation
+ =
GET

Lo thruput + Hi thruput = Hi thruput
3D info 0D info 3D info

Crump Institute for Molecular Imaging

Discovery of Cow Rumen Biomass-Degrading Genes and Genomes through DNA Sequencing, DNA Synthesis and Single Cell Genomics, DOE Joint Genome Institute, Eddy Rubin, Copenhagenomics 2012

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Viewers also liked

Viewers also liked (20)

Similar to Discovery of Cow Rumen Biomass-Degrading Genes and Genomes through DNA Sequencing, DNA Synthesis and Single Cell Genomics, DOE Joint Genome Institute, Eddy Rubin, Copenhagenomics 2012

Similar to Discovery of Cow Rumen Biomass-Degrading Genes and Genomes through DNA Sequencing, DNA Synthesis and Single Cell Genomics, DOE Joint Genome Institute, Eddy Rubin, Copenhagenomics 2012 (20)

More from Copenhagenomics

More from Copenhagenomics (15)

Recently uploaded

Recently uploaded (20)

Discovery of Cow Rumen Biomass-Degrading Genes and Genomes through DNA Sequencing, DNA Synthesis and Single Cell Genomics, DOE Joint Genome Institute, Eddy Rubin, Copenhagenomics 2012

Editor's Notes