SlideShare a Scribd company logo
Lecture 3:
EVE 161:

Microbial Phylogenomics
Lecture #5:
Modern View of Tree of Life
UC Davis, Winter 2016
Instructors: Jonathan Eisen & Holly Ganz
Where we are going and where we have been
• Previous lecture:
!4. Background on Phylogeny
• Current Lecture:
!5. Modern view of Tree of Life
• Next Lecture:
!6. rRNA from environments
!2
Three papers for today
Syst. Biol. 59(5):518–533, 2010
c⃝ The Author(s) 2010. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved.
For Permissions, please email: journals.permissions@oxfordjournals.org
DOI:10.1093/sysbio/syq037
Advance Access publication on July 23, 2010
Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life
LAURA WEGENER PARFREY1
, JESSICA GRANT2
, YONAS I. TEKLE2,6
, ERICA LASEK-NESSELQUIST3,4
,
HILARY G. MORRISON3
, MITCHELL L. SOGIN3
, DAVID J. PATTERSON5
, AND LAURA A. KATZ1,2,∗
1Program in Organismic and Evolutionary Biology, University of Massachusetts, 611 North Pleasant Street, Amherst,
MA 01003, USA; 2Department of Biological Sciences, Smith College, 44 College Lane, Northampton, MA 01063, USA; 3Bay Paul Center for
Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 4Department of Ecology and
Evolutionary Biology, Brown University, 80 Waterman Street, Providence, RI 02912, USA; 5Biodiversity Informatics Group, Marine Biological
Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 6Present address: Department of Epidemiology and Public Health, Yale University School of
Medicine, New Haven, CT 06520, USA;
∗Correspondence to be sent to: Laura A. Katz, 44 College Lane, Northampton, MA 01003, USA; E-mail: lkatz@smith.edu.
Laura Wegener Parfrey and Jessica Grant have contributed equally to this work.
Received 30 September 2009; reviews returned 1 December 2009; accepted 25 May 2010
Associate Editor: C´ecile An´e
Abstract.—An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the
diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diver-
sity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses.
However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due
to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these
genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich
strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa
representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life.
The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the
accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes
or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex-
tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene
transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analy-
ses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support
atUniversityohttp://sysbio.oxfordjournals.org/Downloadedfrom
first published online 24 October 2012, doi: 10.1098/rspb.2012.17952792012Proc. R. Soc. B
Tom A. Williams, Peter G. Foster, Tom M. W. Nye, Cymon J. Cox and T. Martin Embley
the Archaea
A congruent phylogenomic signal places eukaryotes within
Supplementary data
tml
http://rspb.royalsocietypublishing.org/content/suppl/2012/10/18/rspb.2012.1795.DC1.h
"Data Supplement"
References
http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#related-urls
Article cited in:
http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#ref-list-1
This article cites 56 articles, 35 of which can be accessed free
This article is free to access
Subject collections
(1595 articles)evolution
(25 articles)bioinformatics
Articles on similar topics can be found in the following collections
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!3
ARTICLE doi:10.1038/nature14447
Complex archaea that bridge the gap
between prokaryotes and eukaryotes
Anja Spang1
*, Jimmy H. Saw1
*, Steffen L. Jørgensen2
*, Katarzyna Zaremba-Niedzwiedzka1
*, Joran Martijn1
, Anders E. Lind1
,
Roel van Eijk1
{, Christa Schleper2,3
, Lionel Guy1,4
& Thijs J. G. Ettema1
The origin of the eukaryotic cell remains one of the most contentious puzzles in modern biology. Recent studies
Palfrey et al.
Syst. Biol. 59(5):518–533, 2010
c⃝ The Author(s) 2010. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved.
For Permissions, please email: journals.permissions@oxfordjournals.org
DOI:10.1093/sysbio/syq037
Advance Access publication on July 23, 2010
Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life
LAURA WEGENER PARFREY1
, JESSICA GRANT2
, YONAS I. TEKLE2,6
, ERICA LASEK-NESSELQUIST3,4
,
HILARY G. MORRISON3
, MITCHELL L. SOGIN3
, DAVID J. PATTERSON5
, AND LAURA A. KATZ1,2,∗
1Program in Organismic and Evolutionary Biology, University of Massachusetts, 611 North Pleasant Street, Amherst,
MA 01003, USA; 2Department of Biological Sciences, Smith College, 44 College Lane, Northampton, MA 01063, USA; 3Bay Paul Center for
Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 4Department of Ecology and
Evolutionary Biology, Brown University, 80 Waterman Street, Providence, RI 02912, USA; 5Biodiversity Informatics Group, Marine Biological
Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 6Present address: Department of Epidemiology and Public Health, Yale University School of
Medicine, New Haven, CT 06520, USA;
∗Correspondence to be sent to: Laura A. Katz, 44 College Lane, Northampton, MA 01003, USA; E-mail: lkatz@smith.edu.
Laura Wegener Parfrey and Jessica Grant have contributed equally to this work.
Received 30 September 2009; reviews returned 1 December 2009; accepted 25 May 2010
Associate Editor: C´ecile An´e
Abstract.—An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the
diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diver-
sity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses.
However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due
to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these
genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich
strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa
representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life.
The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the
accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes
or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex-
tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene
transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
!4
Abstract
An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations
underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes.
Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,”
many of which receive strong support in phylogenomic analyses. However, the abundance of
data in phylogenomic analyses can lead to highly supported but incorrect relationships due to
systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or
fewer) included in these genomic studies may exaggerate systematic error and reduce power to
evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We
show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72
major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic
tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels
of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable
topology emerges without the removal of rapidly evolving genes or taxa, a practice common to
phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata”
is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the
presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or
plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of
genes. Finally, taxon-rich analyses such as presented here provide a method for testing the
accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses
and enable placement of the multitude of lineages that lack genome scale data. [Excavata;
microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.]
!5
Abstract
An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations
underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes.
Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,”
many of which receive strong support in phylogenomic analyses. However, the abundance of
data in phylogenomic analyses can lead to highly supported but incorrect relationships due to
systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or
fewer) included in these genomic studies may exaggerate systematic error and reduce power to
evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We
show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72
major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic
tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels
of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable
topology emerges without the removal of rapidly evolving genes or taxa, a practice common to
phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata”
is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the
presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or
plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of
genes. Finally, taxon-rich analyses such as presented here provide a method for testing the
accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses
and enable placement of the multitude of lineages that lack genome scale data. [Excavata;
microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.]
!6
Abstract
An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations
underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes.
Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,”
many of which receive strong support in phylogenomic analyses. However, the abundance of
data in phylogenomic analyses can lead to highly supported but incorrect relationships due to
systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or
fewer) included in these genomic studies may exaggerate systematic error and reduce power to
evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We
show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72
major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic
tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels
of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable
topology emerges without the removal of rapidly evolving genes or taxa, a practice common to
phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata”
is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the
presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or
plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of
genes. Finally, taxon-rich analyses such as presented here provide a method for testing the
accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses
and enable placement of the multitude of lineages that lack genome scale data. [Excavata;
microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.]
!7
Abstract
An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations
underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes.
Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,”
many of which receive strong support in phylogenomic analyses. However, the abundance of
data in phylogenomic analyses can lead to highly supported but incorrect relationships due to
systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or
fewer) included in these genomic studies may exaggerate systematic error and reduce power to
evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We
show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72
major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic
tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels
of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable
topology emerges without the removal of rapidly evolving genes or taxa, a practice common to
phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata”
is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the
presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or
plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of
genes. Finally, taxon-rich analyses such as presented here provide a method for testing the
accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses
and enable placement of the multitude of lineages that lack genome scale data. [Excavata;
microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.]
!8
Abstract
An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations
underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes.
Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,”
many of which receive strong support in phylogenomic analyses. However, the abundance of
data in phylogenomic analyses can lead to highly supported but incorrect relationships due to
systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or
fewer) included in these genomic studies may exaggerate systematic error and reduce power to
evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We
show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72
major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic
tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels
of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable
topology emerges without the removal of rapidly evolving genes or taxa, a practice common to
phylogenomic analyses. Several major groups are stable and strongly supported in these
analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata”
is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the
presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or
plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary
relationships can be achieved with broad taxonomic sampling and a moderate number of
genes. Finally, taxon-rich analyses such as presented here provide a method for testing the
accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses
and enable placement of the multitude of lineages that lack genome scale data. [Excavata;
microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.]
!9
Introduction
!10
Introduction
!11
!12
!13
Introduction
• Questions about Introduction?
Methods
!16
!17
!18
Methods
• Questions about Methods?
Results and Discussion
• Questions about Methods?
!21
Fig 1: 451 Taxa and some of the 16 genes
!22
Fig. 2: 88 Taxa each w/ 10 or more of the 16 genes
!26
!35
!36
Just Rhizaria
!37
Just Excavata
!38
Consensus
!39
Results and Discussions
• Questions
Conclusions
!41
!42
Williams et al.
first published online 24 October 2012, doi: 10.1098/rspb.2012.17952792012Proc. R. Soc. B
Tom A. Williams, Peter G. Foster, Tom M. W. Nye, Cymon J. Cox and T. Martin Embley
the Archaea
A congruent phylogenomic signal places eukaryotes within
Supplementary data
tml
http://rspb.royalsocietypublishing.org/content/suppl/2012/10/18/rspb.2012.1795.DC1.h
"Data Supplement"
References
http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#related-urls
Article cited in:
http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#ref-list-1
This article cites 56 articles, 35 of which can be accessed free
This article is free to access
Subject collections
(178 articles)taxonomy and systematics
(1595 articles)evolution
(25 articles)bioinformatics
Articles on similar topics can be found in the following collections
Email alerting service hereright-hand corner of the article or click
Receive free email alerts when new articles cite this article - sign up in the box at the top
!43
Abstract
Determining the relationships among the major groups of cellular life is important for
understanding the evolution of biological diversity, but is difficult given the enormous
time spans involved. In the textbook ‘three domains’ tree based on informational genes,
eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria.
However, some phylogenetic analyses of the same data have placed eukaryotes within
the Archaea, as the nearest relatives of different archaeal lineages. We compared the
support for these competing hypotheses using sophisticated phylogenetic methods and
an improved sampling of archaeal biodiversity. We also employed both new and existing
tests of phylogenetic congruence to explore the level of uncertainty and conflict in the
data. Our analyses suggested that much of the observed incongruence is weakly
supported or associated with poorly fitting evolutionary models. All of our phylogenetic
analyses, whether on small subunit and large subunit ribosomal RNA or concatenated
protein-coding genes, recovered a monophyletic group containing eukaryotes and the
TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota,
Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the
iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis
whereby vital components of the eukaryotic nuclear lineage originated from within the
archaeal radiation.
!44
Abstract
Determining the relationships among the major groups of cellular life is important for
understanding the evolution of biological diversity, but is difficult given the enormous
time spans involved. In the textbook ‘three domains’ tree based on informational genes,
eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria.
However, some phylogenetic analyses of the same data have placed eukaryotes within
the Archaea, as the nearest relatives of different archaeal lineages. We compared the
support for these competing hypotheses using sophisticated phylogenetic methods and
an improved sampling of archaeal biodiversity. We also employed both new and existing
tests of phylogenetic congruence to explore the level of uncertainty and conflict in the
data. Our analyses suggested that much of the observed incongruence is weakly
supported or associated with poorly fitting evolutionary models. All of our phylogenetic
analyses, whether on small subunit and large subunit ribosomal RNA or concatenated
protein-coding genes, recovered a monophyletic group containing eukaryotes and the
TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota,
Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the
iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis
whereby vital components of the eukaryotic nuclear lineage originated from within the
archaeal radiation.
!45
Abstract
Determining the relationships among the major groups of cellular life is important for
understanding the evolution of biological diversity, but is difficult given the enormous
time spans involved. In the textbook ‘three domains’ tree based on informational genes,
eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria.
However, some phylogenetic analyses of the same data have placed eukaryotes within
the Archaea, as the nearest relatives of different archaeal lineages. We compared the
support for these competing hypotheses using sophisticated phylogenetic methods and
an improved sampling of archaeal biodiversity. We also employed both new and existing
tests of phylogenetic congruence to explore the level of uncertainty and conflict in the
data. Our analyses suggested that much of the observed incongruence is weakly
supported or associated with poorly fitting evolutionary models. All of our phylogenetic
analyses, whether on small subunit and large subunit ribosomal RNA or concatenated
protein-coding genes, recovered a monophyletic group containing eukaryotes and the
TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota,
Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the
iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis
whereby vital components of the eukaryotic nuclear lineage originated from within the
archaeal radiation.
!46
Abstract
Determining the relationships among the major groups of cellular life is important for
understanding the evolution of biological diversity, but is difficult given the enormous
time spans involved. In the textbook ‘three domains’ tree based on informational genes,
eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria.
However, some phylogenetic analyses of the same data have placed eukaryotes within
the Archaea, as the nearest relatives of different archaeal lineages. We compared the
support for these competing hypotheses using sophisticated phylogenetic methods and
an improved sampling of archaeal biodiversity. We also employed both new and existing
tests of phylogenetic congruence to explore the level of uncertainty and conflict in the
data. Our analyses suggested that much of the observed incongruence is weakly
supported or associated with poorly fitting evolutionary models. All of our phylogenetic
analyses, whether on small subunit and large subunit ribosomal RNA or concatenated
protein-coding genes, recovered a monophyletic group containing eukaryotes and the
TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota,
Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the
iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis
whereby vital components of the eukaryotic nuclear lineage originated from within the
archaeal radiation.
!47
Abstract
Determining the relationships among the major groups of cellular life is important for
understanding the evolution of biological diversity, but is difficult given the enormous
time spans involved. In the textbook ‘three domains’ tree based on informational genes,
eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria.
However, some phylogenetic analyses of the same data have placed eukaryotes within
the Archaea, as the nearest relatives of different archaeal lineages. We compared the
support for these competing hypotheses using sophisticated phylogenetic methods and
an improved sampling of archaeal biodiversity. We also employed both new and existing
tests of phylogenetic congruence to explore the level of uncertainty and conflict in the
data. Our analyses suggested that much of the observed incongruence is weakly
supported or associated with poorly fitting evolutionary models. All of our phylogenetic
analyses, whether on small subunit and large subunit ribosomal RNA or concatenated
protein-coding genes, recovered a monophyletic group containing eukaryotes and the
TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota,
Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the
iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis
whereby vital components of the eukaryotic nuclear lineage originated from within the
archaeal radiation.
!48
Introduction
!49
!50
Introduction
!51
Introduction
!52
Introduction
!53
Introduction
Introduction Questions
Results and Discussion
Methods?????
Methods?????
Methods?????
Methods?????
Results and Discussion
!61
Results and Discussion
!62
Results and Discussion
rRNA
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Giardia lamblia
Trichomonas vaginalis
Naegleria gruberi
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Staphylothermus marinus
Hyperthermus butylicus
Ignicoccus hospitalis
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
0.83
1
0.2
(a)
Bacteria
Euryarchaeota
Crenarchaeota
Eukaryota
Trichomonas vaginalis
Arabidopsis thaliana
Giardia lamblia
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Naegleria gruberi
Archaeoglobus fulgidus
Methanosarcina mazei
Thermoplasma volcanium
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Staphylothermus marinus
Ignicoccus hospitalis
Aeropyrum pernix
Clostridium acetobutylicum
Synechocystis sp.
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
0.2
(b)
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Trichomonas vaginalis
Giardia lamblia
Naegleria gruberi
Entamoeba histolytica
Dictyostelium discoideum
Trypanosoma brucei
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Ignicoccus hospitalis
Staphylothermus marinus
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
1
0.2
(c)
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Methanococcus jannaschii
Thermoplasma volcanium
Methanosarcina mazei
Archaeoglobus fulgidus
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Korarchaeum cryptofilum
Nitrosopumilus maritimus
Cenarchaeum symbiosum
Caldiarchaeum subterraneum
Giardia lamblia
Homo sapiens
Thalassiosira pseudonana
Saccharomyces cerevisiae
Trypanosoma brucei
Naegleria gruberi
Entamoeba histolytica
Trichomonas vaginalis
Dictyostelium discoideum
Arabidopsis thaliana
Thermofilum pendens
Pyrobaculum aerophilum
Caldivirga maquilingensis
Sulfolobus solfataricus
Staphylothermus marinus
Aeropyrum pernix
Ignicoccus hospitalis
Hyperthermus butylicus
Rhodopirellula baltica
Synechocystis sp.
Clostridium acetobutylicum
Treponema pallidum
Chlamydia trachomatis
Rhodopseudomonas palustris
Escherichia coli
Campylobacter jejuni
1
1
0.57
1
0.97
0.2
(d)
Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred from concatenated rRNA. (a) A Bayesian phylogeny of Bac-
teria, Archaea and eukaryotes inferred under the GTR model, showing an eocyte-like topology in which eukaryotes emerge
from within the Archaea with maximal support (posterior probability (PP) ¼ 1). (b) Removal of recently characterized archaeal
groups (the Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree into a canonical three-domains topology,
again with maximal support (PP ¼ 1), indicating that sampling plays an important role in the resolution of these ancient
relationships. Analyses of the full dataset using the better-fitting NDRH þ NDCH (c) and CAT (d) models recover maximally
supported eocyte-like topologies; these models also recover eocyte-like topologies on the reduced dataset, without the TAK
sequences (see the electronic supplementary material, figure S1). Branch lengths are proportional to substitutions per site.
Evolution of eukaryotes from Archaea T. A. Williams et al. 4873
Proc. R. Soc. B (2012)
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!63
Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred
from concatenated rRNA. (a) A Bayesian phylogeny of Bacteria,
Archaea and eukaryotes inferred under the GTR model, showing an
eocyte-like topology in which eukaryotes emerge from within the
Archaea with maximal support (posterior probability (PP) 1⁄4 1). (b)
Removal of recently characterized archaeal groups (the
Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree
into a canonical three-domains topology, again with maximal support
(PP 1⁄4 1), indicating that sampling plays an important role in the
resolution of these ancient relationships. Analyses of the full dataset
using the better-fitting NDRH þ NDCH (c) and CAT (d ) models
recover maximally supported eocyte-like topologies; these models
also recover eocyte-like topologies on the reduced dataset, without
the TAK sequences (see the electronic supplementary material,
figure S1). Branch lengths are proportional to substitutions per site.
rRNA
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Giardia lamblia
Trichomonas vaginalis
Naegleria gruberi
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Staphylothermus marinus
Hyperthermus butylicus
Ignicoccus hospitalis
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
0.83
1
0.2
(a)
Bacteria
Euryarchaeota
Crenarchaeota
Eukaryota
Trichomonas vaginalis
Arabidopsis thaliana
Giardia lamblia
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Naegleria gruberi
Archaeoglobus fulgidus
Methanosarcina mazei
Thermoplasma volcanium
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Staphylothermus marinus
Ignicoccus hospitalis
Aeropyrum pernix
Clostridium acetobutylicum
Synechocystis sp.
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
0.2
(b)
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Trichomonas vaginalis
Giardia lamblia
Naegleria gruberi
Entamoeba histolytica
Dictyostelium discoideum
Trypanosoma brucei
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Ignicoccus hospitalis
Staphylothermus marinus
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
1
0.2
(c)
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Methanococcus jannaschii
Thermoplasma volcanium
Methanosarcina mazei
Archaeoglobus fulgidus
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Korarchaeum cryptofilum
Nitrosopumilus maritimus
Cenarchaeum symbiosum
Caldiarchaeum subterraneum
Giardia lamblia
Homo sapiens
Thalassiosira pseudonana
Saccharomyces cerevisiae
Trypanosoma brucei
Naegleria gruberi
Entamoeba histolytica
Trichomonas vaginalis
Dictyostelium discoideum
Arabidopsis thaliana
Thermofilum pendens
Pyrobaculum aerophilum
Caldivirga maquilingensis
Sulfolobus solfataricus
Staphylothermus marinus
Aeropyrum pernix
Ignicoccus hospitalis
Hyperthermus butylicus
Rhodopirellula baltica
Synechocystis sp.
Clostridium acetobutylicum
Treponema pallidum
Chlamydia trachomatis
Rhodopseudomonas palustris
Escherichia coli
Campylobacter jejuni
1
1
0.57
1
0.97
0.2
(d)
Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred from concatenated rRNA. (a) A Bayesian phylogeny of Bac-
teria, Archaea and eukaryotes inferred under the GTR model, showing an eocyte-like topology in which eukaryotes emerge
from within the Archaea with maximal support (posterior probability (PP) ¼ 1). (b) Removal of recently characterized archaeal
groups (the Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree into a canonical three-domains topology,
again with maximal support (PP ¼ 1), indicating that sampling plays an important role in the resolution of these ancient
relationships. Analyses of the full dataset using the better-fitting NDRH þ NDCH (c) and CAT (d) models recover maximally
supported eocyte-like topologies; these models also recover eocyte-like topologies on the reduced dataset, without the TAK
sequences (see the electronic supplementary material, figure S1). Branch lengths are proportional to substitutions per site.
Evolution of eukaryotes from Archaea T. A. Williams et al. 4873
Proc. R. Soc. B (2012)
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!65
rRNA
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Giardia lamblia
Trichomonas vaginalis
Naegleria gruberi
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Staphylothermus marinus
Hyperthermus butylicus
Ignicoccus hospitalis
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
0.83
1
0.2
(a)
Bacteria
Euryarchaeota
Crenarchaeota
Eukaryota
Trichomonas vaginalis
Arabidopsis thaliana
Giardia lamblia
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Naegleria gruberi
Archaeoglobus fulgidus
Methanosarcina mazei
Thermoplasma volcanium
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Staphylothermus marinus
Ignicoccus hospitalis
Aeropyrum pernix
Clostridium acetobutylicum
Synechocystis sp.
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
0.2
(b)
Evolution of eukaryotes from Archaea T. A. Williams et al.
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
With New Data
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Giardia lamblia
Trichomonas vaginalis
Naegleria gruberi
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Staphylothermus marinus
Hyperthermus butylicus
Ignicoccus hospitalis
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
1
1
1
1
0.83
1
(a)
Bacteria
Euryarchaeota
Crenarchaeota
Eukaryota
Trichomonas vaginalis
Arabidopsis thaliana
Giardia lamblia
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Dictyostelium discoideum
Trypanosoma brucei
Entamoeba histolytica
Naegleria gruberi
Archaeoglobus fulgidus
Methanosarcina mazei
Thermoplasma volcanium
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Staphylothermus marinus
Ignicoccus hospitalis
Aeropyrum pernix
Clostridium acetobutylicum
Synechocystis sp.
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
0.2
(b)
Evolution of eukaryotes from Archaea T. A. Williams et al.
Without New Data
!66
rRNA w/ Better Models
Rhodopirellula baltica
0.2
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Archaeoglobus fulgidus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Methanosarcina mazei
Thermoplasma volcanium
Trichomonas vaginalis
Giardia lamblia
Naegleria gruberi
Entamoeba histolytica
Dictyostelium discoideum
Trypanosoma brucei
Arabidopsis thaliana
Homo sapiens
Saccharomyces cerevisiae
Thalassiosira pseudonana
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Caldivirga maquilingensis
Pyrobaculum aerophilum
Thermofilum pendens
Sulfolobus solfataricus
Hyperthermus butylicus
Ignicoccus hospitalis
Staphylothermus marinus
Aeropyrum pernix
Campylobacter jejuni
Escherichia coli
Rhodopseudomonas palustris
Clostridium acetobutylicum
Synechocystis sp.
Treponema pallidum
Chlamydia trachomatis
Rhodopirellula baltica
1
1
1
1
1
1
0.2
(c)
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Methanococcus jannaschii
Thermoplasma volcanium
Methanosarcina mazei
Archaeoglobus fulgidus
Methanothermobacter thermautotrophicus
Pyrococcus furiosus
Korarchaeum cryptofilum
Nitrosopumilus maritimus
Cenarchaeum symbiosum
Caldiarchaeum subterraneum
Giardia lamblia
Homo sapiens
Thalassiosira pseudonana
Saccharomyces cerevisiae
Trypanosoma brucei
Naegleria gruberi
Entamoeba histolytica
Trichomonas vaginalis
Dictyostelium discoideum
Arabidopsis thaliana
Thermofilum pendens
Pyrobaculum aerophilum
Caldivirga maquilingensis
Sulfolobus solfataricus
Staphylothermus marinus
Aeropyrum pernix
Ignicoccus hospitalis
Hyperthermus butylicus
Rhodopirellula baltica
Synechocystis sp.
Clostridium acetobutylicum
Treponema pallidum
Chlamydia trachomatis
Rhodopseudomonas palustris
Escherichia coli
Campylobacter jejuni
1
1
0.57
1
0.97
0.2
(d)
!67
!68
!70
!71
Concatenated Proteins
Bacteria
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Methanothermobacter thermautotrophicus
Methanococcus jannaschii
Thermoplasma volcanium
Methanosarcina mazei
Archaeoglobus fulgidus
Pyrococcus furiosus
Giardia lamblia
Trichomonas vaginalis
Thalassiosira pseudonana
Phytophthora ramorum
Saccharomyces cerevisiae
Homo sapiens
Entamoeba histolytica
Dictyostelium discoideum
Leishmania major
Arabidopsis thaliana
Korarchaeum cryptofilum
Nitrosopumilus maritimus
Nitrosoarchaeum limnia
Cenarchaeum symbiosum
Caldiarchaeum subterraneum
Thermofilum pendens
Pyrobaculum aerophilum
Caldivirga maquilingensis
Staphylothermus marinus
Sulfolobus solfataricus
Ignicoccus hospitalis
Aeropyrum pernix
Hyperthermus butylicus
Rhodopseudomonas palustris
Escherichia coli
Treponema pallidum
Rhodopirellula baltica
Chlamydia trachomatis
Synechocystis sp.
Clostridium acetobutylicum
Campylobacter jejuni
1
0.51
0.81
0.99
0.99
1
0.99
1
1
0.2
(a)
Euryarchaeota
Korarchaeota
Crenarchaeota
Aigarchaeota
Thaumarchaeota
Eukaryota
Pyrococcus furiosus
Methanococcus jannaschii
Methanothermobacter thermautotrophicus
Thermoplasma acidophilum
Archaeoglobus fulgidus
Methanosarcina mazei
Trichomonas vaginalis
Giardia lamblia
Entamoeba histolytica
Naegleria gruberi
Leishmania major
Dictyostelium discoideum
Saccharomyces cerevisiae
Homo sapiens
Arabidopsis thaliana
Thalassiosira pseudonana
Phytophthora ramorum
Korarchaeum cryptofilum
Caldiarchaeum subterraneum
Cenarchaeum symbiosum
Nitrosopumilus maritimus
Nitrosoarchaeum limnia
Thermofilum pendens
Pyrobaculum aerophilum
Caldivirga maquilingensis
Sulfolobus solfataricus
Ignicoccus hospitalis
Staphylothermus marinus
Hyperthermus butylicus
Aeropyrum pernix
1
1
1
0.99
1
1
0.5
(b)
Figure 2. Phylogenies of Bacteria, Archaea and eukaryotes inferred from conserved protein-coding genes. (a) A phylogeny
inferred from 29 concatenated proteins conserved between Bacteria, Archaea and eukaryotes. An eocyte topology was recov-
ered with strong (PP ¼ 0.99) support. In this phylogeny, the eukaryotes emerge as the sister group of Korarchaeum, nested with
the TACK superphylum. (b) A phylogeny inferred from 63 concatenated proteins shared between Archaea and eukaryotes. The
position of the root is not explicitly indicated. However, based on the result from (a) and the electronic supplementary material,
table S4, it is likely to be either within, or on the branch leading to, the Euryarchaea. If this position is correct, then the tree
shows the eukaryotes emerging as the sister group to the TACK superphylum, including Korarchaeum. These trees were
inferred using the CAT model in PHYLOBAYES. Branch lengths are proportional to substitutions per site, except the truncated
bacterial branch in (a).
4874 T. A. Williams et al. Evolution of eukaryotes from Archaea
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!72
Figure 2. Phylogenies of Bacteria, Archaea and eukaryotes inferred from
conserved protein-coding genes. (a) A phylogeny inferred from 29
concatenated proteins conserved between Bacteria, Archaea and
eukaryotes. An eocyte topology was recovered with strong (PP 1⁄4 0.99)
support. In this phylogeny, the eukaryotes emerge as the sister group of
Korarchaeum, nested with the TACK superphylum. (b) A phylogeny
inferred from 63 concatenated proteins shared between Archaea and
eukaryotes. The position of the root is not explicitly indicated. However,
based on the result from (a) and the electronic supplementary material,
table S4, it is likely to be either within, or on the branch leading to, the
Euryarchaea. If this position is correct, then the tree shows the
eukaryotes emerging as the sister group to the TACK superphylum,
including Korarchaeum. These trees were inferred using the CAT model
in PHYLOBAYES. Branch lengths are proportional to substitutions per
site, except the truncated bacterial branch in (a).
!74
!75
Tree Congruence
3. CONCLUSIONS theories of eukaryotic origins [1]. Here, we have com-
distance
frequency
1 2 3 4 5
no.testspassed(P>0.05)
saturation and
homoplasy
site-specific
biochemical diversity
compositional
heterogeneity
0
10
20
30
40
50
60
model
CAT20
LG
(b)
0
50
100
150
200
250
300
(a)
1.0 1.5 2.0 2.5 3.0
density
model
CAT20
LG
0
0.2
0.4
0.6
0.8
1.0
1.2
(c)
distance
Figure 3. Analysing incongruence using a novel measure of distance between gene trees. We used distributions of pairwise geo-
desic distances between gene trees to compare levels of incongruence inferred under different evolutionary models. (a) The
distribution of distances under a single model (CAT20) can be used to identify obvious outliers corresponding to highly incon-
gruent gene trees; a single gene was responsible for the peak highlighted in red, and was removed from subsequent analyses.
(b) Overview of model-fitting tests (posterior predictive simulations) for each gene in the 64AE dataset. The height of the bars
indicates the proportion of genes that ‘passed’ a test under a particular model; we said that a test was passed when the value of
the test statistic on the real data fell within the central 95% of the distribution of values produced by posterior predictive simu-
lation. The results suggest that CAT20 fits better than LG, successfully accounting for the observed levels of saturation and
homoplasy in all but one of the alignments. Both models do a poor job of modelling the site-specific selective constraints in
our dataset, although again CAT20 performs better than LG (13 passes as opposed to 0). (c) Comparison of the distance dis-
tributions inferred under the CAT20 and LG models. The trees inferred under the better-fitting CAT20 model are significantly
more congruent than those inferred under LG (mean distance: 2.68 versus 3.22, p , 0.0001). The significance of this differ-
ence was assessed using a permutation test that took the correlations between pairwise distances into account (see §4). These
results suggest that a significant portion of the incongruence in this dataset of informational genes can be attributed to model
misspecification, rather than genuinely distinct evolutionary histories.
4876 T. A. Williams et al. Evolution of eukaryotes from Archaea
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!76
Figure 3. Analysing incongruence using a novel measure of distance between gene trees. We
used distributions of pairwise geodesic distances between gene trees to compare levels of
incongruence inferred under different evolutionary models. (a) The distribution of distances
under a single model (CAT20) can be used to identify obvious outliers corresponding to
highly incongruent gene trees; a single gene was responsible for the peak highlighted in red,
and was removed from subsequent analyses. (b) Overview of model-fitting tests (posterior
predictive simulations) for each gene in the 64AE dataset. The height of the bars indicates the
proportion of genes that ‘passed’ a test under a particular model; we said that a test was
passed when the value of the test statistic on the real data fell within the central 95% of the
distribution of values produced by posterior predictive simulation. The results suggest that
CAT20 fits better than LG, successfully accounting for the observed levels of saturation and
homoplasy in all but one of the alignments. Both models do a poor job of modelling the site-
specific selective constraints in our dataset, although again CAT20 performs better than LG
(13 passes as opposed to 0). (c) Comparison of the distance distributions inferred under the
CAT20 and LG models. The trees inferred under the better-fitting CAT20 model are
significantly more congruent than those inferred under LG (mean distance: 2.68 versus 3.22,
p , 0.0001). The significance of this difference was assessed using a permutation test that took
the correlations between pairwise distances into account (see §4). These results suggest that a
significant portion of the incongruence in this dataset of informational genes can be attributed
to model misspecification, rather than genuinely distinct evolutionary histories.
Tree Congruence
distance
frequency
1 2 3 4 5
no.testspassed(P>0.05)
saturation and
homoplasy
site-specific
biochemical diversity
compositional
heterogeneity
0
10
20
30
40
50
60
model
CAT20
LG
(b)
0
50
100
150
200
250
300
(a)
1.0 1.5 2.0 2.5 3.0
density
model
CAT20
LG
0
0.2
0.4
0.6
0.8
1.0
1.2
(c)
4876 T. A. Williams et al. Evolution of eukaryotes from Archaea
on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from
!78
Conclusion
!80
!81

More Related Content

What's hot

Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Jonathan Eisen
 
EVE 161 Winter 2018 Class 10
EVE 161 Winter 2018 Class 10EVE 161 Winter 2018 Class 10
EVE 161 Winter 2018 Class 10
Jonathan Eisen
 
UC Davis EVE161 Lecture 17 by @phylogenomics
 UC Davis EVE161 Lecture 17 by @phylogenomics UC Davis EVE161 Lecture 17 by @phylogenomics
UC Davis EVE161 Lecture 17 by @phylogenomics
Jonathan Eisen
 
EVE 161 Lecture 6
EVE 161 Lecture 6EVE 161 Lecture 6
EVE 161 Lecture 6
Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Jonathan Eisen
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17
Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of LifeMicrobial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Jonathan Eisen
 
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomicsUC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Jonathan Eisen
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun MetagenomicsMicrobial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Jonathan Eisen
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Jonathan Eisen
 
EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15
Jonathan Eisen
 
Marine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and OpportunitiesMarine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and Opportunities
Jonathan Eisen
 
UC Davis EVE161 Lecture 10 by @phylogenomics
UC Davis EVE161 Lecture 10 by @phylogenomicsUC Davis EVE161 Lecture 10 by @phylogenomics
UC Davis EVE161 Lecture 10 by @phylogenomicsJonathan Eisen
 
The Seagrass Microbiome Project
The Seagrass Microbiome Project The Seagrass Microbiome Project
The Seagrass Microbiome Project
Jonathan Eisen
 
American Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk UniversityAmerican Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk University
mcdonadt
 
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of LifeEVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
Jonathan Eisen
 
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life ExampleEVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
Jonathan Eisen
 

What's hot (20)

Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
 
EVE 161 Winter 2018 Class 10
EVE 161 Winter 2018 Class 10EVE 161 Winter 2018 Class 10
EVE 161 Winter 2018 Class 10
 
UC Davis EVE161 Lecture 17 by @phylogenomics
 UC Davis EVE161 Lecture 17 by @phylogenomics UC Davis EVE161 Lecture 17 by @phylogenomics
UC Davis EVE161 Lecture 17 by @phylogenomics
 
EVE 161 Lecture 6
EVE 161 Lecture 6EVE 161 Lecture 6
EVE 161 Lecture 6
 
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 15: Shotgun Metagenomics
 
EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17EVE 161 Winter 2018 Class 17
EVE 161 Winter 2018 Class 17
 
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of LifeMicrobial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
 
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomicsUC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
UC Davis EVE161 Lecture 8 - rRNA ecology - by Jonathan Eisen @phylogenomics
 
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome SequencingMicrobial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
Microbial Phylogenomics (EVE161) Class 10-11: Genome Sequencing
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun MetagenomicsMicrobial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
Microbial Phylogenomics (EVE161) Class 16: Shotgun Metagenomics
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
 
EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15EVE 161 Winter 2018 Class 15
EVE 161 Winter 2018 Class 15
 
Marine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and OpportunitiesMarine Host-Microbiome Interactions: Challenges and Opportunities
Marine Host-Microbiome Interactions: Challenges and Opportunities
 
UC Davis EVE161 Lecture 10 by @phylogenomics
UC Davis EVE161 Lecture 10 by @phylogenomicsUC Davis EVE161 Lecture 10 by @phylogenomics
UC Davis EVE161 Lecture 10 by @phylogenomics
 
The Seagrass Microbiome Project
The Seagrass Microbiome Project The Seagrass Microbiome Project
The Seagrass Microbiome Project
 
American Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk UniversityAmerican Gut Project presentation at Masaryk University
American Gut Project presentation at Masaryk University
 
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of LifeEVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
 
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life ExampleEVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
 

Similar to Microbial Phylogenomics (EVE161) Class 5

EVE 161 Lecture 4
EVE 161 Lecture 4EVE 161 Lecture 4
EVE 161 Lecture 4
Jonathan Eisen
 
the others our biased perspective
the others our biased perspectivethe others our biased perspective
the others our biased perspective
João Soares
 
Smbe 2013 talk
Smbe 2013 talkSmbe 2013 talk
Smbe 2013 talk
James McInerney
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
Philip Bourne
 
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuksThe need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
Jonathan Eisen
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Jonathan Eisen
 
Nrf2: A Guardian of Healthspan and Gatekeeper of Species Longevity
Nrf2:  A Guardian of Healthspan and Gatekeeper of Species LongevityNrf2:  A Guardian of Healthspan and Gatekeeper of Species Longevity
Nrf2: A Guardian of Healthspan and Gatekeeper of Species LongevityLifeVantage
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
Larry Smarr
 
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March..."Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
Sophien Kamoun
 
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
Jonathan Eisen
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeUsing Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Larry Smarr
 
PNAS-2013-Barr-10771-6
PNAS-2013-Barr-10771-6PNAS-2013-Barr-10771-6
PNAS-2013-Barr-10771-6Rita Auro
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogenetics
Prof. Wim Van Criekinge
 
Plant pathology in the post-genomics era
Plant pathology in the post-genomics eraPlant pathology in the post-genomics era
Plant pathology in the post-genomics era
Sophien Kamoun
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genometcha163
 
Ap Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological DiversityAp Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological Diversitysmithbio
 
Aniket_An Integrated Approach to Biology
Aniket_An Integrated Approach to BiologyAniket_An Integrated Approach to Biology
Aniket_An Integrated Approach to BiologyAniket Bhattacharya
 
Molecular.biology.of.the.cell.alberts.6th.ed
Molecular.biology.of.the.cell.alberts.6th.edMolecular.biology.of.the.cell.alberts.6th.ed
Molecular.biology.of.the.cell.alberts.6th.ed
rads 99
 
Using the Semantic Web to Support Ecoinformatics
Using the Semantic Web to Support EcoinformaticsUsing the Semantic Web to Support Ecoinformatics
Using the Semantic Web to Support Ecoinformatics
ebiquity
 
Biological Diversity: a molecular re-analysis
Biological Diversity: a molecular re-analysisBiological Diversity: a molecular re-analysis
Biological Diversity: a molecular re-analysis
Jason Price, PhD
 

Similar to Microbial Phylogenomics (EVE161) Class 5 (20)

EVE 161 Lecture 4
EVE 161 Lecture 4EVE 161 Lecture 4
EVE 161 Lecture 4
 
the others our biased perspective
the others our biased perspectivethe others our biased perspective
the others our biased perspective
 
Smbe 2013 talk
Smbe 2013 talkSmbe 2013 talk
Smbe 2013 talk
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
 
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuksThe need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
The need for a phylogeny driven genomic encyclopedia of eukaryotes #SMBEEuks
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
 
Nrf2: A Guardian of Healthspan and Gatekeeper of Species Longevity
Nrf2:  A Guardian of Healthspan and Gatekeeper of Species LongevityNrf2:  A Guardian of Healthspan and Gatekeeper of Species Longevity
Nrf2: A Guardian of Healthspan and Gatekeeper of Species Longevity
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March..."Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
"Keeping up with the plant destroyers." My talk at The Royal Society, 7 March...
 
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
MIB200A at UCDavis Module: Microbial Phylogeny; Class 3
 
Using Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of LifeUsing Supercomputers and Supernetworks to Explore the Ocean of Life
Using Supercomputers and Supernetworks to Explore the Ocean of Life
 
PNAS-2013-Barr-10771-6
PNAS-2013-Barr-10771-6PNAS-2013-Barr-10771-6
PNAS-2013-Barr-10771-6
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogenetics
 
Plant pathology in the post-genomics era
Plant pathology in the post-genomics eraPlant pathology in the post-genomics era
Plant pathology in the post-genomics era
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genome
 
Ap Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological DiversityAp Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological Diversity
 
Aniket_An Integrated Approach to Biology
Aniket_An Integrated Approach to BiologyAniket_An Integrated Approach to Biology
Aniket_An Integrated Approach to Biology
 
Molecular.biology.of.the.cell.alberts.6th.ed
Molecular.biology.of.the.cell.alberts.6th.edMolecular.biology.of.the.cell.alberts.6th.ed
Molecular.biology.of.the.cell.alberts.6th.ed
 
Using the Semantic Web to Support Ecoinformatics
Using the Semantic Web to Support EcoinformaticsUsing the Semantic Web to Support Ecoinformatics
Using the Semantic Web to Support Ecoinformatics
 
Biological Diversity: a molecular re-analysis
Biological Diversity: a molecular re-analysisBiological Diversity: a molecular re-analysis
Biological Diversity: a molecular re-analysis
 

More from Jonathan Eisen

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
Jonathan Eisen
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
Jonathan Eisen
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
Jonathan Eisen
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
Jonathan Eisen
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Jonathan Eisen
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
Jonathan Eisen
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
Jonathan Eisen
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
Jonathan Eisen
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
Jonathan Eisen
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
Jonathan Eisen
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
Jonathan Eisen
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
Jonathan Eisen
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
Jonathan Eisen
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
Jonathan Eisen
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
Jonathan Eisen
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
Jonathan Eisen
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
Jonathan Eisen
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
Jonathan Eisen
 

More from Jonathan Eisen (20)

Eisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdfEisen.CentralValley2024.pdf
Eisen.CentralValley2024.pdf
 
Phylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of MicrobesPhylogenomics and the Diversity and Diversification of Microbes
Phylogenomics and the Diversity and Diversification of Microbes
 
Talk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meetingTalk by Jonathan Eisen for LAMG2022 meeting
Talk by Jonathan Eisen for LAMG2022 meeting
 
Thoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current ActionsThoughts on UC Davis' COVID Current Actions
Thoughts on UC Davis' COVID Current Actions
 
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
Phylogenetic and Phylogenomic Approaches to the Study of Microbes and Microbi...
 
A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2A Field Guide to Sars-CoV-2
A Field Guide to Sars-CoV-2
 
EVE198 Summer Session Class 4
EVE198 Summer Session Class 4EVE198 Summer Session Class 4
EVE198 Summer Session Class 4
 
EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1 EVE198 Summer Session 2 Class 1
EVE198 Summer Session 2 Class 1
 
EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines EVE198 Summer Session 2 Class 2 Vaccines
EVE198 Summer Session 2 Class 2 Vaccines
 
EVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 IntroductionEVE198 Spring2021 Class1 Introduction
EVE198 Spring2021 Class1 Introduction
 
EVE198 Spring2021 Class2
EVE198 Spring2021 Class2EVE198 Spring2021 Class2
EVE198 Spring2021 Class2
 
EVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 VaccinesEVE198 Spring2021 Class5 Vaccines
EVE198 Spring2021 Class5 Vaccines
 
EVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA DetectionEVE198 Winter2020 Class 8 - COVID RNA Detection
EVE198 Winter2020 Class 8 - COVID RNA Detection
 
EVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 IntroductionEVE198 Winter2020 Class 1 Introduction
EVE198 Winter2020 Class 1 Introduction
 
EVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID TestingEVE198 Winter2020 Class 3 - COVID Testing
EVE198 Winter2020 Class 3 - COVID Testing
 
EVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID VaccinesEVE198 Winter2020 Class 5 - COVID Vaccines
EVE198 Winter2020 Class 5 - COVID Vaccines
 
EVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID TransmissionEVE198 Winter2020 Class 9 - COVID Transmission
EVE198 Winter2020 Class 9 - COVID Transmission
 
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 VaccinesEVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
EVE198 Fall2020 "Covid Mass Testing" Class 8 Vaccines
 
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and TestingEVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
EVE198 Fall2020 "Covid Mass Testing" Class 2: Viruses, COIVD and Testing
 
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 IntroductionEVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
EVE198 Fall2020 "Covid Mass Testing" Class 1 Introduction
 

Recently uploaded

Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
AADYARAJPANDEY1
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
rakeshsharma20142015
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
anitaento25
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
Cherry
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 

Recently uploaded (20)

Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCINGRNA INTERFERENCE: UNRAVELING GENETIC SILENCING
RNA INTERFERENCE: UNRAVELING GENETIC SILENCING
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
insect morphology and physiology of insect
insect morphology and physiology of insectinsect morphology and physiology of insect
insect morphology and physiology of insect
 
Penicillin...........................pptx
Penicillin...........................pptxPenicillin...........................pptx
Penicillin...........................pptx
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 

Microbial Phylogenomics (EVE161) Class 5

  • 1. Lecture 3: EVE 161:
 Microbial Phylogenomics Lecture #5: Modern View of Tree of Life UC Davis, Winter 2016 Instructors: Jonathan Eisen & Holly Ganz
  • 2. Where we are going and where we have been • Previous lecture: !4. Background on Phylogeny • Current Lecture: !5. Modern view of Tree of Life • Next Lecture: !6. rRNA from environments !2
  • 3. Three papers for today Syst. Biol. 59(5):518–533, 2010 c⃝ The Author(s) 2010. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org DOI:10.1093/sysbio/syq037 Advance Access publication on July 23, 2010 Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life LAURA WEGENER PARFREY1 , JESSICA GRANT2 , YONAS I. TEKLE2,6 , ERICA LASEK-NESSELQUIST3,4 , HILARY G. MORRISON3 , MITCHELL L. SOGIN3 , DAVID J. PATTERSON5 , AND LAURA A. KATZ1,2,∗ 1Program in Organismic and Evolutionary Biology, University of Massachusetts, 611 North Pleasant Street, Amherst, MA 01003, USA; 2Department of Biological Sciences, Smith College, 44 College Lane, Northampton, MA 01063, USA; 3Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 4Department of Ecology and Evolutionary Biology, Brown University, 80 Waterman Street, Providence, RI 02912, USA; 5Biodiversity Informatics Group, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 6Present address: Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT 06520, USA; ∗Correspondence to be sent to: Laura A. Katz, 44 College Lane, Northampton, MA 01003, USA; E-mail: lkatz@smith.edu. Laura Wegener Parfrey and Jessica Grant have contributed equally to this work. Received 30 September 2009; reviews returned 1 December 2009; accepted 25 May 2010 Associate Editor: C´ecile An´e Abstract.—An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diver- sity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analy- ses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support atUniversityohttp://sysbio.oxfordjournals.org/Downloadedfrom first published online 24 October 2012, doi: 10.1098/rspb.2012.17952792012Proc. R. Soc. B Tom A. Williams, Peter G. Foster, Tom M. W. Nye, Cymon J. Cox and T. Martin Embley the Archaea A congruent phylogenomic signal places eukaryotes within Supplementary data tml http://rspb.royalsocietypublishing.org/content/suppl/2012/10/18/rspb.2012.1795.DC1.h "Data Supplement" References http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#related-urls Article cited in: http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#ref-list-1 This article cites 56 articles, 35 of which can be accessed free This article is free to access Subject collections (1595 articles)evolution (25 articles)bioinformatics Articles on similar topics can be found in the following collections on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !3 ARTICLE doi:10.1038/nature14447 Complex archaea that bridge the gap between prokaryotes and eukaryotes Anja Spang1 *, Jimmy H. Saw1 *, Steffen L. Jørgensen2 *, Katarzyna Zaremba-Niedzwiedzka1 *, Joran Martijn1 , Anders E. Lind1 , Roel van Eijk1 {, Christa Schleper2,3 , Lionel Guy1,4 & Thijs J. G. Ettema1 The origin of the eukaryotic cell remains one of the most contentious puzzles in modern biology. Recent studies
  • 4. Palfrey et al. Syst. Biol. 59(5):518–533, 2010 c⃝ The Author(s) 2010. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oxfordjournals.org DOI:10.1093/sysbio/syq037 Advance Access publication on July 23, 2010 Broadly Sampled Multigene Analyses Yield a Well-Resolved Eukaryotic Tree of Life LAURA WEGENER PARFREY1 , JESSICA GRANT2 , YONAS I. TEKLE2,6 , ERICA LASEK-NESSELQUIST3,4 , HILARY G. MORRISON3 , MITCHELL L. SOGIN3 , DAVID J. PATTERSON5 , AND LAURA A. KATZ1,2,∗ 1Program in Organismic and Evolutionary Biology, University of Massachusetts, 611 North Pleasant Street, Amherst, MA 01003, USA; 2Department of Biological Sciences, Smith College, 44 College Lane, Northampton, MA 01063, USA; 3Bay Paul Center for Comparative Molecular Biology and Evolution, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 4Department of Ecology and Evolutionary Biology, Brown University, 80 Waterman Street, Providence, RI 02912, USA; 5Biodiversity Informatics Group, Marine Biological Laboratory, 7 MBL Street, Woods Hole, MA 02543, USA; 6Present address: Department of Epidemiology and Public Health, Yale University School of Medicine, New Haven, CT 06520, USA; ∗Correspondence to be sent to: Laura A. Katz, 44 College Lane, Northampton, MA 01003, USA; E-mail: lkatz@smith.edu. Laura Wegener Parfrey and Jessica Grant have contributed equally to this work. Received 30 September 2009; reviews returned 1 December 2009; accepted 25 May 2010 Associate Editor: C´ecile An´e Abstract.—An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diver- sity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary !4
  • 5. Abstract An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analyses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses and enable placement of the multitude of lineages that lack genome scale data. [Excavata; microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.] !5
  • 6. Abstract An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analyses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses and enable placement of the multitude of lineages that lack genome scale data. [Excavata; microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.] !6
  • 7. Abstract An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analyses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses and enable placement of the multitude of lineages that lack genome scale data. [Excavata; microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.] !7
  • 8. Abstract An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analyses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses and enable placement of the multitude of lineages that lack genome scale data. [Excavata; microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.] !8
  • 9. Abstract An accurate reconstruction of the eukaryotic tree of life is essential to identify the innovations underlying the diversity of microbial and macroscopic (e.g., plants and animals) eukaryotes. Previous work has divided eukaryotic diversity into a small number of high-level “supergroups,” many of which receive strong support in phylogenomic analyses. However, the abundance of data in phylogenomic analyses can lead to highly supported but incorrect relationships due to systematic phylogenetic error. Furthermore, the paucity of major eukaryotic lineages (19 or fewer) included in these genomic studies may exaggerate systematic error and reduce power to evaluate hypotheses. Here, we use a taxon-rich strategy to assess eukaryotic relationships. We show that analyses emphasizing broad taxonomic sampling (up to 451 taxa representing 72 major lineages) combined with a moderate number of genes yield a well-resolved eukaryotic tree of life. The consistency across analyses with varying numbers of taxa (88–451) and levels of missing data (17–69%) supports the accuracy of the resulting topologies. The resulting stable topology emerges without the removal of rapidly evolving genes or taxa, a practice common to phylogenomic analyses. Several major groups are stable and strongly supported in these analyses (e.g., SAR, Rhizaria, Excavata), whereas the proposed supergroup “Chromalveolata” is rejected. Furthermore, ex- tensive instability among photosynthetic lineages suggests the presence of systematic biases including endosymbiotic gene transfer from symbiont (nucleus or plastid) to host. Our analyses demonstrate that stable topologies of ancient evolutionary relationships can be achieved with broad taxonomic sampling and a moderate number of genes. Finally, taxon-rich analyses such as presented here provide a method for testing the accuracy of relationships that receive high bootstrap support (BS) in phylogenomic analyses and enable placement of the multitude of lineages that lack genome scale data. [Excavata; microbial eukaryotes; Rhizaria; supergroups; systematic error; taxon sampling.] !9
  • 12. !12
  • 13. !13
  • 16. !16
  • 17. !17
  • 18. !18
  • 20. Results and Discussion • Questions about Methods?
  • 21. !21
  • 22. Fig 1: 451 Taxa and some of the 16 genes !22
  • 23.
  • 24.
  • 25.
  • 26. Fig. 2: 88 Taxa each w/ 10 or more of the 16 genes !26
  • 27.
  • 28.
  • 29.
  • 30.
  • 31.
  • 32.
  • 33.
  • 34.
  • 35. !35
  • 36. !36
  • 42. !42
  • 43. Williams et al. first published online 24 October 2012, doi: 10.1098/rspb.2012.17952792012Proc. R. Soc. B Tom A. Williams, Peter G. Foster, Tom M. W. Nye, Cymon J. Cox and T. Martin Embley the Archaea A congruent phylogenomic signal places eukaryotes within Supplementary data tml http://rspb.royalsocietypublishing.org/content/suppl/2012/10/18/rspb.2012.1795.DC1.h "Data Supplement" References http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#related-urls Article cited in: http://rspb.royalsocietypublishing.org/content/279/1749/4870.full.html#ref-list-1 This article cites 56 articles, 35 of which can be accessed free This article is free to access Subject collections (178 articles)taxonomy and systematics (1595 articles)evolution (25 articles)bioinformatics Articles on similar topics can be found in the following collections Email alerting service hereright-hand corner of the article or click Receive free email alerts when new articles cite this article - sign up in the box at the top !43
  • 44. Abstract Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook ‘three domains’ tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation. !44
  • 45. Abstract Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook ‘three domains’ tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation. !45
  • 46. Abstract Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook ‘three domains’ tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation. !46
  • 47. Abstract Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook ‘three domains’ tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation. !47
  • 48. Abstract Determining the relationships among the major groups of cellular life is important for understanding the evolution of biological diversity, but is difficult given the enormous time spans involved. In the textbook ‘three domains’ tree based on informational genes, eukaryotes and Archaea share a common ancestor to the exclusion of Bacteria. However, some phylogenetic analyses of the same data have placed eukaryotes within the Archaea, as the nearest relatives of different archaeal lineages. We compared the support for these competing hypotheses using sophisticated phylogenetic methods and an improved sampling of archaeal biodiversity. We also employed both new and existing tests of phylogenetic congruence to explore the level of uncertainty and conflict in the data. Our analyses suggested that much of the observed incongruence is weakly supported or associated with poorly fitting evolutionary models. All of our phylogenetic analyses, whether on small subunit and large subunit ribosomal RNA or concatenated protein-coding genes, recovered a monophyletic group containing eukaryotes and the TACK archaeal superphylum comprising the Thaumarchaeota, Aigarchaeota, Crenarchaeota and Korarchaeota. Hence, while our results provide no support for the iconic three-domain tree of life, they are consistent with an extended eocyte hypothesis whereby vital components of the eukaryotic nuclear lineage originated from within the archaeal radiation. !48
  • 63. rRNA Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Giardia lamblia Trichomonas vaginalis Naegleria gruberi Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Staphylothermus marinus Hyperthermus butylicus Ignicoccus hospitalis Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 0.83 1 0.2 (a) Bacteria Euryarchaeota Crenarchaeota Eukaryota Trichomonas vaginalis Arabidopsis thaliana Giardia lamblia Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Naegleria gruberi Archaeoglobus fulgidus Methanosarcina mazei Thermoplasma volcanium Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Staphylothermus marinus Ignicoccus hospitalis Aeropyrum pernix Clostridium acetobutylicum Synechocystis sp. Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 0.2 (b) Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Trichomonas vaginalis Giardia lamblia Naegleria gruberi Entamoeba histolytica Dictyostelium discoideum Trypanosoma brucei Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Ignicoccus hospitalis Staphylothermus marinus Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 1 0.2 (c) Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Methanococcus jannaschii Thermoplasma volcanium Methanosarcina mazei Archaeoglobus fulgidus Methanothermobacter thermautotrophicus Pyrococcus furiosus Korarchaeum cryptofilum Nitrosopumilus maritimus Cenarchaeum symbiosum Caldiarchaeum subterraneum Giardia lamblia Homo sapiens Thalassiosira pseudonana Saccharomyces cerevisiae Trypanosoma brucei Naegleria gruberi Entamoeba histolytica Trichomonas vaginalis Dictyostelium discoideum Arabidopsis thaliana Thermofilum pendens Pyrobaculum aerophilum Caldivirga maquilingensis Sulfolobus solfataricus Staphylothermus marinus Aeropyrum pernix Ignicoccus hospitalis Hyperthermus butylicus Rhodopirellula baltica Synechocystis sp. Clostridium acetobutylicum Treponema pallidum Chlamydia trachomatis Rhodopseudomonas palustris Escherichia coli Campylobacter jejuni 1 1 0.57 1 0.97 0.2 (d) Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred from concatenated rRNA. (a) A Bayesian phylogeny of Bac- teria, Archaea and eukaryotes inferred under the GTR model, showing an eocyte-like topology in which eukaryotes emerge from within the Archaea with maximal support (posterior probability (PP) ¼ 1). (b) Removal of recently characterized archaeal groups (the Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree into a canonical three-domains topology, again with maximal support (PP ¼ 1), indicating that sampling plays an important role in the resolution of these ancient relationships. Analyses of the full dataset using the better-fitting NDRH þ NDCH (c) and CAT (d) models recover maximally supported eocyte-like topologies; these models also recover eocyte-like topologies on the reduced dataset, without the TAK sequences (see the electronic supplementary material, figure S1). Branch lengths are proportional to substitutions per site. Evolution of eukaryotes from Archaea T. A. Williams et al. 4873 Proc. R. Soc. B (2012) on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !63
  • 64. Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred from concatenated rRNA. (a) A Bayesian phylogeny of Bacteria, Archaea and eukaryotes inferred under the GTR model, showing an eocyte-like topology in which eukaryotes emerge from within the Archaea with maximal support (posterior probability (PP) 1⁄4 1). (b) Removal of recently characterized archaeal groups (the Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree into a canonical three-domains topology, again with maximal support (PP 1⁄4 1), indicating that sampling plays an important role in the resolution of these ancient relationships. Analyses of the full dataset using the better-fitting NDRH þ NDCH (c) and CAT (d ) models recover maximally supported eocyte-like topologies; these models also recover eocyte-like topologies on the reduced dataset, without the TAK sequences (see the electronic supplementary material, figure S1). Branch lengths are proportional to substitutions per site.
  • 65. rRNA Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Giardia lamblia Trichomonas vaginalis Naegleria gruberi Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Staphylothermus marinus Hyperthermus butylicus Ignicoccus hospitalis Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 0.83 1 0.2 (a) Bacteria Euryarchaeota Crenarchaeota Eukaryota Trichomonas vaginalis Arabidopsis thaliana Giardia lamblia Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Naegleria gruberi Archaeoglobus fulgidus Methanosarcina mazei Thermoplasma volcanium Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Staphylothermus marinus Ignicoccus hospitalis Aeropyrum pernix Clostridium acetobutylicum Synechocystis sp. Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 0.2 (b) Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Trichomonas vaginalis Giardia lamblia Naegleria gruberi Entamoeba histolytica Dictyostelium discoideum Trypanosoma brucei Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Ignicoccus hospitalis Staphylothermus marinus Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 1 0.2 (c) Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Methanococcus jannaschii Thermoplasma volcanium Methanosarcina mazei Archaeoglobus fulgidus Methanothermobacter thermautotrophicus Pyrococcus furiosus Korarchaeum cryptofilum Nitrosopumilus maritimus Cenarchaeum symbiosum Caldiarchaeum subterraneum Giardia lamblia Homo sapiens Thalassiosira pseudonana Saccharomyces cerevisiae Trypanosoma brucei Naegleria gruberi Entamoeba histolytica Trichomonas vaginalis Dictyostelium discoideum Arabidopsis thaliana Thermofilum pendens Pyrobaculum aerophilum Caldivirga maquilingensis Sulfolobus solfataricus Staphylothermus marinus Aeropyrum pernix Ignicoccus hospitalis Hyperthermus butylicus Rhodopirellula baltica Synechocystis sp. Clostridium acetobutylicum Treponema pallidum Chlamydia trachomatis Rhodopseudomonas palustris Escherichia coli Campylobacter jejuni 1 1 0.57 1 0.97 0.2 (d) Figure 1. Phylogenies of Bacteria, Archaea and eukaryotes inferred from concatenated rRNA. (a) A Bayesian phylogeny of Bac- teria, Archaea and eukaryotes inferred under the GTR model, showing an eocyte-like topology in which eukaryotes emerge from within the Archaea with maximal support (posterior probability (PP) ¼ 1). (b) Removal of recently characterized archaeal groups (the Thaumarchaeota, Aigarchaeota and Korarchaeota) converts this tree into a canonical three-domains topology, again with maximal support (PP ¼ 1), indicating that sampling plays an important role in the resolution of these ancient relationships. Analyses of the full dataset using the better-fitting NDRH þ NDCH (c) and CAT (d) models recover maximally supported eocyte-like topologies; these models also recover eocyte-like topologies on the reduced dataset, without the TAK sequences (see the electronic supplementary material, figure S1). Branch lengths are proportional to substitutions per site. Evolution of eukaryotes from Archaea T. A. Williams et al. 4873 Proc. R. Soc. B (2012) on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !65
  • 66. rRNA Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Giardia lamblia Trichomonas vaginalis Naegleria gruberi Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Staphylothermus marinus Hyperthermus butylicus Ignicoccus hospitalis Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 0.83 1 0.2 (a) Bacteria Euryarchaeota Crenarchaeota Eukaryota Trichomonas vaginalis Arabidopsis thaliana Giardia lamblia Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Naegleria gruberi Archaeoglobus fulgidus Methanosarcina mazei Thermoplasma volcanium Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Staphylothermus marinus Ignicoccus hospitalis Aeropyrum pernix Clostridium acetobutylicum Synechocystis sp. Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 0.2 (b) Evolution of eukaryotes from Archaea T. A. Williams et al. on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from With New Data Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Giardia lamblia Trichomonas vaginalis Naegleria gruberi Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Staphylothermus marinus Hyperthermus butylicus Ignicoccus hospitalis Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis 1 1 1 1 0.83 1 (a) Bacteria Euryarchaeota Crenarchaeota Eukaryota Trichomonas vaginalis Arabidopsis thaliana Giardia lamblia Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Dictyostelium discoideum Trypanosoma brucei Entamoeba histolytica Naegleria gruberi Archaeoglobus fulgidus Methanosarcina mazei Thermoplasma volcanium Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Staphylothermus marinus Ignicoccus hospitalis Aeropyrum pernix Clostridium acetobutylicum Synechocystis sp. Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 0.2 (b) Evolution of eukaryotes from Archaea T. A. Williams et al. Without New Data !66
  • 67. rRNA w/ Better Models Rhodopirellula baltica 0.2 Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Archaeoglobus fulgidus Methanococcus jannaschii Methanothermobacter thermautotrophicus Pyrococcus furiosus Methanosarcina mazei Thermoplasma volcanium Trichomonas vaginalis Giardia lamblia Naegleria gruberi Entamoeba histolytica Dictyostelium discoideum Trypanosoma brucei Arabidopsis thaliana Homo sapiens Saccharomyces cerevisiae Thalassiosira pseudonana Cenarchaeum symbiosum Nitrosopumilus maritimus Korarchaeum cryptofilum Caldiarchaeum subterraneum Caldivirga maquilingensis Pyrobaculum aerophilum Thermofilum pendens Sulfolobus solfataricus Hyperthermus butylicus Ignicoccus hospitalis Staphylothermus marinus Aeropyrum pernix Campylobacter jejuni Escherichia coli Rhodopseudomonas palustris Clostridium acetobutylicum Synechocystis sp. Treponema pallidum Chlamydia trachomatis Rhodopirellula baltica 1 1 1 1 1 1 0.2 (c) Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Methanococcus jannaschii Thermoplasma volcanium Methanosarcina mazei Archaeoglobus fulgidus Methanothermobacter thermautotrophicus Pyrococcus furiosus Korarchaeum cryptofilum Nitrosopumilus maritimus Cenarchaeum symbiosum Caldiarchaeum subterraneum Giardia lamblia Homo sapiens Thalassiosira pseudonana Saccharomyces cerevisiae Trypanosoma brucei Naegleria gruberi Entamoeba histolytica Trichomonas vaginalis Dictyostelium discoideum Arabidopsis thaliana Thermofilum pendens Pyrobaculum aerophilum Caldivirga maquilingensis Sulfolobus solfataricus Staphylothermus marinus Aeropyrum pernix Ignicoccus hospitalis Hyperthermus butylicus Rhodopirellula baltica Synechocystis sp. Clostridium acetobutylicum Treponema pallidum Chlamydia trachomatis Rhodopseudomonas palustris Escherichia coli Campylobacter jejuni 1 1 0.57 1 0.97 0.2 (d) !67
  • 68. !68
  • 69.
  • 70. !70
  • 71. !71
  • 72. Concatenated Proteins Bacteria Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Methanothermobacter thermautotrophicus Methanococcus jannaschii Thermoplasma volcanium Methanosarcina mazei Archaeoglobus fulgidus Pyrococcus furiosus Giardia lamblia Trichomonas vaginalis Thalassiosira pseudonana Phytophthora ramorum Saccharomyces cerevisiae Homo sapiens Entamoeba histolytica Dictyostelium discoideum Leishmania major Arabidopsis thaliana Korarchaeum cryptofilum Nitrosopumilus maritimus Nitrosoarchaeum limnia Cenarchaeum symbiosum Caldiarchaeum subterraneum Thermofilum pendens Pyrobaculum aerophilum Caldivirga maquilingensis Staphylothermus marinus Sulfolobus solfataricus Ignicoccus hospitalis Aeropyrum pernix Hyperthermus butylicus Rhodopseudomonas palustris Escherichia coli Treponema pallidum Rhodopirellula baltica Chlamydia trachomatis Synechocystis sp. Clostridium acetobutylicum Campylobacter jejuni 1 0.51 0.81 0.99 0.99 1 0.99 1 1 0.2 (a) Euryarchaeota Korarchaeota Crenarchaeota Aigarchaeota Thaumarchaeota Eukaryota Pyrococcus furiosus Methanococcus jannaschii Methanothermobacter thermautotrophicus Thermoplasma acidophilum Archaeoglobus fulgidus Methanosarcina mazei Trichomonas vaginalis Giardia lamblia Entamoeba histolytica Naegleria gruberi Leishmania major Dictyostelium discoideum Saccharomyces cerevisiae Homo sapiens Arabidopsis thaliana Thalassiosira pseudonana Phytophthora ramorum Korarchaeum cryptofilum Caldiarchaeum subterraneum Cenarchaeum symbiosum Nitrosopumilus maritimus Nitrosoarchaeum limnia Thermofilum pendens Pyrobaculum aerophilum Caldivirga maquilingensis Sulfolobus solfataricus Ignicoccus hospitalis Staphylothermus marinus Hyperthermus butylicus Aeropyrum pernix 1 1 1 0.99 1 1 0.5 (b) Figure 2. Phylogenies of Bacteria, Archaea and eukaryotes inferred from conserved protein-coding genes. (a) A phylogeny inferred from 29 concatenated proteins conserved between Bacteria, Archaea and eukaryotes. An eocyte topology was recov- ered with strong (PP ¼ 0.99) support. In this phylogeny, the eukaryotes emerge as the sister group of Korarchaeum, nested with the TACK superphylum. (b) A phylogeny inferred from 63 concatenated proteins shared between Archaea and eukaryotes. The position of the root is not explicitly indicated. However, based on the result from (a) and the electronic supplementary material, table S4, it is likely to be either within, or on the branch leading to, the Euryarchaea. If this position is correct, then the tree shows the eukaryotes emerging as the sister group to the TACK superphylum, including Korarchaeum. These trees were inferred using the CAT model in PHYLOBAYES. Branch lengths are proportional to substitutions per site, except the truncated bacterial branch in (a). 4874 T. A. Williams et al. Evolution of eukaryotes from Archaea on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !72
  • 73. Figure 2. Phylogenies of Bacteria, Archaea and eukaryotes inferred from conserved protein-coding genes. (a) A phylogeny inferred from 29 concatenated proteins conserved between Bacteria, Archaea and eukaryotes. An eocyte topology was recovered with strong (PP 1⁄4 0.99) support. In this phylogeny, the eukaryotes emerge as the sister group of Korarchaeum, nested with the TACK superphylum. (b) A phylogeny inferred from 63 concatenated proteins shared between Archaea and eukaryotes. The position of the root is not explicitly indicated. However, based on the result from (a) and the electronic supplementary material, table S4, it is likely to be either within, or on the branch leading to, the Euryarchaea. If this position is correct, then the tree shows the eukaryotes emerging as the sister group to the TACK superphylum, including Korarchaeum. These trees were inferred using the CAT model in PHYLOBAYES. Branch lengths are proportional to substitutions per site, except the truncated bacterial branch in (a).
  • 74. !74
  • 75. !75
  • 76. Tree Congruence 3. CONCLUSIONS theories of eukaryotic origins [1]. Here, we have com- distance frequency 1 2 3 4 5 no.testspassed(P>0.05) saturation and homoplasy site-specific biochemical diversity compositional heterogeneity 0 10 20 30 40 50 60 model CAT20 LG (b) 0 50 100 150 200 250 300 (a) 1.0 1.5 2.0 2.5 3.0 density model CAT20 LG 0 0.2 0.4 0.6 0.8 1.0 1.2 (c) distance Figure 3. Analysing incongruence using a novel measure of distance between gene trees. We used distributions of pairwise geo- desic distances between gene trees to compare levels of incongruence inferred under different evolutionary models. (a) The distribution of distances under a single model (CAT20) can be used to identify obvious outliers corresponding to highly incon- gruent gene trees; a single gene was responsible for the peak highlighted in red, and was removed from subsequent analyses. (b) Overview of model-fitting tests (posterior predictive simulations) for each gene in the 64AE dataset. The height of the bars indicates the proportion of genes that ‘passed’ a test under a particular model; we said that a test was passed when the value of the test statistic on the real data fell within the central 95% of the distribution of values produced by posterior predictive simu- lation. The results suggest that CAT20 fits better than LG, successfully accounting for the observed levels of saturation and homoplasy in all but one of the alignments. Both models do a poor job of modelling the site-specific selective constraints in our dataset, although again CAT20 performs better than LG (13 passes as opposed to 0). (c) Comparison of the distance dis- tributions inferred under the CAT20 and LG models. The trees inferred under the better-fitting CAT20 model are significantly more congruent than those inferred under LG (mean distance: 2.68 versus 3.22, p , 0.0001). The significance of this differ- ence was assessed using a permutation test that took the correlations between pairwise distances into account (see §4). These results suggest that a significant portion of the incongruence in this dataset of informational genes can be attributed to model misspecification, rather than genuinely distinct evolutionary histories. 4876 T. A. Williams et al. Evolution of eukaryotes from Archaea on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !76
  • 77. Figure 3. Analysing incongruence using a novel measure of distance between gene trees. We used distributions of pairwise geodesic distances between gene trees to compare levels of incongruence inferred under different evolutionary models. (a) The distribution of distances under a single model (CAT20) can be used to identify obvious outliers corresponding to highly incongruent gene trees; a single gene was responsible for the peak highlighted in red, and was removed from subsequent analyses. (b) Overview of model-fitting tests (posterior predictive simulations) for each gene in the 64AE dataset. The height of the bars indicates the proportion of genes that ‘passed’ a test under a particular model; we said that a test was passed when the value of the test statistic on the real data fell within the central 95% of the distribution of values produced by posterior predictive simulation. The results suggest that CAT20 fits better than LG, successfully accounting for the observed levels of saturation and homoplasy in all but one of the alignments. Both models do a poor job of modelling the site- specific selective constraints in our dataset, although again CAT20 performs better than LG (13 passes as opposed to 0). (c) Comparison of the distance distributions inferred under the CAT20 and LG models. The trees inferred under the better-fitting CAT20 model are significantly more congruent than those inferred under LG (mean distance: 2.68 versus 3.22, p , 0.0001). The significance of this difference was assessed using a permutation test that took the correlations between pairwise distances into account (see §4). These results suggest that a significant portion of the incongruence in this dataset of informational genes can be attributed to model misspecification, rather than genuinely distinct evolutionary histories.
  • 78. Tree Congruence distance frequency 1 2 3 4 5 no.testspassed(P>0.05) saturation and homoplasy site-specific biochemical diversity compositional heterogeneity 0 10 20 30 40 50 60 model CAT20 LG (b) 0 50 100 150 200 250 300 (a) 1.0 1.5 2.0 2.5 3.0 density model CAT20 LG 0 0.2 0.4 0.6 0.8 1.0 1.2 (c) 4876 T. A. Williams et al. Evolution of eukaryotes from Archaea on January 16, 2014rspb.royalsocietypublishing.orgDownloaded from !78
  • 80. !80
  • 81. !81