SlideShare a Scribd company logo
1 of 44
1
2
“If a tree dies, plant another in its place.”
- Carolus Linnaeus
https://bigpictureeducation.com/tree-life
Part 1
Tree Thinking
Hug, Laura A., et al. "A new view of the tree of life." Nature Microbiology 1 (2016): 16048.
*
3
CPR
Proteobacteria
4
From the same paper
CPR
Proteobacteria
What’s going on?
5
Concatenated universal proteins Single gene (16S ribosomal RNA gene)
Another example – Aquificae +
Epsilonproteobacteria
6
Wu and Eisen (2008)
AMPHORA pipeline
Wu et al. (2009)
AMPHORA pipeline
Huh?
7
2008 – one Aquificae 2009 – two Aquificae!
WTF??
8
Aquifex aeolicus genes
have high similarity with
genes from…
Adding Sulfurihydrogenibium
is enough to swing it from
“early branching” to “exotic
Epsilonproteobacterium”
Eveleigh et al. Genome Biol Evol (2013)
What I’m trying to say here
Lateral gene transfer and other processes, coupled with
old, old divergence times, leads to:
1. Phylogenetic instability
2. Artefactual “early branching”
3. Invalid representations of evolutionary relationships
One possible solution: try to release the "phylogenetic
pressure" with a network representation
9
10
Emma-Allen Vercoe
“I am not one and simple, but complex and many.”
- Virginia Woolf, The Waves
Part 2: On to networks!
So you want to build a network.
• Things to think about:
• What types of relationship are you trying to show?
• Do evolutionary distances matter?
• Rooted or unrooted?
• ALL relationships or just the most interesting / important
ones?
• How long will it take to build?
11
Implicit networks
• Good at showing uncertainty, ambiguity and
conflict in the data, BUT which type of confusion
are we looking at??
• Showing alternative bipartitions:
12
Neighbor-net: networks from
distance matrices
13
Complexity: O(n3)
Bryant and Moulton, Mol Biol Evol (2004)
14
Neighbor-net for 298
bacterial genomes
Acidithiobacillus is weird,
but why..?
Beiko, Biol Direct (2011)
Z-closure: supernetworks from
trees (Huson et al., 2004)
15
• We have a set of trees T that define a set of splits
• These splits are not all necessarily compatible, nor
do they necessarily all cover the same set of taxa
• Reconcile them by merging splits from overlapping
trees (the Z-rule)
• The result is a supernetwork constructed directly
from many trees
16
17
18
Core genes (n = 2317) of 30 E. coli genomes (red = pathogen, blue = non-pathogen)
Beauregard-Racine, middle authors, and Bapteste, Biol Direct (2011)
Chaos and confusion
So…
Explicit networks
• Show specific merging relationships (LGT,
hybridization) between lineages
• Need to balance complexity of calculation with
complexity of representation
19
Nodes in an explicit network
20
Huson and Scornavacca,
Genome Biol Evol (2010)
Tree node
(1 parent, 2 children)
Reticulation node
(2 parents, 1 child)
Cluster networks (Huson and Rupp, 2008)
• "Hardwired": each edge defines a bipartition,
therefore every displayed bipartition needs a
supporting edge
21
Galled networks (Kanj et al., 2008)
• "Softwired": switching reticulation edges on or off
gives different relationships
22
Cluster network
Galled network
So...
• "Softwired" clusters are easier to interpret
• But their complexity is much worse! (exponential vs
polynomial)
23
Affinity
• Network / matrix that shows the existence (and
possibly magnitude) of direct relationships
between entities
• Unconstrained in the relationships they can show,
but not phylogenetic per se
Pseudomonas affinities (Holloway and Beiko, 2010)
24
25
Sharing of antibiotic-resistance genes among bacteria
Hu et al., Appl Environ Microbiol (2016)
What should I do?????
• The best option may depend on just *how*
incompatible your sequences / trees are, and how
much information you're willing to set aside
• Tree = "la la la..."
• Cluster network = "it's complicated..."
• Affinity = "it's complicated, let me lay it out for you"
26
27
"Divide each difficulty into as many parts as is
feasible and necessary to resolve it."
- René Descartes
Part 3: Unwinding Gene Transfer
LGT networks based on the subtree
prune-and-regraft distance
28
SPR
Move this
Over here
29
SPR
SPR and LGT
“Species” tree LGT Gene tree
30
Identifying LGT events: permute species tree via SPRs until it agrees with the gene tree
Exponential in the size of the tree (n taxa)
PAINFULLY SLOW
(Beiko and Hamilton, 2006)
SPR and LGT and MAF
AGREEMENT FOREST (AF) – a set of subtrees that are compatible between S and G
MAXIMUM AGREEMENT FOREST (MAF) – the AF with the smallest number of subtrees
SPRS,G = |MAF|S,G – 1 (Bordewich and Semple, Ann Combinatorics 2005)
31
Building a MAF by cutting
Example case: a & c are sisters in the species tree, but not in the gene tree (where
b1, b2, …, bn are intervening). What can we do to the gene tree?
Do this recursively until there are no discrepancies between the two trees
32
Cut a Cut c Cut all of the bis
FIXED PARAMETER TRACTABLE –
Exponential in the distance between trees (k), not
the number of leaves (n)
k is almost always << n
AND IT’S EVEN BETTER –
Separate a larger tree (distance k) into independent
subtrees (distances j and k – j)
33
34
A supertree based on 244
microbial genomes
35
Alternative relationships
36
This comparison step took < 1 minute for all 40,000 trees
(and about a week to build the supertree)
37
Remember that the happiest people are not those
getting more, but those giving more.
- H. Jackson Brown
Part 4: A Couple of New Directions
Thinking about metagenomes
• Sure, we can infer LGT between genomes, but what
do these genomes have to do with each other?
• We can look for evidence of within-habitat transfer
by examining metagenomic samples
38
Hsu et al. (submitted)
WAAFLE
Workflow for Annotating Assemblies and Finding Lateral gene transfer Events
39
Genes on a contig
Genomic Epidemiology
• Inferring movement of pathogens between
habitats, and evolutionary events (such as LGT of
antimicrobial resistance elements) during the
spread of a pathogen
• Thousands of genomes create challenges!!
40
41
Keddy and Beiko (2018) Glob Ecol Biogeogr
And on that note…
We’re hiring!!!
42
http://arete-amr.ca/
Putting it all together...
• Networks: order out of chaos?
• Slightly less chaos out of chaos
• More importantly: less misleading about the extent of
agreement in your data
• Strong treelike signal can still come through
• Capturing LGT information from environmental
samples?
• Oy.
• Still lots of assumptions.
43
End
44
END

More Related Content

What's hot

Isolation of bacterial genomes from soil metagenomes
Isolation of bacterial genomes from soil metagenomes Isolation of bacterial genomes from soil metagenomes
Isolation of bacterial genomes from soil metagenomes Dan Burkhardt
 
60 ch14dn ahistory2008
60 ch14dn ahistory200860 ch14dn ahistory2008
60 ch14dn ahistory2008sbarkanic
 
Karl kjer : Thinking about science
Karl kjer : Thinking about scienceKarl kjer : Thinking about science
Karl kjer : Thinking about scienceKarl Kjer
 
Molecular clock, Neutral hypothesis
Molecular clock, Neutral hypothesisMolecular clock, Neutral hypothesis
Molecular clock, Neutral hypothesisFreya Cardozo
 
Open stax biology (nonmajors) ch10
Open stax biology (nonmajors) ch10Open stax biology (nonmajors) ch10
Open stax biology (nonmajors) ch10Lumen Learning
 
Transgenic animals mice for students
Transgenic animals mice  for studentsTransgenic animals mice  for students
Transgenic animals mice for studentsthirupathiSathya
 
Open stax biology(nonmajors) ch07
Open stax biology(nonmajors) ch07Open stax biology(nonmajors) ch07
Open stax biology(nonmajors) ch07Lumen Learning
 
Open stax biology (nonmajors) ch08
Open stax biology (nonmajors) ch08Open stax biology (nonmajors) ch08
Open stax biology (nonmajors) ch08Lumen Learning
 
Form, function, and evolution of living organisms
Form, function, and evolution of living organismsForm, function, and evolution of living organisms
Form, function, and evolution of living organismsJosé Luis Moreno Garvayo
 
23 lecture evolution_populations
23 lecture evolution_populations23 lecture evolution_populations
23 lecture evolution_populationsveneethmathew
 

What's hot (13)

Isolation of bacterial genomes from soil metagenomes
Isolation of bacterial genomes from soil metagenomes Isolation of bacterial genomes from soil metagenomes
Isolation of bacterial genomes from soil metagenomes
 
60 ch14dn ahistory2008
60 ch14dn ahistory200860 ch14dn ahistory2008
60 ch14dn ahistory2008
 
nplants201542
nplants201542nplants201542
nplants201542
 
Karl kjer : Thinking about science
Karl kjer : Thinking about scienceKarl kjer : Thinking about science
Karl kjer : Thinking about science
 
Molecular clock, Neutral hypothesis
Molecular clock, Neutral hypothesisMolecular clock, Neutral hypothesis
Molecular clock, Neutral hypothesis
 
Open stax biology (nonmajors) ch10
Open stax biology (nonmajors) ch10Open stax biology (nonmajors) ch10
Open stax biology (nonmajors) ch10
 
Transgenic animals mice for students
Transgenic animals mice  for studentsTransgenic animals mice  for students
Transgenic animals mice for students
 
TRANSGENIC MICE
TRANSGENIC MICETRANSGENIC MICE
TRANSGENIC MICE
 
Open stax biology(nonmajors) ch07
Open stax biology(nonmajors) ch07Open stax biology(nonmajors) ch07
Open stax biology(nonmajors) ch07
 
Open stax biology (nonmajors) ch08
Open stax biology (nonmajors) ch08Open stax biology (nonmajors) ch08
Open stax biology (nonmajors) ch08
 
Form, function, and evolution of living organisms
Form, function, and evolution of living organismsForm, function, and evolution of living organisms
Form, function, and evolution of living organisms
 
23 lecture evolution_populations
23 lecture evolution_populations23 lecture evolution_populations
23 lecture evolution_populations
 
Transgenic technology
Transgenic technologyTransgenic technology
Transgenic technology
 

Similar to Beiko networks 2019_final

Lecture 02 (2 04-2021) phylogeny
Lecture 02 (2 04-2021) phylogenyLecture 02 (2 04-2021) phylogeny
Lecture 02 (2 04-2021) phylogenyKristen DeAngelis
 
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
 Multiple Sequence Alignment-just glims of viewes on bioinformatics. Multiple Sequence Alignment-just glims of viewes on bioinformatics.
Multiple Sequence Alignment-just glims of viewes on bioinformatics.Arghadip Samanta
 
Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Jonathan Eisen
 
BiPday 2014 -- Vicario Saverio
BiPday 2014 -- Vicario SaverioBiPday 2014 -- Vicario Saverio
BiPday 2014 -- Vicario Saverioeventi-ITBbari
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisJosh Neufeld
 
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups Jonathan Eisen
 
Ap Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological DiversityAp Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological Diversitysmithbio
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18Jonathan Eisen
 
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...c.titus.brown
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Jonathan Eisen
 
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of LifeMicrobial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of LifeJonathan Eisen
 
Effects of density on spacing patterns and habitat associations of a Neotropi...
Effects of density on spacing patterns and habitat associations of a Neotropi...Effects of density on spacing patterns and habitat associations of a Neotropi...
Effects of density on spacing patterns and habitat associations of a Neotropi...Nicole Angeli
 
Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...boussau
 
Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable  Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable taxonbytes
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsProf. Wim Van Criekinge
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genometcha163
 

Similar to Beiko networks 2019_final (20)

07_Phylogeny_2022.pdf
07_Phylogeny_2022.pdf07_Phylogeny_2022.pdf
07_Phylogeny_2022.pdf
 
Lecture 02 (2 04-2021) phylogeny
Lecture 02 (2 04-2021) phylogenyLecture 02 (2 04-2021) phylogeny
Lecture 02 (2 04-2021) phylogeny
 
bai2
bai2bai2
bai2
 
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
 Multiple Sequence Alignment-just glims of viewes on bioinformatics. Multiple Sequence Alignment-just glims of viewes on bioinformatics.
Multiple Sequence Alignment-just glims of viewes on bioinformatics.
 
Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5Microbial Phylogenomics (EVE161) Class 5
Microbial Phylogenomics (EVE161) Class 5
 
phylogenetics.pdf
phylogenetics.pdfphylogenetics.pdf
phylogenetics.pdf
 
BiPday 2014 -- Vicario Saverio
BiPday 2014 -- Vicario SaverioBiPday 2014 -- Vicario Saverio
BiPday 2014 -- Vicario Saverio
 
Introduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysisIntroduction to 16S rRNA gene multivariate analysis
Introduction to 16S rRNA gene multivariate analysis
 
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
Microbial Phylogenomics (EVE161) Class 7: rRNA PCR and Major Groups
 
Ap Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological DiversityAp Chapter 26 Evolutionary History Of Biological Diversity
Ap Chapter 26 Evolutionary History Of Biological Diversity
 
EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18EVE 161 Winter 2018 Class 18
EVE 161 Winter 2018 Class 18
 
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...
2014 talk at NYU CUSP: "Biology Caught the Bus: Now what? Sequencing, Big Dat...
 
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
Phylogeny of Bacterial and Archaeal Genomes Using Conserved Genes: Supertrees...
 
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of LifeMicrobial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
Microbial Phylogenomics (EVE161) Class 3: Woese and the Tree of Life
 
Effects of density on spacing patterns and habitat associations of a Neotropi...
Effects of density on spacing patterns and habitat associations of a Neotropi...Effects of density on spacing patterns and habitat associations of a Neotropi...
Effects of density on spacing patterns and habitat associations of a Neotropi...
 
Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...Towards inferring the history of life in the presence of lateral gene transfe...
Towards inferring the history of life in the presence of lateral gene transfe...
 
Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable  Franz et al evol 2016 representing phylogeny as a logically tractable variable
Franz et al evol 2016 representing phylogeny as a logically tractable variable
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogenetics
 
Smbe 2013 talk
Smbe 2013 talkSmbe 2013 talk
Smbe 2013 talk
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genome
 

More from beiko

ASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptxASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptxbeiko
 
Beiko cmo gen_epi_monday
Beiko cmo gen_epi_mondayBeiko cmo gen_epi_monday
Beiko cmo gen_epi_mondaybeiko
 
Biomedical data
Biomedical dataBiomedical data
Biomedical databeiko
 
Rob csm2018
Rob csm2018Rob csm2018
Rob csm2018beiko
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3beiko
 
CCBC tutorial beiko
CCBC tutorial beikoCCBC tutorial beiko
CCBC tutorial beikobeiko
 
GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016beiko
 
Beiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentationBeiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentationbeiko
 
DCSI presentation 2015
DCSI presentation 2015DCSI presentation 2015
DCSI presentation 2015beiko
 
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big databeiko
 
Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?beiko
 
Beiko hpcs
Beiko hpcsBeiko hpcs
Beiko hpcsbeiko
 
Beiko biogeography
Beiko biogeographyBeiko biogeography
Beiko biogeographybeiko
 
2014 04-beiko-biology
2014 04-beiko-biology2014 04-beiko-biology
2014 04-beiko-biologybeiko
 
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"beiko
 
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)beiko
 
Beiko smbe2013-final
Beiko smbe2013-finalBeiko smbe2013-final
Beiko smbe2013-finalbeiko
 
Rob Beiko - #SMBE12 presentation
Rob Beiko - #SMBE12 presentationRob Beiko - #SMBE12 presentation
Rob Beiko - #SMBE12 presentationbeiko
 
Beiko gen gis2-share
Beiko gen gis2-shareBeiko gen gis2-share
Beiko gen gis2-sharebeiko
 

More from beiko (19)

ASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptxASMNGS_ARETE_Beiko_2022Oct19.pptx
ASMNGS_ARETE_Beiko_2022Oct19.pptx
 
Beiko cmo gen_epi_monday
Beiko cmo gen_epi_mondayBeiko cmo gen_epi_monday
Beiko cmo gen_epi_monday
 
Biomedical data
Biomedical dataBiomedical data
Biomedical data
 
Rob csm2018
Rob csm2018Rob csm2018
Rob csm2018
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3
 
CCBC tutorial beiko
CCBC tutorial beikoCCBC tutorial beiko
CCBC tutorial beiko
 
GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016GenGIS presentation at Vizbi 2016
GenGIS presentation at Vizbi 2016
 
Beiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentationBeiko ANL Soil Metagenomics presentation
Beiko ANL Soil Metagenomics presentation
 
DCSI presentation 2015
DCSI presentation 2015DCSI presentation 2015
DCSI presentation 2015
 
2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data2015 06-12-beiko-irida-big data
2015 06-12-beiko-irida-big data
 
Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?Is microbial ecology driven by roaming genes?
Is microbial ecology driven by roaming genes?
 
Beiko hpcs
Beiko hpcsBeiko hpcs
Beiko hpcs
 
Beiko biogeography
Beiko biogeographyBeiko biogeography
Beiko biogeography
 
2014 04-beiko-biology
2014 04-beiko-biology2014 04-beiko-biology
2014 04-beiko-biology
 
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
Beiko Deep Genomics presentation - "Grand theft operon - lateral city"
 
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
Rob's GenGIS presentation at IBS Special Meeting (Montreal 2013)
 
Beiko smbe2013-final
Beiko smbe2013-finalBeiko smbe2013-final
Beiko smbe2013-final
 
Rob Beiko - #SMBE12 presentation
Rob Beiko - #SMBE12 presentationRob Beiko - #SMBE12 presentation
Rob Beiko - #SMBE12 presentation
 
Beiko gen gis2-share
Beiko gen gis2-shareBeiko gen gis2-share
Beiko gen gis2-share
 

Recently uploaded

Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxRenuJangid3
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Silpa
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry Areesha Ahmad
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIADr. TATHAGAT KHOBRAGADE
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLkantirani197
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professormuralinath2
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceAlex Henderson
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Silpa
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsbassianu17
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Silpa
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsOrtegaSyrineMay
 

Recently uploaded (20)

Use of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptxUse of mutants in understanding seedling development.pptx
Use of mutants in understanding seedling development.pptx
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICEPATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
PATNA CALL GIRLS 8617370543 LOW PRICE ESCORT SERVICE
 
Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.Selaginella: features, morphology ,anatomy and reproduction.
Selaginella: features, morphology ,anatomy and reproduction.
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRLGwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
Gwalior ❤CALL GIRL 84099*07087 ❤CALL GIRLS IN Gwalior ESCORT SERVICE❤CALL GIRL
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate ProfessorThyroid Physiology_Dr.E. Muralinath_ Associate Professor
Thyroid Physiology_Dr.E. Muralinath_ Associate Professor
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.Phenolics: types, biosynthesis and functions.
Phenolics: types, biosynthesis and functions.
 
Genetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditionsGenetics and epigenetics of ADHD and comorbid conditions
Genetics and epigenetics of ADHD and comorbid conditions
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.Molecular markers- RFLP, RAPD, AFLP, SNP etc.
Molecular markers- RFLP, RAPD, AFLP, SNP etc.
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
Grade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its FunctionsGrade 7 - Lesson 1 - Microscope and Its Functions
Grade 7 - Lesson 1 - Microscope and Its Functions
 

Beiko networks 2019_final

  • 1. 1
  • 2. 2 “If a tree dies, plant another in its place.” - Carolus Linnaeus https://bigpictureeducation.com/tree-life Part 1 Tree Thinking
  • 3. Hug, Laura A., et al. "A new view of the tree of life." Nature Microbiology 1 (2016): 16048. * 3 CPR Proteobacteria
  • 4. 4 From the same paper CPR Proteobacteria
  • 5. What’s going on? 5 Concatenated universal proteins Single gene (16S ribosomal RNA gene)
  • 6. Another example – Aquificae + Epsilonproteobacteria 6 Wu and Eisen (2008) AMPHORA pipeline Wu et al. (2009) AMPHORA pipeline
  • 7. Huh? 7 2008 – one Aquificae 2009 – two Aquificae!
  • 8. WTF?? 8 Aquifex aeolicus genes have high similarity with genes from… Adding Sulfurihydrogenibium is enough to swing it from “early branching” to “exotic Epsilonproteobacterium” Eveleigh et al. Genome Biol Evol (2013)
  • 9. What I’m trying to say here Lateral gene transfer and other processes, coupled with old, old divergence times, leads to: 1. Phylogenetic instability 2. Artefactual “early branching” 3. Invalid representations of evolutionary relationships One possible solution: try to release the "phylogenetic pressure" with a network representation 9
  • 10. 10 Emma-Allen Vercoe “I am not one and simple, but complex and many.” - Virginia Woolf, The Waves Part 2: On to networks!
  • 11. So you want to build a network. • Things to think about: • What types of relationship are you trying to show? • Do evolutionary distances matter? • Rooted or unrooted? • ALL relationships or just the most interesting / important ones? • How long will it take to build? 11
  • 12. Implicit networks • Good at showing uncertainty, ambiguity and conflict in the data, BUT which type of confusion are we looking at?? • Showing alternative bipartitions: 12
  • 13. Neighbor-net: networks from distance matrices 13 Complexity: O(n3) Bryant and Moulton, Mol Biol Evol (2004)
  • 14. 14 Neighbor-net for 298 bacterial genomes Acidithiobacillus is weird, but why..? Beiko, Biol Direct (2011)
  • 15. Z-closure: supernetworks from trees (Huson et al., 2004) 15 • We have a set of trees T that define a set of splits • These splits are not all necessarily compatible, nor do they necessarily all cover the same set of taxa • Reconcile them by merging splits from overlapping trees (the Z-rule) • The result is a supernetwork constructed directly from many trees
  • 16. 16
  • 17. 17
  • 18. 18 Core genes (n = 2317) of 30 E. coli genomes (red = pathogen, blue = non-pathogen) Beauregard-Racine, middle authors, and Bapteste, Biol Direct (2011) Chaos and confusion
  • 19. So… Explicit networks • Show specific merging relationships (LGT, hybridization) between lineages • Need to balance complexity of calculation with complexity of representation 19
  • 20. Nodes in an explicit network 20 Huson and Scornavacca, Genome Biol Evol (2010) Tree node (1 parent, 2 children) Reticulation node (2 parents, 1 child)
  • 21. Cluster networks (Huson and Rupp, 2008) • "Hardwired": each edge defines a bipartition, therefore every displayed bipartition needs a supporting edge 21 Galled networks (Kanj et al., 2008) • "Softwired": switching reticulation edges on or off gives different relationships
  • 23. So... • "Softwired" clusters are easier to interpret • But their complexity is much worse! (exponential vs polynomial) 23
  • 24. Affinity • Network / matrix that shows the existence (and possibly magnitude) of direct relationships between entities • Unconstrained in the relationships they can show, but not phylogenetic per se Pseudomonas affinities (Holloway and Beiko, 2010) 24
  • 25. 25 Sharing of antibiotic-resistance genes among bacteria Hu et al., Appl Environ Microbiol (2016)
  • 26. What should I do????? • The best option may depend on just *how* incompatible your sequences / trees are, and how much information you're willing to set aside • Tree = "la la la..." • Cluster network = "it's complicated..." • Affinity = "it's complicated, let me lay it out for you" 26
  • 27. 27 "Divide each difficulty into as many parts as is feasible and necessary to resolve it." - René Descartes Part 3: Unwinding Gene Transfer
  • 28. LGT networks based on the subtree prune-and-regraft distance 28
  • 30. SPR SPR and LGT “Species” tree LGT Gene tree 30 Identifying LGT events: permute species tree via SPRs until it agrees with the gene tree Exponential in the size of the tree (n taxa) PAINFULLY SLOW (Beiko and Hamilton, 2006)
  • 31. SPR and LGT and MAF AGREEMENT FOREST (AF) – a set of subtrees that are compatible between S and G MAXIMUM AGREEMENT FOREST (MAF) – the AF with the smallest number of subtrees SPRS,G = |MAF|S,G – 1 (Bordewich and Semple, Ann Combinatorics 2005) 31
  • 32. Building a MAF by cutting Example case: a & c are sisters in the species tree, but not in the gene tree (where b1, b2, …, bn are intervening). What can we do to the gene tree? Do this recursively until there are no discrepancies between the two trees 32 Cut a Cut c Cut all of the bis
  • 33. FIXED PARAMETER TRACTABLE – Exponential in the distance between trees (k), not the number of leaves (n) k is almost always << n AND IT’S EVEN BETTER – Separate a larger tree (distance k) into independent subtrees (distances j and k – j) 33
  • 34. 34
  • 35. A supertree based on 244 microbial genomes 35
  • 36. Alternative relationships 36 This comparison step took < 1 minute for all 40,000 trees (and about a week to build the supertree)
  • 37. 37 Remember that the happiest people are not those getting more, but those giving more. - H. Jackson Brown Part 4: A Couple of New Directions
  • 38. Thinking about metagenomes • Sure, we can infer LGT between genomes, but what do these genomes have to do with each other? • We can look for evidence of within-habitat transfer by examining metagenomic samples 38
  • 39. Hsu et al. (submitted) WAAFLE Workflow for Annotating Assemblies and Finding Lateral gene transfer Events 39 Genes on a contig
  • 40. Genomic Epidemiology • Inferring movement of pathogens between habitats, and evolutionary events (such as LGT of antimicrobial resistance elements) during the spread of a pathogen • Thousands of genomes create challenges!! 40
  • 41. 41 Keddy and Beiko (2018) Glob Ecol Biogeogr
  • 42. And on that note… We’re hiring!!! 42 http://arete-amr.ca/
  • 43. Putting it all together... • Networks: order out of chaos? • Slightly less chaos out of chaos • More importantly: less misleading about the extent of agreement in your data • Strong treelike signal can still come through • Capturing LGT information from environmental samples? • Oy. • Still lots of assumptions. 43