SlideShare a Scribd company logo
1 of 1
Literature Cited
Preliminary Results
Introduction
What are transposable elements?
A transposable element (TE) is a segment of DNA that can copy itself and/or move from place to
place in a genome. There are many types of TEs, or transposons, as they are also called, and they range
in length according to their type.
Transposable elements can be divided into two major classes by mechanism of action. Class 1
transposons follow a “copy and paste” mechanism, leaving multiple copies of themselves throughout
the genome they occupy. Class 2 transposons follow a “cut and paste” mechanism, moving around the
genome, but maintaining only a single copy at any given time.
Acknowledgments
Transposon Analysis in the Aardvark Genome
Alicia Wafa, Sarah Mangum, Laura Blanco-Berdugo, Roy Platt II, David Ray
Methods
Purpose
What does TE analysis tell us?
By looking at the types, ages, and amounts of transposable element in a species, we can identify
patterns of evolution within the genomes of the organisms of interest.
Traditional Transposable Element Analysis: Homology
Preliminary transposable element analysis using homology usually includes comparing the
species to a previously analyzed relative. This method does not allow identification of TEs not
previously identified in the relative. This method also produces inaccuracies in the aging of
transposable elements, as well as their numbers and types.
A Better Way: De Novo Analysis and Manual Curation
By beginning “from scratch”, modeling and identifying TEs in a genome with the use of certain
computer programs (a de novo analysis), there is no need to rely on homology. By supplementing
this framework with manual curation done by the researcher, full TE consensus sequences can be
reconstructed from the fragments often identified from the programs used. Implementing both of
these techniques allows us to create a much more accurate data set from which to draw conclusions
about what forces have influenced the species’ genomes and events in their evolution.
Figure 1. Classes and types of transposable elements and their movement through the genome.
A bigger problem
The data produced from homology based analysis does more than skew the accuracy of one paper.
It presents a false foundation upon which to base hypotheses related to genome structure and
function. Not only can this data distort conclusions about this species, but if it is used to run
homology-dependent analysis of related species, it can skew assumption about the evolutionary
history of whole clades.
Discussion
A shifting landscape
Multiple rounds of manual curation have been completed but the project is still ongoing. However,
results from the first three rounds indicate that the de novo annotation is yielding a distinct picture of
the genome. For example, as seen in figures 3 and 5, a homology-based analysis suggests that there has
been little SINE accumulation in the aardvark genome. However, the more accurate de novo annotation
suggests that SINEs have accumulated a higher rates than any other category of TE. Other categories
exhibit similar shifts. As more iterations of annotation are completed, the divergence between the
homology dependent analysis of the aardvark genome and that accomplished with de novo
identification and manual curation will increase. At the completion of this project, one may expect to
see results similar to those shown below. On the left, the genome of this butterfly has been analyzed
using homology alone. The corrected version on the right, accomplished through de novo identification
and manual curation, shows how skewed the original data was. This is a common problem, especially
when working with species that do not have close relatives for which extensive transposable element
analysis has been accomplished. The more genetic distance between two species, the less accurate
homology-based annotation becomes.
Figure 6. TE distributions before and after manual curation of the Heliconius Melpomene
transposable elements. (C) Homology only based distribution. (F) Distribution after
manual curation. Original studies from Platt 2016 and Lavoie 2014..
Chicken
Painted Turtle
Elephant Shark
Lizard
Manatee
Gharial
Panda
DogKiller Whale
Platypus
WallabySloth Gorilla Armadillo
Aardvark
0
10
20
30
40
50
60
300 800 1300 1800 2300 2800 3300 3800
GenomeTE%Coverage
Genome Size (Mb)
Figure 5. Preliminary results of the transposon content of the Aardvark genome after manual
curation.
Figure 4. TE content in other well studied genomes.
Figure 3. Graph of TE content in the aardvark genome using only homology dependent
techniques.
Special thanks to the Honors College Undergraduate Research Scholars Program
supported by the CH and Helen Jones Foundation.
Platt, Roy N., Laura Blanco-Berdugo, and David A. Ray. "Accurate transposable element annotation is vital
when analyzing new genome assemblies." Genome Biology and Evolution (2016): evw009
Methods (cont)
Figure 2. Typical alignment of the 40 most similar elements. (a) beginning of element. (b) end of element.
a bManual Annotation
Once the sequences have
been aligned, it is the job of
the researcher to manually
identify the beginning and
end of an element. If both
are present, the researcher
classifies the identification of
that element as complete. If
either end is absent, then the
researcher will extend the
element until both end are
found.
0
0.001
0.002
0.003
0.004
0.005
0.006
0.007
PercentGenomeCoverage
Millions of Years
others
SINE
LINE
LTR
ERV
DNA
0
0.002
0.004
0.006
0.008
0.01
0.012
0.014
0.00E+00
4.55E+06
9.09E+06
1.36E+07
1.82E+07
2.27E+07
2.73E+07
3.18E+07
3.64E+07
4.09E+07
4.55E+07
5.00E+07
5.45E+07
5.91E+07
6.36E+07
6.82E+07
7.27E+07
7.73E+07
8.18E+07
8.64E+07
9.09E+07
9.55E+07
1.00E+08
1.05E+08
1.09E+08
1.14E+08
1.18E+08
1.23E+08
1.27E+08
1.32E+08
1.36E+08
1.41E+08
1.45E+08
1.50E+08
1.55E+08
1.59E+08
1.64E+08
1.68E+08
1.73E+08
1.77E+08
1.82E+08
1.86E+08
1.91E+08
1.95E+08
2.00E+08
PercentGenomeCoverage
Millions of Years
Others
SINE
LINE
LTR
ERV
DNA

More Related Content

What's hot

Molecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomicsMolecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomicsCharithRanatunga
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applicationsSham Sadiq
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsLars Juhl Jensen
 
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...GigaScience, BGI Hong Kong
 
Metagenomic analysis
Metagenomic analysisMetagenomic analysis
Metagenomic analysisAnimesh Kumar
 
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life ExampleEVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life ExampleJonathan Eisen
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2GCUF
 
DNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyDNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyBikash1489
 
Systems biology: Bioinformatics on complete biological system
Systems biology: Bioinformatics on complete biological systemSystems biology: Bioinformatics on complete biological system
Systems biology: Bioinformatics on complete biological systemLars Juhl Jensen
 
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of LifeEVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of LifeJonathan Eisen
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureLarry Smarr
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3beiko
 
EVE 161 Winter 2018 Class 16
EVE 161 Winter 2018 Class 16EVE 161 Winter 2018 Class 16
EVE 161 Winter 2018 Class 16Jonathan Eisen
 
Advanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug DiscoveryAdvanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug DiscoveryMikel Txopitea Elorriaga
 
Systems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems levelSystems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems levelLars Juhl Jensen
 
Systems biology: Bioinformatics on complete biological systems
Systems biology: Bioinformatics on complete biological systemsSystems biology: Bioinformatics on complete biological systems
Systems biology: Bioinformatics on complete biological systemsLars Juhl Jensen
 

What's hot (20)

Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
Introduction to Metagenomics Data Analysis - UEB-VHIR - 2013
 
Molecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomicsMolecular pathology in microbiology and metagenomics
Molecular pathology in microbiology and metagenomics
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
 
Systems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systemsSystems biology - Bioinformatics on complete biological systems
Systems biology - Bioinformatics on complete biological systems
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...
Tom Delmont: From the Terragenome Project to Global Metagenomic Comparisons: ...
 
Metagenomic analysis
Metagenomic analysisMetagenomic analysis
Metagenomic analysis
 
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life ExampleEVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
EVE161: Microbial Phylogenomics - Class 6 - Tree of Life Example
 
Comparative genomics 2
Comparative genomics 2Comparative genomics 2
Comparative genomics 2
 
DNA Sequencing in Phylogeny
DNA Sequencing in PhylogenyDNA Sequencing in Phylogeny
DNA Sequencing in Phylogeny
 
Systems biology: Bioinformatics on complete biological system
Systems biology: Bioinformatics on complete biological systemSystems biology: Bioinformatics on complete biological system
Systems biology: Bioinformatics on complete biological system
 
Folker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data AnnotationFolker Meyer: Metagenomic Data Annotation
Folker Meyer: Metagenomic Data Annotation
 
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of LifeEVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
EVE161: Microbial Phylogenomics - Class 3 - Woese and the Tree of Life
 
Microbial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New CyberinfrastructureMicrobial Metagenomics Drives a New Cyberinfrastructure
Microbial Metagenomics Drives a New Cyberinfrastructure
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Beiko taconic-nov3
Beiko taconic-nov3Beiko taconic-nov3
Beiko taconic-nov3
 
EVE 161 Winter 2018 Class 16
EVE 161 Winter 2018 Class 16EVE 161 Winter 2018 Class 16
EVE 161 Winter 2018 Class 16
 
Advanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug DiscoveryAdvanced Systems Biology Methods in Drug Discovery
Advanced Systems Biology Methods in Drug Discovery
 
Systems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems levelSystems biology - Understanding biology at the systems level
Systems biology - Understanding biology at the systems level
 
Systems biology: Bioinformatics on complete biological systems
Systems biology: Bioinformatics on complete biological systemsSystems biology: Bioinformatics on complete biological systems
Systems biology: Bioinformatics on complete biological systems
 

Similar to AliciaPoster (Wafa%2c Alicia)

Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Kate Hertweck
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genometcha163
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Genetic representation
Genetic representationGenetic representation
Genetic representationDEEPIKA T
 
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...The Lifesciences Magazine
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Leighton Pritchard
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsProf. Wim Van Criekinge
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAthira RG
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayStefanie Yang
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentationKate Hertweck
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-researchc.titus.brown
 
Particle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster IdentificationParticle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster IdentificationEditor IJCATR
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtJessica Kabigting
 

Similar to AliciaPoster (Wafa%2c Alicia) (20)

Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
Evolution of transposons, genomes, and organisms (Hertweck Fall 2014)
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genome
 
te_poster_sping_2016
te_poster_sping_2016te_poster_sping_2016
te_poster_sping_2016
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Nature Of Gene.pdf
Nature Of Gene.pdfNature Of Gene.pdf
Nature Of Gene.pdf
 
Nature Of Gene.pdf
Nature Of Gene.pdfNature Of Gene.pdf
Nature Of Gene.pdf
 
Genetic representation
Genetic representationGenetic representation
Genetic representation
 
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...
Degenerate Transposable Elements Have Been Found In Human DNA | The Lifescien...
 
Pathogen Genome Data
Pathogen Genome DataPathogen Genome Data
Pathogen Genome Data
 
Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1Comparative Genomics and Visualisation - Part 1
Comparative Genomics and Visualisation - Part 1
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Bioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogeneticsBioinformatica 24-11-2011-t6-phylogenetics
Bioinformatica 24-11-2011-t6-phylogenetics
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
How Can Ngs Forward Research Essay
How Can Ngs Forward Research EssayHow Can Ngs Forward Research Essay
How Can Ngs Forward Research Essay
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentation
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
2014 whitney-research
2014 whitney-research2014 whitney-research
2014 whitney-research
 
Particle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster IdentificationParticle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster Identification
 
Bio
BioBio
Bio
 
Group 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & EnvtGroup 5 DNA Tech - Ecology & Envt
Group 5 DNA Tech - Ecology & Envt
 

AliciaPoster (Wafa%2c Alicia)

  • 1. Literature Cited Preliminary Results Introduction What are transposable elements? A transposable element (TE) is a segment of DNA that can copy itself and/or move from place to place in a genome. There are many types of TEs, or transposons, as they are also called, and they range in length according to their type. Transposable elements can be divided into two major classes by mechanism of action. Class 1 transposons follow a “copy and paste” mechanism, leaving multiple copies of themselves throughout the genome they occupy. Class 2 transposons follow a “cut and paste” mechanism, moving around the genome, but maintaining only a single copy at any given time. Acknowledgments Transposon Analysis in the Aardvark Genome Alicia Wafa, Sarah Mangum, Laura Blanco-Berdugo, Roy Platt II, David Ray Methods Purpose What does TE analysis tell us? By looking at the types, ages, and amounts of transposable element in a species, we can identify patterns of evolution within the genomes of the organisms of interest. Traditional Transposable Element Analysis: Homology Preliminary transposable element analysis using homology usually includes comparing the species to a previously analyzed relative. This method does not allow identification of TEs not previously identified in the relative. This method also produces inaccuracies in the aging of transposable elements, as well as their numbers and types. A Better Way: De Novo Analysis and Manual Curation By beginning “from scratch”, modeling and identifying TEs in a genome with the use of certain computer programs (a de novo analysis), there is no need to rely on homology. By supplementing this framework with manual curation done by the researcher, full TE consensus sequences can be reconstructed from the fragments often identified from the programs used. Implementing both of these techniques allows us to create a much more accurate data set from which to draw conclusions about what forces have influenced the species’ genomes and events in their evolution. Figure 1. Classes and types of transposable elements and their movement through the genome. A bigger problem The data produced from homology based analysis does more than skew the accuracy of one paper. It presents a false foundation upon which to base hypotheses related to genome structure and function. Not only can this data distort conclusions about this species, but if it is used to run homology-dependent analysis of related species, it can skew assumption about the evolutionary history of whole clades. Discussion A shifting landscape Multiple rounds of manual curation have been completed but the project is still ongoing. However, results from the first three rounds indicate that the de novo annotation is yielding a distinct picture of the genome. For example, as seen in figures 3 and 5, a homology-based analysis suggests that there has been little SINE accumulation in the aardvark genome. However, the more accurate de novo annotation suggests that SINEs have accumulated a higher rates than any other category of TE. Other categories exhibit similar shifts. As more iterations of annotation are completed, the divergence between the homology dependent analysis of the aardvark genome and that accomplished with de novo identification and manual curation will increase. At the completion of this project, one may expect to see results similar to those shown below. On the left, the genome of this butterfly has been analyzed using homology alone. The corrected version on the right, accomplished through de novo identification and manual curation, shows how skewed the original data was. This is a common problem, especially when working with species that do not have close relatives for which extensive transposable element analysis has been accomplished. The more genetic distance between two species, the less accurate homology-based annotation becomes. Figure 6. TE distributions before and after manual curation of the Heliconius Melpomene transposable elements. (C) Homology only based distribution. (F) Distribution after manual curation. Original studies from Platt 2016 and Lavoie 2014.. Chicken Painted Turtle Elephant Shark Lizard Manatee Gharial Panda DogKiller Whale Platypus WallabySloth Gorilla Armadillo Aardvark 0 10 20 30 40 50 60 300 800 1300 1800 2300 2800 3300 3800 GenomeTE%Coverage Genome Size (Mb) Figure 5. Preliminary results of the transposon content of the Aardvark genome after manual curation. Figure 4. TE content in other well studied genomes. Figure 3. Graph of TE content in the aardvark genome using only homology dependent techniques. Special thanks to the Honors College Undergraduate Research Scholars Program supported by the CH and Helen Jones Foundation. Platt, Roy N., Laura Blanco-Berdugo, and David A. Ray. "Accurate transposable element annotation is vital when analyzing new genome assemblies." Genome Biology and Evolution (2016): evw009 Methods (cont) Figure 2. Typical alignment of the 40 most similar elements. (a) beginning of element. (b) end of element. a bManual Annotation Once the sequences have been aligned, it is the job of the researcher to manually identify the beginning and end of an element. If both are present, the researcher classifies the identification of that element as complete. If either end is absent, then the researcher will extend the element until both end are found. 0 0.001 0.002 0.003 0.004 0.005 0.006 0.007 PercentGenomeCoverage Millions of Years others SINE LINE LTR ERV DNA 0 0.002 0.004 0.006 0.008 0.01 0.012 0.014 0.00E+00 4.55E+06 9.09E+06 1.36E+07 1.82E+07 2.27E+07 2.73E+07 3.18E+07 3.64E+07 4.09E+07 4.55E+07 5.00E+07 5.45E+07 5.91E+07 6.36E+07 6.82E+07 7.27E+07 7.73E+07 8.18E+07 8.64E+07 9.09E+07 9.55E+07 1.00E+08 1.05E+08 1.09E+08 1.14E+08 1.18E+08 1.23E+08 1.27E+08 1.32E+08 1.36E+08 1.41E+08 1.45E+08 1.50E+08 1.55E+08 1.59E+08 1.64E+08 1.68E+08 1.73E+08 1.77E+08 1.82E+08 1.86E+08 1.91E+08 1.95E+08 2.00E+08 PercentGenomeCoverage Millions of Years Others SINE LINE LTR ERV DNA