SlideShare a Scribd company logo
Raunak Shrestha
27th October 2011
Source:
Angiuoli SV, Hotopp JC, Salzberg SL, Tettelin H. Improving pan-genome annotation
using whole genome multiple alignment. BMC Bioinformatics. 2011 Jun 30;12:272.
Background
• Describing genetic
diversity of some
organism is difficult on
the basis of a single
reference genome
• Pan-genomes
• greater intra-specific
genetic variation even in
closely related strains
• To aid gene-prediction &
annotation genome
sequence of the some
closely related strains
are required
2
http://en.wikipedia.org/wiki/File:Pan-genome-graphics.png
Background
3
Schnoes et. al., 2009
The change in
misannotation over time
in the NR database for
the 37 families
investigated.
Mugsy-Annotator (http://mugsy.sf.net)
• Steps:
1. Aligning multiple whole genomes,
2. mapping orthologs among the genomes,
3. identifying annotation anomalies
4
• Objectives :
1) identifying orthologs and
2) Evaluating the quality of
annotated gene
structures in prokaryotic
genomes.
Determining Orthologs
• Identifies orthologs on the basis of Whole Genome Alignment
(WGA), sequence position and length of sequence.
• expects one segment per organism in the whole genome
alignment.
• For segmental duplications:
• It will report separate ortholog groups for each copy only if whole
genome alignment identifies orthologous copies in other
genomes
• If not, it will not recognize the duplication and group under a
single ortholog
5
Identification of annotation inconsistencies
• Evaluate Start codon, Stop codon and Translation Initiation
Sites (TIS),
6
Data set
• Neisseria meningitidis (Nmen) dataset of 20 genomes
• Nmen verA contained 13 genomes
• Nmen verB contained 7 genomes
• Annotation pipeline differs between Nmen verA and Nmen verB
• A genome dataset of other 9 bacterial species from Refseq
database.
7
Comparison of the groups of
orthologs for 20 Nmen genomes
• Within the genes reported exclusively by any one method
• intra-genome BLASTP matches predicts most of the genes to be
paralogs (40 % for Mugsy-Annotator & 60% for OrthoMCL)
• Some have functional names that indicate transposases
• Some are hypothetical proteins
• Paper claims that OrthoMCL clusters paralogs and orthologs in a
single group
8
Run Time Performance
• Nmen dataset of 20 genomes
• single CPU in ~4 h
• ~2 h for WGA with Mugsy and
• ~2 h for comparing annotations with Mugsy-Annotator
• OrthoMCL consumed ~32 CPU hours
• WGA method is computationally efficient and has a significant
runtime performance advantage over BLAST based OrthoMCL
9
10
Consistencyof annotatedgenestructures in several
speciespan-genomes as reportedby Mugsy-Annotator
11
improve annotation consistency
• In case of inconsistency
in TIS, Mugsy-Annotator
suggests alternative
gene structures that
improve annotation
consistency
• Strategy -> to look for
the conserved TIS in the
close proximity to the
previously annotated
TIS
12
Conclusion
• aids in identifying and comparing gene content across a pan-
genome
• Aids annotation and re-annotation of genes within a pan-
genome rather than in a single genome
• Study demonstrates significant variation in annotation
primarily due to different bioinformatics approaches
available rather than the true biological variation
• Mugsy-Annotator : efficient, accurate method for finding
orthologs within a pan-genome
• Mugsy (WGA approach) is computationally efficient compared
to BLAST-based approaches for finding orthologs
13
Critique
• Musgy-Annotator requires pre-predicted annotation
information and is therefore not an independent annotation
tool
• Musgy-Annotator still finds difficult to determine the
segmental duplications and paralogs
• It would have been even better, if the author had measured
the performance of Musgy-Annotator for pan-genomes
dataset with larger evolutionary distance.
14
15
QUESTIONS
?

More Related Content

Similar to Improving pan-genome annotation using whole genome multiple alignment

Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seq
Manjappa Ganiger
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
Monica Munoz-Torres
 
An introduction to promoter prediction and analysis
An introduction to promoter prediction and analysisAn introduction to promoter prediction and analysis
An introduction to promoter prediction and analysis
Sarbesh D. Dangol
 
31961.ppt
31961.ppt31961.ppt
31961.ppt
DrParamAB
 
Transposable elements of Agavoideae
Transposable elements of AgavoideaeTransposable elements of Agavoideae
Transposable elements of Agavoideae
Kate Hertweck
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
c.titus.brown
 
Rna lecture
Rna lectureRna lecture
Rna lecturenishulpu
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
UdayBhanushali111
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
Monica Munoz-Torres
 
Unison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningUnison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningReece Hart
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Manikhandan Mudaliar
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
hhalhaddad
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citri
Monica Munoz-Torres
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
Lars Juhl Jensen
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Athira RG
 
Hertweck bbl2012
Hertweck bbl2012Hertweck bbl2012
Hertweck bbl2012
Kate Hertweck
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBruno Mmassy
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
shewaademu
 

Similar to Improving pan-genome annotation using whole genome multiple alignment (20)

Catalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seqCatalyzing Plant Science Research with RNA-seq
Catalyzing Plant Science Research with RNA-seq
 
Apollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 IntroductionApollo Workshop AGS2017 Introduction
Apollo Workshop AGS2017 Introduction
 
An introduction to promoter prediction and analysis
An introduction to promoter prediction and analysisAn introduction to promoter prediction and analysis
An introduction to promoter prediction and analysis
 
31961.ppt
31961.ppt31961.ppt
31961.ppt
 
Transposable elements of Agavoideae
Transposable elements of AgavoideaeTransposable elements of Agavoideae
Transposable elements of Agavoideae
 
2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial2015 beacon-metagenome-tutorial
2015 beacon-metagenome-tutorial
 
Rna lecture
Rna lectureRna lecture
Rna lecture
 
Molecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contructionMolecular basis of evolution and softwares used in phylogenetic tree contruction
Molecular basis of evolution and softwares used in phylogenetic tree contruction
 
Curation Introduction - Apollo Workshop
Curation Introduction - Apollo WorkshopCuration Introduction - Apollo Workshop
Curation Introduction - Apollo Workshop
 
Unison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic miningUnison: Enabling easy, rapid, and comprehensive proteomic mining
Unison: Enabling easy, rapid, and comprehensive proteomic mining
 
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
Variant (SNP) calling - an introduction (with a worked example, using FreeBay...
 
The Human Genome Project - Part III
The Human Genome Project - Part IIIThe Human Genome Project - Part III
The Human Genome Project - Part III
 
Apollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citriApollo annotation guidelines for i5k projects Diaphorina citri
Apollo annotation guidelines for i5k projects Diaphorina citri
 
Prediction of protein function
Prediction of protein functionPrediction of protein function
Prediction of protein function
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Data mining ppt
Data mining pptData mining ppt
Data mining ppt
 
Hertweck bbl2012
Hertweck bbl2012Hertweck bbl2012
Hertweck bbl2012
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogenetics
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 

More from Raunak Shrestha

A multidimensional strategy to detect polypharmacological targets in the abse...
A multidimensional strategy to detect polypharmacological targets in the abse...A multidimensional strategy to detect polypharmacological targets in the abse...
A multidimensional strategy to detect polypharmacological targets in the abse...
Raunak Shrestha
 
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...High-resolution genome-wide copy-number analysis suggests a monoclonal origin...
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...Raunak Shrestha
 
Cross-species gene normalization by species inference
Cross-species gene normalization by species inferenceCross-species gene normalization by species inference
Cross-species gene normalization by species inferenceRaunak Shrestha
 
In silico reconstruction of viral genomes from small RNAs improves virus-deri...
In silico reconstruction of viral genomes from small RNAs improves virus-deri...In silico reconstruction of viral genomes from small RNAs improves virus-deri...
In silico reconstruction of viral genomes from small RNAs improves virus-deri...Raunak Shrestha
 
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...DNA barcode sequence identification incorporating taxonomic hierarchy and wit...
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...Raunak Shrestha
 
Proteins with complex architecture as potential targets for drug design: a ca...
Proteins with complex architecture as potential targets for drug design: a ca...Proteins with complex architecture as potential targets for drug design: a ca...
Proteins with complex architecture as potential targets for drug design: a ca...Raunak Shrestha
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to CancerRaunak Shrestha
 
An Integrated Approach to Uncover Drivers of Cancer
An Integrated Approach to Uncover Drivers of CancerAn Integrated Approach to Uncover Drivers of Cancer
An Integrated Approach to Uncover Drivers of CancerRaunak Shrestha
 
Personalized Oncology Through Integrative High-Throughput Sequencing:
Personalized Oncology Through Integrative High-Throughput Sequencing:Personalized Oncology Through Integrative High-Throughput Sequencing:
Personalized Oncology Through Integrative High-Throughput Sequencing:Raunak Shrestha
 
Genomic architecture and evolution of clear cell renal cell carcinomas define...
Genomic architecture and evolution of clear cell renal cell carcinomas define...Genomic architecture and evolution of clear cell renal cell carcinomas define...
Genomic architecture and evolution of clear cell renal cell carcinomas define...Raunak Shrestha
 
Emerging landscape of oncogenic signatures across human cancers
Emerging landscape of oncogenic signatures across human cancers Emerging landscape of oncogenic signatures across human cancers
Emerging landscape of oncogenic signatures across human cancers
Raunak Shrestha
 
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...Raunak Shrestha
 

More from Raunak Shrestha (12)

A multidimensional strategy to detect polypharmacological targets in the abse...
A multidimensional strategy to detect polypharmacological targets in the abse...A multidimensional strategy to detect polypharmacological targets in the abse...
A multidimensional strategy to detect polypharmacological targets in the abse...
 
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...High-resolution genome-wide copy-number analysis suggests a monoclonal origin...
High-resolution genome-wide copy-number analysis suggests a monoclonal origin...
 
Cross-species gene normalization by species inference
Cross-species gene normalization by species inferenceCross-species gene normalization by species inference
Cross-species gene normalization by species inference
 
In silico reconstruction of viral genomes from small RNAs improves virus-deri...
In silico reconstruction of viral genomes from small RNAs improves virus-deri...In silico reconstruction of viral genomes from small RNAs improves virus-deri...
In silico reconstruction of viral genomes from small RNAs improves virus-deri...
 
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...DNA barcode sequence identification incorporating taxonomic hierarchy and wit...
DNA barcode sequence identification incorporating taxonomic hierarchy and wit...
 
Proteins with complex architecture as potential targets for drug design: a ca...
Proteins with complex architecture as potential targets for drug design: a ca...Proteins with complex architecture as potential targets for drug design: a ca...
Proteins with complex architecture as potential targets for drug design: a ca...
 
Systems Biology Approaches to Cancer
Systems Biology Approaches to CancerSystems Biology Approaches to Cancer
Systems Biology Approaches to Cancer
 
An Integrated Approach to Uncover Drivers of Cancer
An Integrated Approach to Uncover Drivers of CancerAn Integrated Approach to Uncover Drivers of Cancer
An Integrated Approach to Uncover Drivers of Cancer
 
Personalized Oncology Through Integrative High-Throughput Sequencing:
Personalized Oncology Through Integrative High-Throughput Sequencing:Personalized Oncology Through Integrative High-Throughput Sequencing:
Personalized Oncology Through Integrative High-Throughput Sequencing:
 
Genomic architecture and evolution of clear cell renal cell carcinomas define...
Genomic architecture and evolution of clear cell renal cell carcinomas define...Genomic architecture and evolution of clear cell renal cell carcinomas define...
Genomic architecture and evolution of clear cell renal cell carcinomas define...
 
Emerging landscape of oncogenic signatures across human cancers
Emerging landscape of oncogenic signatures across human cancers Emerging landscape of oncogenic signatures across human cancers
Emerging landscape of oncogenic signatures across human cancers
 
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...
Cumulative Haploinsufficiency and Triplosensitivity Drive Aneuploidy Patterns...
 

Recently uploaded

For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #GirlsFor Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
Savita Shen $i11
 
New Drug Discovery and Development .....
New Drug Discovery and Development .....New Drug Discovery and Development .....
New Drug Discovery and Development .....
NEHA GUPTA
 
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
Savita Shen $i11
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
MedicoseAcademics
 
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
kevinkariuki227
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
Shweta
 
24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all
DrSathishMS1
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
Dr. Rabia Inam Gandapore
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
Little Cross Family Clinic
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Prof. Marcus Renato de Carvalho
 
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model SafeSurat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Savita Shen $i11
 
Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
MedicoseAcademics
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
VarunMahajani
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
FFragrant
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
bkling
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
sisternakatoto
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
i3 Health
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
addon Scans
 
basicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdfbasicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdf
aljamhori teaching hospital
 

Recently uploaded (20)

For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #GirlsFor Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
For Better Surat #ℂall #Girl Service ❤85270-49040❤ Surat #ℂall #Girls
 
New Drug Discovery and Development .....
New Drug Discovery and Development .....New Drug Discovery and Development .....
New Drug Discovery and Development .....
 
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
Phone Us ❤85270-49040❤ #ℂall #gIRLS In Surat By Surat @ℂall @Girls Hotel With...
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
 
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
TEST BANK for Operations Management, 14th Edition by William J. Stevenson, Ve...
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all24 Upakrama.pptx class ppt useful in all
24 Upakrama.pptx class ppt useful in all
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
 
Are There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdfAre There Any Natural Remedies To Treat Syphilis.pdf
Are There Any Natural Remedies To Treat Syphilis.pdf
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
 
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model SafeSurat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
 
Physiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdfPhysiology of Chemical Sensation of smell.pdf
Physiology of Chemical Sensation of smell.pdf
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
 
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
Report Back from SGO 2024: What’s the Latest in Cervical Cancer?
 
263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,263778731218 Abortion Clinic /Pills In Harare ,
263778731218 Abortion Clinic /Pills In Harare ,
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
 
Flu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore KarnatakaFlu Vaccine Alert in Bangalore Karnataka
Flu Vaccine Alert in Bangalore Karnataka
 
basicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdfbasicmodesofventilation2022-220313203758.pdf
basicmodesofventilation2022-220313203758.pdf
 

Improving pan-genome annotation using whole genome multiple alignment

  • 1. Raunak Shrestha 27th October 2011 Source: Angiuoli SV, Hotopp JC, Salzberg SL, Tettelin H. Improving pan-genome annotation using whole genome multiple alignment. BMC Bioinformatics. 2011 Jun 30;12:272.
  • 2. Background • Describing genetic diversity of some organism is difficult on the basis of a single reference genome • Pan-genomes • greater intra-specific genetic variation even in closely related strains • To aid gene-prediction & annotation genome sequence of the some closely related strains are required 2 http://en.wikipedia.org/wiki/File:Pan-genome-graphics.png
  • 3. Background 3 Schnoes et. al., 2009 The change in misannotation over time in the NR database for the 37 families investigated.
  • 4. Mugsy-Annotator (http://mugsy.sf.net) • Steps: 1. Aligning multiple whole genomes, 2. mapping orthologs among the genomes, 3. identifying annotation anomalies 4 • Objectives : 1) identifying orthologs and 2) Evaluating the quality of annotated gene structures in prokaryotic genomes.
  • 5. Determining Orthologs • Identifies orthologs on the basis of Whole Genome Alignment (WGA), sequence position and length of sequence. • expects one segment per organism in the whole genome alignment. • For segmental duplications: • It will report separate ortholog groups for each copy only if whole genome alignment identifies orthologous copies in other genomes • If not, it will not recognize the duplication and group under a single ortholog 5
  • 6. Identification of annotation inconsistencies • Evaluate Start codon, Stop codon and Translation Initiation Sites (TIS), 6
  • 7. Data set • Neisseria meningitidis (Nmen) dataset of 20 genomes • Nmen verA contained 13 genomes • Nmen verB contained 7 genomes • Annotation pipeline differs between Nmen verA and Nmen verB • A genome dataset of other 9 bacterial species from Refseq database. 7
  • 8. Comparison of the groups of orthologs for 20 Nmen genomes • Within the genes reported exclusively by any one method • intra-genome BLASTP matches predicts most of the genes to be paralogs (40 % for Mugsy-Annotator & 60% for OrthoMCL) • Some have functional names that indicate transposases • Some are hypothetical proteins • Paper claims that OrthoMCL clusters paralogs and orthologs in a single group 8
  • 9. Run Time Performance • Nmen dataset of 20 genomes • single CPU in ~4 h • ~2 h for WGA with Mugsy and • ~2 h for comparing annotations with Mugsy-Annotator • OrthoMCL consumed ~32 CPU hours • WGA method is computationally efficient and has a significant runtime performance advantage over BLAST based OrthoMCL 9
  • 10. 10
  • 11. Consistencyof annotatedgenestructures in several speciespan-genomes as reportedby Mugsy-Annotator 11
  • 12. improve annotation consistency • In case of inconsistency in TIS, Mugsy-Annotator suggests alternative gene structures that improve annotation consistency • Strategy -> to look for the conserved TIS in the close proximity to the previously annotated TIS 12
  • 13. Conclusion • aids in identifying and comparing gene content across a pan- genome • Aids annotation and re-annotation of genes within a pan- genome rather than in a single genome • Study demonstrates significant variation in annotation primarily due to different bioinformatics approaches available rather than the true biological variation • Mugsy-Annotator : efficient, accurate method for finding orthologs within a pan-genome • Mugsy (WGA approach) is computationally efficient compared to BLAST-based approaches for finding orthologs 13
  • 14. Critique • Musgy-Annotator requires pre-predicted annotation information and is therefore not an independent annotation tool • Musgy-Annotator still finds difficult to determine the segmental duplications and paralogs • It would have been even better, if the author had measured the performance of Musgy-Annotator for pan-genomes dataset with larger evolutionary distance. 14

Editor's Notes

  1. OrthoMCL: popular BLAST-based clustering method, performs a clustering of Reciprocal Best BLAST (RBB) matches between conceptual translations of genes to identify orthologs.