SlideShare a Scribd company logo
Topic: COGs and
Comparative Genomics
Durdam M.Sc. Bioinformatics sem-2
Some important terminologies:
Orthologs are genes in different species that evolved from a
common ancestral gene by speciation. Normally, orthologs retain the
same function in the course of evolution. Identification of orthologs
is critical for reliable prediction of gene function in newly sequenced
genomes.
Paralogs are genes related by duplication within a genome.
Orthologs retain the same function in the course of evolution,
whereas paralogs evolve new functions, even if these are related to
the original one.
Speciation is the origin of a new species capable of making a living
in a new way from the species from which it arose. As part of this
process it has also acquired some barrier to genetic exchange with
the parent species.
Speciation
COGs
• Cluster of orthologous genes.
• Clusters of Orthologous Groups, are groups of three or more ortholog
genes, meaning they are direct evolutionary counter parts and are
considered to be part of an 'ancient conserved domain'. A COG is defined
as three or more proteins from the genomes of distant species that are
more similar to each other than to any other protein within the individual
genome.
• COGs can be used to predict the function of homologous proteins in
poorly studied species and can also be used to track the evolutionary
divergence from a common ancestor, hence providing a powerful tool for
functional annotation of uncharacterized proteins.
• Important in comparative genomics studies
Application of COG
• The most straightforward application of the COGs is for the prediction of
functions of individual proteins or protein sets, including those from newly
completed genomes.
NCBI provides a COG database that consists of 4,873 COGs that code for over
136,000 proteins from the genomes of 50 bacteria, 13 archaea and 3 unicellular
eukaryotes. This database uses completely sequenced genomes to classify
proteins using the orthology concept.
The COG database
What are some questions that comparative
genomics can address?
• How has the organism evolved?
• What differentiates species?
• Which non-coding regions are important?
• Which genes are required for organisms to survive in a
certain environment?
What is Comparative Genomics?
It is the comparison of one genome to another.
Genomics DNA (Gene)
Functional
Genomics
Transcriptomics RNA
Proteomics PROTEIN
Metabolomics METABOLITE
Transcription
Translation
Enzymatic
reaction
Difference is in Scale and Direction
One or several genes
compared against all
other known genes.
Use genome to
inform us about the
entire organism.
Use information
from many
genomes to learn
more about the
individual genes.
Entire Genome
compared to
other entire
genomes.
Other “omics” Comparative
Comparative genomics
• Discover what lies hidden in genomic
sequence by comparing sequence
information.
• Main areas
– Whole genome alignment
– Gene prediction
– Regulatory element prediction
– Phylogenomics
– Pharmacogenetics
Comparative Genomics
Comparative genomics is a powerful tool for identifying the features and dissecting
the functions of genomes. The approach is based on selection for the gene or
regulatory region constraining the evolution of the sequence. Comparison with other
genomes has become an integral part of the analysis of the human genome sequence
and is one of the most effective methods for identifying genes (Batzoglou et al. ,
2000; Roest Crollius et al. , 2000)
Comparative genomics is a field of biological research in which the genomic features of
different organisms are compared. The genomic features may include the DNA sequence,
genes, gene order, regulatory sequences, and other genomic structural landmarks
Figure: Species tree of different organisms
Figure: Distribution and clustering of orthologous genes of Tulsi genome to other related plant
genomes. a. Distribution of gene families among five plant genomes. Ocimum tenuiflorum (Ote
- green), Arabidopsis thaliana (Ath – black rectangle), Oryza sativa (Osa – red), Solanum
lycopersicum (Sly – blue) and Mimulus guttatus (Mgu – black circle). The numbers in the Venn
diagram represent shared and unique gene families across these 5 species obtained by
OrthoMCL.
b. Horizontal stacked bar plot of all the genes in 23 different genomes. This figure shows
ortholog group distribution in all 23 plant species including Tulsi. Each row represents a plant
species - Physcomitrella patens (Ppa), Selaginella moellendorffii (Smo), Oryza sativa (Osa),
Setaria italic (Sit), Zea mays (Zma), Sorghum bicolor (Sbi), Aquilegia caerulea (Aca), Ocimum
tenuiflorum (Ote), Mimulus guttatus (Mgu), Solanum lycopersicum (Sly), Solanum tuberosum
(Stu), Vitis vinifera (Vvi), Eucalyptus grandis (Egr), Citrus sinensis (Csi), Theobroma cacao (Tca),
Carica papaya (Cpa), Brassica rapa (Bra), Arabidopsis thaliana (Ath), Fragaria vesca (Fve), Prunus
persica (Ppe), Glycine max (Gma), Medicago truncatula (Mtr), Populus trichocarpa (Ptr). The bar
graph represents ortholog protein groups for that species subdivided into 22 categories
depending on the degree of sharing with the other 22 plant species e.g., category 2 represents
the number of orthologous groups that have representatives from the species of interest and
from one more species out of the 23 species selected for the study
Background: Shortly after multiple
genome sequences of bacteria, archae
and unicellular eukaryotes became
available, an attempt on such a
classification was implemented in
Cluster of Orthologous Groups of
proteins (COGs). Rapid accumulation of
genome sequences creates
opportunities for refining COGs but also
represents a challenge because of error
amplification.
Conclusion: The arCOGs provide a
convenient, flexible framework for
functional annotation of archael
genomes, comparative genomics and
evolutionary reconstructions. Genomic
reconstructions suggest that the last
common ancestor of archaea might
have been (nearly) as advanced as the
modern archael hyperthermophiles.
For more info:
ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
MBGD Database
MBGD is a database for comparative analysis of completely sequenced microbial
genomes, the number of which is now growing rapidly. The aim of MBGD is to
facilitate comparative genomics from various points of view such as ortholog
identification, paralog clustering, motif analysis and gene order comparison.
Conclusion
The study of Cluster of Orthologous
Genes play a vital role in the
Comparative genomic studies.
References and links
• NCBI COGs database
• Chapter 22 of the NCBI handbook: The Clusters of Orthologous Groups (COGs)
Database: Phylogenetic Classification of Proteins from Complete Genomes. NCBI
Bookshelf ID: NBK21101.
• NCBI News Letter: Protein Families and Genome Evolution. Published Feb 1998.
• http://homepage.usask.ca/~ctl271/857/def_homolog.shtml
• http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33
• Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223.
Epub 2014 Nov 26.
• http://www.ncbi.nlm.nih.gov/pubmed/25428365
• http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33
THANK YOU

More Related Content

What's hot

Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
IndrajaDoradla
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and tools
KAUSHAL SAHU
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
DrSatyabrataSahoo
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 
Cath
CathCath
Cath
Ramya S
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
Nusrat Gulbarga
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
Karan Veer Singh
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
YEAST TWO HYBRID SYSTEM
 YEAST TWO HYBRID SYSTEM YEAST TWO HYBRID SYSTEM
YEAST TWO HYBRID SYSTEM
Md Nahidul Islam
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
Dhananjay Desai
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
Pawan Kumar
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
naveed ul mushtaq
 
2d analysis for protein ppt
2d analysis for protein ppt2d analysis for protein ppt
2d analysis for protein ppt
ANSHULKUMAR218
 
Analysis of gene expression
Analysis of gene expressionAnalysis of gene expression
Analysis of gene expressionTapeshwar Yadav
 
Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organelles
KAUSHAL SAHU
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
Puneet Kulyana
 
Proteomics
Proteomics   Proteomics
Proteomics
Mohit Bharti
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
ShivaniShewale2
 
Sts
StsSts

What's hot (20)

Genomics(functional genomics)
Genomics(functional genomics)Genomics(functional genomics)
Genomics(functional genomics)
 
Functional proteomics, methods and tools
Functional proteomics, methods and toolsFunctional proteomics, methods and tools
Functional proteomics, methods and tools
 
Genomic databases
Genomic databasesGenomic databases
Genomic databases
 
Scop database
Scop databaseScop database
Scop database
 
Cath
CathCath
Cath
 
Gene prediction method
Gene prediction method Gene prediction method
Gene prediction method
 
genomic comparison
genomic comparison genomic comparison
genomic comparison
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
YEAST TWO HYBRID SYSTEM
 YEAST TWO HYBRID SYSTEM YEAST TWO HYBRID SYSTEM
YEAST TWO HYBRID SYSTEM
 
Express sequence tags
Express sequence tagsExpress sequence tags
Express sequence tags
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
2d analysis for protein ppt
2d analysis for protein ppt2d analysis for protein ppt
2d analysis for protein ppt
 
Analysis of gene expression
Analysis of gene expressionAnalysis of gene expression
Analysis of gene expression
 
Comparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organellesComparative genomics in eukaryotes, organelles
Comparative genomics in eukaryotes, organelles
 
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCINGDNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
DNA SEQUENCING METHODS AND STRATEGIES FOR GENOME SEQUENCING
 
Proteomics
Proteomics   Proteomics
Proteomics
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Sts
StsSts
Sts
 

Viewers also liked

What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
Usman Arshad
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
Emmanuel Aguon
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomicsNikhil Aggarwal
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
sidjena70
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerl
Jason Stajich
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
kiran singh
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
ajay301
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
Sobia
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
Leighton Pritchard
 
Database management system
Database management systemDatabase management system
Database management system
ashishkthakur94
 
Ozonolysis of 2-Butenes
Ozonolysis of 2-ButenesOzonolysis of 2-Butenes
Ozonolysis of 2-Butenes
Mixtli Campos-Pineda
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
Leighton Pritchard
 
03 Object Dbms Technology
03 Object Dbms Technology03 Object Dbms Technology
03 Object Dbms Technology
Laguna State Polytechnic University
 
Nomenclature
NomenclatureNomenclature
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...CGIAR Generation Challenge Programme
 
Trans 2 butene
Trans 2 buteneTrans 2 butene
Trans 2 buteneimanijc
 
Unit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes NotesUnit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes Notes
M F Ebden
 
rules in naming organic compound
rules in naming organic compoundrules in naming organic compound
rules in naming organic compound
vxiiayah
 

Viewers also liked (20)

What is comparative genomics
What is comparative genomicsWhat is comparative genomics
What is comparative genomics
 
Comparative genomics presentation
Comparative genomics presentationComparative genomics presentation
Comparative genomics presentation
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Comparative genomics and proteomics
Comparative genomics and proteomicsComparative genomics and proteomics
Comparative genomics and proteomics
 
Comparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 formatComparative genomics @ sid 2003 format
Comparative genomics @ sid 2003 format
 
Comparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerlComparative Genomics with GMOD and BioPerl
Comparative Genomics with GMOD and BioPerl
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
 
Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2Comparative Genomics and Visualisation - Part 2
Comparative Genomics and Visualisation - Part 2
 
GRM 2011: Theme 1 -- Comparative genomics
GRM 2011: Theme 1 -- Comparative genomicsGRM 2011: Theme 1 -- Comparative genomics
GRM 2011: Theme 1 -- Comparative genomics
 
Database management system
Database management systemDatabase management system
Database management system
 
Ozonolysis of 2-Butenes
Ozonolysis of 2-ButenesOzonolysis of 2-Butenes
Ozonolysis of 2-Butenes
 
Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)Microbial Genomics and Bioinformatics: BM405 (2015)
Microbial Genomics and Bioinformatics: BM405 (2015)
 
03 Object Dbms Technology
03 Object Dbms Technology03 Object Dbms Technology
03 Object Dbms Technology
 
Nomenclature
NomenclatureNomenclature
Nomenclature
 
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
TLIII: Overview of TLII achievements, lessons and challenges for Phase III – ...
 
Trans 2 butene
Trans 2 buteneTrans 2 butene
Trans 2 butene
 
Unit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes NotesUnit 2 8 Alcohols And Halogenoalkanes Notes
Unit 2 8 Alcohols And Halogenoalkanes Notes
 
rules in naming organic compound
rules in naming organic compoundrules in naming organic compound
rules in naming organic compound
 

Similar to Comparative genomics

Comparative genomics ................pptx
Comparative genomics ................pptxComparative genomics ................pptx
Comparative genomics ................pptx
Cherry
 
Gene order
Gene orderGene order
Gene order
Prasanthperceptron
 
Gene order
Gene orderGene order
Gene order
Prasanthperceptron
 
Genetic fine structure
Genetic fine structureGenetic fine structure
Genetic fine structure
Sujan Karki
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 pptrehman2009
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Jajati Keshari Nayak
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genometcha163
 
Tools of Genomics
Tools of GenomicsTools of Genomics
Tools of Genomics
Meghaj Mallick
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
Lars Juhl Jensen
 
Molecular Biology
Molecular BiologyMolecular Biology
Molecular BiologyBealise Sc
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomics
Sayak Ghosh
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
BITS
 
Plant genomics general overview
Plant genomics general overviewPlant genomics general overview
Plant genomics general overview
KAUSHAL SAHU
 
genomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptxgenomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptx
suyashempire
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and Disease
Barry Smith
 
THE human genome
THE human genomeTHE human genome
THE human genome
rokanuzzaman moschus
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
DevikaPatel12
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
Philip Bourne
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdf
Aartisoni17
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
Sham Sadiq
 

Similar to Comparative genomics (20)

Comparative genomics ................pptx
Comparative genomics ................pptxComparative genomics ................pptx
Comparative genomics ................pptx
 
Gene order
Gene orderGene order
Gene order
 
Gene order
Gene orderGene order
Gene order
 
Genetic fine structure
Genetic fine structureGenetic fine structure
Genetic fine structure
 
Chapter 20 ppt
Chapter 20 pptChapter 20 ppt
Chapter 20 ppt
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Plang functional genome
Plang functional genomePlang functional genome
Plang functional genome
 
Tools of Genomics
Tools of GenomicsTools of Genomics
Tools of Genomics
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
Molecular Biology
Molecular BiologyMolecular Biology
Molecular Biology
 
Comparative transcriptomics
Comparative transcriptomicsComparative transcriptomics
Comparative transcriptomics
 
BITS - Introduction to comparative genomics
BITS - Introduction to comparative genomicsBITS - Introduction to comparative genomics
BITS - Introduction to comparative genomics
 
Plant genomics general overview
Plant genomics general overviewPlant genomics general overview
Plant genomics general overview
 
genomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptxgenomic proteomic changes by Suyash Garg.pptx
genomic proteomic changes by Suyash Garg.pptx
 
Basic Formal Ontology (BFO) and Disease
 Basic Formal Ontology (BFO) and Disease Basic Formal Ontology (BFO) and Disease
Basic Formal Ontology (BFO) and Disease
 
THE human genome
THE human genomeTHE human genome
THE human genome
 
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
Functional Genomic l Genomes l proteomic l DNA l #genomics #proteomics #scien...
 
Bioinformatics A Biased Overview
Bioinformatics A Biased OverviewBioinformatics A Biased Overview
Bioinformatics A Biased Overview
 
Molecular systematics.pdf
Molecular systematics.pdfMolecular systematics.pdf
Molecular systematics.pdf
 
Metagenomics and it’s applications
Metagenomics and it’s applicationsMetagenomics and it’s applications
Metagenomics and it’s applications
 

More from prateek kumar

Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
prateek kumar
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
prateek kumar
 
docking
docking docking
docking
prateek kumar
 
RAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSISRAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSIS
prateek kumar
 
Bhageerath h
Bhageerath  h Bhageerath  h
Bhageerath h
prateek kumar
 
DNA sequencing
DNA sequencingDNA sequencing
DNA sequencing
prateek kumar
 
Genomic variation
Genomic variationGenomic variation
Genomic variation
prateek kumar
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
prateek kumar
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
prateek kumar
 

More from prateek kumar (9)

Applications of microarray
Applications of microarrayApplications of microarray
Applications of microarray
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
docking
docking docking
docking
 
RAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSISRAPD, AFLP AND RFLP ANALYSIS
RAPD, AFLP AND RFLP ANALYSIS
 
Bhageerath h
Bhageerath  h Bhageerath  h
Bhageerath h
 
DNA sequencing
DNA sequencingDNA sequencing
DNA sequencing
 
Genomic variation
Genomic variationGenomic variation
Genomic variation
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
 
2 d gel analysis
2 d gel analysis2 d gel analysis
2 d gel analysis
 

Recently uploaded

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
Celine George
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
AzmatAli747758
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Thiyagu K
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
TechSoup
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 

Recently uploaded (20)

Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
How to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS ModuleHow to Split Bills in the Odoo 17 POS Module
How to Split Bills in the Odoo 17 POS Module
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...Cambridge International AS  A Level Biology Coursebook - EBook (MaryFosbery J...
Cambridge International AS A Level Biology Coursebook - EBook (MaryFosbery J...
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
Unit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdfUnit 2- Research Aptitude (UGC NET Paper I).pdf
Unit 2- Research Aptitude (UGC NET Paper I).pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup   New Member Orientation and Q&A (May 2024).pdfWelcome to TechSoup   New Member Orientation and Q&A (May 2024).pdf
Welcome to TechSoup New Member Orientation and Q&A (May 2024).pdf
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Supporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptxSupporting (UKRI) OA monographs at Salford.pptx
Supporting (UKRI) OA monographs at Salford.pptx
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 

Comparative genomics

  • 1. Topic: COGs and Comparative Genomics Durdam M.Sc. Bioinformatics sem-2
  • 2. Some important terminologies: Orthologs are genes in different species that evolved from a common ancestral gene by speciation. Normally, orthologs retain the same function in the course of evolution. Identification of orthologs is critical for reliable prediction of gene function in newly sequenced genomes. Paralogs are genes related by duplication within a genome. Orthologs retain the same function in the course of evolution, whereas paralogs evolve new functions, even if these are related to the original one. Speciation is the origin of a new species capable of making a living in a new way from the species from which it arose. As part of this process it has also acquired some barrier to genetic exchange with the parent species.
  • 4. COGs • Cluster of orthologous genes. • Clusters of Orthologous Groups, are groups of three or more ortholog genes, meaning they are direct evolutionary counter parts and are considered to be part of an 'ancient conserved domain'. A COG is defined as three or more proteins from the genomes of distant species that are more similar to each other than to any other protein within the individual genome. • COGs can be used to predict the function of homologous proteins in poorly studied species and can also be used to track the evolutionary divergence from a common ancestor, hence providing a powerful tool for functional annotation of uncharacterized proteins. • Important in comparative genomics studies
  • 5. Application of COG • The most straightforward application of the COGs is for the prediction of functions of individual proteins or protein sets, including those from newly completed genomes. NCBI provides a COG database that consists of 4,873 COGs that code for over 136,000 proteins from the genomes of 50 bacteria, 13 archaea and 3 unicellular eukaryotes. This database uses completely sequenced genomes to classify proteins using the orthology concept. The COG database
  • 6. What are some questions that comparative genomics can address? • How has the organism evolved? • What differentiates species? • Which non-coding regions are important? • Which genes are required for organisms to survive in a certain environment?
  • 7. What is Comparative Genomics? It is the comparison of one genome to another. Genomics DNA (Gene) Functional Genomics Transcriptomics RNA Proteomics PROTEIN Metabolomics METABOLITE Transcription Translation Enzymatic reaction
  • 8. Difference is in Scale and Direction One or several genes compared against all other known genes. Use genome to inform us about the entire organism. Use information from many genomes to learn more about the individual genes. Entire Genome compared to other entire genomes. Other “omics” Comparative
  • 9. Comparative genomics • Discover what lies hidden in genomic sequence by comparing sequence information. • Main areas – Whole genome alignment – Gene prediction – Regulatory element prediction – Phylogenomics – Pharmacogenetics
  • 10. Comparative Genomics Comparative genomics is a powerful tool for identifying the features and dissecting the functions of genomes. The approach is based on selection for the gene or regulatory region constraining the evolution of the sequence. Comparison with other genomes has become an integral part of the analysis of the human genome sequence and is one of the most effective methods for identifying genes (Batzoglou et al. , 2000; Roest Crollius et al. , 2000) Comparative genomics is a field of biological research in which the genomic features of different organisms are compared. The genomic features may include the DNA sequence, genes, gene order, regulatory sequences, and other genomic structural landmarks
  • 11. Figure: Species tree of different organisms
  • 12.
  • 13. Figure: Distribution and clustering of orthologous genes of Tulsi genome to other related plant genomes. a. Distribution of gene families among five plant genomes. Ocimum tenuiflorum (Ote - green), Arabidopsis thaliana (Ath – black rectangle), Oryza sativa (Osa – red), Solanum lycopersicum (Sly – blue) and Mimulus guttatus (Mgu – black circle). The numbers in the Venn diagram represent shared and unique gene families across these 5 species obtained by OrthoMCL. b. Horizontal stacked bar plot of all the genes in 23 different genomes. This figure shows ortholog group distribution in all 23 plant species including Tulsi. Each row represents a plant species - Physcomitrella patens (Ppa), Selaginella moellendorffii (Smo), Oryza sativa (Osa), Setaria italic (Sit), Zea mays (Zma), Sorghum bicolor (Sbi), Aquilegia caerulea (Aca), Ocimum tenuiflorum (Ote), Mimulus guttatus (Mgu), Solanum lycopersicum (Sly), Solanum tuberosum (Stu), Vitis vinifera (Vvi), Eucalyptus grandis (Egr), Citrus sinensis (Csi), Theobroma cacao (Tca), Carica papaya (Cpa), Brassica rapa (Bra), Arabidopsis thaliana (Ath), Fragaria vesca (Fve), Prunus persica (Ppe), Glycine max (Gma), Medicago truncatula (Mtr), Populus trichocarpa (Ptr). The bar graph represents ortholog protein groups for that species subdivided into 22 categories depending on the degree of sharing with the other 22 plant species e.g., category 2 represents the number of orthologous groups that have representatives from the species of interest and from one more species out of the 23 species selected for the study
  • 14. Background: Shortly after multiple genome sequences of bacteria, archae and unicellular eukaryotes became available, an attempt on such a classification was implemented in Cluster of Orthologous Groups of proteins (COGs). Rapid accumulation of genome sequences creates opportunities for refining COGs but also represents a challenge because of error amplification. Conclusion: The arCOGs provide a convenient, flexible framework for functional annotation of archael genomes, comparative genomics and evolutionary reconstructions. Genomic reconstructions suggest that the last common ancestor of archaea might have been (nearly) as advanced as the modern archael hyperthermophiles. For more info: ftp://ftp.ncbi.nih.gov/pub/koonin/arCOGs/.
  • 15.
  • 16. MBGD Database MBGD is a database for comparative analysis of completely sequenced microbial genomes, the number of which is now growing rapidly. The aim of MBGD is to facilitate comparative genomics from various points of view such as ortholog identification, paralog clustering, motif analysis and gene order comparison.
  • 17. Conclusion The study of Cluster of Orthologous Genes play a vital role in the Comparative genomic studies.
  • 18. References and links • NCBI COGs database • Chapter 22 of the NCBI handbook: The Clusters of Orthologous Groups (COGs) Database: Phylogenetic Classification of Proteins from Complete Genomes. NCBI Bookshelf ID: NBK21101. • NCBI News Letter: Protein Families and Genome Evolution. Published Feb 1998. • http://homepage.usask.ca/~ctl271/857/def_homolog.shtml • http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33 • Nucleic Acids Res. 2015 Jan;43(Database issue):D261-9. doi: 10.1093/nar/gku1223. Epub 2014 Nov 26. • http://www.ncbi.nlm.nih.gov/pubmed/25428365 • http://biologydirect.biomedcentral.com/articles/10.1186/1745-6150-2-33

Editor's Notes

  1. Homology: Is the relationship between biological structures or sequences that are derived from a common ancestry. Two things are homologous if they bear same relationship to one another, such as a certain bone in various forms of the “hand”.