SlideShare a Scribd company logo
Dot plot interpretationDot plot interpretation
Submitted by:
Shweta Kumari
Roll no: 21
M.Sc Bioinformatics
2nd semester
Session: 2014-16
ContentContent

Introduction

Principle

Example

Dot plot interpretation

Analysis of dot plot matrix

Identical sequence

Direct repeat

Inverted repeat

Palindromic sequence

Frame shifts

Low complexity region

Application

Limitation

Dot plot software

References
IntroductionIntroduction

In bioinformatics a dot plot is a graphical method that allows
the comparison of two biological sequences and identify
regions of close similarity between them.

Introduced by GIBBS and MCLNTYE in 1970.

It is the one way to visualize that similarity between two
protein and nucleotide sequences by uses a similarity matrix.
PrinciplePrinciple

Dot plot are two dimensional graphs, showing a comarision of two sequences.

The principle used to generate the dot plot is:
The top X and the left y axes of a rectangular array are used to represent the
two sequences to be compared.

Calculation:
Matrix
• Columns = residues of sequence 1
• Rows = residues of sequence 2

A dot is plotted at every co-ordinate where there is similarity between the bases.
ExampleExample
Seq 1: TWILIGHTZONE
Seq 2: MIDNIGHTZONE
Matrix= 12 * 12

A dot is plotted at every co-ordinate where there is similarity between the
bases.
Dot plot interpretationDot plot interpretation
Seq1: ATGATAT
Seq2: ATGATAT
Analysis of dot plot matrixAnalysis of dot plot matrix

Region of similarity appears as diagonal run of dots.

Principal diagonal shows identical sequence.

Global and local alignment are shown.

Multiple diagonal indicate repeatation

Reverse diagonal (perpendicular to diagonal) indicate
INVERSION.

Reverse diagonal crossing diagonal (X) indicate
PALINDROMES.

Formation of box indicate the low complexity region.
Identical sequenceIdentical sequence

These are the two identical sequences:

Seq1: MALWGRL

Seq2: MALWGRL
Direct repeatDirect repeat
Inverted repeatInverted repeat
An inverted repeat is sequence of nucleotides followed downstream by its
reverse complement.
Inverted repeat: abcdeedcbafghijklmno
Palindromic sequencesPalindromic sequences
A palindromic sequence is a nucleic acid sequence (DNA or
RNA) tha is same whether read 5' to 3' on one strand or 5'
to 3' on the complementary strand with which it forms a
double helix.
Frame shiftsFrame shifts
Frame shifts in a nucleotide
sequence can occur due to
insertions, deletions or
mutations.
1. Deletion of nucleotides
2.Insertion of nucleotides
3.Mutation (out of frame)
Low cmplexity regionLow cmplexity region

Low-complexity regions in sequences can be found as regions around the diagonal all
obtaining a high score. Low complexity regions are calculated from the redundancy of
amino acids within a limited region [Wootton and Federhen,1993].
ApplicationApplication

Shows the all possible alignment between two nucleic acid
and amino acid sequences.

All kind of local and global aligment can be traped.

Help to recognise large region of simiarity.

To find self base pairing of RNA (eg, tRNA) by comparing a
sequence to itself complemented and reverse.

An excellent approach for finding sequence transposition.

To find the location of genes between two genomes.

To find the non sequential alignment.
LimitationLimitation

For longer sequence, memory required for the graphical
representation is very high. So long sequnece can not be
aligned.

Lots of insignifcant matches makes it noisy (so many off
diagonal appear).

Time required to compare two sequences is proportional to
the product of length of the squences time of the search
window.

i.e, higher efficiency of short sequence.

Low efficiency of long sequence.
Dot plot softwareDot plot software

GCG is a commercial software, hence not possible to use all
the time.

Instead of this, we can use the EMBOSS package, which are
followig:

Dotmatcher

Dotpath

Polydot

Dottup
(http://emboss.bioinformatics.nl/cgi-bin/emboss/dottup)
ReferencesReferences
●
Bioinformatics Principal and Applications by Zhumur Ghosh
and Bibekanand Mallick
●
Bioinformatics concepts, skill & applications, second edition by
S.C.Rastogi, Namita Mendriatta, Parag Rastogi

http://en.wikipedia.org/wiki/Dot_plot_%28bioinformatics%29

http://www.code10.info/index.php?option=com_content&view=ar
ticle&id=64:inroduction-to-dot-plots&catid=52:cat_coding_al
gorithms_dot-plots&Itemid=76

http://lectures.molgen.mpg.de/Pairwise/DotPlots/

https://ugene.unipro.ru/wiki/pages/viewpage.action?pageId=4
227426

http://www.clcsupport.com/clcgenomicsworkbench/650/Examples
_interpretations_dot_plots.html
dot plot analysis

More Related Content

What's hot

Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Blast
BlastBlast
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
Vijay Hemmadi
 
BLAST
BLASTBLAST
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicshemantbreeder
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
Shital Pal
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
nadeem akhter
 
Dot matrix
Dot matrixDot matrix
Dot matrix
Tania Khan
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
Karan Veer Singh
 
Cath
CathCath
Cath
Ramya S
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
Vijay Hemmadi
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Subhranil Bhattacharjee
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
Siva Dharshini R
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Vidya Kalaivani Rajkumar
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
prashant tripathi
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
Thapar Institute of Engineering & Technology, Patiala, Punjab, India
 
Biological database
Biological databaseBiological database
Biological database
Iqbal college Peringammala TVM
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
karamveer prajapat
 

What's hot (20)

Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Blast
BlastBlast
Blast
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
BLAST
BLASTBLAST
BLAST
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Genome sequencing
Genome sequencingGenome sequencing
Genome sequencing
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
Genome annotation 2013
Genome annotation 2013Genome annotation 2013
Genome annotation 2013
 
Cath
CathCath
Cath
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
NCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology InformationNCBI National Center for Biotechnology Information
NCBI National Center for Biotechnology Information
 
Biological database
Biological databaseBiological database
Biological database
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 

Viewers also liked

Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformaticsavrilcoghlan
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignmentavrilcoghlan
 
Multiple alignment
Multiple alignmentMultiple alignment
Multiple alignmentavrilcoghlan
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
Nikesh Narayanan
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
Kubuldinho
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
ammar kareem
 
paper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular proteinpaper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular protein
ShwetA Kumari
 
Final shweta protein kinase cascade
Final shweta protein kinase cascadeFinal shweta protein kinase cascade
Final shweta protein kinase cascade
ShwetA Kumari
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
ShwetA Kumari
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
goneshwar
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Malla Reddy College of Pharmacy
 
Shweta ppt I1FFL
Shweta ppt I1FFLShweta ppt I1FFL
Shweta ppt I1FFL
ShwetA Kumari
 
Aminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and PyrrolysineAminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and Pyrrolysine
ShwetA Kumari
 
Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
ShwetA Kumari
 
protein kinase cascade
protein kinase cascadeprotein kinase cascade
protein kinase cascade
ShwetA Kumari
 
Data mining
Data miningData mining
Data mining
ShwetA Kumari
 
Genetic drift
Genetic drift Genetic drift
Genetic drift
ShwetA Kumari
 
Natural selection
Natural selection Natural selection
Natural selection
ShwetA Kumari
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
Rai University
 
Social Network ส่งท้ายปี 2014
Social Network ส่งท้ายปี 2014Social Network ส่งท้ายปี 2014
Social Network ส่งท้ายปี 2014
Thanawat Boontan
 

Viewers also liked (20)

Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Multiple alignment
Multiple alignmentMultiple alignment
Multiple alignment
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
paper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular proteinpaper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular protein
 
Final shweta protein kinase cascade
Final shweta protein kinase cascadeFinal shweta protein kinase cascade
Final shweta protein kinase cascade
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Shweta ppt I1FFL
Shweta ppt I1FFLShweta ppt I1FFL
Shweta ppt I1FFL
 
Aminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and PyrrolysineAminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and Pyrrolysine
 
Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
 
protein kinase cascade
protein kinase cascadeprotein kinase cascade
protein kinase cascade
 
Data mining
Data miningData mining
Data mining
 
Genetic drift
Genetic drift Genetic drift
Genetic drift
 
Natural selection
Natural selection Natural selection
Natural selection
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
 
Social Network ส่งท้ายปี 2014
Social Network ส่งท้ายปี 2014Social Network ส่งท้ายปี 2014
Social Network ส่งท้ายปี 2014
 

Similar to dot plot analysis

A tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus KaiserA tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus Kaiser
Lake Como School of Advanced Studies
 
Sequence alignment
Sequence alignmentSequence alignment
Computational Analysis with ICM
Computational Analysis with ICMComputational Analysis with ICM
Computational Analysis with ICMVernon D Dutch Jr
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Robin Gutell
 
Urop poster 2014 benson and mat 5 13 14 (2)
Urop poster 2014 benson and mat 5 13 14 (2)Urop poster 2014 benson and mat 5 13 14 (2)
Urop poster 2014 benson and mat 5 13 14 (2)
Mathew Shum
 
Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...
N. Sukumar
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
prateek kumar
 
On the identifiability of phylogenetic networks under a pseudolikelihood model
On the identifiability of phylogenetic networks under a pseudolikelihood modelOn the identifiability of phylogenetic networks under a pseudolikelihood model
On the identifiability of phylogenetic networks under a pseudolikelihood model
Arrigo Coen
 
Application of graph theory in drug design
Application of graph theory in drug designApplication of graph theory in drug design
Application of graph theory in drug designReihaneh Safavi
 
Bio process
Bio processBio process
Bio processsun777
 
Bio process
Bio processBio process
Bio processsun777
 
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
Jonathan Blakes
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
Nagendrasahu6
 
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
csandit
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
csandit
 
jin-HMG2014-post
jin-HMG2014-postjin-HMG2014-post
jin-HMG2014-postJin Yu
 
intracell-networks.ppt
intracell-networks.pptintracell-networks.ppt
intracell-networks.ppt
Khush318896
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Jonathan Eisen
 
Aligning Subunits of Internally Symmetric Proteins with CE-Symm
Aligning Subunits of Internally Symmetric Proteins with CE-SymmAligning Subunits of Internally Symmetric Proteins with CE-Symm
Aligning Subunits of Internally Symmetric Proteins with CE-Symm
Spencer Bliven
 

Similar to dot plot analysis (20)

A tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus KaiserA tutorial in Connectome Analysis (1) - Marcus Kaiser
A tutorial in Connectome Analysis (1) - Marcus Kaiser
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Computational Analysis with ICM
Computational Analysis with ICMComputational Analysis with ICM
Computational Analysis with ICM
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105
 
Urop poster 2014 benson and mat 5 13 14 (2)
Urop poster 2014 benson and mat 5 13 14 (2)Urop poster 2014 benson and mat 5 13 14 (2)
Urop poster 2014 benson and mat 5 13 14 (2)
 
Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...Disintegration of the small world property with increasing diversity of chemi...
Disintegration of the small world property with increasing diversity of chemi...
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
Bloom Lab Summary
Bloom Lab SummaryBloom Lab Summary
Bloom Lab Summary
 
On the identifiability of phylogenetic networks under a pseudolikelihood model
On the identifiability of phylogenetic networks under a pseudolikelihood modelOn the identifiability of phylogenetic networks under a pseudolikelihood model
On the identifiability of phylogenetic networks under a pseudolikelihood model
 
Application of graph theory in drug design
Application of graph theory in drug designApplication of graph theory in drug design
Application of graph theory in drug design
 
Bio process
Bio processBio process
Bio process
 
Bio process
Bio processBio process
Bio process
 
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
20080110 Genome exploration in A-T G-C space: an introduction to DNA walking
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
MULTI RESOLUTION LATTICE DISCRETE FOURIER TRANSFORM (MRL-DFT)
 
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphsUse of eigenvalues and eigenvectors to analyze bipartivity of network graphs
Use of eigenvalues and eigenvectors to analyze bipartivity of network graphs
 
jin-HMG2014-post
jin-HMG2014-postjin-HMG2014-post
jin-HMG2014-post
 
intracell-networks.ppt
intracell-networks.pptintracell-networks.ppt
intracell-networks.ppt
 
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from UnculturedMicrobial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
Microbial Phylogenomics (EVE161) Class 17: Genomes from Uncultured
 
Aligning Subunits of Internally Symmetric Proteins with CE-Symm
Aligning Subunits of Internally Symmetric Proteins with CE-SymmAligning Subunits of Internally Symmetric Proteins with CE-Symm
Aligning Subunits of Internally Symmetric Proteins with CE-Symm
 

Recently uploaded

The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
silvermistyshot
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 

Recently uploaded (20)

The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Lateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensiveLateral Ventricles.pdf very easy good diagrams comprehensive
Lateral Ventricles.pdf very easy good diagrams comprehensive
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 

dot plot analysis

  • 1. Dot plot interpretationDot plot interpretation Submitted by: Shweta Kumari Roll no: 21 M.Sc Bioinformatics 2nd semester Session: 2014-16
  • 2. ContentContent  Introduction  Principle  Example  Dot plot interpretation  Analysis of dot plot matrix  Identical sequence  Direct repeat  Inverted repeat  Palindromic sequence  Frame shifts  Low complexity region  Application  Limitation  Dot plot software  References
  • 3. IntroductionIntroduction  In bioinformatics a dot plot is a graphical method that allows the comparison of two biological sequences and identify regions of close similarity between them.  Introduced by GIBBS and MCLNTYE in 1970.  It is the one way to visualize that similarity between two protein and nucleotide sequences by uses a similarity matrix.
  • 4. PrinciplePrinciple  Dot plot are two dimensional graphs, showing a comarision of two sequences.  The principle used to generate the dot plot is: The top X and the left y axes of a rectangular array are used to represent the two sequences to be compared.  Calculation: Matrix • Columns = residues of sequence 1 • Rows = residues of sequence 2  A dot is plotted at every co-ordinate where there is similarity between the bases.
  • 5. ExampleExample Seq 1: TWILIGHTZONE Seq 2: MIDNIGHTZONE Matrix= 12 * 12  A dot is plotted at every co-ordinate where there is similarity between the bases.
  • 6. Dot plot interpretationDot plot interpretation Seq1: ATGATAT Seq2: ATGATAT
  • 7. Analysis of dot plot matrixAnalysis of dot plot matrix  Region of similarity appears as diagonal run of dots.  Principal diagonal shows identical sequence.  Global and local alignment are shown.  Multiple diagonal indicate repeatation  Reverse diagonal (perpendicular to diagonal) indicate INVERSION.  Reverse diagonal crossing diagonal (X) indicate PALINDROMES.  Formation of box indicate the low complexity region.
  • 8. Identical sequenceIdentical sequence  These are the two identical sequences:  Seq1: MALWGRL  Seq2: MALWGRL
  • 10. Inverted repeatInverted repeat An inverted repeat is sequence of nucleotides followed downstream by its reverse complement. Inverted repeat: abcdeedcbafghijklmno
  • 11. Palindromic sequencesPalindromic sequences A palindromic sequence is a nucleic acid sequence (DNA or RNA) tha is same whether read 5' to 3' on one strand or 5' to 3' on the complementary strand with which it forms a double helix.
  • 12. Frame shiftsFrame shifts Frame shifts in a nucleotide sequence can occur due to insertions, deletions or mutations. 1. Deletion of nucleotides 2.Insertion of nucleotides 3.Mutation (out of frame)
  • 13. Low cmplexity regionLow cmplexity region  Low-complexity regions in sequences can be found as regions around the diagonal all obtaining a high score. Low complexity regions are calculated from the redundancy of amino acids within a limited region [Wootton and Federhen,1993].
  • 14. ApplicationApplication  Shows the all possible alignment between two nucleic acid and amino acid sequences.  All kind of local and global aligment can be traped.  Help to recognise large region of simiarity.  To find self base pairing of RNA (eg, tRNA) by comparing a sequence to itself complemented and reverse.  An excellent approach for finding sequence transposition.  To find the location of genes between two genomes.  To find the non sequential alignment.
  • 15. LimitationLimitation  For longer sequence, memory required for the graphical representation is very high. So long sequnece can not be aligned.  Lots of insignifcant matches makes it noisy (so many off diagonal appear).  Time required to compare two sequences is proportional to the product of length of the squences time of the search window.  i.e, higher efficiency of short sequence.  Low efficiency of long sequence.
  • 16. Dot plot softwareDot plot software  GCG is a commercial software, hence not possible to use all the time.  Instead of this, we can use the EMBOSS package, which are followig:  Dotmatcher  Dotpath  Polydot  Dottup (http://emboss.bioinformatics.nl/cgi-bin/emboss/dottup)
  • 17. ReferencesReferences ● Bioinformatics Principal and Applications by Zhumur Ghosh and Bibekanand Mallick ● Bioinformatics concepts, skill & applications, second edition by S.C.Rastogi, Namita Mendriatta, Parag Rastogi  http://en.wikipedia.org/wiki/Dot_plot_%28bioinformatics%29  http://www.code10.info/index.php?option=com_content&view=ar ticle&id=64:inroduction-to-dot-plots&catid=52:cat_coding_al gorithms_dot-plots&Itemid=76  http://lectures.molgen.mpg.de/Pairwise/DotPlots/  https://ugene.unipro.ru/wiki/pages/viewpage.action?pageId=4 227426  http://www.clcsupport.com/clcgenomicsworkbench/650/Examples _interpretations_dot_plots.html