SlideShare a Scribd company logo
1 of 20
Dot plots
Alignment
• In molecular biology, a common question is to
ask whether or not two sequences are related.
• The most common way to tell whether or not
they are related is to compare them to one
another to see if they are similar.
• Biological sequences that are similar (but not exact)
provide useful information to help discover functional,
structural, and evolutionary information.
• One common mistake is to describe two sequences as
having some sort of homology or a percent homology
based on their sequence similarity.
• This is a misuse of the biological term.
• Two sequences in different organisms are homologous if
they have been derived from a common ancestor
sequence.
• However, the greater the sequence similarity, the greater
chance there is that they share similar function and/or
structure.
Biological Definitions for Related
Sequences
• Homologs are similar sequences in two different
organisms that have been derived from a
common ancestor sequence.
– Homologs can be described as either orthologous or
paralogous.
• Orthologs are similar sequences in two different
organisms that have arisen due to a speciation
event.
Orthologs typically retain their functionality
throughout evolution.
• Paralogs are similar sequences within a single
organism that have arisen due to a gene
duplication event.
• Xenologs are similar sequences that do not
share the same evolutionary origin, but rather
have arisen out of horizontal transfer events
through symbiosis, viruses, etc
Hamming or edit distance
• One method in determining sequence
similarity is to determine the edit distance
between two sequences.
• If we take the example of pear and tear, how
similar are these two words?
• There is a mismatch in the first letter, and
matches in the last three
• An alignment of these two is as follows
• P E A R
| | |
• T E A R
One way to score this alignment is to calculate the Hamming distance,
which is the minimum number of letters by which the two words
differ.
In this example, the Hamming distance would be 1.
The Hamming distance is calculated by summing up the
number of mismatches when two words are aligned to one another.
• Which alignment is the better alignment? One
way to judge this is to assign a positive score for
each match, and a negative score for each
mismatch, and a negative score for each
insertion/deletion
– (collectively referred to as indels).
• One scoring scheme might assign the following
values:
– match: +2
– mismatch: -1
– indel –2
Scoring alignment
Using this scoring scheme, the first alignment has 5 matches, 1 mismatch, and 4
indels.
The score for this alignment is: 5 * 2 – 1(1) – 4(2) = 10 – 1 – 8 = 1.
The second alignment has 6 matches, 1 mismatch, and 2 indels. The score for the
second
alignment is 6 * 2 – 1(1) – 2 (2) = 12 – 1 – 4 = 7.
Therefore, using the above scoring scheme, the second alignment is a
better alignment, since it produces a higher alignment score.
Dot Plots
• One of the more basic, yet important techniques
for determining the alignment between two
sequences is by using a visual alignment known
as dot plots.
• Dot plots of sequence similarity are created
using a matrix where the rows in the matrix
correspond to the characters in the first sequence
and the columns in the matrix correspond to the
characters in the second sequence.
• The dot plot is created as follows:
• loop through each row. For the current row,
take the character in that row and compare it
to the character in each column.
• If they are equal, place a dot in the matrix.
• Continue until all nodes in the matrix have
been considered.
Information within Dot Plots
• Dot plots are useful as a first-level filter for
determining an alignment between two
sequences.
• Regions of similarity will show up as diagonals
within the dot plot matrix
• Regions containing insertions/deletions can be
readily determined
• One potential application is to determine the
number of coding regions (exons) contained
within a processed mRNA.
• Regions of genomic DNA can contain
repetitive regions.
• For instance, approximately 50 percent of the
human genome -repetitive elements, which
can be on the order of a few hundred bases
• low complexity are present as well
• In addition to repetitive elements, regions of a
genome can be duplicated
• The duplicated region can be found either as a
direct repeat or as an inverted repeat meaning
– Direct - it occurs in the same direction,
- inverted repeat - meaning that the sequence
of the duplicated region is found in the
reverse complement direction
• Dot plots can readily show regions of direct
and inverted repeats.
• Dot plots show all possible matches of
residues
• the researcher can decide which alignments
are the most significant.
Available Dot Plot Packages
Dot Plot Articles
Notes – online - Complete notes on Bioinformatics

More Related Content

What's hot

Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence AlignmentMeghaj Mallick
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure predictionSiva Dharshini R
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-naveed ul mushtaq
 
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazMultiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazShumailaRiaz6
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fastaALLIENU
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysisyuvraj404
 
Shuttle vector - a plasmid vector used in rDNA technology.
Shuttle vector - a plasmid vector used in rDNA technology. Shuttle vector - a plasmid vector used in rDNA technology.
Shuttle vector - a plasmid vector used in rDNA technology. neeru02
 
Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)SumatiHajela
 
protein sequence analysis
protein sequence analysisprotein sequence analysis
protein sequence analysisRamikaSingla
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsNikesh Narayanan
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & DomainAnik Banik
 

What's hot (20)

Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence Alignment
 
BLAST Search tool
BLAST Search toolBLAST Search tool
BLAST Search tool
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Blast Algorithm
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila RiazMultiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
Multiple Alignment Sequence using Clustal Omega/ Shumaila Riaz
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
The Cre-LoxP System
The Cre-LoxP SystemThe Cre-LoxP System
The Cre-LoxP System
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Microarray Data Analysis
Microarray Data AnalysisMicroarray Data Analysis
Microarray Data Analysis
 
Shuttle vector - a plasmid vector used in rDNA technology.
Shuttle vector - a plasmid vector used in rDNA technology. Shuttle vector - a plasmid vector used in rDNA technology.
Shuttle vector - a plasmid vector used in rDNA technology.
 
Illumina Sequencing
Illumina SequencingIllumina Sequencing
Illumina Sequencing
 
Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)Scoring schemes in bioinformatics (blosum)
Scoring schemes in bioinformatics (blosum)
 
protein sequence analysis
protein sequence analysisprotein sequence analysis
protein sequence analysis
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Fasta
FastaFasta
Fasta
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
 

Similar to Dot plots-1.ppt

AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfH K Yoon
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBruno Mmassy
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptxArupKhakhlari1
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its applicationprateek kumar
 
bioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygybioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational BoilogygyMUHAMMEDBAWAYUSUF
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastRai University
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastRai University
 
CSA 3702 machine learning module 4
CSA 3702 machine learning module 4CSA 3702 machine learning module 4
CSA 3702 machine learning module 4Nandhini S
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfsriaisvariyasundar
 
Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3gyanikashukla
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. localbenazeer fathima
 
Chromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisChromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisV.V.V.College for Women
 
local and global allignment
local and global allignmentlocal and global allignment
local and global allignmentSHOBI KHAKHI
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Prof. Wim Van Criekinge
 

Similar to Dot plots-1.ppt (20)

AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogenetics
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
bioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygybioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygy
 
CI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptxCI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptx
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
CSA 3702 machine learning module 4
CSA 3702 machine learning module 4CSA 3702 machine learning module 4
CSA 3702 machine learning module 4
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
CI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdfCI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdf
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
 
Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Chromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisChromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysis
 
local and global allignment
local and global allignmentlocal and global allignment
local and global allignment
 
Numerical taxonomy or taximetrics
Numerical taxonomy or taximetricsNumerical taxonomy or taximetrics
Numerical taxonomy or taximetrics
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
 

More from Swaminathan34154

2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.pptSwaminathan34154
 
Lec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptxLec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptxSwaminathan34154
 
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM Significance of Microbial spoilage of food.pptxSwaminathan34154
 
11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptxSwaminathan34154
 
6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptxSwaminathan34154
 
2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptxSwaminathan34154
 
Biological data – types and characteristics.ppt
Biological data – types and characteristics.pptBiological data – types and characteristics.ppt
Biological data – types and characteristics.pptSwaminathan34154
 

More from Swaminathan34154 (8)

2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt
 
Lec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptxLec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptx
 
Lecture 2&3.pptx
Lecture 2&3.pptxLecture 2&3.pptx
Lecture 2&3.pptx
 
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
 
11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx
 
6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx
 
2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx
 
Biological data – types and characteristics.ppt
Biological data – types and characteristics.pptBiological data – types and characteristics.ppt
Biological data – types and characteristics.ppt
 

Recently uploaded

Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 

Recently uploaded (20)

Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
9953056974 Young Call Girls In Mahavir enclave Indian Quality Escort service
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
G9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.pptG9 Science Q4- Week 1-2 Projectile Motion.ppt
G9 Science Q4- Week 1-2 Projectile Motion.ppt
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 

Dot plots-1.ppt

  • 2. Alignment • In molecular biology, a common question is to ask whether or not two sequences are related. • The most common way to tell whether or not they are related is to compare them to one another to see if they are similar.
  • 3. • Biological sequences that are similar (but not exact) provide useful information to help discover functional, structural, and evolutionary information. • One common mistake is to describe two sequences as having some sort of homology or a percent homology based on their sequence similarity. • This is a misuse of the biological term. • Two sequences in different organisms are homologous if they have been derived from a common ancestor sequence. • However, the greater the sequence similarity, the greater chance there is that they share similar function and/or structure.
  • 4. Biological Definitions for Related Sequences • Homologs are similar sequences in two different organisms that have been derived from a common ancestor sequence. – Homologs can be described as either orthologous or paralogous. • Orthologs are similar sequences in two different organisms that have arisen due to a speciation event. Orthologs typically retain their functionality throughout evolution.
  • 5. • Paralogs are similar sequences within a single organism that have arisen due to a gene duplication event. • Xenologs are similar sequences that do not share the same evolutionary origin, but rather have arisen out of horizontal transfer events through symbiosis, viruses, etc
  • 6. Hamming or edit distance • One method in determining sequence similarity is to determine the edit distance between two sequences. • If we take the example of pear and tear, how similar are these two words? • There is a mismatch in the first letter, and matches in the last three • An alignment of these two is as follows
  • 7. • P E A R | | | • T E A R One way to score this alignment is to calculate the Hamming distance, which is the minimum number of letters by which the two words differ. In this example, the Hamming distance would be 1. The Hamming distance is calculated by summing up the number of mismatches when two words are aligned to one another.
  • 8. • Which alignment is the better alignment? One way to judge this is to assign a positive score for each match, and a negative score for each mismatch, and a negative score for each insertion/deletion – (collectively referred to as indels). • One scoring scheme might assign the following values: – match: +2 – mismatch: -1 – indel –2 Scoring alignment
  • 9. Using this scoring scheme, the first alignment has 5 matches, 1 mismatch, and 4 indels. The score for this alignment is: 5 * 2 – 1(1) – 4(2) = 10 – 1 – 8 = 1. The second alignment has 6 matches, 1 mismatch, and 2 indels. The score for the second alignment is 6 * 2 – 1(1) – 2 (2) = 12 – 1 – 4 = 7. Therefore, using the above scoring scheme, the second alignment is a better alignment, since it produces a higher alignment score.
  • 10. Dot Plots • One of the more basic, yet important techniques for determining the alignment between two sequences is by using a visual alignment known as dot plots. • Dot plots of sequence similarity are created using a matrix where the rows in the matrix correspond to the characters in the first sequence and the columns in the matrix correspond to the characters in the second sequence.
  • 11. • The dot plot is created as follows: • loop through each row. For the current row, take the character in that row and compare it to the character in each column. • If they are equal, place a dot in the matrix. • Continue until all nodes in the matrix have been considered.
  • 12.
  • 13.
  • 14. Information within Dot Plots • Dot plots are useful as a first-level filter for determining an alignment between two sequences. • Regions of similarity will show up as diagonals within the dot plot matrix • Regions containing insertions/deletions can be readily determined • One potential application is to determine the number of coding regions (exons) contained within a processed mRNA.
  • 15. • Regions of genomic DNA can contain repetitive regions. • For instance, approximately 50 percent of the human genome -repetitive elements, which can be on the order of a few hundred bases • low complexity are present as well • In addition to repetitive elements, regions of a genome can be duplicated
  • 16. • The duplicated region can be found either as a direct repeat or as an inverted repeat meaning – Direct - it occurs in the same direction, - inverted repeat - meaning that the sequence of the duplicated region is found in the reverse complement direction • Dot plots can readily show regions of direct and inverted repeats.
  • 17.
  • 18. • Dot plots show all possible matches of residues • the researcher can decide which alignments are the most significant.
  • 19. Available Dot Plot Packages
  • 20. Dot Plot Articles Notes – online - Complete notes on Bioinformatics