SlideShare a Scribd company logo
Dot plots
Alignment
• In molecular biology, a common question is to
ask whether or not two sequences are related.
• The most common way to tell whether or not
they are related is to compare them to one
another to see if they are similar.
• Biological sequences that are similar (but not exact)
provide useful information to help discover functional,
structural, and evolutionary information.
• One common mistake is to describe two sequences as
having some sort of homology or a percent homology
based on their sequence similarity.
• This is a misuse of the biological term.
• Two sequences in different organisms are homologous if
they have been derived from a common ancestor
sequence.
• However, the greater the sequence similarity, the greater
chance there is that they share similar function and/or
structure.
Biological Definitions for Related
Sequences
• Homologs are similar sequences in two different
organisms that have been derived from a
common ancestor sequence.
– Homologs can be described as either orthologous or
paralogous.
• Orthologs are similar sequences in two different
organisms that have arisen due to a speciation
event.
Orthologs typically retain their functionality
throughout evolution.
• Paralogs are similar sequences within a single
organism that have arisen due to a gene
duplication event.
• Xenologs are similar sequences that do not
share the same evolutionary origin, but rather
have arisen out of horizontal transfer events
through symbiosis, viruses, etc
Hamming or edit distance
• One method in determining sequence
similarity is to determine the edit distance
between two sequences.
• If we take the example of pear and tear, how
similar are these two words?
• There is a mismatch in the first letter, and
matches in the last three
• An alignment of these two is as follows
• P E A R
| | |
• T E A R
One way to score this alignment is to calculate the Hamming distance,
which is the minimum number of letters by which the two words
differ.
In this example, the Hamming distance would be 1.
The Hamming distance is calculated by summing up the
number of mismatches when two words are aligned to one another.
• Which alignment is the better alignment? One
way to judge this is to assign a positive score for
each match, and a negative score for each
mismatch, and a negative score for each
insertion/deletion
– (collectively referred to as indels).
• One scoring scheme might assign the following
values:
– match: +2
– mismatch: -1
– indel –2
Scoring alignment
Using this scoring scheme, the first alignment has 5 matches, 1 mismatch, and 4
indels.
The score for this alignment is: 5 * 2 – 1(1) – 4(2) = 10 – 1 – 8 = 1.
The second alignment has 6 matches, 1 mismatch, and 2 indels. The score for the
second
alignment is 6 * 2 – 1(1) – 2 (2) = 12 – 1 – 4 = 7.
Therefore, using the above scoring scheme, the second alignment is a
better alignment, since it produces a higher alignment score.
Dot Plots
• One of the more basic, yet important techniques
for determining the alignment between two
sequences is by using a visual alignment known
as dot plots.
• Dot plots of sequence similarity are created
using a matrix where the rows in the matrix
correspond to the characters in the first sequence
and the columns in the matrix correspond to the
characters in the second sequence.
• The dot plot is created as follows:
• loop through each row. For the current row,
take the character in that row and compare it
to the character in each column.
• If they are equal, place a dot in the matrix.
• Continue until all nodes in the matrix have
been considered.
Information within Dot Plots
• Dot plots are useful as a first-level filter for
determining an alignment between two
sequences.
• Regions of similarity will show up as diagonals
within the dot plot matrix
• Regions containing insertions/deletions can be
readily determined
• One potential application is to determine the
number of coding regions (exons) contained
within a processed mRNA.
• Regions of genomic DNA can contain
repetitive regions.
• For instance, approximately 50 percent of the
human genome -repetitive elements, which
can be on the order of a few hundred bases
• low complexity are present as well
• In addition to repetitive elements, regions of a
genome can be duplicated
• The duplicated region can be found either as a
direct repeat or as an inverted repeat meaning
– Direct - it occurs in the same direction,
- inverted repeat - meaning that the sequence
of the duplicated region is found in the
reverse complement direction
• Dot plots can readily show regions of direct
and inverted repeats.
• Dot plots show all possible matches of
residues
• the researcher can decide which alignments
are the most significant.
Available Dot Plot Packages
Dot Plot Articles
Notes – online - Complete notes on Bioinformatics

More Related Content

What's hot

Genomic variation
Genomic variationGenomic variation
Genomic variation
prateek kumar
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
Usman Arshad
 
Difference between genetic linkage and physical map
Difference between genetic  linkage and physical  mapDifference between genetic  linkage and physical  map
Difference between genetic linkage and physical map
Kanimoli Mathivathana
 
Chromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisChromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysis
V.V.V.College for Women
 
GENE TRANSFER METHODS IN ANIMALS
GENE TRANSFER METHODS IN ANIMALSGENE TRANSFER METHODS IN ANIMALS
GENE TRANSFER METHODS IN ANIMALS
HUZAIFA GULZAR
 
Prosite
PrositeProsite
genetic linkage and gene mapping
genetic linkage and gene mappinggenetic linkage and gene mapping
genetic linkage and gene mapping
Mahammed Faizan
 
Cell cell hybridization or somatic cell hybridization
Cell cell hybridization or somatic cell hybridizationCell cell hybridization or somatic cell hybridization
Cell cell hybridization or somatic cell hybridization
Subhradeep sarkar
 
Chromosome structure
Chromosome structureChromosome structure
Chromosome structure
Arshad Al-Ghafour
 
Chromosome walking
Chromosome walkingChromosome walking
Chromosome walking
Saajida Sultaana
 
Fine structureof gene,allelic complementation,and split gene
Fine structureof gene,allelic complementation,and split gene Fine structureof gene,allelic complementation,and split gene
Fine structureof gene,allelic complementation,and split gene
Junagadh Agricultural University, Junagadh (Guj.), India
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
ruchibioinfo
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
Alphonsa Joseph
 
Ultra fine structure of gene
Ultra fine structure of geneUltra fine structure of gene
Ultra fine structure of gene
GauravRajSinhVaghela
 
BITS: Basics of Sequence similarity
BITS: Basics of Sequence similarityBITS: Basics of Sequence similarity
BITS: Basics of Sequence similarity
BITS
 
Nucleosome and histones
Nucleosome and histonesNucleosome and histones
Nucleosome and histones
HARINATHA REDDY ASWARTHA
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
Jajati Keshari Nayak
 
Genome evolution
Genome evolutionGenome evolution
Genome evolution
ruchibioinfo
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Subhranil Bhattacharjee
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Arindam Ghosh
 

What's hot (20)

Genomic variation
Genomic variationGenomic variation
Genomic variation
 
PHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICSPHYSICAL MAPPING STRATEGIES IN GENOMICS
PHYSICAL MAPPING STRATEGIES IN GENOMICS
 
Difference between genetic linkage and physical map
Difference between genetic  linkage and physical  mapDifference between genetic  linkage and physical  map
Difference between genetic linkage and physical map
 
Chromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysisChromosome or gene mapping &Linkage analysis
Chromosome or gene mapping &Linkage analysis
 
GENE TRANSFER METHODS IN ANIMALS
GENE TRANSFER METHODS IN ANIMALSGENE TRANSFER METHODS IN ANIMALS
GENE TRANSFER METHODS IN ANIMALS
 
Prosite
PrositeProsite
Prosite
 
genetic linkage and gene mapping
genetic linkage and gene mappinggenetic linkage and gene mapping
genetic linkage and gene mapping
 
Cell cell hybridization or somatic cell hybridization
Cell cell hybridization or somatic cell hybridizationCell cell hybridization or somatic cell hybridization
Cell cell hybridization or somatic cell hybridization
 
Chromosome structure
Chromosome structureChromosome structure
Chromosome structure
 
Chromosome walking
Chromosome walkingChromosome walking
Chromosome walking
 
Fine structureof gene,allelic complementation,and split gene
Fine structureof gene,allelic complementation,and split gene Fine structureof gene,allelic complementation,and split gene
Fine structureof gene,allelic complementation,and split gene
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
Sequence file formats
Sequence file formatsSequence file formats
Sequence file formats
 
Ultra fine structure of gene
Ultra fine structure of geneUltra fine structure of gene
Ultra fine structure of gene
 
BITS: Basics of Sequence similarity
BITS: Basics of Sequence similarityBITS: Basics of Sequence similarity
BITS: Basics of Sequence similarity
 
Nucleosome and histones
Nucleosome and histonesNucleosome and histones
Nucleosome and histones
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
Genome evolution
Genome evolutionGenome evolution
Genome evolution
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 

Similar to Dot plots-1.ppt

AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
H K Yoon
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBruno Mmassy
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
ArupKhakhlari1
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
Ravi Gandham
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
prateek kumar
 
bioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygybioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygy
MUHAMMEDBAWAYUSUF
 
CI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptxCI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptx
SantiagoGarridoBulln
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
Rai University
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastRai University
 
CSA 3702 machine learning module 4
CSA 3702 machine learning module 4CSA 3702 machine learning module 4
CSA 3702 machine learning module 4
Nandhini S
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Zeeshan Hanjra
 
CI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdfCI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdf
SantiagoGarridoBulln
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
sriaisvariyasundar
 
Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3
gyanikashukla
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
benazeer fathima
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
Prof. Wim Van Criekinge
 
local and global allignment
local and global allignmentlocal and global allignment
local and global allignment
SHOBI KHAKHI
 
Numerical taxonomy or taximetrics
Numerical taxonomy or taximetricsNumerical taxonomy or taximetrics
Numerical taxonomy or taximetrics
Nistarini College, Purulia (W.B) India
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Prof. Wim Van Criekinge
 
Genetic Algorithm (1).pdf
Genetic Algorithm (1).pdfGenetic Algorithm (1).pdf
Genetic Algorithm (1).pdf
AzmiNizar1
 

Similar to Dot plots-1.ppt (20)

AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Bls 303 l1.phylogenetics
Bls 303 l1.phylogeneticsBls 303 l1.phylogenetics
Bls 303 l1.phylogenetics
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
bioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygybioinformatics lecture 2.pptx and computational Boilogygy
bioinformatics lecture 2.pptx and computational Boilogygy
 
CI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptxCI_L11_Optimization_ag2_eng.pptx
CI_L11_Optimization_ag2_eng.pptx
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
B.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blastB.sc biochem i bobi u 3.2 algorithm + blast
B.sc biochem i bobi u 3.2 algorithm + blast
 
CSA 3702 machine learning module 4
CSA 3702 machine learning module 4CSA 3702 machine learning module 4
CSA 3702 machine learning module 4
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
CI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdfCI_L02_Optimization_ag2_eng.pdf
CI_L02_Optimization_ag2_eng.pdf
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
 
Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014Bioinformatics t8-go-hmm v2014
Bioinformatics t8-go-hmm v2014
 
local and global allignment
local and global allignmentlocal and global allignment
local and global allignment
 
Numerical taxonomy or taximetrics
Numerical taxonomy or taximetricsNumerical taxonomy or taximetrics
Numerical taxonomy or taximetrics
 
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013Bioinformatics t8-go-hmm wim-vancriekinge_v2013
Bioinformatics t8-go-hmm wim-vancriekinge_v2013
 
Genetic Algorithm (1).pdf
Genetic Algorithm (1).pdfGenetic Algorithm (1).pdf
Genetic Algorithm (1).pdf
 

More from Swaminathan34154

2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt
Swaminathan34154
 
Lec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptxLec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptx
Swaminathan34154
 
Lecture 2&3.pptx
Lecture 2&3.pptxLecture 2&3.pptx
Lecture 2&3.pptx
Swaminathan34154
 
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
Swaminathan34154
 
11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx
Swaminathan34154
 
6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx
Swaminathan34154
 
2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx
Swaminathan34154
 
Biological data – types and characteristics.ppt
Biological data – types and characteristics.pptBiological data – types and characteristics.ppt
Biological data – types and characteristics.ppt
Swaminathan34154
 

More from Swaminathan34154 (8)

2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt2. Abiotic- Salt stress.ppt
2. Abiotic- Salt stress.ppt
 
Lec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptxLec 1-16.06.2021-final (4).pptx
Lec 1-16.06.2021-final (4).pptx
 
Lecture 2&3.pptx
Lecture 2&3.pptxLecture 2&3.pptx
Lecture 2&3.pptx
 
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx3 & 4 T. AGM  Significance of Microbial spoilage of food.pptx
3 & 4 T. AGM Significance of Microbial spoilage of food.pptx
 
11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx11 & 12T. AGM 353 Food standards.pptx
11 & 12T. AGM 353 Food standards.pptx
 
6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx6T. AGM Mycotoxicoses in food.pptx
6T. AGM Mycotoxicoses in food.pptx
 
2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx2T. AGM Physical and chemical changes during food spoilage.pptx
2T. AGM Physical and chemical changes during food spoilage.pptx
 
Biological data – types and characteristics.ppt
Biological data – types and characteristics.pptBiological data – types and characteristics.ppt
Biological data – types and characteristics.ppt
 

Recently uploaded

Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
ossaicprecious19
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
yusufzako14
 
Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
Cherry
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
Areesha Ahmad
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
azzyixes
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
rakeshsharma20142015
 

Recently uploaded (20)

Lab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerinLab report on liquid viscosity of glycerin
Lab report on liquid viscosity of glycerin
 
plant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptxplant biotechnology Lecture note ppt.pptx
plant biotechnology Lecture note ppt.pptx
 
Large scale production of streptomycin.pptx
Large scale production of streptomycin.pptxLarge scale production of streptomycin.pptx
Large scale production of streptomycin.pptx
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of LipidsGBSN - Biochemistry (Unit 5) Chemistry of Lipids
GBSN - Biochemistry (Unit 5) Chemistry of Lipids
 
justice-and-fairness-ethics with example
justice-and-fairness-ethics with examplejustice-and-fairness-ethics with example
justice-and-fairness-ethics with example
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Viksit bharat till 2047 India@2047.pptx
Viksit bharat till 2047  India@2047.pptxViksit bharat till 2047  India@2047.pptx
Viksit bharat till 2047 India@2047.pptx
 

Dot plots-1.ppt

  • 2. Alignment • In molecular biology, a common question is to ask whether or not two sequences are related. • The most common way to tell whether or not they are related is to compare them to one another to see if they are similar.
  • 3. • Biological sequences that are similar (but not exact) provide useful information to help discover functional, structural, and evolutionary information. • One common mistake is to describe two sequences as having some sort of homology or a percent homology based on their sequence similarity. • This is a misuse of the biological term. • Two sequences in different organisms are homologous if they have been derived from a common ancestor sequence. • However, the greater the sequence similarity, the greater chance there is that they share similar function and/or structure.
  • 4. Biological Definitions for Related Sequences • Homologs are similar sequences in two different organisms that have been derived from a common ancestor sequence. – Homologs can be described as either orthologous or paralogous. • Orthologs are similar sequences in two different organisms that have arisen due to a speciation event. Orthologs typically retain their functionality throughout evolution.
  • 5. • Paralogs are similar sequences within a single organism that have arisen due to a gene duplication event. • Xenologs are similar sequences that do not share the same evolutionary origin, but rather have arisen out of horizontal transfer events through symbiosis, viruses, etc
  • 6. Hamming or edit distance • One method in determining sequence similarity is to determine the edit distance between two sequences. • If we take the example of pear and tear, how similar are these two words? • There is a mismatch in the first letter, and matches in the last three • An alignment of these two is as follows
  • 7. • P E A R | | | • T E A R One way to score this alignment is to calculate the Hamming distance, which is the minimum number of letters by which the two words differ. In this example, the Hamming distance would be 1. The Hamming distance is calculated by summing up the number of mismatches when two words are aligned to one another.
  • 8. • Which alignment is the better alignment? One way to judge this is to assign a positive score for each match, and a negative score for each mismatch, and a negative score for each insertion/deletion – (collectively referred to as indels). • One scoring scheme might assign the following values: – match: +2 – mismatch: -1 – indel –2 Scoring alignment
  • 9. Using this scoring scheme, the first alignment has 5 matches, 1 mismatch, and 4 indels. The score for this alignment is: 5 * 2 – 1(1) – 4(2) = 10 – 1 – 8 = 1. The second alignment has 6 matches, 1 mismatch, and 2 indels. The score for the second alignment is 6 * 2 – 1(1) – 2 (2) = 12 – 1 – 4 = 7. Therefore, using the above scoring scheme, the second alignment is a better alignment, since it produces a higher alignment score.
  • 10. Dot Plots • One of the more basic, yet important techniques for determining the alignment between two sequences is by using a visual alignment known as dot plots. • Dot plots of sequence similarity are created using a matrix where the rows in the matrix correspond to the characters in the first sequence and the columns in the matrix correspond to the characters in the second sequence.
  • 11. • The dot plot is created as follows: • loop through each row. For the current row, take the character in that row and compare it to the character in each column. • If they are equal, place a dot in the matrix. • Continue until all nodes in the matrix have been considered.
  • 12.
  • 13.
  • 14. Information within Dot Plots • Dot plots are useful as a first-level filter for determining an alignment between two sequences. • Regions of similarity will show up as diagonals within the dot plot matrix • Regions containing insertions/deletions can be readily determined • One potential application is to determine the number of coding regions (exons) contained within a processed mRNA.
  • 15. • Regions of genomic DNA can contain repetitive regions. • For instance, approximately 50 percent of the human genome -repetitive elements, which can be on the order of a few hundred bases • low complexity are present as well • In addition to repetitive elements, regions of a genome can be duplicated
  • 16. • The duplicated region can be found either as a direct repeat or as an inverted repeat meaning – Direct - it occurs in the same direction, - inverted repeat - meaning that the sequence of the duplicated region is found in the reverse complement direction • Dot plots can readily show regions of direct and inverted repeats.
  • 17.
  • 18. • Dot plots show all possible matches of residues • the researcher can decide which alignments are the most significant.
  • 19. Available Dot Plot Packages
  • 20. Dot Plot Articles Notes – online - Complete notes on Bioinformatics