SlideShare a Scribd company logo
1 of 44
M.SRI ARAVIND
LAL
B841018
INTRODUCTION
• In computional biology a dot plot is a graphical methods
for comparing two biological sequences and identifying
region of close similarity
• It is type of recurrence plot (graph of horizontal and
vertical axis
HISTORY
• These are introduced by Gibbs and Mclntyre in 1970
• These plot are two dimensional matrices that have
sequences of the proteins being compared along the vertical
and horizontal axis.
• Individual cells in matrix can be shaded black,if the residue
are identical
• Thus matched sequences run of diagonal lines across the
matrix.
PRINCIPLE
• The principle used to generate the dot plot is:
• The top X and the left y axes of a rectangular array are used to represent the two
sequences to be compared
• Calculation:
• Matrix Columns = residues of sequence 1
Rows = residues of sequence 2
EXAMPLE
• Seq 1: TWILIGHTZONE
• Seq 2: MIDNIGHTZONE Matrix= 12 * 12
• A dot is plotted at every co-ordinate where there is similarity between the bases
DOT PLOT INTERPRETATION
• Seq1: ATGATAT
• Seq2: ATGATAT
SIMPLE PLOT TERMS
• Window: size of sequence block used for comparison.
example:
window = 1
• Stringency = Number of matches required to score
positive.
example:
stringency = 1 (required exact match)
DOTPLOT SCORING
• Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever
there is identity.
G A T C T
G
A
T
C
T
DOTPLOT SCORING
• Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever
there is identity.
G A T C T
G
A
T
C
T
.
DOTPLOT SCORING
• Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever
there is identity.
G A T C T
G
A
T
C
T
... .
DOTPLOT SCORING
• Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever
there is identity.
G A T C T
G
A
T
C
T
... ... .
G A T A C T G C G A T A C T G C G C A
G 1 1 1 1 1
A 1 1 1 1 1
T 1 1 1 1 1
A 1 1 1 1
C 1 1 1 1 1
T 1 1 1 1
G 1 1 1 1
C 1 1 1 1
G 1 1 1
A 1 1 1
T 1 1
A 1
C 1 1 1
T 1
G 1 1
C 1 1
G 1
C 1
A 1
G A T A C T G C A T C G T C A C T C A
G 1 1 1
A 1 1 1 1 1
T 1 1 1 1 1
A 1 1 1 1
C 1 1 1 1 1 1
T 1 1 1 1
G 1 1
C 1 1 1 1 1
A 1 1 1
T 1 1 1
C 1 1 1 1
G 1
T 1 1
C 1 1 1
A 1 1
C 1 1
T 1
C 1
A 1
INTRAGENIC COMPARISON
• Rat Groucho Gene
INTERGENIC COMPARISON
• Rat and Drosophila Groucho Gene
INTERGENIC COMPARISON
• Nucleotide sequence contains three
domains.
INTERGENIC COMPARISON
• Nucleotide sequence contains three
domains.
• 50 - 350 - Strong conservation
• Indel places comparison out of register
INTERGENIC COMPARISON
• Nucleotide sequence contains three
domains.
• 50 - 350 - Strong conservation
• Indel places comparison out of register
• 450 - 1300 - Slightly weaker conservation
INTERGENIC COMPARISON
• Nucleotide sequence contains three
domains.
• 50 - 350 - Strong conservation
• Indel places comparison out of register
• 450 - 1300 - Slightly weaker conservation
• 1300 - 2400 - Strong conservation
ANALYSIS OF DOT PLOT MATRIX
• Principal diagonal shows identical sequence.
• Global and local alignment are shown.
• Multiple diagonal indicate repeatation
• Reverse diagonal (perpendicular to diagonal) indicate
INVERSION.
• Reverse diagonal crossing diagonal (X) indicate
PALINDROMES.
• Formation of box indicate the low complexity region
DIRECT REPEAT
PALINDROMIC SEQUENCE
• A palindromic sequence is a nucleic acid sequence (DNA or RNA) tha is same
whether read 5' to 3' on one strand or 5' to 3' on the complementary strand with
which it forms a double helix.
INVERTED REPEAT
• An inverted repeat is sequence of nucleotides followed downstream by its
reverse complement.
• Inverted repeat: abcdeedcbafghijklmno
LOW-COMPLEXITY REGIONS
• Low-complexity regions in sequences can be found as regions around the
diagonal all obtaining a high score. Low complexity regions are calculated from
the redundancy of amino acids within a limited region.
DOT PLOT SOFTWARE
• we can use the EMBOSS package, which are following:
 Dotmatcher
 Dotpath
 Polydot
 Dottup
(http://emboss.bioinformatics.nl/cgi-
bin/emboss/dottup
JOURNALS
APPLICATION
• Shows the all possible alignment between two nucleic
acid and amino acid sequences.
• Help to recognise large region of simiarity.
• An excellent approach for finding sequence transposition.
• To find the location of genes between two genomes.
• To find the non sequential alignment.
LIMITATION
• For longer sequence, memory required for the graphical
representation is very high. So long sequence can not be aligned.
(only 2 sequence can align at a time)
• Lots of insignifcant matches makes it noisy (so many off diagonal
appear).
• Time required to compare two sequences is proportional to the
product of length of the sequences time of the search window. (not
very quick)
i.e, higher efficiency of short sequence.
Low efficiency of long sequence.
GAP PENALITY
• Gap penality is a method of scoring alignment of two or more sequence.
• when a gap is inserted in an sequence it matches more than the sequence
without gap insertion.
• Too many gap can cause an alignment to become meaningless.
Types of gap penality
Constant
Linear
affine
SCORING SCHEMES
TYPES OF GAP PENALITY
Constant
This is the simplest type of gap penality and a fixed negative score is given to
every gap, regardless of its length.
ATTGACCTGA EACH MATCH=1 SCORE 7-1=6
AT CCTGA WHOLE GAP=1
TYPES OF GAP PENALITY
Linear
The linear gap penalty takes into account the length (L) of each insertion/deletion
in the gap.
ATTGACCTGA EACH MATCH =1
AT CCTGA EACH GAP = -1
The score here is (7 − 3 = 4).
TYPES OF GAP PENALITY
Affine
 Most widely used gap penality and it combines both linear and
constant gap penality.
 Penality is based on form of A+B.L
 A is known as the gap opening penalty, B the gap extension penalty
and L the length of the gap.
 Gap opening refers to the cost required to open a gap of any length,
and gap extension the cost to extend the length of an existing gap
by 1.
VALUE IS 26
VALUE IS
7
REFERENCES
• Bioinformatics concepts, skill & applications, second edition by
S.C.Rastogi, Namita Mendriatta, Parag Rastogi
• http://en.wikipedia.org/wiki/Dot_plot_%28bioinformatics%29
• http://lectures.molgen.mpg.de/Pairwise/DotPlots/
• https://ugene.unipro.ru/wiki/pages/viewpage.action?pageId=4
227426
• http://www.clcsupport.com/clcgenomicsworkbench/650/Examples
_interpretations_dot_plots.html
Dot matrix seminar
Dot matrix seminar
Dot matrix seminar
Dot matrix seminar

More Related Content

What's hot

Sequence analysis - Bioinformatics
Sequence analysis - BioinformaticsSequence analysis - Bioinformatics
Sequence analysis - BioinformaticsPratik Parikh
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure predictionSiva Dharshini R
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformaticsavrilcoghlan
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fastaALLIENU
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matricesAshwini
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmProshantaShil
 
Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence AlignmentMeghaj Mallick
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsNikesh Narayanan
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijayVijay Hemmadi
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentRamya S
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. localbenazeer fathima
 

What's hot (20)

Protein database
Protein databaseProtein database
Protein database
 
Sequence analysis - Bioinformatics
Sequence analysis - BioinformaticsSequence analysis - Bioinformatics
Sequence analysis - Bioinformatics
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Scop database
Scop databaseScop database
Scop database
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
222397 lecture 16 17
222397 lecture 16 17222397 lecture 16 17
222397 lecture 16 17
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
 
Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence Alignment
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
SEQUENCE ANALYSIS
SEQUENCE ANALYSISSEQUENCE ANALYSIS
SEQUENCE ANALYSIS
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Swiss prot database
Swiss prot databaseSwiss prot database
Swiss prot database
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 

Similar to Dot matrix seminar

Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3gyanikashukla
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment Parinda Rajapaksha
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfsriaisvariyasundar
 
Schelkunoff Polynomial Method for Antenna Synthesis
Schelkunoff Polynomial Method for Antenna SynthesisSchelkunoff Polynomial Method for Antenna Synthesis
Schelkunoff Polynomial Method for Antenna SynthesisSwapnil Bangera
 
Quality control of sequencing with fast qc obtained with
Quality control of sequencing with fast qc obtained withQuality control of sequencing with fast qc obtained with
Quality control of sequencing with fast qc obtained withHafiz Muhammad Zeeshan Raza
 
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIX
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIXDOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIX
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIXnanamimomozano4562
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satChenYiHuang5
 
Contrast sensitivity
Contrast sensitivity Contrast sensitivity
Contrast sensitivity Khulesh Sahu
 
Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Prof. Wim Van Criekinge
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 

Similar to Dot matrix seminar (20)

Ch06 multalign
Ch06 multalignCh06 multalign
Ch06 multalign
 
seq alignment.ppt
seq alignment.pptseq alignment.ppt
seq alignment.ppt
 
Sequence alignment unit 3
Sequence alignment unit 3Sequence alignment unit 3
Sequence alignment unit 3
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment
 
Ch06 alignment
Ch06 alignmentCh06 alignment
Ch06 alignment
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
Sequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdfSequence-analysis-pairwise-alignment.pdf
Sequence-analysis-pairwise-alignment.pdf
 
Schelkunoff Polynomial Method for Antenna Synthesis
Schelkunoff Polynomial Method for Antenna SynthesisSchelkunoff Polynomial Method for Antenna Synthesis
Schelkunoff Polynomial Method for Antenna Synthesis
 
Quality control of sequencing with fast qc obtained with
Quality control of sequencing with fast qc obtained withQuality control of sequencing with fast qc obtained with
Quality control of sequencing with fast qc obtained with
 
Biological sequences analysis
Biological sequences analysisBiological sequences analysis
Biological sequences analysis
 
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIX
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIXDOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIX
DOT MATRIX DOT MATRIX DOT MATRIX DOT MATRIX
 
Paper study: Learning to solve circuit sat
Paper study: Learning to solve circuit satPaper study: Learning to solve circuit sat
Paper study: Learning to solve circuit sat
 
Stats chapter 3
Stats chapter 3Stats chapter 3
Stats chapter 3
 
UNIT III.pptx
UNIT III.pptxUNIT III.pptx
UNIT III.pptx
 
Unit V - ppt.pptx
Unit V - ppt.pptxUnit V - ppt.pptx
Unit V - ppt.pptx
 
Bioinformatica t4-alignments
Bioinformatica t4-alignmentsBioinformatica t4-alignments
Bioinformatica t4-alignments
 
Contrast sensitivity
Contrast sensitivity Contrast sensitivity
Contrast sensitivity
 
Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013Bioinformatics t4-alignments wim_vancriekingev2013
Bioinformatics t4-alignments wim_vancriekingev2013
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 

Recently uploaded

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxUnboundStockton
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxmanuelaromero2013
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaVirag Sontakke
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 

Recently uploaded (20)

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Blooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docxBlooming Together_ Growing a Community Garden Worksheet.docx
Blooming Together_ Growing a Community Garden Worksheet.docx
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
How to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptxHow to Make a Pirate ship Primary Education.pptx
How to Make a Pirate ship Primary Education.pptx
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Painted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of IndiaPainted Grey Ware.pptx, PGW Culture of India
Painted Grey Ware.pptx, PGW Culture of India
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 

Dot matrix seminar

  • 2. INTRODUCTION • In computional biology a dot plot is a graphical methods for comparing two biological sequences and identifying region of close similarity • It is type of recurrence plot (graph of horizontal and vertical axis
  • 3. HISTORY • These are introduced by Gibbs and Mclntyre in 1970 • These plot are two dimensional matrices that have sequences of the proteins being compared along the vertical and horizontal axis. • Individual cells in matrix can be shaded black,if the residue are identical • Thus matched sequences run of diagonal lines across the matrix.
  • 4. PRINCIPLE • The principle used to generate the dot plot is: • The top X and the left y axes of a rectangular array are used to represent the two sequences to be compared • Calculation: • Matrix Columns = residues of sequence 1 Rows = residues of sequence 2
  • 5. EXAMPLE • Seq 1: TWILIGHTZONE • Seq 2: MIDNIGHTZONE Matrix= 12 * 12 • A dot is plotted at every co-ordinate where there is similarity between the bases
  • 6. DOT PLOT INTERPRETATION • Seq1: ATGATAT • Seq2: ATGATAT
  • 7. SIMPLE PLOT TERMS • Window: size of sequence block used for comparison. example: window = 1 • Stringency = Number of matches required to score positive. example: stringency = 1 (required exact match)
  • 8. DOTPLOT SCORING • Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever there is identity. G A T C T G A T C T
  • 9. DOTPLOT SCORING • Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever there is identity. G A T C T G A T C T .
  • 10. DOTPLOT SCORING • Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever there is identity. G A T C T G A T C T ... .
  • 11. DOTPLOT SCORING • Dotplot- matrix, with one sequence across top, other down side. Put a dot, or 1, where ever there is identity. G A T C T G A T C T ... ... .
  • 12. G A T A C T G C G A T A C T G C G C A G 1 1 1 1 1 A 1 1 1 1 1 T 1 1 1 1 1 A 1 1 1 1 C 1 1 1 1 1 T 1 1 1 1 G 1 1 1 1 C 1 1 1 1 G 1 1 1 A 1 1 1 T 1 1 A 1 C 1 1 1 T 1 G 1 1 C 1 1 G 1 C 1 A 1
  • 13. G A T A C T G C A T C G T C A C T C A G 1 1 1 A 1 1 1 1 1 T 1 1 1 1 1 A 1 1 1 1 C 1 1 1 1 1 1 T 1 1 1 1 G 1 1 C 1 1 1 1 1 A 1 1 1 T 1 1 1 C 1 1 1 1 G 1 T 1 1 C 1 1 1 A 1 1 C 1 1 T 1 C 1 A 1
  • 15.
  • 16.
  • 17.
  • 18. INTERGENIC COMPARISON • Rat and Drosophila Groucho Gene
  • 19.
  • 20. INTERGENIC COMPARISON • Nucleotide sequence contains three domains.
  • 21. INTERGENIC COMPARISON • Nucleotide sequence contains three domains. • 50 - 350 - Strong conservation • Indel places comparison out of register
  • 22. INTERGENIC COMPARISON • Nucleotide sequence contains three domains. • 50 - 350 - Strong conservation • Indel places comparison out of register • 450 - 1300 - Slightly weaker conservation
  • 23. INTERGENIC COMPARISON • Nucleotide sequence contains three domains. • 50 - 350 - Strong conservation • Indel places comparison out of register • 450 - 1300 - Slightly weaker conservation • 1300 - 2400 - Strong conservation
  • 24. ANALYSIS OF DOT PLOT MATRIX • Principal diagonal shows identical sequence. • Global and local alignment are shown. • Multiple diagonal indicate repeatation • Reverse diagonal (perpendicular to diagonal) indicate INVERSION. • Reverse diagonal crossing diagonal (X) indicate PALINDROMES. • Formation of box indicate the low complexity region
  • 26. PALINDROMIC SEQUENCE • A palindromic sequence is a nucleic acid sequence (DNA or RNA) tha is same whether read 5' to 3' on one strand or 5' to 3' on the complementary strand with which it forms a double helix.
  • 27. INVERTED REPEAT • An inverted repeat is sequence of nucleotides followed downstream by its reverse complement. • Inverted repeat: abcdeedcbafghijklmno
  • 28. LOW-COMPLEXITY REGIONS • Low-complexity regions in sequences can be found as regions around the diagonal all obtaining a high score. Low complexity regions are calculated from the redundancy of amino acids within a limited region.
  • 29. DOT PLOT SOFTWARE • we can use the EMBOSS package, which are following:  Dotmatcher  Dotpath  Polydot  Dottup (http://emboss.bioinformatics.nl/cgi- bin/emboss/dottup
  • 31. APPLICATION • Shows the all possible alignment between two nucleic acid and amino acid sequences. • Help to recognise large region of simiarity. • An excellent approach for finding sequence transposition. • To find the location of genes between two genomes. • To find the non sequential alignment.
  • 32. LIMITATION • For longer sequence, memory required for the graphical representation is very high. So long sequence can not be aligned. (only 2 sequence can align at a time) • Lots of insignifcant matches makes it noisy (so many off diagonal appear). • Time required to compare two sequences is proportional to the product of length of the sequences time of the search window. (not very quick) i.e, higher efficiency of short sequence. Low efficiency of long sequence.
  • 33. GAP PENALITY • Gap penality is a method of scoring alignment of two or more sequence. • when a gap is inserted in an sequence it matches more than the sequence without gap insertion. • Too many gap can cause an alignment to become meaningless. Types of gap penality Constant Linear affine
  • 35. TYPES OF GAP PENALITY Constant This is the simplest type of gap penality and a fixed negative score is given to every gap, regardless of its length. ATTGACCTGA EACH MATCH=1 SCORE 7-1=6 AT CCTGA WHOLE GAP=1
  • 36. TYPES OF GAP PENALITY Linear The linear gap penalty takes into account the length (L) of each insertion/deletion in the gap. ATTGACCTGA EACH MATCH =1 AT CCTGA EACH GAP = -1 The score here is (7 − 3 = 4).
  • 37. TYPES OF GAP PENALITY Affine  Most widely used gap penality and it combines both linear and constant gap penality.  Penality is based on form of A+B.L  A is known as the gap opening penalty, B the gap extension penalty and L the length of the gap.  Gap opening refers to the cost required to open a gap of any length, and gap extension the cost to extend the length of an existing gap by 1.
  • 40. REFERENCES • Bioinformatics concepts, skill & applications, second edition by S.C.Rastogi, Namita Mendriatta, Parag Rastogi • http://en.wikipedia.org/wiki/Dot_plot_%28bioinformatics%29 • http://lectures.molgen.mpg.de/Pairwise/DotPlots/ • https://ugene.unipro.ru/wiki/pages/viewpage.action?pageId=4 227426 • http://www.clcsupport.com/clcgenomicsworkbench/650/Examples _interpretations_dot_plots.html