SlideShare a Scribd company logo
1 of 39
Definition:
•Procedure for comparing two or more sequences by
searching for a series of individual characters or
character patterns that are in the same order in the
sequences.
– Pair-wise alignment: compare two sequences.
– Multiple sequence alignment: compare more than
two sequences.
Sequence Alignment
Need
• To find whether two (or more) genes or
proteins are evolutionarily related to each
Other.
• To find structurally or functionally similar
regions within proteins.
• Align abcdef with abdgf
• Write second sequence below the first
abcdef
abdgf
• Move sequences to give maximum match
between them.
• Show characters that match using vertical bar.
Example
Example sequence alignment
abcdef
||
abdgf
abcdef
|| || |
ab-dgf
Matching Similarity vs. Identity
• Alignments can be based on finding only identical
characters, or (more commonly) can be based on
finding similar characters.
• Conserved substitutions, semi-conserved
substitution, and non conserved substitution are
the terms used to define similarity.
Matching Similarity vs. Identity
Different colours denote different chemical groups of amino acids, i.e. hydrophobic, acidic, etc.
Inferences
• In sequence alignment, the degree of similarity
between particular regions in the sequences can
be interpreted as a rough measure of how
conserved a particular region or sequence motif is
among lineages
• In conserved regions the order of letters does not
change during evolution or changes only slightly,
i.e. has mostly conservative substitutions.
• Sequence motif = sequence pattern that is
widespread and has a biological significance
resulting in its conservation through evolution.
Global vs. Local Alignment
– Global alignment algorithms which optimize
overall alignment between two sequences.
– Local alignment algorithms which seek only
relatively conserved pieces of sequence.
Alignment stops at the ends of regions of
strong similarity.
– Favors finding conserved patterns in different pairs
of sequences.
• Global
LGPSSKQTGKGS-SRIWDN
LN-TKSAGKGAIMRLGDA
• Local
--------GKG--------
|||
--------GKG--------
Methods for Pair-wise Alignment
• Dot matrix analysis
• Dynamic Programming (Needleman- Wunsch,
Smith-Waterman algorithms)
• Word or k-tuple methods (BLAST and FASTA)
DOT MATRIX
Interpretation of Dot Matrices
• Regions of similarity appear as diagonal runs
of dots.
• Interruption in middle of diagonal line
indicates insertions or deletions.
• Parallel diagonal line within the matrix
represents repetitive regions of the sequence.
Uses
• Can use dot matrices to align two proteins or two
nucleic acid sequences.
• Can use to find amino acid repeats within a
protein by comparing a protein sequence to itself.
• Used in identifying Nucleic Acids secondary
structure detecting self complementarily of the
sequence.
• Used in comparative genomics by predicting gene
order conservation between closely related
genomes.
Limitations
• A problem with dot matrices for long
sequences is that they can be very noisy due
to lots of insignificant matches.
• Only a pairwise alignment method not
suitable for multiple alignment of sequences.
• It lacks statistically rigor in assessing the
quality of alignment.
Solution
• By using a window (W)/ tuple.
– compare character by character within a
window (have to choose window size).
– require certain fraction of matches within
window in order to display it with a “dot”.
W=23
set of
stacked
diagonals
in upper
Left
• Initialisation
• Matrix fill (scoring)
• Traceback (alignment)
Dynamic Programming Approach
Steps
M= (length of sequence i)
N= (Length of sequence ii)
Initialization Step
Scoring
• For each position, Mi,j is defined to be the
maximum score at position i,j; i.e.
Mi,j = MAXIMUM [ Mi-1, j-1 + Si,j (match/mismatch in the diagonal)
Mi,j-1 + w (gap in sequence #1),
Mi-1,j + w (gap in sequence #2)]
• In the following case, Mi-1,j-1 will be red, Mi,j-1 will
be green and Mi-1,j will be blue.
• A simple scoring scheme is assumed where
– Si,j = 1 if the residue at position i of sequence #1 is
the same as the residue at position j of sequence
#2 (match score); otherwise
– Si,j = 0 (mismatch score)
– w = 0 (gap penalty)
Matrix Fill Step
Traceback Step
Final Alignment
G A A T T C A G T T A
| | | | | |
G G A _ T C _ G _ _ A
Summary
• The NW alignment is over the entire length of two Sequences (the
traceback starts from the lower right corner of the traceback
matrix, and completes in the upper left cell of this matrix).
• The Needleman-Wunsch algorithm works in the same way
regardless of the length or complexity of sequences and guarantees
to find the best alignment.
• The Needleman-Wunsch algorithm is appropriate for finding the
best alignment of two sequences which are
(i) of the similar length.
(ii) similar across their entire lengths.
seq alignment.ppt
seq alignment.ppt
seq alignment.ppt
seq alignment.ppt
seq alignment.ppt

More Related Content

Similar to seq alignment.ppt

Similar to seq alignment.ppt (20)

The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Ch06 multalign
Ch06 multalignCh06 multalign
Ch06 multalign
 
Sequence Alignment.pptx
Sequence Alignment.pptxSequence Alignment.pptx
Sequence Alignment.pptx
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Ch06 alignment
Ch06 alignmentCh06 alignment
Ch06 alignment
 
Biological sequences analysis
Biological sequences analysisBiological sequences analysis
Biological sequences analysis
 
02-alignment.pdf
02-alignment.pdf02-alignment.pdf
02-alignment.pdf
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
Swaati algorithm of alignment ppt
Swaati algorithm of alignment pptSwaati algorithm of alignment ppt
Swaati algorithm of alignment ppt
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
Lecture 4
Lecture 4Lecture 4
Lecture 4
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHMGRAPH - DISCRETE STRUCTURE AND ALGORITHM
GRAPH - DISCRETE STRUCTURE AND ALGORITHM
 
Datamining
DataminingDatamining
Datamining
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Dot plots-1.ppt
Dot plots-1.pptDot plots-1.ppt
Dot plots-1.ppt
 

Recently uploaded

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Callshivangimorya083
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFxolyaivanovalion
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxStephen266013
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxolyaivanovalion
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 

Recently uploaded (20)

꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
꧁❤ Greater Noida Call Girls Delhi ❤꧂ 9711199171 ☎️ Hard And Sexy Vip Call
 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
 
Log Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptxLog Analysis using OSSEC sasoasasasas.pptx
Log Analysis using OSSEC sasoasasasas.pptx
 
Halmar dropshipping via API with DroFx
Halmar  dropshipping  via API with DroFxHalmar  dropshipping  via API with DroFx
Halmar dropshipping via API with DroFx
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
B2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docxB2 Creative Industry Response Evaluation.docx
B2 Creative Industry Response Evaluation.docx
 
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiVIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati
 
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxBPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx
 
Mature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptxMature dropshipping via API with DroFx.pptx
Mature dropshipping via API with DroFx.pptx
 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 

seq alignment.ppt

  • 1. Definition: •Procedure for comparing two or more sequences by searching for a series of individual characters or character patterns that are in the same order in the sequences. – Pair-wise alignment: compare two sequences. – Multiple sequence alignment: compare more than two sequences. Sequence Alignment
  • 2. Need • To find whether two (or more) genes or proteins are evolutionarily related to each Other. • To find structurally or functionally similar regions within proteins.
  • 3. • Align abcdef with abdgf • Write second sequence below the first abcdef abdgf • Move sequences to give maximum match between them. • Show characters that match using vertical bar. Example
  • 5. Matching Similarity vs. Identity • Alignments can be based on finding only identical characters, or (more commonly) can be based on finding similar characters. • Conserved substitutions, semi-conserved substitution, and non conserved substitution are the terms used to define similarity.
  • 6. Matching Similarity vs. Identity Different colours denote different chemical groups of amino acids, i.e. hydrophobic, acidic, etc.
  • 7. Inferences • In sequence alignment, the degree of similarity between particular regions in the sequences can be interpreted as a rough measure of how conserved a particular region or sequence motif is among lineages • In conserved regions the order of letters does not change during evolution or changes only slightly, i.e. has mostly conservative substitutions. • Sequence motif = sequence pattern that is widespread and has a biological significance resulting in its conservation through evolution.
  • 8. Global vs. Local Alignment – Global alignment algorithms which optimize overall alignment between two sequences. – Local alignment algorithms which seek only relatively conserved pieces of sequence. Alignment stops at the ends of regions of strong similarity. – Favors finding conserved patterns in different pairs of sequences.
  • 10. Methods for Pair-wise Alignment • Dot matrix analysis • Dynamic Programming (Needleman- Wunsch, Smith-Waterman algorithms) • Word or k-tuple methods (BLAST and FASTA)
  • 12. Interpretation of Dot Matrices • Regions of similarity appear as diagonal runs of dots. • Interruption in middle of diagonal line indicates insertions or deletions. • Parallel diagonal line within the matrix represents repetitive regions of the sequence.
  • 13. Uses • Can use dot matrices to align two proteins or two nucleic acid sequences. • Can use to find amino acid repeats within a protein by comparing a protein sequence to itself. • Used in identifying Nucleic Acids secondary structure detecting self complementarily of the sequence. • Used in comparative genomics by predicting gene order conservation between closely related genomes.
  • 14. Limitations • A problem with dot matrices for long sequences is that they can be very noisy due to lots of insignificant matches. • Only a pairwise alignment method not suitable for multiple alignment of sequences. • It lacks statistically rigor in assessing the quality of alignment.
  • 15. Solution • By using a window (W)/ tuple. – compare character by character within a window (have to choose window size). – require certain fraction of matches within window in order to display it with a “dot”.
  • 17. • Initialisation • Matrix fill (scoring) • Traceback (alignment) Dynamic Programming Approach Steps M= (length of sequence i) N= (Length of sequence ii)
  • 19. Scoring • For each position, Mi,j is defined to be the maximum score at position i,j; i.e. Mi,j = MAXIMUM [ Mi-1, j-1 + Si,j (match/mismatch in the diagonal) Mi,j-1 + w (gap in sequence #1), Mi-1,j + w (gap in sequence #2)] • In the following case, Mi-1,j-1 will be red, Mi,j-1 will be green and Mi-1,j will be blue.
  • 20. • A simple scoring scheme is assumed where – Si,j = 1 if the residue at position i of sequence #1 is the same as the residue at position j of sequence #2 (match score); otherwise – Si,j = 0 (mismatch score) – w = 0 (gap penalty)
  • 22.
  • 23.
  • 24.
  • 25.
  • 27.
  • 28.
  • 29.
  • 30. Final Alignment G A A T T C A G T T A | | | | | | G G A _ T C _ G _ _ A
  • 31.
  • 32.
  • 33.
  • 34. Summary • The NW alignment is over the entire length of two Sequences (the traceback starts from the lower right corner of the traceback matrix, and completes in the upper left cell of this matrix). • The Needleman-Wunsch algorithm works in the same way regardless of the length or complexity of sequences and guarantees to find the best alignment. • The Needleman-Wunsch algorithm is appropriate for finding the best alignment of two sequences which are (i) of the similar length. (ii) similar across their entire lengths.

Editor's Notes

  1. Conserved amino acid substitution are the replacement of an amino acid residue with another one with similar properties such as aspartate and glutamate. They are both negatively charged. Semi-conserved amino acid substitution replaces one residue with another one that has similar steric conformation but does not share chemical properties like substitution of cysteine for alanine or leucine.