SlideShare a Scribd company logo
1 of 19
Sequence Alignment
Lecture – 4
Nafis Neehal, Lecturer, Department of CSE, DIU
CONTENTS
1. Sequence Alignment
2. Sequence Alignment Methods
- Pairwise Alignment
- Multiple Sequence Alignment
3. Pairwise Sequence Alignment Methods
-Global Alignment (Needleman-Wunsch)
- Local Alignment (Smith-Waterman)
4. Multiple Sequence Alignment
- Progressive Method
- Iterative Method
- MSA Challenges
1. Sequence Alignment
Why and how align sequences
Sequence
Alignment
A way of arranging the sequences of DNA, RNA, or
protein to identify regions of similarity that may
be a consequence of functional, structural, or
evolutionary relationships between the
sequences
CTGTCG-CTGCACG
-TGC-CG-TG----
2. Sequence Alignment Methods
Pairwise and Multiple
Pairwise Sequence Alignment
▹A pair of sequences as input
▹Align them in such a way that, for that particular
alignment the assumed region of similarity produces
higher score than all the other alignments
▹Methods
- Global Alignment (Needleman-Wunsch)
- Local Alignment (Smith-Waterman)
CTGTCGCTGCACG--
-------TGC-CGTG
Multiple Sequence Alignment
• Three or more than three sequences as
input
• Align all the sequences altogether in such a
manner that the alignment produces
highest score
3. Pairwise Sequence Alignment
Global and Local methods
Global Alignment (Needleman-Wunsch)
3 Major Steps
-Create 2D Matrix
-Trace back
-Final Alignment
Create 2D Matrix
- Row x Col 2D matrix draw (Row , Col
size of seq1 and seq2 respectively)
- Place 2 seqs as Row and Column
Header
- Cell (0,0) = 0
- Cell (0,1) to Cell (0,Column) and Cell
(1,0) to Cell (Row,0) value = delete
gap value from previous cell value
- For other cell values, follow
equation in (1)
Trace back
- Start from Cell (Row, Col)
- Go back up to Cell (0,0)
Final Alignment
- Start from Cell (Row, Col)
- If then, place
character in both seq
- If or then
character in start seq &
gap in end seq
Global Alignment (Needleman-Wunsch) - Example
Input
- seq1 = AAAC
- seq2 = AGC
Scoring Scheme
δ(x, x) = 1 (Match)
δ(x,-) = -2 (Gap)
δ(x, y) = -1 (Mis match)
Eq. 1: Cell Value
A G C
A
A
A
C
0 -2 -4 -6
-2 1 -1 -3
-4
-6
-8
-1
-3
-5
0
-2
-4
-2
-1
-1
Final
Alignment
-AGC
AAAC
Local Alignment (Smith-Waterman)
3 Major Steps
-Create 2D Matrix
-Trace back
-Final Alignment
Create 2D Matrix
- Row x Col 2D matrix draw (Row , Col
size of seq1 and seq2 respectively)
- Place 2 seqs as Row and Column
Header
- First Row, First Column all value = 0
- For other cell values, follow
equation in (2)
Trace back
- Start from each Cell which has the maximum
value in the entire matrix
- Go back up to the Cell where first time 0
occurs
Final Alignment
- Start from each Cell with max value
- If then, place character in both seq
- If or then character in start
seq & gap in end seq
Local Alignment (Smith-Waterman) - Example
Input
- seq1 = AAAC
- seq2 = AAG
Scoring Scheme
δ(x, x) = 1 (Match)
δ(x,-) = -2 (Gap)
δ(x, y) = -1 (Mis match)
Eq. 2: Cell Value
A A G
A
A
A
C
0 0 0 0
0 1 1 0
0
0
0
1
1
0
2
2
0
0
1
1
Final
Alignment
-AAG
AAAC
4. Multiple Sequence Alignment
Progressive, Iterative
Progressive Method
▹ Two major steps – Guide Tree build up and
Multiple Pairwise Alignment
▹ Steps
- Take each pair, align
- Generate consensus of that
alignment
- Align new sequence with the
consensus of the previous one
- Go back, Until all sequences are
finished
▹ Example
- Clustal ω
- MAFFT
- KALIGN
- T-COFFEE
Iterative Method
▹ Works similarly to progressive
methods
▹ Repeatedly realign the initial
sequences as well as add new
sequences to the growing MSA
▹ Example
- DIALIGN
- MUSCLE
- POA
MSA Challanges
▹ Computationally Expensive
▹ Difficult to score. Multiple comparison necessary in each
column of the MSA for a cumulative score
▹ Placement of gaps and scoring of substitution is more difficult
▹ Difficulty increases with diversity
▹ Relatively easy for a set of closely related sequences.
Identifying the correct ancestry relationships for a set of
distantly related sequences is more challenging
▹ Even difficult if some members are more alike compared to
others
95%
Of Human DNA is identical to Chimpanzees
510 DNA Codes
Lost throughout human evolution
2 gm DNA
Can contain digital information of whole world
1.8 Meter
Long DNA is squeezed into a space of 0.09 µm
Shocked?
Youtube Links
▹Global Alignment Part 1 - https://www.youtube.com/watch?v=vqxc2EfPWdk
▹Global Alignment Part 2 - https://www.youtube.com/watch?v=zwA-6_1bLgE
▹Local Alignment - https://www.youtube.com/watch?v=IatoWOsJ35Q

More Related Content

What's hot

Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. localbenazeer fathima
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysismikaelhuss
 
PCR Methods and applications
PCR Methods and applicationsPCR Methods and applications
PCR Methods and applicationsBehzad Milani
 
Protein sequencing
Protein sequencingProtein sequencing
Protein sequencingHafsaJamil1
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahuKAUSHAL SAHU
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPuneet Kulyana
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencingSwathi Prabakar
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformaticsatmapandey
 
DNA & RNA isolation
DNA & RNA isolationDNA & RNA isolation
DNA & RNA isolationgptgeetika
 
Gene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCRGene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCRSuresh Antre
 
Nucleic acid hybridization by kk
Nucleic acid hybridization by kkNucleic acid hybridization by kk
Nucleic acid hybridization by kkKAUSHAL SAHU
 
VNTR and RFLP
VNTR and RFLPVNTR and RFLP
VNTR and RFLPUlaa Iman
 

What's hot (20)

Jan2016 pac bio giab
Jan2016 pac bio giabJan2016 pac bio giab
Jan2016 pac bio giab
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 
Maximum parsimony
Maximum parsimonyMaximum parsimony
Maximum parsimony
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
PCR Methods and applications
PCR Methods and applicationsPCR Methods and applications
PCR Methods and applications
 
Protein sequencing
Protein sequencingProtein sequencing
Protein sequencing
 
Gene bank by kk sahu
Gene bank by kk sahuGene bank by kk sahu
Gene bank by kk sahu
 
Primary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyanaPrimary and secondary databases ppt by puneet kulyana
Primary and secondary databases ppt by puneet kulyana
 
Dna sequencing and its types
Dna sequencing and its typesDna sequencing and its types
Dna sequencing and its types
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Next generation sequencing
Next generation sequencingNext generation sequencing
Next generation sequencing
 
Blast bioinformatics
Blast bioinformaticsBlast bioinformatics
Blast bioinformatics
 
DNA & RNA isolation
DNA & RNA isolationDNA & RNA isolation
DNA & RNA isolation
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
UPGMA
UPGMAUPGMA
UPGMA
 
Ngs ppt
Ngs pptNgs ppt
Ngs ppt
 
Gene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCRGene Expression Analysis by Real Time PCR
Gene Expression Analysis by Real Time PCR
 
Nucleic acid hybridization by kk
Nucleic acid hybridization by kkNucleic acid hybridization by kk
Nucleic acid hybridization by kk
 
VNTR and RFLP
VNTR and RFLPVNTR and RFLP
VNTR and RFLP
 

Similar to SeqAlignLec4

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentSanaym
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07Computer Science Club
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment Parinda Rajapaksha
 
Needleman-wunch algorithm harshita
Needleman-wunch algorithm  harshitaNeedleman-wunch algorithm  harshita
Needleman-wunch algorithm harshitaHarshita Bhawsar
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
T coffee algorithm dissection
T coffee algorithm dissectionT coffee algorithm dissection
T coffee algorithm dissectionGui Chen
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence AlignmentRavi Gandham
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignmentKubuldinho
 
Statistical Clustering
Statistical ClusteringStatistical Clustering
Statistical Clusteringtim_hare
 
Swaati algorithm of alignment ppt
Swaati algorithm of alignment pptSwaati algorithm of alignment ppt
Swaati algorithm of alignment pptSwati Kumari
 
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptxPRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptxBulBulsTutorial
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadfalizain9604
 
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfssusera1eccd
 
Msa & rooted/unrooted tree
Msa & rooted/unrooted treeMsa & rooted/unrooted tree
Msa & rooted/unrooted treeSamiul Ehsan
 

Similar to SeqAlignLec4 (20)

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Ch06 multalign
Ch06 multalignCh06 multalign
Ch06 multalign
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
Bioinformatics lesson
Bioinformatics lessonBioinformatics lesson
Bioinformatics lesson
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment
 
Needleman-wunch algorithm harshita
Needleman-wunch algorithm  harshitaNeedleman-wunch algorithm  harshita
Needleman-wunch algorithm harshita
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
T coffee algorithm dissection
T coffee algorithm dissectionT coffee algorithm dissection
T coffee algorithm dissection
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Backpropagation - Elisa Sayrol - UPC Barcelona 2018
Backpropagation - Elisa Sayrol - UPC Barcelona 2018Backpropagation - Elisa Sayrol - UPC Barcelona 2018
Backpropagation - Elisa Sayrol - UPC Barcelona 2018
 
Statistical Clustering
Statistical ClusteringStatistical Clustering
Statistical Clustering
 
Swaati algorithm of alignment ppt
Swaati algorithm of alignment pptSwaati algorithm of alignment ppt
Swaati algorithm of alignment ppt
 
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptxPRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
PCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdfPCB_Lect02_Pairwise_allign (1).pdf
PCB_Lect02_Pairwise_allign (1).pdf
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Msa & rooted/unrooted tree
Msa & rooted/unrooted treeMsa & rooted/unrooted tree
Msa & rooted/unrooted tree
 

More from Owali Shawon (10)

Lecture 7
Lecture 7Lecture 7
Lecture 7
 
Lecture 8
Lecture 8Lecture 8
Lecture 8
 
Lecture 6
Lecture 6Lecture 6
Lecture 6
 
Lecture 2
Lecture 2Lecture 2
Lecture 2
 
Lecture 5
Lecture 5Lecture 5
Lecture 5
 
Lecture 3
Lecture 3Lecture 3
Lecture 3
 
Lecture 1
Lecture 1Lecture 1
Lecture 1
 
ABOUT ME!
ABOUT ME! ABOUT ME!
ABOUT ME!
 
Electrical Circuit
Electrical CircuitElectrical Circuit
Electrical Circuit
 
Short Review of Stack
Short Review of StackShort Review of Stack
Short Review of Stack
 

Recently uploaded

Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsAArockiyaNisha
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...ssifa0344
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 sciencefloriejanemacaya1
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhousejana861314
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 

Recently uploaded (20)

Natural Polymer Based Nanomaterials
Natural Polymer Based NanomaterialsNatural Polymer Based Nanomaterials
Natural Polymer Based Nanomaterials
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
TEST BANK For Radiologic Science for Technologists, 12th Edition by Stewart C...
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )Recombination DNA Technology (Nucleic Acid Hybridization )
Recombination DNA Technology (Nucleic Acid Hybridization )
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Boyles law module in the grade 10 science
Boyles law module in the grade 10 scienceBoyles law module in the grade 10 science
Boyles law module in the grade 10 science
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Orientation, design and principles of polyhouse
Orientation, design and principles of polyhouseOrientation, design and principles of polyhouse
Orientation, design and principles of polyhouse
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 

SeqAlignLec4

  • 1. Sequence Alignment Lecture – 4 Nafis Neehal, Lecturer, Department of CSE, DIU
  • 2. CONTENTS 1. Sequence Alignment 2. Sequence Alignment Methods - Pairwise Alignment - Multiple Sequence Alignment 3. Pairwise Sequence Alignment Methods -Global Alignment (Needleman-Wunsch) - Local Alignment (Smith-Waterman) 4. Multiple Sequence Alignment - Progressive Method - Iterative Method - MSA Challenges
  • 3. 1. Sequence Alignment Why and how align sequences
  • 4. Sequence Alignment A way of arranging the sequences of DNA, RNA, or protein to identify regions of similarity that may be a consequence of functional, structural, or evolutionary relationships between the sequences CTGTCG-CTGCACG -TGC-CG-TG----
  • 5. 2. Sequence Alignment Methods Pairwise and Multiple
  • 6. Pairwise Sequence Alignment ▹A pair of sequences as input ▹Align them in such a way that, for that particular alignment the assumed region of similarity produces higher score than all the other alignments ▹Methods - Global Alignment (Needleman-Wunsch) - Local Alignment (Smith-Waterman) CTGTCGCTGCACG-- -------TGC-CGTG
  • 7. Multiple Sequence Alignment • Three or more than three sequences as input • Align all the sequences altogether in such a manner that the alignment produces highest score
  • 8. 3. Pairwise Sequence Alignment Global and Local methods
  • 9. Global Alignment (Needleman-Wunsch) 3 Major Steps -Create 2D Matrix -Trace back -Final Alignment Create 2D Matrix - Row x Col 2D matrix draw (Row , Col size of seq1 and seq2 respectively) - Place 2 seqs as Row and Column Header - Cell (0,0) = 0 - Cell (0,1) to Cell (0,Column) and Cell (1,0) to Cell (Row,0) value = delete gap value from previous cell value - For other cell values, follow equation in (1) Trace back - Start from Cell (Row, Col) - Go back up to Cell (0,0) Final Alignment - Start from Cell (Row, Col) - If then, place character in both seq - If or then character in start seq & gap in end seq
  • 10. Global Alignment (Needleman-Wunsch) - Example Input - seq1 = AAAC - seq2 = AGC Scoring Scheme δ(x, x) = 1 (Match) δ(x,-) = -2 (Gap) δ(x, y) = -1 (Mis match) Eq. 1: Cell Value A G C A A A C 0 -2 -4 -6 -2 1 -1 -3 -4 -6 -8 -1 -3 -5 0 -2 -4 -2 -1 -1 Final Alignment -AGC AAAC
  • 11. Local Alignment (Smith-Waterman) 3 Major Steps -Create 2D Matrix -Trace back -Final Alignment Create 2D Matrix - Row x Col 2D matrix draw (Row , Col size of seq1 and seq2 respectively) - Place 2 seqs as Row and Column Header - First Row, First Column all value = 0 - For other cell values, follow equation in (2) Trace back - Start from each Cell which has the maximum value in the entire matrix - Go back up to the Cell where first time 0 occurs Final Alignment - Start from each Cell with max value - If then, place character in both seq - If or then character in start seq & gap in end seq
  • 12. Local Alignment (Smith-Waterman) - Example Input - seq1 = AAAC - seq2 = AAG Scoring Scheme δ(x, x) = 1 (Match) δ(x,-) = -2 (Gap) δ(x, y) = -1 (Mis match) Eq. 2: Cell Value A A G A A A C 0 0 0 0 0 1 1 0 0 0 0 1 1 0 2 2 0 0 1 1 Final Alignment -AAG AAAC
  • 13. 4. Multiple Sequence Alignment Progressive, Iterative
  • 14. Progressive Method ▹ Two major steps – Guide Tree build up and Multiple Pairwise Alignment ▹ Steps - Take each pair, align - Generate consensus of that alignment - Align new sequence with the consensus of the previous one - Go back, Until all sequences are finished ▹ Example - Clustal ω - MAFFT - KALIGN - T-COFFEE
  • 15. Iterative Method ▹ Works similarly to progressive methods ▹ Repeatedly realign the initial sequences as well as add new sequences to the growing MSA ▹ Example - DIALIGN - MUSCLE - POA
  • 16. MSA Challanges ▹ Computationally Expensive ▹ Difficult to score. Multiple comparison necessary in each column of the MSA for a cumulative score ▹ Placement of gaps and scoring of substitution is more difficult ▹ Difficulty increases with diversity ▹ Relatively easy for a set of closely related sequences. Identifying the correct ancestry relationships for a set of distantly related sequences is more challenging ▹ Even difficult if some members are more alike compared to others
  • 17. 95% Of Human DNA is identical to Chimpanzees 510 DNA Codes Lost throughout human evolution 2 gm DNA Can contain digital information of whole world 1.8 Meter Long DNA is squeezed into a space of 0.09 µm
  • 19. Youtube Links ▹Global Alignment Part 1 - https://www.youtube.com/watch?v=vqxc2EfPWdk ▹Global Alignment Part 2 - https://www.youtube.com/watch?v=zwA-6_1bLgE ▹Local Alignment - https://www.youtube.com/watch?v=IatoWOsJ35Q