SlideShare a Scribd company logo
MULTIPLE SEQUENCE ALINGMENT
HARSHITA AGARWAL
KUMAR SHASHANK
SHAURYA
NANCY
BACKGROUND
In bioinformatics, a sequence alignment is a way of arranging the
sequences of DNA, RNA or protein to identify regions of similarity that
may be a consequence of functional, structural or evolutionary
relationships between the sequences. One could align these sequences
either pair wise against each other or against whole database, or by
performing multiple sequence alignment where one align 3-many
sequences at a time. Multiple sequence alignment (MSA) helps us to
find conserved domain among a whole set of sequences that is difficult
to find if pair wise alignment is done. Like pair wise alignment MSA
could also be done locally or globally. Global alignment includes gaps
while local alignment could avoid these gaps. One could Take MSA an
extension of pair wise alignment, as primary step of MSA is pair wise
alignment between all the possible pairs followed by formation of a
guided tree ultimately providing the final MSA report. There are various
ways of doing MSA: Progressive, Iterative, and Block based method.
These methods are explained in following text. There are various
statistical model of MSA which provides us a table or file containing
information regarding probability of amino acid or nucleotide at each
position of MSA file. These models help us to correlate aligned
sequences and study insertions and deletions more easily and efficiently.
In our present study we have explained various ways of performing
MSA and some statistical model of MSA (PSSM, Profile, and PSI-
BLAST).
Multiple Sequence Alignment
A natural extension of pairwise alignment is multiple sequence
alignment, which is to align multiple related sequences to achieve
optimal matching of the sequences. There is a unique advantage of
multiple sequence alignment because it reveals more biological
information than many pairwise alignments can.
SCORING FUNCTION
The scoring function for multiple sequence alignment is based on the
concept of sum of pairs (SP). As the name suggests, it is the sum of the
scores of all possible pairs of sequences in a multiple alignment based
on a particular scoring matrix. In calculating the SP scores, each column
is scored by summing the scores for all possible pairwise matches,
mismatches and gap costs. The score of the entire alignment is the sum
of all of the column scores .
Multiple
sequence
alignment
heurestic
progressive itertative
block based
method
exhaustive
EXHAUSTIVE ALGORITHMS
The exhaustive alignment method involves examining all possible
aligned positions simultaneously. Similar to dynamic programming in
pairwise alignment, which involves the use of a two-dimensional matrix
to search for an optimal alignment, to use dynamic programming for
multiple sequence alignment, extra dimensions are needed to take all
possible ways of sequence matching into consideration. This means to
establish a multidimensional search matrix. For aligning N sequences, an
N-dimensional matrix is needed to be filled with alignment scores. As
the amount of computational time and memory space required increases
exponentially with the number of sequences, it makes the method
computationally prohibitive to use for a large data set. For this reason,
full dynamic programming is limited to small datasets of less than ten
short sequences.
HEURISTIC ALGORITHMS
Because the use of dynamic programming is not feasible for routine
multiple sequence alignment, faster and heuristic algorithms have been
developed. The heuristic algorithms fall into three categories:
 progressive alignment type
 iterative alignment type
 and block-based alignment type.
Progressive Alignment Method
Progressive alignment depends on the stepwise assembly of multiple
alignment and is heuristic in nature . it does not seprate the process of
scoring an alignment from the optimization algorithm. It is fast and
efficient and does not directly optimize any global scoring function of
alignment correctness.
Steps:
1. perform pairwise alignment for all sequences
2. use the alignment score, that gives a phylogenetic tree using
neighbor-joining(NJ) method.
3. The sequences are aligned using the phylogenetic relationships
indicated by the tree.
CLUSTALW produces the best match for the sequences and arranges
them so that the similarities and differences can be seen. It works on the
hypothesis that sequence in an alignment will reflect their evolutionary
history.
Iterative Alignment
The iterative approach is based on the idea that an optimal solution can
be found by repeatedly modifying existing suboptimal solutions. The
procedure starts by producing a low-quality alignment and gradually
improves it by iterative realignment through well-defined procedures
until no more improvements in the alignment scores.
Steps remain similar to progressive alignment, only pre-alignment is
done multiple times.
Software used: T-coffee
Block-Based Alignment
The progressive and iterative alignment strategies are largely global
alignment based and may therefore fail to recognize conserved domains
and motifs among highly divergent sequences of varying lengths. For
such divergent sequences that share only regional similarities, a local
alignment based approach has to be used. The strategy identifies a block
of ungapped alignment shared by all the sequences, hence, the block-
based local alignment strategy is used.
APPLICATION OF M.S.A
 phylogenetic tree construction
 illumination of functionally important regions
 prediction of structure of proteins
REFERENCES
 S.C. Rastogi and N. Mendiratla and P.Rastogi. Bioinformatics
methods and applicationsGenomics, Proteomics and Drug
Discovery. Prentice Hall India, 2004.
 Multiple sequence alignment Introduction to Computational
Biology Teresa Przytycka, PhD, https://www.ncbi.nlm.nih.gov
 https://www.ebi.ac.uk/Tools/msa/clustal
 http://www.iiserpune.ac.in/~farhat/wordpress/wpcontent/uploads/2
011/06/lecture_MSA.pdf

More Related Content

What's hot

Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
Siva Dharshini R
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
The Oxford College Engineering
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
Mariya Raju
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
nadeem akhter
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
Hafiz Muhammad Zeeshan Raza
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
Pranavathiyani G
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
AnkitTiwari354
 
Dot matrix
Dot matrixDot matrix
Dot matrix
Tania Khan
 
Site directed mutagenesis
Site  directed mutagenesisSite  directed mutagenesis
Site directed mutagenesis
Zain Khadim
 
Cath
CathCath
Cath
Ramya S
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
benazeer fathima
 
Data mining
Data miningData mining
Data mining
ShwetA Kumari
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
ProshantaShil
 
Peptide Mass Fingerprinting
Peptide Mass FingerprintingPeptide Mass Fingerprinting
Peptide Mass Fingerprinting
Rida Khalid
 
Tree building
Tree buildingTree building
Tree building
deepalakshmi59
 
Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence Alignment
Meghaj Mallick
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
Nitin Naik
 
Scop database
Scop databaseScop database
Scop database
Sayantani Roy
 

What's hot (20)

Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
sequence of file formats in bioinformatics
sequence of file formats in bioinformaticssequence of file formats in bioinformatics
sequence of file formats in bioinformatics
 
Entrez databases
Entrez databasesEntrez databases
Entrez databases
 
Nucleic Acid Sequence databases
Nucleic Acid Sequence databasesNucleic Acid Sequence databases
Nucleic Acid Sequence databases
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
Site directed mutagenesis
Site  directed mutagenesisSite  directed mutagenesis
Site directed mutagenesis
 
Cath
CathCath
Cath
 
Sequence alignment global vs. local
Sequence alignment  global vs. localSequence alignment  global vs. local
Sequence alignment global vs. local
 
Data mining
Data miningData mining
Data mining
 
Needleman-Wunsch Algorithm
Needleman-Wunsch AlgorithmNeedleman-Wunsch Algorithm
Needleman-Wunsch Algorithm
 
Peptide Mass Fingerprinting
Peptide Mass FingerprintingPeptide Mass Fingerprinting
Peptide Mass Fingerprinting
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Tree building
Tree buildingTree building
Tree building
 
Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence Alignment
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
 
Scop database
Scop databaseScop database
Scop database
 

Similar to multiple sequence alignment

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Subhranil Bhattacharjee
 
multiple sequence and pairwise alignment.pdf
multiple sequence and pairwise alignment.pdfmultiple sequence and pairwise alignment.pdf
multiple sequence and pairwise alignment.pdf
sriaisvariyasundar
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
parwati sihag
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
IJCSEIT Journal
 
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptxPRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
BulBulsTutorial
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence Analysis
Sangeeta Das
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
journal ijrtem
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
IJRTEMJOURNAL
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
Meghaj Mallick
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
Meghaj Mallick
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Zeeshan Hanjra
 
Sequence alignment.pptx
Sequence alignment.pptxSequence alignment.pptx
Sequence alignment.pptx
PagudalaSangeetha
 
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
International Journal of Engineering Inventions www.ijeijournal.com
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
Nagendrasahu6
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
PRUTHVIRAJ K
 
Sequence database
Sequence databaseSequence database
Sequence database
Dr.M.Prasad Naidu
 
Machine learning and reinforcement learning
Machine learning and reinforcement learningMachine learning and reinforcement learning
Machine learning and reinforcement learning
jenil desai
 
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdfBIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
sirwansleman
 

Similar to multiple sequence alignment (20)

Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
multiple sequence and pairwise alignment.pdf
multiple sequence and pairwise alignment.pdfmultiple sequence and pairwise alignment.pdf
multiple sequence and pairwise alignment.pdf
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptxPRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
PRESENTATION MULTIPLE SEQUENCE ALIGNMENT.pptx
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence Analysis
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
Performance Improvement of BLAST with Use of MSA Techniques to Search Ancesto...
 
Sequence Analysis
Sequence AnalysisSequence Analysis
Sequence Analysis
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Sequence alignment.pptx
Sequence alignment.pptxSequence alignment.pptx
Sequence alignment.pptx
 
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Machine learning and reinforcement learning
Machine learning and reinforcement learningMachine learning and reinforcement learning
Machine learning and reinforcement learning
 
20120140505011
2012014050501120120140505011
20120140505011
 
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdfBIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
BIOINFORMATICS_AND_PHYLOGENY.pdf.pdf
 

Recently uploaded

The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
thanhdowork
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
EduSkills OECD
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
timhan337
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
Kartik Tiwari
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
Vivekanand Anglo Vedic Academy
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
DeeptiGupta154
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
Mohammed Sikander
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
Peter Windle
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
camakaiclarkmusic
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
deeptiverma2406
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
Celine George
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
Pavel ( NSTU)
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
chanes7
 

Recently uploaded (20)

The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
A Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptxA Survey of Techniques for Maximizing LLM Performance.pptx
A Survey of Techniques for Maximizing LLM Performance.pptx
 
Francesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptxFrancesca Gottschalk - How can education support child empowerment.pptx
Francesca Gottschalk - How can education support child empowerment.pptx
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Honest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptxHonest Reviews of Tim Han LMA Course Program.pptx
Honest Reviews of Tim Han LMA Course Program.pptx
 
Chapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdfChapter -12, Antibiotics (One Page Notes).pdf
Chapter -12, Antibiotics (One Page Notes).pdf
 
The French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free downloadThe French Revolution Class 9 Study Material pdf free download
The French Revolution Class 9 Study Material pdf free download
 
Overview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with MechanismOverview on Edible Vaccine: Pros & Cons with Mechanism
Overview on Edible Vaccine: Pros & Cons with Mechanism
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
Multithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race conditionMultithreading_in_C++ - std::thread, race condition
Multithreading_in_C++ - std::thread, race condition
 
A Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in EducationA Strategic Approach: GenAI in Education
A Strategic Approach: GenAI in Education
 
CACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdfCACJapan - GROUP Presentation 1- Wk 4.pdf
CACJapan - GROUP Presentation 1- Wk 4.pdf
 
Best Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDABest Digital Marketing Institute In NOIDA
Best Digital Marketing Institute In NOIDA
 
Model Attribute Check Company Auto Property
Model Attribute  Check Company Auto PropertyModel Attribute  Check Company Auto Property
Model Attribute Check Company Auto Property
 
Synthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptxSynthetic Fiber Construction in lab .pptx
Synthetic Fiber Construction in lab .pptx
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
Digital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion DesignsDigital Artifact 2 - Investigating Pavilion Designs
Digital Artifact 2 - Investigating Pavilion Designs
 

multiple sequence alignment

  • 1. MULTIPLE SEQUENCE ALINGMENT HARSHITA AGARWAL KUMAR SHASHANK SHAURYA NANCY
  • 2. BACKGROUND In bioinformatics, a sequence alignment is a way of arranging the sequences of DNA, RNA or protein to identify regions of similarity that may be a consequence of functional, structural or evolutionary relationships between the sequences. One could align these sequences either pair wise against each other or against whole database, or by performing multiple sequence alignment where one align 3-many sequences at a time. Multiple sequence alignment (MSA) helps us to find conserved domain among a whole set of sequences that is difficult to find if pair wise alignment is done. Like pair wise alignment MSA could also be done locally or globally. Global alignment includes gaps while local alignment could avoid these gaps. One could Take MSA an extension of pair wise alignment, as primary step of MSA is pair wise alignment between all the possible pairs followed by formation of a guided tree ultimately providing the final MSA report. There are various ways of doing MSA: Progressive, Iterative, and Block based method. These methods are explained in following text. There are various statistical model of MSA which provides us a table or file containing information regarding probability of amino acid or nucleotide at each position of MSA file. These models help us to correlate aligned sequences and study insertions and deletions more easily and efficiently. In our present study we have explained various ways of performing MSA and some statistical model of MSA (PSSM, Profile, and PSI- BLAST).
  • 3. Multiple Sequence Alignment A natural extension of pairwise alignment is multiple sequence alignment, which is to align multiple related sequences to achieve optimal matching of the sequences. There is a unique advantage of multiple sequence alignment because it reveals more biological information than many pairwise alignments can. SCORING FUNCTION The scoring function for multiple sequence alignment is based on the concept of sum of pairs (SP). As the name suggests, it is the sum of the scores of all possible pairs of sequences in a multiple alignment based on a particular scoring matrix. In calculating the SP scores, each column is scored by summing the scores for all possible pairwise matches, mismatches and gap costs. The score of the entire alignment is the sum of all of the column scores . Multiple sequence alignment heurestic progressive itertative block based method exhaustive
  • 4. EXHAUSTIVE ALGORITHMS The exhaustive alignment method involves examining all possible aligned positions simultaneously. Similar to dynamic programming in pairwise alignment, which involves the use of a two-dimensional matrix to search for an optimal alignment, to use dynamic programming for multiple sequence alignment, extra dimensions are needed to take all possible ways of sequence matching into consideration. This means to establish a multidimensional search matrix. For aligning N sequences, an N-dimensional matrix is needed to be filled with alignment scores. As the amount of computational time and memory space required increases exponentially with the number of sequences, it makes the method computationally prohibitive to use for a large data set. For this reason, full dynamic programming is limited to small datasets of less than ten short sequences. HEURISTIC ALGORITHMS Because the use of dynamic programming is not feasible for routine multiple sequence alignment, faster and heuristic algorithms have been developed. The heuristic algorithms fall into three categories:  progressive alignment type  iterative alignment type  and block-based alignment type. Progressive Alignment Method Progressive alignment depends on the stepwise assembly of multiple alignment and is heuristic in nature . it does not seprate the process of scoring an alignment from the optimization algorithm. It is fast and
  • 5. efficient and does not directly optimize any global scoring function of alignment correctness. Steps: 1. perform pairwise alignment for all sequences 2. use the alignment score, that gives a phylogenetic tree using neighbor-joining(NJ) method. 3. The sequences are aligned using the phylogenetic relationships indicated by the tree. CLUSTALW produces the best match for the sequences and arranges them so that the similarities and differences can be seen. It works on the hypothesis that sequence in an alignment will reflect their evolutionary history.
  • 6. Iterative Alignment The iterative approach is based on the idea that an optimal solution can be found by repeatedly modifying existing suboptimal solutions. The procedure starts by producing a low-quality alignment and gradually improves it by iterative realignment through well-defined procedures until no more improvements in the alignment scores. Steps remain similar to progressive alignment, only pre-alignment is done multiple times. Software used: T-coffee Block-Based Alignment The progressive and iterative alignment strategies are largely global alignment based and may therefore fail to recognize conserved domains and motifs among highly divergent sequences of varying lengths. For such divergent sequences that share only regional similarities, a local alignment based approach has to be used. The strategy identifies a block of ungapped alignment shared by all the sequences, hence, the block- based local alignment strategy is used.
  • 7. APPLICATION OF M.S.A  phylogenetic tree construction  illumination of functionally important regions  prediction of structure of proteins
  • 8. REFERENCES  S.C. Rastogi and N. Mendiratla and P.Rastogi. Bioinformatics methods and applicationsGenomics, Proteomics and Drug Discovery. Prentice Hall India, 2004.  Multiple sequence alignment Introduction to Computational Biology Teresa Przytycka, PhD, https://www.ncbi.nlm.nih.gov  https://www.ebi.ac.uk/Tools/msa/clustal  http://www.iiserpune.ac.in/~farhat/wordpress/wpcontent/uploads/2 011/06/lecture_MSA.pdf