SlideShare a Scribd company logo
MSAT
MULTIPLE SEQUENCE ALIGNMENT TOOL
BY
GROUP 2
2/22/2018
1
OVERVIEW
 Sequence alignment
 Types of sequence alignments
 Multiple sequence alignment
 Purpose of MSA
 Types of MSA
 Progressive alignment
 Pros & Cons
2/22/2018
2
SEQUENCE ALIGNMENT
 In bioinformatics, a sequence alignments a way of arranging the sequences of DNA, RNA
, or protein to identify regions of similarity that may be a consequence of functional, stru
ctural, or evolutionary relationships between the sequences.
2/22/2018
3
SEQUENCE ALIGNMENT
 SEQUENCE ALIGNMENT Sequences often contain highly conserved regions
 These regions can be used for initial alignments
2/22/2018
4
TYPES OF SEQUENCE ALIGNMENTS
Pair‐wise alignment
 Dot matrix method
 Dynamic programming
 Word methods
Multiple sequence alignment
 Dynamic programming
 Progressive methods
 Iterative methods 2/22/2018
5
MULTIPLE SEQUENCE ALIGNMENT
 A multiple sequence alignment is tool that simultaneously aligns multiple protein
sequences, automatically utilizes information about protein domains, and has a good
compromise between speed and accuracy will have practical advantages over
current tools
 The principle is that multiple alignments are achieved by successive application of p
airwise methods.
2/22/2018
6
PURPOSE OF MSA
 In order to characterize protein families, identify shared regions of homology in a
multiple sequence alignment
 Determination of the consensus sequence of several aligned sequences.
 Consensus sequences can help to develop a sequence “finger print” which allo
ws the identification of members of distantly related protein family (motifs)
 MSA can help us to reveal biological facts
about proteins, like analysis of the secondary/tertiary structure
2/22/2018
7
2/22/2018
8
TYPES OF MSA
 Dynamic programming approach
Computes an optimal alignment for a given score function. Because of its high ru
nning time , it is not typically used in practice.
 Progressive method
This approach repeatedly aligns two sequences, two alignments, or a sequence
with an alignment.
 Iterative method
Works similarly to progressive methods but repeatedly realigns the initial sequence
s as well as adding new sequences to the growing MSA. 2/22/2018
9
PROGRESSIVE ALIGNMENT
 The most widely used approach
 Builds up a final MSA by combining pairwise
alignments beginning with the most similar pair and progressing to the most distantly r
elated
 Progressive alignment methods require two stages:
First stage in which the relationships between the sequences are represented as a tree,
called a guide tree
‐Second step in which the MSA is built by adding
the sequences sequentially to the growing MSA according to the guide tree
2/22/2018
10
USING COBALT NCBI
 Constraint based alignment tool that implements a general framework for
multiple alignment of protein sequences
 COBALT finds a collection of pairwise constraints derived from database
searches, sequence similarity and user input, combines these pairwise
constraints, and then incorporates them into a progressive multiple
alignment
 COBALT has reasonable runtime performance and alignment accuracy
comparable to or exceeding that of other tools for a broad range of
problems
2/22/2018
11
USING COBALT NCBI
 COBALT has a general framework that uses progressive multiple alignment
to combine pairwise constraints from different sources into a multiple
alignment
 When the same domain matches to multiple sequences, we can infer
several potential pairwise constraints based on these domain matches
 CDD ( Conserved Domains Database ) also contains auxiliary information
that allows COBALT to create partial profiles for input sequences before
progressive alignment begins, and this avoids computationally expensive
procedures for building profiles
2/22/2018
12
RUNTIME OF COBALT
 The runtime performance of COBALT is highly data driven
 COBALT is about five times faster than ProbCons
 COBALT is included in the NCBI C++ Toolkit
 Numerous auxiliary programs were written in C, C++ and Perl to automate
testing and summarize results
2/22/2018
13
AVAILABILITY
 COBALT is included in the NCBI C++ toolkit. A Linux executable for COBALT,
and CDD and PROSITE data used is available at:
https://www.ncbi.nlm.nih.gov/tools/cobalt/re_cobalt.cgi
 Contact: richa@helix.nih.gov
2/22/2018
14
STEP 1
2/22/2018
15
 Go to https://www.ncbi.nlm.nih.gov/
STEP 2
 The Swiss-Prot protein sequence for Schizosaccharomyces pombe Clr4 is
O60016.2.
2/22/2018
16
STEP 3
2/22/2018
17
Step 4
2/22/2018
18
STEP 5
2/22/2018
19
STEP 6
2/22/2018
20
Your patience is greatly appreciated….
2/22/2018
21
RESULT
2/22/2018
22
STEP 6
2/22/2018
23
Select First 11
STEP 7
2/22/2018
24
Your patience is greatly appreciated….
2/22/2018
25
RESULTS
2/22/2018
26
RESULTS
2/22/2018
27
STEP 8
 Notice that the above multiple alignment cant be edited “Edit and
Resubmit” link at the top of the COBALT results to remove the undesired
protein than search again.
2/22/2018
28
STEP 8 (a)
2/22/2018
29
STEP 8 (b)
2/22/2018
30
PROS AND CONS OF PROGRESSIVE
METHOD OF ALIGNMENT
 PROS:
Efficient enough to implement on a large scale for
many (100s to 1000s) sequences.
 Progressive alignment services are commonly available
on publicly accessible web servers, so users need not
locally install the applications of interest.
 Most widely used method of multiple sequence
alignment because of speed and accuracy.
2/22/2018
31
CONS…….
 Progressive alignments are not guaranteed to be
globally optimal.
 The primary problem is that when errors are made at
any stage in growing the MSA, these errors are then
propagated through to the final result.
 Performance is also particularly bad when all of the
sequences in the set are rather distantly related
2/22/2018
32
REFERENCES
 https://insidescienceresources.wordpress.com/2017/05/15/ncbi-
bioinformatics-tools-protein-blast-cobalt-and-cn3d-structure-viewer/
 https://academic.oup.com/bioinformatics/article/23/9/1073/272774
 https://www.ncbi.nlm.nih.gov/
2/22/2018
33
2/22/2018
34

More Related Content

What's hot

Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
AnkitTiwari354
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Arindam Ghosh
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
AjayPatil210
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
PRUTHVIRAJ K
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
Vijay Hemmadi
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
Phylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony methodPhylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony method
Afnan Zuiter
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
Lars Juhl Jensen
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
Pritom Chaki
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
Vidya Kalaivani Rajkumar
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
Ashwini
 
String.pptx
String.pptxString.pptx
String.pptx
RitikaChoudhary57
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
karamveer prajapat
 
Prosite
PrositeProsite
Phylogenetic data analysis
Phylogenetic data analysisPhylogenetic data analysis
Phylogenetic data analysis
Md. Dilshad karim
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
Kubuldinho
 
Cath
CathCath
Cath
Ramya S
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 

What's hot (20)

Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Est database
Est databaseEst database
Est database
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
BLAST and sequence alignment
 
Phylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony methodPhylogenetic prediction - maximum parsimony method
Phylogenetic prediction - maximum parsimony method
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Fasta
FastaFasta
Fasta
 
String.pptx
String.pptxString.pptx
String.pptx
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
Prosite
PrositeProsite
Prosite
 
Phylogenetic data analysis
Phylogenetic data analysisPhylogenetic data analysis
Phylogenetic data analysis
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Cath
CathCath
Cath
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 

Similar to Multiple Sequence Alignment Tool Using NCBI COBALT

OpenACC and Hackathons Monthly Highlights: April 2023
OpenACC and Hackathons Monthly Highlights: April  2023OpenACC and Hackathons Monthly Highlights: April  2023
OpenACC and Hackathons Monthly Highlights: April 2023
OpenACC
 
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACHGPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
ijdms
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
IRJET Journal
 
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming DataIRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET Journal
 
IRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET Journal
 
Networking project list for java and dotnet
Networking project list for java and dotnetNetworking project list for java and dotnet
Networking project list for java and dotnet
redpel dot com
 
An effective adaptive approach for joining data in data
An effective adaptive approach for joining data in dataAn effective adaptive approach for joining data in data
An effective adaptive approach for joining data in data
eSAT Publishing House
 
An Efficient Approach for Requirement Traceability Integrated With Software R...
An Efficient Approach for Requirement Traceability Integrated With Software R...An Efficient Approach for Requirement Traceability Integrated With Software R...
An Efficient Approach for Requirement Traceability Integrated With Software R...
IOSR Journals
 
Hybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
Hybrid Task Scheduling Approach using Gravitational and ACO Search AlgorithmHybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
Hybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
IRJET Journal
 
A Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System UptimeA Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System Uptime
YogeshIJTSRD
 
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsA Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
IRJET Journal
 
Efficient failure detection and consensus at extreme-scale systems
Efficient failure detection and consensus at extreme-scale  systemsEfficient failure detection and consensus at extreme-scale  systems
Efficient failure detection and consensus at extreme-scale systems
IJECEIAES
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
SBGC
 
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
SBGC
 
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docxCIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
sleeperharwell
 
Ahmed Absi slides bigbwa
Ahmed Absi slides  bigbwaAhmed Absi slides  bigbwa
Ahmed Absi slides bigbwa
Absi Ahmed
 
Performance Analysis and Parallelization of CosineSimilarity of Documents
Performance Analysis and Parallelization of CosineSimilarity of DocumentsPerformance Analysis and Parallelization of CosineSimilarity of Documents
Performance Analysis and Parallelization of CosineSimilarity of Documents
IRJET Journal
 

Similar to Multiple Sequence Alignment Tool Using NCBI COBALT (20)

OpenACC and Hackathons Monthly Highlights: April 2023
OpenACC and Hackathons Monthly Highlights: April  2023OpenACC and Hackathons Monthly Highlights: April  2023
OpenACC and Hackathons Monthly Highlights: April 2023
 
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACHGPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
 
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug TriageSurvey on Software Data Reduction Techniques Accomplishing Bug Triage
Survey on Software Data Reduction Techniques Accomplishing Bug Triage
 
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming DataIRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
IRJET- E-MORES: Efficient Multiple Output Regression for Streaming Data
 
Poster (1)
Poster (1)Poster (1)
Poster (1)
 
IRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop FrameworkIRJET- Big Data Processes and Analysis using Hadoop Framework
IRJET- Big Data Processes and Analysis using Hadoop Framework
 
Networking project list for java and dotnet
Networking project list for java and dotnetNetworking project list for java and dotnet
Networking project list for java and dotnet
 
An effective adaptive approach for joining data in data
An effective adaptive approach for joining data in dataAn effective adaptive approach for joining data in data
An effective adaptive approach for joining data in data
 
An Efficient Approach for Requirement Traceability Integrated With Software R...
An Efficient Approach for Requirement Traceability Integrated With Software R...An Efficient Approach for Requirement Traceability Integrated With Software R...
An Efficient Approach for Requirement Traceability Integrated With Software R...
 
Spe165 t
Spe165 tSpe165 t
Spe165 t
 
An Efficient Approach for Requirement Traceability Integrated With Software ...
An Efficient Approach for Requirement Traceability Integrated  With Software ...An Efficient Approach for Requirement Traceability Integrated  With Software ...
An Efficient Approach for Requirement Traceability Integrated With Software ...
 
Hybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
Hybrid Task Scheduling Approach using Gravitational and ACO Search AlgorithmHybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
Hybrid Task Scheduling Approach using Gravitational and ACO Search Algorithm
 
A Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System UptimeA Study on Replication and Failover Cluster to Maximize System Uptime
A Study on Replication and Failover Cluster to Maximize System Uptime
 
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image TagsA Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
A Low Rank Mechanism to Detect and Achieve Partially Completed Image Tags
 
Efficient failure detection and consensus at extreme-scale systems
Efficient failure detection and consensus at extreme-scale  systemsEfficient failure detection and consensus at extreme-scale  systems
Efficient failure detection and consensus at extreme-scale systems
 
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
2017 IEEE Projects 2017 For Cse ( Trichy, Chennai )
 
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
Dotnet datamining ieee projects 2012 @ Seabirds ( Chennai, Pondicherry, Vello...
 
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docxCIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
CIS 532 Network Architecture and AnalysisStudent’s NameSubm.docx
 
Ahmed Absi slides bigbwa
Ahmed Absi slides  bigbwaAhmed Absi slides  bigbwa
Ahmed Absi slides bigbwa
 
Performance Analysis and Parallelization of CosineSimilarity of Documents
Performance Analysis and Parallelization of CosineSimilarity of DocumentsPerformance Analysis and Parallelization of CosineSimilarity of Documents
Performance Analysis and Parallelization of CosineSimilarity of Documents
 

Recently uploaded

June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
Levi Shapiro
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Po-Chuan Chen
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
Ashokrao Mane college of Pharmacy Peth-Vadgaon
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
Jheel Barad
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
DhatriParmar
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
CarlosHernanMontoyab2
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
Peter Windle
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
RaedMohamed3
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
MysoreMuleSoftMeetup
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
Celine George
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
BhavyaRajput3
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
heathfieldcps1
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
Special education needs
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
siemaillard
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
JosvitaDsouza2
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
SACHIN R KONDAGURI
 

Recently uploaded (20)

June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...
 
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdfAdversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
 
Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.Biological Screening of Herbal Drugs in detailed.
Biological Screening of Herbal Drugs in detailed.
 
Instructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptxInstructions for Submissions thorugh G- Classroom.pptx
Instructions for Submissions thorugh G- Classroom.pptx
 
The Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptxThe Accursed House by Émile Gaboriau.pptx
The Accursed House by Émile Gaboriau.pptx
 
678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf678020731-Sumas-y-Restas-Para-Colorear.pdf
678020731-Sumas-y-Restas-Para-Colorear.pdf
 
Embracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic ImperativeEmbracing GenAI - A Strategic Imperative
Embracing GenAI - A Strategic Imperative
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
Palestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptxPalestine last event orientationfvgnh .pptx
Palestine last event orientationfvgnh .pptx
 
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
Mule 4.6 & Java 17 Upgrade | MuleSoft Mysore Meetup #46
 
How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17How to Make a Field invisible in Odoo 17
How to Make a Field invisible in Odoo 17
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCECLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
CLASS 11 CBSE B.St Project AIDS TO TRADE - INSURANCE
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
The basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptxThe basics of sentences session 5pptx.pptx
The basics of sentences session 5pptx.pptx
 
special B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdfspecial B.ed 2nd year old paper_20240531.pdf
special B.ed 2nd year old paper_20240531.pdf
 
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
aaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx1.4 modern child centered education - mahatma gandhi-2.pptx
1.4 modern child centered education - mahatma gandhi-2.pptx
 
"Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe..."Protectable subject matters, Protection in biotechnology, Protection of othe...
"Protectable subject matters, Protection in biotechnology, Protection of othe...
 

Multiple Sequence Alignment Tool Using NCBI COBALT

  • 1. MSAT MULTIPLE SEQUENCE ALIGNMENT TOOL BY GROUP 2 2/22/2018 1
  • 2. OVERVIEW  Sequence alignment  Types of sequence alignments  Multiple sequence alignment  Purpose of MSA  Types of MSA  Progressive alignment  Pros & Cons 2/22/2018 2
  • 3. SEQUENCE ALIGNMENT  In bioinformatics, a sequence alignments a way of arranging the sequences of DNA, RNA , or protein to identify regions of similarity that may be a consequence of functional, stru ctural, or evolutionary relationships between the sequences. 2/22/2018 3
  • 4. SEQUENCE ALIGNMENT  SEQUENCE ALIGNMENT Sequences often contain highly conserved regions  These regions can be used for initial alignments 2/22/2018 4
  • 5. TYPES OF SEQUENCE ALIGNMENTS Pair‐wise alignment  Dot matrix method  Dynamic programming  Word methods Multiple sequence alignment  Dynamic programming  Progressive methods  Iterative methods 2/22/2018 5
  • 6. MULTIPLE SEQUENCE ALIGNMENT  A multiple sequence alignment is tool that simultaneously aligns multiple protein sequences, automatically utilizes information about protein domains, and has a good compromise between speed and accuracy will have practical advantages over current tools  The principle is that multiple alignments are achieved by successive application of p airwise methods. 2/22/2018 6
  • 7. PURPOSE OF MSA  In order to characterize protein families, identify shared regions of homology in a multiple sequence alignment  Determination of the consensus sequence of several aligned sequences.  Consensus sequences can help to develop a sequence “finger print” which allo ws the identification of members of distantly related protein family (motifs)  MSA can help us to reveal biological facts about proteins, like analysis of the secondary/tertiary structure 2/22/2018 7
  • 9. TYPES OF MSA  Dynamic programming approach Computes an optimal alignment for a given score function. Because of its high ru nning time , it is not typically used in practice.  Progressive method This approach repeatedly aligns two sequences, two alignments, or a sequence with an alignment.  Iterative method Works similarly to progressive methods but repeatedly realigns the initial sequence s as well as adding new sequences to the growing MSA. 2/22/2018 9
  • 10. PROGRESSIVE ALIGNMENT  The most widely used approach  Builds up a final MSA by combining pairwise alignments beginning with the most similar pair and progressing to the most distantly r elated  Progressive alignment methods require two stages: First stage in which the relationships between the sequences are represented as a tree, called a guide tree ‐Second step in which the MSA is built by adding the sequences sequentially to the growing MSA according to the guide tree 2/22/2018 10
  • 11. USING COBALT NCBI  Constraint based alignment tool that implements a general framework for multiple alignment of protein sequences  COBALT finds a collection of pairwise constraints derived from database searches, sequence similarity and user input, combines these pairwise constraints, and then incorporates them into a progressive multiple alignment  COBALT has reasonable runtime performance and alignment accuracy comparable to or exceeding that of other tools for a broad range of problems 2/22/2018 11
  • 12. USING COBALT NCBI  COBALT has a general framework that uses progressive multiple alignment to combine pairwise constraints from different sources into a multiple alignment  When the same domain matches to multiple sequences, we can infer several potential pairwise constraints based on these domain matches  CDD ( Conserved Domains Database ) also contains auxiliary information that allows COBALT to create partial profiles for input sequences before progressive alignment begins, and this avoids computationally expensive procedures for building profiles 2/22/2018 12
  • 13. RUNTIME OF COBALT  The runtime performance of COBALT is highly data driven  COBALT is about five times faster than ProbCons  COBALT is included in the NCBI C++ Toolkit  Numerous auxiliary programs were written in C, C++ and Perl to automate testing and summarize results 2/22/2018 13
  • 14. AVAILABILITY  COBALT is included in the NCBI C++ toolkit. A Linux executable for COBALT, and CDD and PROSITE data used is available at: https://www.ncbi.nlm.nih.gov/tools/cobalt/re_cobalt.cgi  Contact: richa@helix.nih.gov 2/22/2018 14
  • 15. STEP 1 2/22/2018 15  Go to https://www.ncbi.nlm.nih.gov/
  • 16. STEP 2  The Swiss-Prot protein sequence for Schizosaccharomyces pombe Clr4 is O60016.2. 2/22/2018 16
  • 21. Your patience is greatly appreciated…. 2/22/2018 21
  • 25. Your patience is greatly appreciated…. 2/22/2018 25
  • 28. STEP 8  Notice that the above multiple alignment cant be edited “Edit and Resubmit” link at the top of the COBALT results to remove the undesired protein than search again. 2/22/2018 28
  • 31. PROS AND CONS OF PROGRESSIVE METHOD OF ALIGNMENT  PROS: Efficient enough to implement on a large scale for many (100s to 1000s) sequences.  Progressive alignment services are commonly available on publicly accessible web servers, so users need not locally install the applications of interest.  Most widely used method of multiple sequence alignment because of speed and accuracy. 2/22/2018 31
  • 32. CONS…….  Progressive alignments are not guaranteed to be globally optimal.  The primary problem is that when errors are made at any stage in growing the MSA, these errors are then propagated through to the final result.  Performance is also particularly bad when all of the sequences in the set are rather distantly related 2/22/2018 32