SlideShare a Scribd company logo
1 of 55
Phylogenetics and Sequence Analysis
Fall 2015
Kurt Wollenberg, PhD
Phylogenetics Specialist
Bioinformatics and Computational Biosciences Branch
Office of Cyber Infrastructure and Computational Biology
We Are BCBB!
2
 Group of 37
• Bioinformatics Software Developers
• Computational Biologists
• Project Management & Analysis Professionals
Contact BCBB…
 “NIH Users: Access a menu of BCBB services on the
NIAID Intranet:
• http://bioinformatics.niaid.nih.gov/
 Outside of NIH –
• search “BCBB” on the NIAID Public Internet Page:
www.niaid.nih.gov
– or – use this direct link
 Email us at:
• ScienceApps@niaid.nih.gov
3
Course Organization
• Building a clean sequence
• Collecting homologs
• Aligning your sequences
• Building trees
• Further analyses
4
Previously
• Hierarchical and genealogical data
• Comparative sequence analysis
• Generating clean sequence
• Trim vector contamination
• Trim low-quality ends
• Align fragment overlap to build contig
• Export contig (consensus)
5
Today…
6
Pairwise sequence alignment
• How does it work?
BLAST
• How does it work?
• The many flavors of BLAST
• Demo
Multiple Sequence Alignment
• How does it work?
• Demo
• Inspect and correct your MSA
7
and BLAST: Basic Local Alignment Search Tool
• Sequence Alignment: Assigning homology to
sites among a group of known sequences
• BLAST: Alignment of one sequence with many
unknown sequences
PAIRWISE ALIGNMENT
HOMOLOGY vs. ANALOGY
8
common ancestry convergence
Pairing of sites based on an assessment of homology
Homology assessed using Substitution Matrices
9
PAIRWISE ALIGNMENT
HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKL
G+ +VK+HGKKV A+++++AH+D++ +++++LS+LH KL
HBB_HUMAN GNPKVKAHGKKVLGAFSDGLAHLDNLKGTFATLSELHCDKL
HBA_HUMAN GSAQVKGHGKKVADALTNAVAHV---D--DMPNALSALSDLHAHKL
++ ++++H+ KV + +A ++ +L+ L+++H+ K
LGB2_LUPLU NNPELQAHAGKVFKLVYEAAIQLQVTGVVVTDATLKNLGSVHVSKG
HBA_HUMAN GSAQVKGHGKKVADALTNAVAHVDDMPNALSALSD----LHAHKL
GS+ + G + +D L ++ H+ D+ A +AL D ++AH+
F11G11.2 GSGYLVGDSLTFVDLL--VAQHTADLLAANAALLDEFPQFKAHQE
10
PAIRWISE ALIGNMENT
Substitution Matrices
➡ Derived mathematically
➡ Derived from data
“A substitution matrix (even one derived by arbitrarily
assigning probabilities to pairs) is a statement of the
probability of observing these pairs in real alignment.”
11
PAIRWISE ALIGNMENT
• Single parameter - Jukes-Cantor
- Equal base frequencies
- Uniform rates of change
• Two parameter - Kimura
- Equal base probabilities
- Two rates of change
12
PAIRWISE ALIGNMENT
DNA Substitution Matrices
• More parameters - HKY
- Unequal base frequencies
- Two rates of change
• Fully parameterized - GTR
- Unequal base probabilities
- Six rates of change
PAIRWISE ALIGNMENT
DNA Substitution Matrices
13
14
( )
ĂŻ
ĂŽ
ĂŻ
Ă­
ĂŹ
š-
=+
=
-
-
jie
jie
tP
t
t
ij
m
m
4
4
4
1
4
1
4
3
4
1
PAIRWISE ALIGNMENT
Jukes-Cantor Substitution Probabilities
Îźt = 0.25
A C G T
A 0.5259 0.1580 0.1580 0.1580
C 0.1580 0.5259 0.1580 0.1580
G 0.1580 0.1580 0.5259 0.1580
T 0.1580 0.1580 0.1580 0.5259
15
PAIRWISE ALIGNMENT
Jukes-Cantor Substitution Probabilities
PAIRWISE ALIGNMENT
Kimura Two-Parameter Substitution Model
16
If the probability of transitions (A ⇔ G, C ⇔ T) is
different from the probability of transversions (A
⇔T, G ⇔T, A ⇔C, G ⇔C), then there are two
relative rate parameters expressed as the
transition/transversion rate ratio Îş
PAIRWISE ALIGNMENT
Kimura Two-Parameter Substitution Probabilities
17
( )
ĂŻ
ĂŻ
ĂŻ
ĂŽ
ĂŻĂŻ
ĂŻ
Ă­
ĂŹ
=++
š-+
š-
=
+--
+--
-
jiee
transitionjiee
ontransversijie
tP
tt
tt
t
ij
mkm
mkm
m
)1(24
)1(24
4
2
1
4
1
4
1
,
2
1
4
1
4
1
,
4
1
4
1
Îźt = 0.25 k = 2.0
A C G T
A 0.4535 0.1580 0.2304 0.1580
C 0.1580 0.4535 0.1580 0.2304
G 0.2304 0.1580 0.4535 0.1580
T 0.1580 0.2304 0.1580 0.4535
18
PAIRWISE ALIGNMENT
Kimura Two-Parameter Substitution Probabilities
PAIRWISE ALIGNMENT
HKY Substitution Probabilities
19
( )
( )
( )
( ) ( )
ĂŻ
ĂŻ
ĂŻ
ĂŻ
ĂŽ
ĂŻ
ĂŻ
ĂŻ
ĂŻ
Ă­
ĂŹ
š-
š
á
á
ø
Ăś
ç
ç
è
ĂŚ
P
+
á
á
ø
Ăś
ç
ç
è
ĂŚ
-
P
+
=
á
á
ø
Ăś
ç
ç
è
ĂŚ
P
-P
+
á
á
ø
Ăś
ç
ç
è
ĂŚ
-
P
+
=
-
--
--
ontransversijie
transitionjiee
jiee
tP
t
j
tA
j
jt
j
jj
tA
j
jjt
j
jj
ij
,1
,1
1
1
1
m
mm
mm
p
p
pp
p
pp
PAIRWISE ALIGNMENT
HKY Substitution Probabilities
20
( )11 -P+=
+=P
+=P
k
pp
pp
j
TCj
GAj
A
if j is a purine
if j is a pyrimidine
Jukes-Cantor
(One substitution type, equal nucleotide frequencies)
Independent nucl. freq. Two substitution types .
F81/TN82 Kimura 2-Parameter (K2P)
Two substitution types Indep. nucl. freq. Three substitution types .
HKY85/F84 Kimura 3 subst. type (K3ST)
Three substitution types Six substitution types .
Tamura-Nei (TrN) Symmetric (SYM)
Six substitution types Independent nucl. freq. .
General time-reversible (GTR)
Substitution Models
Protein Score Matrices
Similarity of Amino Acids
22From Esquivel RO, et al.. 2013. Advances in Quantum Mechanics, Chapter 27 InTech.
PAIRWISE ALIGNMENT
Protein Score Matrices
• Derived from empirical data
• Account for depth of relationship among the data
• Expressed as log-odds ratio:
- Logarithm of the ratio of the probabilities of two
residues being aligned due to homology versus
random chance
23
PAIRWISE ALIGNMENT
Protein Score (Substitution) Matrices
PAIRWISE ALIGNMENT
24
The log-odds ratio:
s(a,b) = log(pab/qaqb)
qa = frequency of residue a in the data
pab = probability that residues a and b have been
derived from a common ancestor
Protein Substitution Matrices
• PAM250: Based on phylogenies where all sequences
differ by no more than 15%.
• BLOSUM62: Based on clusters of sequences with
greater than 62% identical residues.
PAIRWISE ALIGNMENT
25
WYFVLIMKRHQEDNGAPTSC
W
Y
F
V
L
I
M
K
R
H
Q
E
D
N
G
A
P
T
S
C
170062543236774766528
10721124404442535330
912105425564545334
42422222222101012
6243322343423236
522222222312012
60021232312125
5301001211005
621110320104
63112210113
4221100115
431001005
42101005
2001014
511013
21112
6013
312
20
12
!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!
!!!!!!!!!!!
!!!!!!!!!!!!!
!!!!!!!!!!!!
!!!!!!!!!!!
!!!!
!!!!!!
!!!!!
!!!!
!!
!!
!!
!!
!
!
!
Protein Substitution Matrices
PAM250
26
WYFVLIMKRHQEDNGAPTSC
W
Y
F
V
L
I
M
K
R
H
Q
E
D
N
G
A
P
T
S
C
112132313322344234232
7311112221232323222
610003313333324222
41312332233302021
4222232343413121
413333333413121
51120232312121
5211110211103
501020212113
80011222213
5200211103
520211104
61121103
6022013
602203
41010
7113
511
41
9
-----------------
----------------
--------------
-----------
-------------
-------------
------------
-------
-------
-------
-----
-----
-----
---
---
-
---
-
-
BLOSUM62
27
Protein Substitution Matrices
WYFVLIMKRHQEDNGAPTSC
BW
PW
62112132313322344234232
250170062543236774766528
!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!!
28
Protein Substitution Matrices
BLAST and Sequence Alignment
• Global alignment (Needleman-Wunsch)
• Assign homology across the entire sequence
• Clustal
• Local alignment (Smith-Waterman)
• Assign homology for subsequences
• MUSCLE and BLAST
• Good for aligning very divergent sequences
29
How do two sequences get “aligned”?
HEAGAWGHEE ⇔ PAWHEAE
30
SEQUENCE ALIGNMENT
Build a matrix of score values for all site pairs
H E A G A W G H E E
P 0 -1 1 0 1 -5 0 0 -1 -1
A -1 0 2 1 2 -6 1 -1 0 0
W -3 -7 -6 -7 -6 17 -7 -3 -7 -7
H 6 1 -1 -2 -1 -3 -2 6 1 1
E 1 4 0 0 0 -7 0 1 4 4
A -1 0 2 1 2 -6 1 -1 0 0
E 1 4 0 0 0 -7 0 1 4 4
H E A G A W G H E E
P -2 -1 -1 -2 -1 -4 -2 -2 -1 -1
A -2 -1 4 0 4 -3 0 -2 -1 -1
W -2 -3 -3 -2 -3 11 -2 -2 -3 -3
H 8 0 -2 -2 -2 -2 -2 8 0 0
E 0 5 -1 -2 -1 -3 -2 0 5 5
A -2 -1 4 0 4 -3 0 -2 -1 -1
E 0 5 -1 -2 -1 -3 -2 0 5 5
PAM250 BLOSUM62
31
What about gaps?
• Score penalty for opening
• Score penalty for extending
Penalties are log probabilities of a gap of a
specific length
SEQUENCE ALIGNMENT
32
Standard gap costs
Substitution Matrix Gap Costs (Open, Extend)
PAM30 (9,1)
PAM70 (10,1)
BLOSUM80 (10,1)
BLOSUM62 (10,1)
BLOSUM45 (15,2)
SEQUENCE ALIGNMENT
33
Dynamic Programming:
Calculate a matrix of alignment scores
H E A
P -2 -1 -1
A -2 -1 4
W -2 -3 -3
BLOSUM62
H E A
0 -8 -16 -24
P -8
A -16
W -24
-2 -17-9
-10
-18 -11
-3 -5
-6
SEQUENCE ALIGNMENT
34
Dynamic Programming
1) Calculate a full matrix
2) Traceback to get the Global Alignment
H E A G A W G H E E
0 -8 -16 -24 -32 -40 -48 -56 -64 -72 -80
P -8 -2 -9 -17 -25 -33 -41 -49 -57 -65 -73
A -16 -10 -3 -5 -13 -21 -29 -37 -45 -53 -61
W -24 -18 -11 -6 -7 -15 -10 -18 -26 -34 -41
H -32 -16 -18 -13 -8 -9 -17 -12 -10 -18 -26
E -40 -24 -11 -19 -15 -9 -12 -19 -12 -5 -13
A -48 -32 -19 -7 -15 -11 -12 -12 -20 -13 -6
E -58 -40 -27 -15 -9 -16 -14 -14 -12 -15 -8
H E A G A W G H E E
- - P - A W H E A E
-8
-13
-12
-10
-21
-25-17
-12
-16-80
SEQUENCE ALIGNMENT
35
Local Alignment
• Alignment of subsequences
• Good for aligning very divergent sequences
Score Calculation
• Minimum score is zero
• Traceback begins at the highest score
• Score = 0  End of subsequence
SEQUENCE ALIGNMENT
36
Local Alignment
H E A G A W G H E E
0 0 0 0 0 0 0 0 0 0 0
P 0 0 0 0 0 0 0 0 0 0 0
A 0 0 0 4 0 4 0 0 0 0 0
W 0 0 0 0 0 0 15 7 0 0 0
H 0 8 0 0 0 0 7 13 15 7 0
E 0 0 13 5 0 0 0 5 13 20 12
A 0 0 5 17 9 4 0 0 5 12 17
E 0 0 5 9 15 8 0 0 0 10 17
A W G H E
A W - H E
H E A G A W G H E e
p a w H E A e
p A W - H E a e
H E A G A W G H E e
p A W - H E a e
Repeat Match Overlap Match
20
15 7
4
15
17
13
8
SEQUENCE ALIGNMENT
37
Scoring alignments and expect values
Score := Value in the dynamic programming matrix where
the traceback began.
Expect (E) value := Number of matches expected due to
chance, with a score greater than S, based on a stochastic
sequence model.
P value := Probability of finding at least one match with
score ≥ S
P = 1-e-E(S)
SEQUENCE ALIGNMENT
BLAST
(Basic Local Alignment Search Tool)
• Create a list of query sequence “words”
• Word lengths: 11 nucleotides, 3 amino acids
• Create a list of neighborhood words
• Similar to query words and above a score threshold
• Search for matches in the database
• Extend matches
• Below threshold?
• Above threshold?
• Format and output maximally extended matches
38
How does BLAST work?
Discard!
Keep it!
39
How does BLAST work?
How does BLAST evaluate matches?
It uses (local) alignment scores
BLAST
(Basic Local Alignment Search Tool)
The Many Flavors of BLAST
• BLASTn and BLASTp
• short, nearly-exact match BLAST
• Translated BLAST
• BLASTx nt → aa ➯ protein db
• tBLASTn aa ➯ protein db ← DNA db
• tBLASTx nt → aa ➯ protein db ← DNA db
• PSI-BLAST (Position-Specific Iterated BLAST)
• PHI-BLAST (Pattern Hit Initiated BLAST)
• bl2seq
BLAST
40
short, nearly-exact match BLAST
• Increase Expect threshold
• Reduce word size (7 for nt, 2 for aa)
• Turn off low complexity filter
• Protein: Use a more stringent substitution matrix
41
BLAST
PSI-BLAST
(Position-Specific Iterated BLAST)
• Perform initial BLASTp search
• Generate a sequence profile from results
• BLASTp using the profile
• Iterate until no new sequences are found
• Convergence
42
BLAST
PHI-BLAST
(Pattern Hit Initiated BLAST)
[LIVMF]-G-E-x-[GAS]-[LIVM]-x(5,11)-R-[STAQ]-A-x-[LIVMA]-x-[STACV]
[ ] = Any of the residues within the brackets
- = spacer separating sites in the profile
x = Any residue
x(a,b) = Any residues a to b in length
VGERGLEEDKRKRSAWMQC
MGETALRRRKKEDEERTANVYT
FGEAAMPGGPHQSRSAFAWV
Sequence Profile
43
BLAST
Access to BLAST
• NCBI
• Your own computer
• NIAID HPC cluster
44
BLAST
Multiple Sequence Alignment
45
Multiple Sequence Alignment
The Progressive Alignment Algorithm
Unaligned
Sequences
Distance
Matrix
Aligned Sequences
MSA1
Kimura Distance
Matrix
Aligned Sequences
MSA2
Subtree Alignment
Profiles
Align Subtree Profiles
MSAsp
Better!
Not Better
From RC Edgar 2004, NAR 32: 1792-1797
UPGMA tree
UPGMA tree 2Delete edge from UPGMA
tree 2
• Clustal
• Your own computer
• Web Server
• NIAID HPC cluster
• MUSCLE
• Your own computer
• Web Server
• NIAID HPC cluster
• MAFFT
• Web Server
Programs
46
Multiple Sequence Alignment
NEVER
directly input the output of a MSA program into
an analysis program!
ALWAYS
inspect the alignment to improve it.
Multiple Sequence Alignment
47
Multiple Sequence Alignment Editors
• MacVector
• Commercial software
• MegAlign (Lasergene)
• Commercial software
• AliView
• Public domain
• GeneDoc
• Public domain
• BioEdit
• Public domain
Multiple Sequence Alignment
48
ClustalW
http://www.clustal.org/
Muscle
http://www.drive5.com/muscle/download3.6.html
MAFFT
http://mafft.cbrc.jp/alignment/server/
AliView
http://www.ormbunkar.se/aliview/
GeneDoc
http://www.nrbsc.org/downloads/
BioEdit
http://www.mbio.ncsu.edu/BioEdit/bioedit.html
Web Resources
49
• BLAST search for contig0001 homologs
• Download selected sequence records
• Align sequence records with Clustal2
50
Recapitulation
Seminar Follow-Up Site
 For access to past recordings, handouts, slides visit this site from the
NIH network:
http://collab.niaid.nih.gov/sites/research/SIG/Bioinformatics/
51
1. Select a
Subject Matter
View:
• Seminar Details
• Handout and
Reference Docs
• Relevant Links
• Seminar
Recording Links
2. Select a
Topic
Recommended Browsers:
• IE for Windows,
• Safari for Mac (Firefox on a
Mac is incompatible with
NIH Authentication
technology)
Login
• If prompted to log in use
“NIH” in front of your
username
52
Retrieving Slides/Handouts
This lecture
series
53
Retrieving Slides/Handouts
This lecture
These slides
54
Questions?
55
Next Lecture
Elmo
Grover
Cookie
Oscar Bert
Ernie
The Count
Herry
Tuesday, 10 November at 0930

More Related Content

What's hot

Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Vijay Hemmadi
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijayVijay Hemmadi
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission ToolsRishikaMaji
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignmentavrilcoghlan
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matricesAshwini
 
Protein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverProtein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverEkta Gupta
 
Cath
CathCath
CathRamya S
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)ShivaniShewale2
 
Dot matrix
Dot matrixDot matrix
Dot matrixTania Khan
 
Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming Zohaib HUSSAIN
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure predictionSamvartika Majumdar
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)ZoufishanY
 
Bioinfromatics - local alignment
Bioinfromatics - local alignmentBioinfromatics - local alignment
Bioinfromatics - local alignmentVivek Chandramohan
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignmentKubuldinho
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence AlignmentAjayPatil210
 

What's hot (20)

Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins Secondary Structure Prediction of proteins
Secondary Structure Prediction of proteins
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Sequence Submission Tools
Sequence Submission ToolsSequence Submission Tools
Sequence Submission Tools
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Scoring matrices
Scoring matricesScoring matrices
Scoring matrices
 
Protein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy serverProtein identification and analysis on ExPASy server
Protein identification and analysis on ExPASy server
 
Cath
CathCath
Cath
 
Protein information resource (PIR)
Protein information resource (PIR)Protein information resource (PIR)
Protein information resource (PIR)
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
protein data bank
protein data bankprotein data bank
protein data bank
 
Msa
MsaMsa
Msa
 
Dynamic programming
Dynamic programming Dynamic programming
Dynamic programming
 
Protein 3 d structure prediction
Protein 3 d structure predictionProtein 3 d structure prediction
Protein 3 d structure prediction
 
DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)DNA data bank of japan (DDBJ)
DNA data bank of japan (DDBJ)
 
Bioinfromatics - local alignment
Bioinfromatics - local alignmentBioinfromatics - local alignment
Bioinfromatics - local alignment
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Prosite
PrositeProsite
Prosite
 
Global and Local Sequence Alignment
Global and Local Sequence AlignmentGlobal and Local Sequence Alignment
Global and Local Sequence Alignment
 

Viewers also liked

Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Asiri Wijesinghe
 
sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)Ariful Islam Sagar
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)Sobia
 
BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]BiotechOnline
 

Viewers also liked (20)

Phylogenetics: Tree building
Phylogenetics: Tree buildingPhylogenetics: Tree building
Phylogenetics: Tree building
 
Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)Basic Local Alignment Search Tool (BLAST)
Basic Local Alignment Search Tool (BLAST)
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
3D graphics using VMD
3D graphics using VMD3D graphics using VMD
3D graphics using VMD
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
Pathogen phylogenetics using BEAST
Pathogen phylogenetics using BEASTPathogen phylogenetics using BEAST
Pathogen phylogenetics using BEAST
 
Introduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEASTIntroduction to Bayesian phylogenetics and BEAST
Introduction to Bayesian phylogenetics and BEAST
 
BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)BLAST(Basic Local Alignment Tool)
BLAST(Basic Local Alignment Tool)
 
Phylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figuresPhylogenetics: Making publication-quality tree figures
Phylogenetics: Making publication-quality tree figures
 
Sequence assembly
Sequence assemblySequence assembly
Sequence assembly
 
BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]BLAST [Basic Alignment Local Search Tool]
BLAST [Basic Alignment Local Search Tool]
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
BLAST
BLASTBLAST
BLAST
 
RNA-Seq
RNA-SeqRNA-Seq
RNA-Seq
 
ChIP-seq Theory
ChIP-seq TheoryChIP-seq Theory
ChIP-seq Theory
 
BEAST: Species phylogeny and phylogeographic analysis
BEAST: Species phylogeny and phylogeographic analysisBEAST: Species phylogeny and phylogeographic analysis
BEAST: Species phylogeny and phylogeographic analysis
 
Variant analysis and whole exome sequencing
Variant analysis and whole exome sequencingVariant analysis and whole exome sequencing
Variant analysis and whole exome sequencing
 
BEAST: Time-stamped data and population dynamics
BEAST: Time-stamped data and population dynamicsBEAST: Time-stamped data and population dynamics
BEAST: Time-stamped data and population dynamics
 
Tree building 2
Tree building 2Tree building 2
Tree building 2
 

Similar to BLAST and sequence alignment

Bioinformatics t5-databasesearching v2014
Bioinformatics t5-databasesearching v2014Bioinformatics t5-databasesearching v2014
Bioinformatics t5-databasesearching v2014Prof. Wim Van Criekinge
 
Wang labsummer2010
Wang labsummer2010Wang labsummer2010
Wang labsummer2010russodl
 
Heterogeneous Defect Prediction (

ESEC/FSE 2015)
Heterogeneous Defect Prediction (

ESEC/FSE 2015)Heterogeneous Defect Prediction (

ESEC/FSE 2015)
Heterogeneous Defect Prediction (

ESEC/FSE 2015)Sung Kim
 
Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Prof. Wim Van Criekinge
 
Artificial software diversity: automatic synthesis of program sosies
Artificial software diversity: automatic synthesis of program sosiesArtificial software diversity: automatic synthesis of program sosies
Artificial software diversity: automatic synthesis of program sosiesFoCAS Initiative
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadfalizain9604
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekingeProf. Wim Van Criekinge
 
RegressionProjectReport
RegressionProjectReportRegressionProjectReport
RegressionProjectReportHimanshu agarwal
 
Towards Evaluating Size Reduction Techniques for Software Model Checking
Towards Evaluating Size Reduction Techniques for Software Model CheckingTowards Evaluating Size Reduction Techniques for Software Model Checking
Towards Evaluating Size Reduction Techniques for Software Model CheckingAkos Hajdu
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28Aritra Sarkar
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mappingsatrajit
 
R for data visualization and graphics
R for data visualization and graphicsR for data visualization and graphics
R for data visualization and graphicsRobert Kabacoff
 
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiLazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiDatabricks
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07Computer Science Club
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchGreg Makowski
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticezahid6
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CHENNAKESAVA KADAPA
 

Similar to BLAST and sequence alignment (20)

Ch06 alignment
Ch06 alignmentCh06 alignment
Ch06 alignment
 
Bioinformatics t5-databasesearching v2014
Bioinformatics t5-databasesearching v2014Bioinformatics t5-databasesearching v2014
Bioinformatics t5-databasesearching v2014
 
Wang labsummer2010
Wang labsummer2010Wang labsummer2010
Wang labsummer2010
 
Heterogeneous Defect Prediction (

ESEC/FSE 2015)
Heterogeneous Defect Prediction (

ESEC/FSE 2015)Heterogeneous Defect Prediction (

ESEC/FSE 2015)
Heterogeneous Defect Prediction (

ESEC/FSE 2015)
 
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
MUMS: Transition & SPUQ Workshop - Gradient-Free Construction of Active Subsp...
 
Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015Bioinformatics life sciences_v2015
Bioinformatics life sciences_v2015
 
Artificial software diversity: automatic synthesis of program sosies
Artificial software diversity: automatic synthesis of program sosiesArtificial software diversity: automatic synthesis of program sosies
Artificial software diversity: automatic synthesis of program sosies
 
lecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadflecture4.ppt Sequence Alignmentaldf sdfsadf
lecture4.ppt Sequence Alignmentaldf sdfsadf
 
2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge2016 bioinformatics i_database_searching_wimvancriekinge
2016 bioinformatics i_database_searching_wimvancriekinge
 
RegressionProjectReport
RegressionProjectReportRegressionProjectReport
RegressionProjectReport
 
Towards Evaluating Size Reduction Techniques for Software Model Checking
Towards Evaluating Size Reduction Techniques for Software Model CheckingTowards Evaluating Size Reduction Techniques for Software Model Checking
Towards Evaluating Size Reduction Techniques for Software Model Checking
 
QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28QX Simulator and quantum programming - 2020-04-28
QX Simulator and quantum programming - 2020-04-28
 
Reducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology MappingReducing Structural Bias in Technology Mapping
Reducing Structural Bias in Technology Mapping
 
R for data visualization and graphics
R for data visualization and graphicsR for data visualization and graphics
R for data visualization and graphics
 
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo InterlandiLazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
Lazy Join Optimizations Without Upfront Statistics with Matteo Interlandi
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07
 
Heuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient searchHeuristic design of experiments w meta gradient search
Heuristic design of experiments w meta gradient search
 
Presentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informaticePresentation for blast algorithm bio-informatice
Presentation for blast algorithm bio-informatice
 
Blast Algorithm
Blast AlgorithmBlast Algorithm
Blast Algorithm
 
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
CutFEM on hierarchical B-Spline Cartesian grids with applications to fluid-st...
 

More from Bioinformatics and Computational Biosciences Branch

More from Bioinformatics and Computational Biosciences Branch (20)

Hong_Celine_ES_workshop.pptx
Hong_Celine_ES_workshop.pptxHong_Celine_ES_workshop.pptx
Hong_Celine_ES_workshop.pptx
 
Virus Sequence Alignment and Phylogenetic Analysis 2019
Virus Sequence Alignment and Phylogenetic Analysis 2019Virus Sequence Alignment and Phylogenetic Analysis 2019
Virus Sequence Alignment and Phylogenetic Analysis 2019
 
Nephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele resultsNephele 2.0: How to get the most out of your Nephele results
Nephele 2.0: How to get the most out of your Nephele results
 
Introduction to METAGENOTE
Introduction to METAGENOTE Introduction to METAGENOTE
Introduction to METAGENOTE
 
Intro to homology modeling
Intro to homology modelingIntro to homology modeling
Intro to homology modeling
 
Protein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modelingProtein fold recognition and ab_initio modeling
Protein fold recognition and ab_initio modeling
 
Homology modeling: Modeller
Homology modeling: ModellerHomology modeling: Modeller
Homology modeling: Modeller
 
Protein docking
Protein dockingProtein docking
Protein docking
 
Protein function prediction
Protein function predictionProtein function prediction
Protein function prediction
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
Biological networks
Biological networksBiological networks
Biological networks
 
UNIX Basics and Cluster Computing
UNIX Basics and Cluster ComputingUNIX Basics and Cluster Computing
UNIX Basics and Cluster Computing
 
Statistical applications in GraphPad Prism
Statistical applications in GraphPad PrismStatistical applications in GraphPad Prism
Statistical applications in GraphPad Prism
 
Intro to JMP for statistics
Intro to JMP for statisticsIntro to JMP for statistics
Intro to JMP for statistics
 
Categorical models
Categorical modelsCategorical models
Categorical models
 
Better graphics in R
Better graphics in RBetter graphics in R
Better graphics in R
 
Automating biostatistics workflows using R-based webtools
Automating biostatistics workflows using R-based webtoolsAutomating biostatistics workflows using R-based webtools
Automating biostatistics workflows using R-based webtools
 
Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)Overview of statistical tests: Data handling and data quality (Part II)
Overview of statistical tests: Data handling and data quality (Part II)
 
Overview of statistics: Statistical testing (Part I)
Overview of statistics: Statistical testing (Part I)Overview of statistics: Statistical testing (Part I)
Overview of statistics: Statistical testing (Part I)
 
GraphPad Prism: Curve fitting
GraphPad Prism: Curve fittingGraphPad Prism: Curve fitting
GraphPad Prism: Curve fitting
 

Recently uploaded

Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantadityabhardwaj282
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensorsonawaneprad
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxBerniceCayabyab1
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...lizamodels9
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentationtahreemzahra82
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)DHURKADEVIBASKAR
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 

Recently uploaded (20)

Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Forest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are importantForest laws, Indian forest laws, why they are important
Forest laws, Indian forest laws, why they are important
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Environmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial BiosensorEnvironmental Biotechnology Topic:- Microbial Biosensor
Environmental Biotechnology Topic:- Microbial Biosensor
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptxGenBio2 - Lesson 1 - Introduction to Genetics.pptx
GenBio2 - Lesson 1 - Introduction to Genetics.pptx
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
Best Call Girls In Sector 29 Gurgaon❤️8860477959 EscorTs Service In 24/7 Delh...
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
Harmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms PresentationHarmful and Useful Microorganisms Presentation
Harmful and Useful Microorganisms Presentation
 
Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)Recombinant DNA technology( Transgenic plant and animal)
Recombinant DNA technology( Transgenic plant and animal)
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 

BLAST and sequence alignment

Editor's Notes

  1. Algorithm explained in Felsenstein (2004) Chapter 16.