SlideShare a Scribd company logo
1 of 35
SEQUENCE ALIGNMENT P.S.CHANDRANAND
Objectives ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Homologous   refers to conclusion drawn from the data that the two genes or sequences have descended from a common ancestor   Homologous sequences are of two types   Orthologous   Homologous sequences in different species that arose from a common ancestral gene during speciation Parologous   Homologous sequences within a single species that arose by gene duplication
What is Alignment ? Explicit mapping between two or more sequences   To place one sequence over another in such a fashion so as to get maximum similarity SEQUENCE ALIGNMENT  STRUCTURAL  ALIGNMENT
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
Similarity vs. homology ,[object Object],[object Object],[object Object]
Proteins of 100% identity  (Human & Xenopus Myoglobin) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],GLSDGEWQ Q VLNVWGKVEADI A GHGQEV LIRLF T GHPETLEKFDKFKHLKTE A EMKA SEDLKKHG TV VLTALGGILKKKGHHEAE L KPLAQSHATKHKIP I KYLEFIS DA II H VL H SKHPGDFGADAQGAM T KALELFR N D I A A K YKELGFQG Proteins with similarity  (H orse P02188  & Xenopus)
Evolutionary Basis ,[object Object],[object Object]
Basic Concept of Alignment ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object]
ALIGNMENT Pairwise alignment    Multiple alignment
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Why multiple sequence alignment   ? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The process of aligning sequences is a game involving playing off gaps and mismatches
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Comparative Analysis of Alignment Techniques
Global vs. Local Alignment
A model for database searching score probabilities ,[object Object],[object Object]
Extreme Value Distribution Probability density function for the extreme value distribution resulting from parameter values    = 0 and    = 1, [ y  = 1 – exp(- e -x )], where     is the characteristic value and     is the decay constant.  y  = 1 – exp(- e -  ( x -  ) )
Extreme Value Distribution (EDV) You  know that an optimal alignment of two sequences is selected out of many suboptimal alignments, and that a database search is also about selecting the best alignment(s). This bodes well with the EDV which has a right tail that falls off more slowly than the left tail. Compared to using the normal distribution, when using the EDV an alignment has to score further away from the expected mean value to become a significant hit.  real data EDV approximation
Extreme Value Distribution The probability of a score  S  to be larger than a given value  x  can be calculated following the EDV as:  E-value: P ( S     x ) = 1 – exp(- e  -  ( x -  ) ) ,  where      =(ln  Kmn )/  , and  K  a constant that can be estimated from the background amino acid distribution and scoring matrix (see Altschul and Gish, 1996, for a collection of values for    and  K  over a set of widely used scoring matrices).
Extreme Value Distribution Using the equation for     (preceding slide), the probability for the raw alignment score  S  becomes  P ( S     x ) = 1 – exp(- Kmne -  x ). In practice, the probability  P ( S  x ) is estimated using the approximation 1 – exp(- e -x )    e -x , which is valid for large values of  x . This leads to a simplification of the equation for  P ( S  x ): P ( S    x )    e -  (x-  )  = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score  S .
Normalised sequence similarity Statistical significance ,[object Object],[object Object]
FASTP : Local Alignment Tool Sequence 1  F  L  W  R  T  W  S Sequence 2  S  W  K  T  W  T Method based on lookup tables Lipman & Pearson, Science (1985) vol 227,1435-41 ,[object Object],[object Object]
Construction of the Lookup Table   Position Number Residue  Seq 1  Seq2  Offset(p1-p2) F  1   -   - L  2   -   - W  3,6  2,5  1(3,2)  1(6,5)  4(6,2)  -2(3,5) R  4   -   - T  5  4,6 1(5,4)  - 1(5,6) S  7   1    6(7,1) K  -   3  - Pos no.  1  2  3  4  5  6  7 Sequence 1  F  L  W  R  T  W  S Sequence 2  S  W  K  T  W  T
Calculation of Offset Frequency Offset  Frequency   1  3   4  1 -1  1 -2  1    6  1 Final Local Alignment Pos no.   1  2  3  4  5  6  7 Sequence 1   F  L  W  R  T  W  S Sequence 2   -  S  W  K  T  W  T
Extreme Value Distribution Using the equation for     (preceding slide), the probability for the raw alignment score  S  becomes  P ( S     x ) = 1 – exp(- Kmne -  x ). In practice, the probability  P ( S  x ) is estimated using the approximation 1 – exp(- e -x )    e -x , which is valid for large values of  x . This leads to a simplification of the equation for  P ( S  x ): P ( S    x )    e -  (x-  )  = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score  S .
-Needleman-Wunsch (1970) provided first automatic method -Dynamic Programming to Find Global Alignment ,[object Object],[object Object],[object Object],NEEDLEMAN-WUNSCH Algorithm
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],AGGVLIQVG  AGGVLIIQVG AGGVL-IQVG   AGGVLIIQVG
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Summary An alignment just  reflects the  probable  evolutionary history  of the two genes as it is  presumed  that the homologous sequences have diverged from a common ancestral sequence through iterative molecular changes ,[object Object],[object Object],[object Object],[object Object],Two types of gap penalties Global alignment   Local alignment Two types of Alignment Linear gap penalty Affine gap penalty

More Related Content

What's hot

Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
avrilcoghlan
 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithm
avrilcoghlan
 

What's hot (20)

Clustal
ClustalClustal
Clustal
 
Protein database
Protein databaseProtein database
Protein database
 
EMBL-EBI
EMBL-EBIEMBL-EBI
EMBL-EBI
 
Dynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentDynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignment
 
Multiple Sequence Alignment
Multiple Sequence AlignmentMultiple Sequence Alignment
Multiple Sequence Alignment
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
Upgma
UpgmaUpgma
Upgma
 
Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment   Clustal W - Multiple Sequence alignment
Clustal W - Multiple Sequence alignment
 
Tech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome BrowserTech Talk: UCSC Genome Browser
Tech Talk: UCSC Genome Browser
 
UniProt
UniProtUniProt
UniProt
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
PAM : Point Accepted Mutation
PAM : Point Accepted MutationPAM : Point Accepted Mutation
PAM : Point Accepted Mutation
 
Dot matrix Analysis Tools (Bioinformatics)
Dot matrix Analysis Tools (Bioinformatics)Dot matrix Analysis Tools (Bioinformatics)
Dot matrix Analysis Tools (Bioinformatics)
 
(Expasy)
(Expasy)(Expasy)
(Expasy)
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment The Needleman-Wunsch Algorithm for Sequence Alignment
The Needleman-Wunsch Algorithm for Sequence Alignment
 
Phylogenetic tree and it's types
Phylogenetic tree and it's typesPhylogenetic tree and it's types
Phylogenetic tree and it's types
 
Phylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny ofPhylogenetic tree and its construction and phylogeny of
Phylogenetic tree and its construction and phylogeny of
 
Phylogenetic tree construction
Phylogenetic tree constructionPhylogenetic tree construction
Phylogenetic tree construction
 
The Smith Waterman algorithm
The Smith Waterman algorithmThe Smith Waterman algorithm
The Smith Waterman algorithm
 

Viewers also liked

Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
avrilcoghlan
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
Abhishek Vatsa
 
RNA Secondary Structure Prediction
RNA Secondary Structure PredictionRNA Secondary Structure Prediction
RNA Secondary Structure Prediction
Sumin Byeon
 

Viewers also liked (20)

Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
Introduction to sequence alignment
Introduction to sequence alignmentIntroduction to sequence alignment
Introduction to sequence alignment
 
Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
BLAST
BLASTBLAST
BLAST
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
RNA secondary structure prediction
RNA secondary structure predictionRNA secondary structure prediction
RNA secondary structure prediction
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
BLAST and sequence alignment
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Application of bioinformatics
Application of bioinformaticsApplication of bioinformatics
Application of bioinformatics
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
RNA Secondary Structure Prediction
RNA Secondary Structure PredictionRNA Secondary Structure Prediction
RNA Secondary Structure Prediction
 

Similar to Sequence alignment belgaum

B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
Rai University
 
The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1
Keiji Takamoto
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07
Computer Science Club
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
yaghava
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Sanaym
 
Bioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searchingBioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searching
Prof. Wim Van Criekinge
 

Similar to Sequence alignment belgaum (20)

How the blast work
How the blast workHow the blast work
How the blast work
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
Laboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsLaboratory 1 sequence_alignments
Laboratory 1 sequence_alignments
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1
 
Sequence alignment.pptx
Sequence alignment.pptxSequence alignment.pptx
Sequence alignment.pptx
 
Computation and System Biology Assignment Help
Computation and System Biology Assignment HelpComputation and System Biology Assignment Help
Computation and System Biology Assignment Help
 
Bioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matricesBioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matrices
 
5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Prediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsPrediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methods
 
Bioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searchingBioinformatica 10-11-2011-t5-database searching
Bioinformatica 10-11-2011-t5-database searching
 

More from National Institute of Biologics

Canonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulinsCanonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulins
National Institute of Biologics
 

More from National Institute of Biologics (20)

Waters protein therapeutics application proctocols
Waters protein therapeutics application proctocolsWaters protein therapeutics application proctocols
Waters protein therapeutics application proctocols
 
Potential aggregation prone regions in biotherapeutics
Potential aggregation prone regions in biotherapeuticsPotential aggregation prone regions in biotherapeutics
Potential aggregation prone regions in biotherapeutics
 
How the biologics landscape is evolving
How the biologics landscape is evolvingHow the biologics landscape is evolving
How the biologics landscape is evolving
 
Evaluation of antibody drugs quality safety
Evaluation of antibody drugs quality safetyEvaluation of antibody drugs quality safety
Evaluation of antibody drugs quality safety
 
Approved m abs_feb_2015
Approved m abs_feb_2015Approved m abs_feb_2015
Approved m abs_feb_2015
 
Translating next generation sequencing to practice
Translating next generation sequencing to practiceTranslating next generation sequencing to practice
Translating next generation sequencing to practice
 
From biomarkers to diagnostics –the road to success
From biomarkers to diagnostics –the road to successFrom biomarkers to diagnostics –the road to success
From biomarkers to diagnostics –the road to success
 
Defining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-productsDefining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-products
 
Accelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapiesAccelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapies
 
Canonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulinsCanonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulins
 
Canonical correlation
Canonical correlationCanonical correlation
Canonical correlation
 
Development trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeuticsDevelopment trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeutics
 
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
 
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
 
Therapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammationTherapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammation
 
Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...
 
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...
 
Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...
 
How dissimilarly similar are biosimilars
How dissimilarly similar are biosimilarsHow dissimilarly similar are biosimilars
How dissimilarly similar are biosimilars
 
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...
Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...
 

Sequence alignment belgaum

  • 2.
  • 3.
  • 4. Homologous refers to conclusion drawn from the data that the two genes or sequences have descended from a common ancestor Homologous sequences are of two types Orthologous Homologous sequences in different species that arose from a common ancestral gene during speciation Parologous Homologous sequences within a single species that arose by gene duplication
  • 5. What is Alignment ? Explicit mapping between two or more sequences To place one sequence over another in such a fashion so as to get maximum similarity SEQUENCE ALIGNMENT STRUCTURAL ALIGNMENT
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15. ALIGNMENT Pairwise alignment Multiple alignment
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Global vs. Local Alignment
  • 21.
  • 22. Extreme Value Distribution Probability density function for the extreme value distribution resulting from parameter values  = 0 and  = 1, [ y = 1 – exp(- e -x )], where  is the characteristic value and  is the decay constant. y = 1 – exp(- e -  ( x -  ) )
  • 23. Extreme Value Distribution (EDV) You know that an optimal alignment of two sequences is selected out of many suboptimal alignments, and that a database search is also about selecting the best alignment(s). This bodes well with the EDV which has a right tail that falls off more slowly than the left tail. Compared to using the normal distribution, when using the EDV an alignment has to score further away from the expected mean value to become a significant hit. real data EDV approximation
  • 24. Extreme Value Distribution The probability of a score S to be larger than a given value x can be calculated following the EDV as: E-value: P ( S  x ) = 1 – exp(- e -  ( x -  ) ) , where  =(ln Kmn )/  , and K a constant that can be estimated from the background amino acid distribution and scoring matrix (see Altschul and Gish, 1996, for a collection of values for  and K over a set of widely used scoring matrices).
  • 25. Extreme Value Distribution Using the equation for  (preceding slide), the probability for the raw alignment score S becomes P ( S  x ) = 1 – exp(- Kmne -  x ). In practice, the probability P ( S  x ) is estimated using the approximation 1 – exp(- e -x )  e -x , which is valid for large values of x . This leads to a simplification of the equation for P ( S  x ): P ( S  x )  e -  (x-  ) = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score S .
  • 26.
  • 27.
  • 28. Construction of the Lookup Table Position Number Residue Seq 1 Seq2 Offset(p1-p2) F 1 - - L 2 - - W 3,6 2,5 1(3,2) 1(6,5) 4(6,2) -2(3,5) R 4 - - T 5 4,6 1(5,4) - 1(5,6) S 7 1 6(7,1) K - 3 - Pos no. 1 2 3 4 5 6 7 Sequence 1 F L W R T W S Sequence 2 S W K T W T
  • 29. Calculation of Offset Frequency Offset Frequency 1 3 4 1 -1 1 -2 1 6 1 Final Local Alignment Pos no. 1 2 3 4 5 6 7 Sequence 1 F L W R T W S Sequence 2 - S W K T W T
  • 30. Extreme Value Distribution Using the equation for  (preceding slide), the probability for the raw alignment score S becomes P ( S  x ) = 1 – exp(- Kmne -  x ). In practice, the probability P ( S  x ) is estimated using the approximation 1 – exp(- e -x )  e -x , which is valid for large values of x . This leads to a simplification of the equation for P ( S  x ): P ( S  x )  e -  (x-  ) = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score S .
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.