SlideShare a Scribd company logo
1 of 35
SEQUENCE ALIGNMENT P.S.CHANDRANAND
Objectives ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Homologous   refers to conclusion drawn from the data that the two genes or sequences have descended from a common ancestor   Homologous sequences are of two types   Orthologous   Homologous sequences in different species that arose from a common ancestral gene during speciation Parologous   Homologous sequences within a single species that arose by gene duplication
What is Alignment ? Explicit mapping between two or more sequences   To place one sequence over another in such a fashion so as to get maximum similarity SEQUENCE ALIGNMENT  STRUCTURAL  ALIGNMENT
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object]
Similarity vs. homology ,[object Object],[object Object],[object Object]
Proteins of 100% identity  (Human & Xenopus Myoglobin) ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],GLSDGEWQ Q VLNVWGKVEADI A GHGQEV LIRLF T GHPETLEKFDKFKHLKTE A EMKA SEDLKKHG TV VLTALGGILKKKGHHEAE L KPLAQSHATKHKIP I KYLEFIS DA II H VL H SKHPGDFGADAQGAM T KALELFR N D I A A K YKELGFQG Proteins with similarity  (H orse P02188  & Xenopus)
Evolutionary Basis ,[object Object],[object Object]
Basic Concept of Alignment ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
[object Object]
ALIGNMENT Pairwise alignment    Multiple alignment
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Why multiple sequence alignment   ? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],The process of aligning sequences is a game involving playing off gaps and mismatches
[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Comparative Analysis of Alignment Techniques
Global vs. Local Alignment
A model for database searching score probabilities ,[object Object],[object Object]
Extreme Value Distribution Probability density function for the extreme value distribution resulting from parameter values    = 0 and    = 1, [ y  = 1 – exp(- e -x )], where     is the characteristic value and     is the decay constant.  y  = 1 – exp(- e -  ( x -  ) )
Extreme Value Distribution (EDV) You  know that an optimal alignment of two sequences is selected out of many suboptimal alignments, and that a database search is also about selecting the best alignment(s). This bodes well with the EDV which has a right tail that falls off more slowly than the left tail. Compared to using the normal distribution, when using the EDV an alignment has to score further away from the expected mean value to become a significant hit.  real data EDV approximation
Extreme Value Distribution The probability of a score  S  to be larger than a given value  x  can be calculated following the EDV as:  E-value: P ( S     x ) = 1 – exp(- e  -  ( x -  ) ) ,  where      =(ln  Kmn )/  , and  K  a constant that can be estimated from the background amino acid distribution and scoring matrix (see Altschul and Gish, 1996, for a collection of values for    and  K  over a set of widely used scoring matrices).
Extreme Value Distribution Using the equation for     (preceding slide), the probability for the raw alignment score  S  becomes  P ( S     x ) = 1 – exp(- Kmne -  x ). In practice, the probability  P ( S  x ) is estimated using the approximation 1 – exp(- e -x )    e -x , which is valid for large values of  x . This leads to a simplification of the equation for  P ( S  x ): P ( S    x )    e -  (x-  )  = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score  S .
Normalised sequence similarity Statistical significance ,[object Object],[object Object]
FASTP : Local Alignment Tool Sequence 1  F  L  W  R  T  W  S Sequence 2  S  W  K  T  W  T Method based on lookup tables Lipman & Pearson, Science (1985) vol 227,1435-41 ,[object Object],[object Object]
Construction of the Lookup Table   Position Number Residue  Seq 1  Seq2  Offset(p1-p2) F  1   -   - L  2   -   - W  3,6  2,5  1(3,2)  1(6,5)  4(6,2)  -2(3,5) R  4   -   - T  5  4,6 1(5,4)  - 1(5,6) S  7   1    6(7,1) K  -   3  - Pos no.  1  2  3  4  5  6  7 Sequence 1  F  L  W  R  T  W  S Sequence 2  S  W  K  T  W  T
Calculation of Offset Frequency Offset  Frequency   1  3   4  1 -1  1 -2  1    6  1 Final Local Alignment Pos no.   1  2  3  4  5  6  7 Sequence 1   F  L  W  R  T  W  S Sequence 2   -  S  W  K  T  W  T
Extreme Value Distribution Using the equation for     (preceding slide), the probability for the raw alignment score  S  becomes  P ( S     x ) = 1 – exp(- Kmne -  x ). In practice, the probability  P ( S  x ) is estimated using the approximation 1 – exp(- e -x )    e -x , which is valid for large values of  x . This leads to a simplification of the equation for  P ( S  x ): P ( S    x )    e -  (x-  )  = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score  S .
-Needleman-Wunsch (1970) provided first automatic method -Dynamic Programming to Find Global Alignment ,[object Object],[object Object],[object Object],NEEDLEMAN-WUNSCH Algorithm
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],AGGVLIQVG  AGGVLIIQVG AGGVL-IQVG   AGGVLIIQVG
Gaps ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Summary An alignment just  reflects the  probable  evolutionary history  of the two genes as it is  presumed  that the homologous sequences have diverged from a common ancestral sequence through iterative molecular changes ,[object Object],[object Object],[object Object],[object Object],Two types of gap penalties Global alignment   Local alignment Two types of Alignment Linear gap penalty Affine gap penalty

More Related Content

What's hot (20)

Msa
MsaMsa
Msa
 
NCBI
NCBINCBI
NCBI
 
MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)MEGA (Molecular Evolutionary Genetics Analysis)
MEGA (Molecular Evolutionary Genetics Analysis)
 
Blast fasta
Blast fastaBlast fasta
Blast fasta
 
Dot matrix seminar
Dot matrix seminarDot matrix seminar
Dot matrix seminar
 
Sequence Alignment In Bioinformatics
Sequence Alignment In BioinformaticsSequence Alignment In Bioinformatics
Sequence Alignment In Bioinformatics
 
BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)BLAST (Basic local alignment search Tool)
BLAST (Basic local alignment search Tool)
 
222397 lecture 16 17
222397 lecture 16 17222397 lecture 16 17
222397 lecture 16 17
 
BITS: Basics of Sequence similarity
BITS: Basics of Sequence similarityBITS: Basics of Sequence similarity
BITS: Basics of Sequence similarity
 
Fasta
FastaFasta
Fasta
 
Sequence database
Sequence databaseSequence database
Sequence database
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Dynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignmentDynamic programming and pairwise sequence alignment
Dynamic programming and pairwise sequence alignment
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
BLAST
BLASTBLAST
BLAST
 
protein sequence analysis
protein sequence analysisprotein sequence analysis
protein sequence analysis
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
Pairwise sequence alignment
Pairwise sequence alignmentPairwise sequence alignment
Pairwise sequence alignment
 
EMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology LaboratoryEMBL- European Molecular Biology Laboratory
EMBL- European Molecular Biology Laboratory
 

Viewers also liked

sequence alignment
sequence alignmentsequence alignment
sequence alignmentammar kareem
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentRai University
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-naveed ul mushtaq
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Pritom Chaki
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentgoneshwar
 
RNA secondary structure prediction
RNA secondary structure predictionRNA secondary structure prediction
RNA secondary structure predictionMuhammed sadiq
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformaticsavrilcoghlan
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformaticsHamid Ur-Rahman
 
Application of bioinformatics
Application of bioinformaticsApplication of bioinformatics
Application of bioinformaticsKamlesh Patade
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesSobia
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
RNA Secondary Structure Prediction
RNA Secondary Structure PredictionRNA Secondary Structure Prediction
RNA Secondary Structure PredictionSumin Byeon
 
Bioinformatics
BioinformaticsBioinformatics
BioinformaticsJTADrexel
 
Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction CS, NcState
 

Viewers also liked (20)

Parwati sihag
Parwati sihagParwati sihag
Parwati sihag
 
sequence alignment
sequence alignmentsequence alignment
sequence alignment
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
BLAST
BLASTBLAST
BLAST
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
RNA secondary structure prediction
RNA secondary structure predictionRNA secondary structure prediction
RNA secondary structure prediction
 
Dotplots for Bioinformatics
Dotplots for BioinformaticsDotplots for Bioinformatics
Dotplots for Bioinformatics
 
BLAST and sequence alignment
BLAST and sequence alignmentBLAST and sequence alignment
BLAST and sequence alignment
 
Blast fasta 4
Blast fasta 4Blast fasta 4
Blast fasta 4
 
blast bioinformatics
blast bioinformaticsblast bioinformatics
blast bioinformatics
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
Application of bioinformatics
Application of bioinformaticsApplication of bioinformatics
Application of bioinformatics
 
Application of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciencesApplication of Bioinformatics in different fields of sciences
Application of Bioinformatics in different fields of sciences
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
RNA Secondary Structure Prediction
RNA Secondary Structure PredictionRNA Secondary Structure Prediction
RNA Secondary Structure Prediction
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction Local vs. Global Models for Effort Estimation and Defect Prediction
Local vs. Global Models for Effort Estimation and Defect Prediction
 

Similar to Sequence alignment belgaum

How the blast work
How the blast workHow the blast work
How the blast workAtai Rabby
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)AnkitTiwari354
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENTMariya Raju
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its applicationprateek kumar
 
Laboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsLaboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsseham15
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentRai University
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptxArupKhakhlari1
 
The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1Keiji Takamoto
 
Computation and System Biology Assignment Help
Computation and System Biology Assignment HelpComputation and System Biology Assignment Help
Computation and System Biology Assignment HelpNursing Assignment Help
 
Bioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matricesBioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matricesProf. Wim Van Criekinge
 
5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological dataKrish_ver2
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07Computer Science Club
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfH K Yoon
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignmentSanaym
 
Prediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsPrediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsziggurat
 

Similar to Sequence alignment belgaum (20)

How the blast work
How the blast workHow the blast work
How the blast work
 
Seq alignment
Seq alignment Seq alignment
Seq alignment
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Microarray and its application
Microarray and its applicationMicroarray and its application
Microarray and its application
 
Laboratory 1 sequence_alignments
Laboratory 1 sequence_alignmentsLaboratory 1 sequence_alignments
Laboratory 1 sequence_alignments
 
B.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignmentB.sc biochem i bobi u 3.1 sequence alignment
B.sc biochem i bobi u 3.1 sequence alignment
 
Bioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmmBioinformatica 08-12-2011-t8-go-hmm
Bioinformatica 08-12-2011-t8-go-hmm
 
Bioinformatics
BioinformaticsBioinformatics
Bioinformatics
 
4. sequence alignment.pptx
4. sequence alignment.pptx4. sequence alignment.pptx
4. sequence alignment.pptx
 
The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1The derivation of ungapped global protein alignment score distributions - Part1
The derivation of ungapped global protein alignment score distributions - Part1
 
Sequence alignment.pptx
Sequence alignment.pptxSequence alignment.pptx
Sequence alignment.pptx
 
Computation and System Biology Assignment Help
Computation and System Biology Assignment HelpComputation and System Biology Assignment Help
Computation and System Biology Assignment Help
 
Bioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matricesBioinformatica 20-10-2011-t3-scoring matrices
Bioinformatica 20-10-2011-t3-scoring matrices
 
5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data5.4 mining sequence patterns in biological data
5.4 mining sequence patterns in biological data
 
Sequence Alignment
Sequence AlignmentSequence Alignment
Sequence Alignment
 
20100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture0720100515 bioinformatics kapushesky_lecture07
20100515 bioinformatics kapushesky_lecture07
 
AI 바이오 (4일차).pdf
AI 바이오 (4일차).pdfAI 바이오 (4일차).pdf
AI 바이오 (4일차).pdf
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Prediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methodsPrediction of transcription factor binding to DNA using rule induction methods
Prediction of transcription factor binding to DNA using rule induction methods
 

More from National Institute of Biologics

Defining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-productsDefining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-productsNational Institute of Biologics
 
Accelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapiesAccelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapiesNational Institute of Biologics
 
Canonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulinsCanonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulinsNational Institute of Biologics
 
Development trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeuticsDevelopment trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeuticsNational Institute of Biologics
 
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...National Institute of Biologics
 
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...National Institute of Biologics
 
Therapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammationTherapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammationNational Institute of Biologics
 
Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...National Institute of Biologics
 
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...National Institute of Biologics
 
Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...National Institute of Biologics
 
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...
Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...National Institute of Biologics
 

More from National Institute of Biologics (20)

Waters protein therapeutics application proctocols
Waters protein therapeutics application proctocolsWaters protein therapeutics application proctocols
Waters protein therapeutics application proctocols
 
Potential aggregation prone regions in biotherapeutics
Potential aggregation prone regions in biotherapeuticsPotential aggregation prone regions in biotherapeutics
Potential aggregation prone regions in biotherapeutics
 
How the biologics landscape is evolving
How the biologics landscape is evolvingHow the biologics landscape is evolving
How the biologics landscape is evolving
 
Evaluation of antibody drugs quality safety
Evaluation of antibody drugs quality safetyEvaluation of antibody drugs quality safety
Evaluation of antibody drugs quality safety
 
Approved m abs_feb_2015
Approved m abs_feb_2015Approved m abs_feb_2015
Approved m abs_feb_2015
 
Translating next generation sequencing to practice
Translating next generation sequencing to practiceTranslating next generation sequencing to practice
Translating next generation sequencing to practice
 
From biomarkers to diagnostics –the road to success
From biomarkers to diagnostics –the road to successFrom biomarkers to diagnostics –the road to success
From biomarkers to diagnostics –the road to success
 
Defining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-productsDefining your-target-product-profile in-vitro-diagnostic-products
Defining your-target-product-profile in-vitro-diagnostic-products
 
Accelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapiesAccelerating development and approval of targeted cancer therapies
Accelerating development and approval of targeted cancer therapies
 
Canonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulinsCanonical structures for the hypervariable regions of immunoglobulins
Canonical structures for the hypervariable regions of immunoglobulins
 
Canonical correlation
Canonical correlationCanonical correlation
Canonical correlation
 
Development trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeuticsDevelopment trends for human monoclonal antibody therapeutics
Development trends for human monoclonal antibody therapeutics
 
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
Therapeutic fc fusion proteins and peptides as successful alternatives to ant...
 
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
Fc fusion proteins and fc rn - structural insights for longer-lasting and mor...
 
Therapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammationTherapeutic antibodies for autoimmunity and inflammation
Therapeutic antibodies for autoimmunity and inflammation
 
Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...Introduction to current and future protein therapeutics - a protein engineeri...
Introduction to current and future protein therapeutics - a protein engineeri...
 
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...Pharmaceutical monoclonal antibodies   production - guidelines to cell engine...
Pharmaceutical monoclonal antibodies production - guidelines to cell engine...
 
Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...Intended use of reference products & who international standards or reference...
Intended use of reference products & who international standards or reference...
 
How dissimilarly similar are biosimilars
How dissimilarly similar are biosimilarsHow dissimilarly similar are biosimilars
How dissimilarly similar are biosimilars
 
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...
Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...Evaluation of similar biotherapeutic products (SBP's)   scientific principles ...
Evaluation of similar biotherapeutic products (SBP's) scientific principles ...
 

Sequence alignment belgaum

  • 2.
  • 3.
  • 4. Homologous refers to conclusion drawn from the data that the two genes or sequences have descended from a common ancestor Homologous sequences are of two types Orthologous Homologous sequences in different species that arose from a common ancestral gene during speciation Parologous Homologous sequences within a single species that arose by gene duplication
  • 5. What is Alignment ? Explicit mapping between two or more sequences To place one sequence over another in such a fashion so as to get maximum similarity SEQUENCE ALIGNMENT STRUCTURAL ALIGNMENT
  • 6.
  • 7.
  • 8.
  • 9.
  • 10.
  • 11.
  • 12.
  • 13.
  • 14.
  • 15. ALIGNMENT Pairwise alignment Multiple alignment
  • 16.
  • 17.
  • 18.
  • 19.
  • 20. Global vs. Local Alignment
  • 21.
  • 22. Extreme Value Distribution Probability density function for the extreme value distribution resulting from parameter values  = 0 and  = 1, [ y = 1 – exp(- e -x )], where  is the characteristic value and  is the decay constant. y = 1 – exp(- e -  ( x -  ) )
  • 23. Extreme Value Distribution (EDV) You know that an optimal alignment of two sequences is selected out of many suboptimal alignments, and that a database search is also about selecting the best alignment(s). This bodes well with the EDV which has a right tail that falls off more slowly than the left tail. Compared to using the normal distribution, when using the EDV an alignment has to score further away from the expected mean value to become a significant hit. real data EDV approximation
  • 24. Extreme Value Distribution The probability of a score S to be larger than a given value x can be calculated following the EDV as: E-value: P ( S  x ) = 1 – exp(- e -  ( x -  ) ) , where  =(ln Kmn )/  , and K a constant that can be estimated from the background amino acid distribution and scoring matrix (see Altschul and Gish, 1996, for a collection of values for  and K over a set of widely used scoring matrices).
  • 25. Extreme Value Distribution Using the equation for  (preceding slide), the probability for the raw alignment score S becomes P ( S  x ) = 1 – exp(- Kmne -  x ). In practice, the probability P ( S  x ) is estimated using the approximation 1 – exp(- e -x )  e -x , which is valid for large values of x . This leads to a simplification of the equation for P ( S  x ): P ( S  x )  e -  (x-  ) = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score S .
  • 26.
  • 27.
  • 28. Construction of the Lookup Table Position Number Residue Seq 1 Seq2 Offset(p1-p2) F 1 - - L 2 - - W 3,6 2,5 1(3,2) 1(6,5) 4(6,2) -2(3,5) R 4 - - T 5 4,6 1(5,4) - 1(5,6) S 7 1 6(7,1) K - 3 - Pos no. 1 2 3 4 5 6 7 Sequence 1 F L W R T W S Sequence 2 S W K T W T
  • 29. Calculation of Offset Frequency Offset Frequency 1 3 4 1 -1 1 -2 1 6 1 Final Local Alignment Pos no. 1 2 3 4 5 6 7 Sequence 1 F L W R T W S Sequence 2 - S W K T W T
  • 30. Extreme Value Distribution Using the equation for  (preceding slide), the probability for the raw alignment score S becomes P ( S  x ) = 1 – exp(- Kmne -  x ). In practice, the probability P ( S  x ) is estimated using the approximation 1 – exp(- e -x )  e -x , which is valid for large values of x . This leads to a simplification of the equation for P ( S  x ): P ( S  x )  e -  (x-  ) = Kmn e -  x . The lower the probability (E value) for a given threshold value x, the more significant the score S .
  • 31.
  • 32.
  • 33.
  • 34.
  • 35.