SlideShare a Scribd company logo
Secondary Structure
Prediction Of Protein
Protein
Sequence +
Structure
VIJAY
INRODUCTION

Primary structure (Amino acid sequence)
↓
Secondary structure (α-helix, β-sheet)
↓
Tertiary structure (Three-dimensional
structure formed by assembly of secondary
structures)
↓
Quaternary structure (Structure formed by
more than one polypeptide chains)
Secondary Structure
 Defined as the local conformation of protein backbone
 Primary Structure —folding— Secondary Structure
 a helix and b sheet
Secondary Structure
Regular Secondary
Structure
(a-helices, b-sheets)
Irregular
Secondary
Structure
(Tight turns,
Random coils,
bulges)
a helix
•common confirmation.
•spiral structure
•Tightly packed coiled polypeptide
backbone, with extending side chains
•Spontaneous
•stabilized by H-bonding between amide
hydrogens and carbonyl oxygens of peptide
bonds.
•R-groups lie on the exterior of the helix
and perpendicular to its axis.
•complete turn of helix —3.6 aminoacyl
residues with distance 0.54 nm
e.g. the keratins- entirely α-helical
Myoglobin- 80% helical
•Glycine and Proline , bulky amino acids,
charged amino acids favor disruption of the
helix.
b sheet
•β-sheets are composed of 2 or more different regions of
stretches of at least 5-10 amino acids.
•The folding and alignment of stretches of the polypeptide
backbone aside one another to form β-sheets is stabilized by
H-bonding between amide hydrogens and carbonyl oxygens
•the peptide backbone of the β sheet is highly extended.
•R groups of adjacent residues point in opposite directions.
• β-sheets are either parallel or antiparallel
b-sheet
(parallel, anti-parallel)
What is secondary
structure prediction?
 Given a protein sequence (primary structure)
 1st step in prediction of protein structure.
 Technique concerned with determination of secondary structure of
given polypeptide by locating the Coils Alpha Helix Beta Strands in
plypeptide
GHWIATRGQLIREAYEDYRHFSSECPFIP
 Predict its secondary structure content
(C=Coils H=Alpha Helix E=Beta Strands)
CEEEEECHHHHHHHHHHHCCCHHCCCCCC
Why secondary structure
prediction?
o secondary structure —tertiary structure prediction
o Protein function prediction
o Protein classification
o Predicting structural change
o detection and alignment of remote homology between proteins
o on detecting transmembrane regions, solvent-accessibleresidues,
and other important features of molecules
o Detection of hydrophobic region and hydrophilic region
Prediction methods
o Statistical method
o Chou-Fasman method, GOR I-IV
o Nearest neighbors
o NNSSP, SSPAL
o Neural network
o PHD, Psi-Pred, J-Pred
o Support vector machine (SVM)
o HMM
Chou-Fasman algorithm
 Chou and fasman in 1978
 It is based on assigning a set of prediction value to amino
acid residue in polypeptide and applying an algorithm to the
conformational parameter and positional frequency.
 conformational parameter for each amino acid is calculated
by considering the relative frequency of each 20 amino
acid in proteins
 By this C=Coils H=Alpha Helix E=Beta Strands are
determined
 Also called preference parameter
• A table of prediction value or preference parameter for each
of 20 amino acid in alpha helix ,beta plate and turn
already calculated and standardised.
• To obtain the prediction value the frequency of amino
acids( i) in structure is divided by of all residences in
protein (s)
• i/s
• The resulting structural parameter of
p(alpha),p(beta),p(turn)vary —0.5 to 1.5 for 20 amino acid
 Window is scanned to find a short sequence of
amino acid that has high probability to form one
type of structure
 When 4 out of 6 amino acid have high
probability >1.03 the – alpha helix
 3 out of 5 amino acid with probability >1.03-beta
RULES
ALGORITHM
o Note preference parameter for 20 aa in peptide
o Scan the window and identify the region where 4 out of
6 contiguous residue have p(alpha helix) >1.00
o Continue scanning in both the direction until the 4
contiguous residue that have an average p(alpha
helix)<1.00,end of helix
o If segment is longer than 5aa and p(alpha helix)>p(beta
sheet )-segment –completely alpha helix
o scan different segment and identify - alpha helix
 Identify the region where 3 out of 5 aa have the
value of p( beta sheet) >1.00 ,region is predicted
as beta sheet
 Continue scanning both the direction until 4
residue that have p( beta sheet) <1.00
 End of beta sheet
 average p( beta sheet) >105 and p( beta sheet)
>p(alpha helix) than consider complete segment
as b pleated sheet
 If any region is over lapping than consider it as
alpha helix if average p(alpha helix)>p(beta sheet )
 Or beta sheet if p(alpha helix)<p(beta sheet )
 To identify turn
 P(t)=f(j)f(j+1)f(j+2)f(j+3)
 J=residual number
result
 Accuracy: ~50%  ~60%
 helix alanine,glutamine,leucine,methionine
 Helix breaking proline and glycine
 Beta sheet isoleucine,valine,tyrosine
 Beta breaking proline,aspargine,glutamine
 Turn contains proline(30%),serine(14%),lysine,
aspargine(10%)
 Glycine(19%),aspartic acid
(`18%),serine(13%),tyrosine(11%)
 http://www.accelrys.com/product/gcg-wisconsin-
package/program-list.html
Out put of Chou-Fasman
GOR METHOD
• GOR(Garnier,Osguthorpe,Robson)1978
• Chou fasman method is based on assumption that each amino
acid individually influence the 2ry structure of sequence
• GOR is based on, amino acid flanking the central amino acid
will influence the 2ry structure
• Consider a peptide central amino acid
side amino acid
• It assume that amino acid up to 8 residue on sides will
influence the 2ry structure of central residue
• 4th version
• 64% accurate
ALGORITHUM
•It uses the sliding window of 17 amino acid
•The side amino acid sequence and alignment is determined to
predict secondary structure of central sequence
•Good for helix than sheet because beta sheet has more inter
sequence hydrogen bonding
•36.5% accurate for beta sheet
•input any amino acid sequence
•Output tells about secondary structure
NEAREST NEIGHBOUR
METHOD
o Based on ,short homologues sequences of amino acids
have the same secondary structure
o It predicts secondary structure of central homologues
segment by neighbour homologues sequences
o By using structural database find some secondary
structure of sequence which may be homologues to our
target sequence
o Naturally evolved proteins with 35% identical amino acid
sequence will have same secondary structure
o Find some sequence which may match with target
sequence
o Scoring matrix,MSA
“Singleton” score matrix
Helix Sheet Loop
Buried Inter Exposed Buried Inter Exposed Buried Inter Exposed
ALA -0.578 -0.119 -0.160 0.010 0.583 0.921 0.023 0.218 0.368
ARG 0.997 -0.507 -0.488 1.267 -0.345 -0.580 0.930 -0.005 -0.032
ASN 0.819 0.090 -0.007 0.844 0.221 0.046 0.030 -0.322 -0.487
ASP 1.050 0.172 -0.426 1.145 0.322 0.061 0.308 -0.224 -0.541
CYS -0.360 0.333 1.831 -0.671 0.003 1.216 -0.690 -0.225 1.216
GLN 1.047 -0.294 -0.939 1.452 0.139 -0.555 1.326 0.486 -0.244
GLU 0.670 -0.313 -0.721 0.999 0.031 -0.494 0.845 0.248 -0.144
GLY 0.414 0.932 0.969 0.177 0.565 0.989 -0.562 -0.299 -0.601
HIS 0.479 -0.223 0.136 0.306 -0.343 -0.014 0.019 -0.285 0.051
ILE -0.551 0.087 1.248 -0.875 -0.182 0.500 -0.166 0.384 1.336
LEU -0.744 -0.218 0.940 -0.411 0.179 0.900 -0.205 0.169 1.217
LYS 1.863 -0.045 -0.865 2.109 -0.017 -0.901 1.925 0.474 -0.498
MET -0.641 -0.183 0.779 -0.269 0.197 0.658 -0.228 0.113 0.714
PHE -0.491 0.057 1.364 -0.649 -0.200 0.776 -0.375 -0.001 1.251
PRO 1.090 0.705 0.236 1.249 0.695 0.145 -0.412 -0.491 -0.641
SER 0.350 0.260 -0.020 0.303 0.058 -0.075 -0.173 -0.210 -0.228
THR 0.291 0.215 0.304 0.156 -0.382 -0.584 -0.012 -0.103 -0.125
TRP -0.379 -0.363 1.178 -0.270 -0.477 0.682 -0.220 -0.099 1.267
TYR -0.111 -0.292 0.942 -0.267 -0.691 0.292 -0.015 -0.176 0.946
VAL -0.374 0.236 1.144 -0.912 -0.334 0.089 -0.030 0.309 0.998
Neural Network
Method
•Prediction is done by utilizing the
information of different
DATABASE
•Linear sequence  3D structure of
Polypeptide
Neural network
Input signals are summed
and turned into zero or one
3.
J1
J2
J3
J4
Feed-forward multilayer network
Input layer Hidden layer Output layer
neurons
Enter sequences
Compare Prediction to Reality
AdjustWeights
Neural network training
Simple Neural Network
With Hidden Layer
outi  f ij
2
J  f jk
1
Jk
  kin






j







Simple neural network
with hidden layer
A
C
D
E
F
G
H
I
K
L
M
N
P
Q
R
S
T
V
W
Y
.
H
E
L
D (L)
R (E)
Q (E)
G (E)
F (E)
V (E)
P (E)
A (H)
A (H)
Y (H)
V (E)
K (E)
K (E)
Neural network for
secondary structure
Summary
Introduction
What is secondary structure prediction
Why
Chou-Fasman method
GOR I-IV
Nearest neighbors
Neural network
 Suggested reading:
 Chapter 15 in “Current Topics in Computational Molecular
Biology, edited by Tao Jiang, Ying Xu, and Michael Zhang. MIT
Press. 2002.”
 Bioinformatics by Cynthia and per jambeck
 Bioinformatics by S.C.RASTOGI
 Bioinformatics By Andreas
 Optional reading:
 Review by Burkhard Rost:
http://cubic.bioc.columbia.edu/papers/2003_r
ev_dekker/paper.html
Reference
Secondary Structure Prediction of proteins

More Related Content

What's hot

Cath
CathCath
Cath
Ramya S
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
ALLIENU
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
Protein data bank
Protein data bankProtein data bank
Protein data bank
Alichy Sowmya
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
Dr NEETHU ASOKAN
 
Kegg
KeggKegg
Kegg
msfbi1521
 
BLAST
BLASTBLAST
Fasta
FastaFasta
Homology modeling
Homology modelingHomology modeling
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
Hafiz Muhammad Zeeshan Raza
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
Ramya S
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
Nitin Naik
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
karamveer prajapat
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
Mariya Raju
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
Arindam Ghosh
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
Sabahat Ali
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
Anik Banik
 
Dot matrix
Dot matrixDot matrix
Dot matrix
Tania Khan
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
SHRIKANT YANKANCHI
 

What's hot (20)

Cath
CathCath
Cath
 
Blast and fasta
Blast and fastaBlast and fasta
Blast and fasta
 
PIR- Protein Information Resource
PIR- Protein Information ResourcePIR- Protein Information Resource
PIR- Protein Information Resource
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
 
Kegg
KeggKegg
Kegg
 
BLAST
BLASTBLAST
BLAST
 
Fasta
FastaFasta
Fasta
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
Gen bank databases
Gen bank databasesGen bank databases
Gen bank databases
 
Multiple sequence alignment
Multiple sequence alignmentMultiple sequence alignment
Multiple sequence alignment
 
Phylogenetic analysis
Phylogenetic analysis Phylogenetic analysis
Phylogenetic analysis
 
methods for protein structure prediction
methods for protein structure predictionmethods for protein structure prediction
methods for protein structure prediction
 
MULTIPLE SEQUENCE ALIGNMENT
MULTIPLE  SEQUENCE  ALIGNMENTMULTIPLE  SEQUENCE  ALIGNMENT
MULTIPLE SEQUENCE ALIGNMENT
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
 
Finding ORF
Finding ORFFinding ORF
Finding ORF
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
 
Dot matrix
Dot matrixDot matrix
Dot matrix
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 

Viewers also liked

Protein structure: details
Protein structure: detailsProtein structure: details
Protein structure: detailsdamarisb
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
Ayesha Choudhury
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
Elda Nurafnie
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
Pritom Chaki
 
Primary and Secondary Structure of Protein
Primary and Secondary Structure of ProteinPrimary and Secondary Structure of Protein
Primary and Secondary Structure of ProteinCyra Mae Soreda
 
Protein
ProteinProtein
Protein
sharmin14
 
Protein structure
Protein structureProtein structure
Protein structurepaigep
 
Protein structure
Protein structureProtein structure
Protein structure
ranjani n
 
Chem 45 Biochemistry: Stoker Chapter 20 Proteins
Chem 45 Biochemistry: Stoker Chapter 20 ProteinsChem 45 Biochemistry: Stoker Chapter 20 Proteins
Chem 45 Biochemistry: Stoker Chapter 20 Proteins
Shaina Mavreen Villaroza
 
Biological databases
Biological databasesBiological databases
Biological databases
Sarfaraz Nasri
 
Biological databases
Biological databasesBiological databases
Biological databases
Prasanthperceptron
 
Protein Powerpoint
Protein PowerpointProtein Powerpoint
Protein Powerpointbenmoyer
 
Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017
NVIDIA
 

Viewers also liked (15)

Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
 
Protein structure: details
Protein structure: detailsProtein structure: details
Protein structure: details
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)Global and local alignment (bioinformatics)
Global and local alignment (bioinformatics)
 
Primary and Secondary Structure of Protein
Primary and Secondary Structure of ProteinPrimary and Secondary Structure of Protein
Primary and Secondary Structure of Protein
 
Protein
ProteinProtein
Protein
 
Protein structure
Protein structureProtein structure
Protein structure
 
Protein structure
Protein structureProtein structure
Protein structure
 
Chem 45 Biochemistry: Stoker Chapter 20 Proteins
Chem 45 Biochemistry: Stoker Chapter 20 ProteinsChem 45 Biochemistry: Stoker Chapter 20 Proteins
Chem 45 Biochemistry: Stoker Chapter 20 Proteins
 
PROTEINS
PROTEINSPROTEINS
PROTEINS
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Biological databases
Biological databasesBiological databases
Biological databases
 
Protein Powerpoint
Protein PowerpointProtein Powerpoint
Protein Powerpoint
 
Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017Top 5 Deep Learning and AI Stories - October 6, 2017
Top 5 Deep Learning and AI Stories - October 6, 2017
 

Similar to Secondary Structure Prediction of proteins

Bioinfo
Bioinfo Bioinfo
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
Prof. Wim Van Criekinge
 
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekingeBioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Prof. Wim Van Criekinge
 
The Ramachandran plot
The Ramachandran plotThe Ramachandran plot
The Ramachandran plot
K Tanay
 
P7 2017 biopython3
P7 2017 biopython3P7 2017 biopython3
P7 2017 biopython3
Prof. Wim Van Criekinge
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge
Prof. Wim Van Criekinge
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge
Prof. Wim Van Criekinge
 
Protein 3D structure and classification database
Protein 3D structure and classification database Protein 3D structure and classification database
Protein 3D structure and classification database nadeem akhter
 
Protein_structure_2022.pdf
Protein_structure_2022.pdfProtein_structure_2022.pdf
Protein_structure_2022.pdf
MuhammadSalman453340
 
2016 bioinformatics i_score_matrices_wim_vancriekinge
2016 bioinformatics i_score_matrices_wim_vancriekinge2016 bioinformatics i_score_matrices_wim_vancriekinge
2016 bioinformatics i_score_matrices_wim_vancriekinge
Prof. Wim Van Criekinge
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
Santu Chall
 
Principles of Protein Structure
Principles of Protein StructurePrinciples of Protein Structure
Principles of Protein Structure
Asheesh Pandey
 
Cs273 structure prediction
Cs273 structure predictionCs273 structure prediction
Cs273 structure prediction
University of Allahabad
 
2015 bioinformatics score_matrices_wim_vancriekinge
2015 bioinformatics score_matrices_wim_vancriekinge2015 bioinformatics score_matrices_wim_vancriekinge
2015 bioinformatics score_matrices_wim_vancriekinge
Prof. Wim Van Criekinge
 
Structure of protein By KK Sahu Sir
Structure of protein By KK Sahu SirStructure of protein By KK Sahu Sir
Structure of protein By KK Sahu Sir
KAUSHAL SAHU
 
Protein 3 dimensional structure and function
Protein 3 dimensional structure and functionProtein 3 dimensional structure and function
Protein 3 dimensional structure and function
Dr. Armaan Singh
 
Protein structure
Protein structure Protein structure
Protein structure
harini challapalli
 

Similar to Secondary Structure Prediction of proteins (20)

Bioinfo
Bioinfo Bioinfo
Bioinfo
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
 
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekingeBioinformatics t7-protein structure-v2013_wim_vancriekinge
Bioinformatics t7-protein structure-v2013_wim_vancriekinge
 
The Ramachandran plot
The Ramachandran plotThe Ramachandran plot
The Ramachandran plot
 
P7 2017 biopython3
P7 2017 biopython3P7 2017 biopython3
P7 2017 biopython3
 
Bioinformatica 01-12-2011-t7-protein
Bioinformatica 01-12-2011-t7-proteinBioinformatica 01-12-2011-t7-protein
Bioinformatica 01-12-2011-t7-protein
 
2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge2016 bioinformatics i_proteins_wim_vancriekinge
2016 bioinformatics i_proteins_wim_vancriekinge
 
2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge2015 bioinformatics protein_structure_wimvancriekinge
2015 bioinformatics protein_structure_wimvancriekinge
 
Protein 3D structure and classification database
Protein 3D structure and classification database Protein 3D structure and classification database
Protein 3D structure and classification database
 
Protein_structure_2022.pdf
Protein_structure_2022.pdfProtein_structure_2022.pdf
Protein_structure_2022.pdf
 
2016 bioinformatics i_score_matrices_wim_vancriekinge
2016 bioinformatics i_score_matrices_wim_vancriekinge2016 bioinformatics i_score_matrices_wim_vancriekinge
2016 bioinformatics i_score_matrices_wim_vancriekinge
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
 
Principles of Protein Structure
Principles of Protein StructurePrinciples of Protein Structure
Principles of Protein Structure
 
Bioalgo 2012-03-massspec
Bioalgo 2012-03-massspecBioalgo 2012-03-massspec
Bioalgo 2012-03-massspec
 
Ch08 massspec
Ch08 massspecCh08 massspec
Ch08 massspec
 
Cs273 structure prediction
Cs273 structure predictionCs273 structure prediction
Cs273 structure prediction
 
2015 bioinformatics score_matrices_wim_vancriekinge
2015 bioinformatics score_matrices_wim_vancriekinge2015 bioinformatics score_matrices_wim_vancriekinge
2015 bioinformatics score_matrices_wim_vancriekinge
 
Structure of protein By KK Sahu Sir
Structure of protein By KK Sahu SirStructure of protein By KK Sahu Sir
Structure of protein By KK Sahu Sir
 
Protein 3 dimensional structure and function
Protein 3 dimensional structure and functionProtein 3 dimensional structure and function
Protein 3 dimensional structure and function
 
Protein structure
Protein structure Protein structure
Protein structure
 

More from Vijay Hemmadi

Hemoglobin estimation and Blood typing experiment and
Hemoglobin estimation and Blood typing experiment and Hemoglobin estimation and Blood typing experiment and
Hemoglobin estimation and Blood typing experiment and
Vijay Hemmadi
 
Determination of protein concentration by Bradford method.pptx
Determination of protein concentration by Bradford method.pptxDetermination of protein concentration by Bradford method.pptx
Determination of protein concentration by Bradford method.pptx
Vijay Hemmadi
 
Endangered species of india
Endangered species of india Endangered species of india
Endangered species of india
Vijay Hemmadi
 
Mining
MiningMining
Enzyme assays
Enzyme assaysEnzyme assays
Enzyme assays
Vijay Hemmadi
 
Liposomes-Classification, methods of preparation and application
Liposomes-Classification, methods of preparation and application Liposomes-Classification, methods of preparation and application
Liposomes-Classification, methods of preparation and application
Vijay Hemmadi
 
Natural disasters and its managment
Natural disasters and its managmentNatural disasters and its managment
Natural disasters and its managment
Vijay Hemmadi
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
Vijay Hemmadi
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis
Vijay Hemmadi
 
Morphometric measurements,condition indexing and dissection of fish
Morphometric measurements,condition indexing and dissection of fishMorphometric measurements,condition indexing and dissection of fish
Morphometric measurements,condition indexing and dissection of fish
Vijay Hemmadi
 
Fish anatomy and physilogy
Fish anatomy and physilogy Fish anatomy and physilogy
Fish anatomy and physilogy
Vijay Hemmadi
 
How to identify the fake journals
How to identify the fake  journalsHow to identify the fake  journals
How to identify the fake journals
Vijay Hemmadi
 
Global warming 1
Global warming  1Global warming  1
Global warming 1
Vijay Hemmadi
 
Nucleic acid
Nucleic acidNucleic acid
Nucleic acid
Vijay Hemmadi
 
Environmental legislation
Environmental legislationEnvironmental legislation
Environmental legislation
Vijay Hemmadi
 
Air pollution act
Air pollution actAir pollution act
Air pollution act
Vijay Hemmadi
 
Comet assay
Comet assayComet assay
Comet assay
Vijay Hemmadi
 
Atomic absorption spectrometer
Atomic absorption spectrometerAtomic absorption spectrometer
Atomic absorption spectrometer
Vijay Hemmadi
 
metallothionein -Biomarker
metallothionein -Biomarkermetallothionein -Biomarker
metallothionein -Biomarker
Vijay Hemmadi
 

More from Vijay Hemmadi (20)

Hemoglobin estimation and Blood typing experiment and
Hemoglobin estimation and Blood typing experiment and Hemoglobin estimation and Blood typing experiment and
Hemoglobin estimation and Blood typing experiment and
 
Determination of protein concentration by Bradford method.pptx
Determination of protein concentration by Bradford method.pptxDetermination of protein concentration by Bradford method.pptx
Determination of protein concentration by Bradford method.pptx
 
Endangered species of india
Endangered species of india Endangered species of india
Endangered species of india
 
Mining
MiningMining
Mining
 
Enzyme assays
Enzyme assaysEnzyme assays
Enzyme assays
 
Liposomes-Classification, methods of preparation and application
Liposomes-Classification, methods of preparation and application Liposomes-Classification, methods of preparation and application
Liposomes-Classification, methods of preparation and application
 
Natural disasters and its managment
Natural disasters and its managmentNatural disasters and its managment
Natural disasters and its managment
 
Gene prediction methods vijay
Gene prediction methods  vijayGene prediction methods  vijay
Gene prediction methods vijay
 
Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis Introduction to probability distributions-Statistics and probability analysis
Introduction to probability distributions-Statistics and probability analysis
 
Morphometric measurements,condition indexing and dissection of fish
Morphometric measurements,condition indexing and dissection of fishMorphometric measurements,condition indexing and dissection of fish
Morphometric measurements,condition indexing and dissection of fish
 
Fish anatomy and physilogy
Fish anatomy and physilogy Fish anatomy and physilogy
Fish anatomy and physilogy
 
How to identify the fake journals
How to identify the fake  journalsHow to identify the fake  journals
How to identify the fake journals
 
Global warming 1
Global warming  1Global warming  1
Global warming 1
 
Nucleic acid
Nucleic acidNucleic acid
Nucleic acid
 
Environmental legislation
Environmental legislationEnvironmental legislation
Environmental legislation
 
Air pollution act
Air pollution actAir pollution act
Air pollution act
 
Air pollution act
Air pollution actAir pollution act
Air pollution act
 
Comet assay
Comet assayComet assay
Comet assay
 
Atomic absorption spectrometer
Atomic absorption spectrometerAtomic absorption spectrometer
Atomic absorption spectrometer
 
metallothionein -Biomarker
metallothionein -Biomarkermetallothionein -Biomarker
metallothionein -Biomarker
 

Recently uploaded

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
ChetanK57
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
Sérgio Sacani
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
muralinath2
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
muralinath2
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 

Recently uploaded (20)

Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATIONPRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
PRESENTATION ABOUT PRINCIPLE OF COSMATIC EVALUATION
 
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
THE IMPORTANCE OF MARTIAN ATMOSPHERE SAMPLE RETURN.
 
platelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptxplatelets- lifespan -Clot retraction-disorders.pptx
platelets- lifespan -Clot retraction-disorders.pptx
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
platelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptxplatelets_clotting_biogenesis.clot retractionpptx
platelets_clotting_biogenesis.clot retractionpptx
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 

Secondary Structure Prediction of proteins

  • 1. Secondary Structure Prediction Of Protein Protein Sequence + Structure VIJAY
  • 2. INRODUCTION  Primary structure (Amino acid sequence) ↓ Secondary structure (α-helix, β-sheet) ↓ Tertiary structure (Three-dimensional structure formed by assembly of secondary structures) ↓ Quaternary structure (Structure formed by more than one polypeptide chains)
  • 3. Secondary Structure  Defined as the local conformation of protein backbone  Primary Structure —folding— Secondary Structure  a helix and b sheet Secondary Structure Regular Secondary Structure (a-helices, b-sheets) Irregular Secondary Structure (Tight turns, Random coils, bulges)
  • 4.
  • 5. a helix •common confirmation. •spiral structure •Tightly packed coiled polypeptide backbone, with extending side chains •Spontaneous •stabilized by H-bonding between amide hydrogens and carbonyl oxygens of peptide bonds. •R-groups lie on the exterior of the helix and perpendicular to its axis. •complete turn of helix —3.6 aminoacyl residues with distance 0.54 nm e.g. the keratins- entirely α-helical Myoglobin- 80% helical
  • 6. •Glycine and Proline , bulky amino acids, charged amino acids favor disruption of the helix.
  • 7. b sheet •β-sheets are composed of 2 or more different regions of stretches of at least 5-10 amino acids. •The folding and alignment of stretches of the polypeptide backbone aside one another to form β-sheets is stabilized by H-bonding between amide hydrogens and carbonyl oxygens •the peptide backbone of the β sheet is highly extended. •R groups of adjacent residues point in opposite directions. • β-sheets are either parallel or antiparallel
  • 9.
  • 10. What is secondary structure prediction?  Given a protein sequence (primary structure)  1st step in prediction of protein structure.  Technique concerned with determination of secondary structure of given polypeptide by locating the Coils Alpha Helix Beta Strands in plypeptide GHWIATRGQLIREAYEDYRHFSSECPFIP  Predict its secondary structure content (C=Coils H=Alpha Helix E=Beta Strands) CEEEEECHHHHHHHHHHHCCCHHCCCCCC
  • 11. Why secondary structure prediction? o secondary structure —tertiary structure prediction o Protein function prediction o Protein classification o Predicting structural change o detection and alignment of remote homology between proteins o on detecting transmembrane regions, solvent-accessibleresidues, and other important features of molecules o Detection of hydrophobic region and hydrophilic region
  • 12. Prediction methods o Statistical method o Chou-Fasman method, GOR I-IV o Nearest neighbors o NNSSP, SSPAL o Neural network o PHD, Psi-Pred, J-Pred o Support vector machine (SVM) o HMM
  • 13. Chou-Fasman algorithm  Chou and fasman in 1978  It is based on assigning a set of prediction value to amino acid residue in polypeptide and applying an algorithm to the conformational parameter and positional frequency.  conformational parameter for each amino acid is calculated by considering the relative frequency of each 20 amino acid in proteins  By this C=Coils H=Alpha Helix E=Beta Strands are determined  Also called preference parameter
  • 14. • A table of prediction value or preference parameter for each of 20 amino acid in alpha helix ,beta plate and turn already calculated and standardised. • To obtain the prediction value the frequency of amino acids( i) in structure is divided by of all residences in protein (s) • i/s • The resulting structural parameter of p(alpha),p(beta),p(turn)vary —0.5 to 1.5 for 20 amino acid
  • 15.
  • 16.  Window is scanned to find a short sequence of amino acid that has high probability to form one type of structure  When 4 out of 6 amino acid have high probability >1.03 the – alpha helix  3 out of 5 amino acid with probability >1.03-beta RULES
  • 17. ALGORITHM o Note preference parameter for 20 aa in peptide o Scan the window and identify the region where 4 out of 6 contiguous residue have p(alpha helix) >1.00 o Continue scanning in both the direction until the 4 contiguous residue that have an average p(alpha helix)<1.00,end of helix o If segment is longer than 5aa and p(alpha helix)>p(beta sheet )-segment –completely alpha helix o scan different segment and identify - alpha helix
  • 18.  Identify the region where 3 out of 5 aa have the value of p( beta sheet) >1.00 ,region is predicted as beta sheet  Continue scanning both the direction until 4 residue that have p( beta sheet) <1.00  End of beta sheet  average p( beta sheet) >105 and p( beta sheet) >p(alpha helix) than consider complete segment as b pleated sheet
  • 19.  If any region is over lapping than consider it as alpha helix if average p(alpha helix)>p(beta sheet )  Or beta sheet if p(alpha helix)<p(beta sheet )  To identify turn  P(t)=f(j)f(j+1)f(j+2)f(j+3)  J=residual number
  • 20. result  Accuracy: ~50%  ~60%  helix alanine,glutamine,leucine,methionine  Helix breaking proline and glycine  Beta sheet isoleucine,valine,tyrosine  Beta breaking proline,aspargine,glutamine  Turn contains proline(30%),serine(14%),lysine, aspargine(10%)  Glycine(19%),aspartic acid (`18%),serine(13%),tyrosine(11%)  http://www.accelrys.com/product/gcg-wisconsin- package/program-list.html
  • 21. Out put of Chou-Fasman
  • 22.
  • 23. GOR METHOD • GOR(Garnier,Osguthorpe,Robson)1978 • Chou fasman method is based on assumption that each amino acid individually influence the 2ry structure of sequence • GOR is based on, amino acid flanking the central amino acid will influence the 2ry structure • Consider a peptide central amino acid side amino acid • It assume that amino acid up to 8 residue on sides will influence the 2ry structure of central residue • 4th version • 64% accurate
  • 24. ALGORITHUM •It uses the sliding window of 17 amino acid •The side amino acid sequence and alignment is determined to predict secondary structure of central sequence •Good for helix than sheet because beta sheet has more inter sequence hydrogen bonding •36.5% accurate for beta sheet •input any amino acid sequence •Output tells about secondary structure
  • 25.
  • 26. NEAREST NEIGHBOUR METHOD o Based on ,short homologues sequences of amino acids have the same secondary structure o It predicts secondary structure of central homologues segment by neighbour homologues sequences o By using structural database find some secondary structure of sequence which may be homologues to our target sequence o Naturally evolved proteins with 35% identical amino acid sequence will have same secondary structure o Find some sequence which may match with target sequence o Scoring matrix,MSA
  • 27. “Singleton” score matrix Helix Sheet Loop Buried Inter Exposed Buried Inter Exposed Buried Inter Exposed ALA -0.578 -0.119 -0.160 0.010 0.583 0.921 0.023 0.218 0.368 ARG 0.997 -0.507 -0.488 1.267 -0.345 -0.580 0.930 -0.005 -0.032 ASN 0.819 0.090 -0.007 0.844 0.221 0.046 0.030 -0.322 -0.487 ASP 1.050 0.172 -0.426 1.145 0.322 0.061 0.308 -0.224 -0.541 CYS -0.360 0.333 1.831 -0.671 0.003 1.216 -0.690 -0.225 1.216 GLN 1.047 -0.294 -0.939 1.452 0.139 -0.555 1.326 0.486 -0.244 GLU 0.670 -0.313 -0.721 0.999 0.031 -0.494 0.845 0.248 -0.144 GLY 0.414 0.932 0.969 0.177 0.565 0.989 -0.562 -0.299 -0.601 HIS 0.479 -0.223 0.136 0.306 -0.343 -0.014 0.019 -0.285 0.051 ILE -0.551 0.087 1.248 -0.875 -0.182 0.500 -0.166 0.384 1.336 LEU -0.744 -0.218 0.940 -0.411 0.179 0.900 -0.205 0.169 1.217 LYS 1.863 -0.045 -0.865 2.109 -0.017 -0.901 1.925 0.474 -0.498 MET -0.641 -0.183 0.779 -0.269 0.197 0.658 -0.228 0.113 0.714 PHE -0.491 0.057 1.364 -0.649 -0.200 0.776 -0.375 -0.001 1.251 PRO 1.090 0.705 0.236 1.249 0.695 0.145 -0.412 -0.491 -0.641 SER 0.350 0.260 -0.020 0.303 0.058 -0.075 -0.173 -0.210 -0.228 THR 0.291 0.215 0.304 0.156 -0.382 -0.584 -0.012 -0.103 -0.125 TRP -0.379 -0.363 1.178 -0.270 -0.477 0.682 -0.220 -0.099 1.267 TYR -0.111 -0.292 0.942 -0.267 -0.691 0.292 -0.015 -0.176 0.946 VAL -0.374 0.236 1.144 -0.912 -0.334 0.089 -0.030 0.309 0.998
  • 28.
  • 29. Neural Network Method •Prediction is done by utilizing the information of different DATABASE •Linear sequence  3D structure of Polypeptide
  • 30. Neural network Input signals are summed and turned into zero or one 3. J1 J2 J3 J4 Feed-forward multilayer network Input layer Hidden layer Output layer neurons
  • 31. Enter sequences Compare Prediction to Reality AdjustWeights Neural network training
  • 32. Simple Neural Network With Hidden Layer outi  f ij 2 J  f jk 1 Jk   kin       j        Simple neural network with hidden layer
  • 33. A C D E F G H I K L M N P Q R S T V W Y . H E L D (L) R (E) Q (E) G (E) F (E) V (E) P (E) A (H) A (H) Y (H) V (E) K (E) K (E) Neural network for secondary structure
  • 34. Summary Introduction What is secondary structure prediction Why Chou-Fasman method GOR I-IV Nearest neighbors Neural network
  • 35.
  • 36.
  • 37.  Suggested reading:  Chapter 15 in “Current Topics in Computational Molecular Biology, edited by Tao Jiang, Ying Xu, and Michael Zhang. MIT Press. 2002.”  Bioinformatics by Cynthia and per jambeck  Bioinformatics by S.C.RASTOGI  Bioinformatics By Andreas  Optional reading:  Review by Burkhard Rost: http://cubic.bioc.columbia.edu/papers/2003_r ev_dekker/paper.html Reference