SlideShare a Scribd company logo
1
Protein Structure, Structure
Classification and Prediction
Bioinformatics X3
January 2005
P. Johansson, D. Madsen
Dept.of Cell & Molecular Biology,
Uppsala University
2
Overview
• Introduction to proteins, structure & classification
• Protein Folding
• Experimental techniques for structure determination
• Structure prediction
3
4
Proteins
• Proteins play a crucial role in virtually all biological processes
with a broad range of functions.
• The activity of an enzyme or the function of a protein is
governed by the three-dimensional structure
5
20 amino acids - the building blocks
6
The Amino Acids
7
Hydrophilic or hydrophobic..?
• Virtually all soluble proteins feature a hydrophobic core
surrounded by a hydrophilic surface
• But, peptide backbone is inherently polar ?
• Solution ; neutralize potential H-donors & acceptors using
ordered secondary structure
8
Secondary Structure: a-helix
9
• 3.6 residues / turn
• Axial dipole moment
• Not Proline & Glycine
• Protein surfaces
Secondary Structure: a-helix
10
Secondary Structure: b-sheets
11
Secondary Structure: b-sheets
• Parallel or antiparallel
• Alternating side-chains
• No mixing
• Loops often have polar amino acids
12
Structural classification
• Databases
– SCOP, ’Structural Classification of Proteins’,
manual classification
– CATH, ’Class Architecture Topology Homology’, based
on the SSAP algorithm
– FSSP, ’Family of Structurally Similar Proteins’, based
on the DALI algorithm
– PClass, ’Protein Classification’
based on the LOCK and 3Dsearch algorithms
13
Structural classification, CATH
• Class, four types :
– Mainly a
– a / b structures
– Mainly b
– No secondary structure
• Arhitecture (fold)
• Topology (superfamily)
• Homology (family)
14
Structural classification..
15
Structural classification..
• Two types of algorithms
– Inter-Molecular, 3D, Rigid Body ; structural alignment in a
common coordinate system (hard) e.g. VAST, LOCK.. alg.
– Intra-Molecular, 2D, Internal Geometry ; structural
alignment using internal distances and angles e.g. DALI,
STRUCTURAL, SSAP.. alg.
16
Structural classification, SSAP
• SSAP, ‘Sequential Structure Alignment Program’
Basic idea ; The similarity between residue i in molecule A
and residue k in molecule B is characterised in terms of their
structural surroundings
This similarity can be quantified into a score, Sik
Based on this similarity score and some specified gap penalty,
dynamic programming is used to find the optimal structural
alignment
17
Structural classification, SSAP
The structural neighborhood of residue i in A compared
to residue k in B
i k
18
Structural classification, SSAP..
Distance between residue i & j in molecule A ; dA
i,j
Similarity for two pairs of residues, i j in A & k l in B ;
,
,
b
d
d
a
s B
kl
A
ij
kl
ij


 a,b constants
Similarity between residue i in A and residue k in B ;


 
 


n
n
m
B
m
k
k
A
m
i
i
k
i
b
d
d
a
S
,
,
,
Idea ; Si,k is big if the distances from residue i in A to the 2n
nearest neighbours are similar to the corresponding distances
around k in B
19
Structural classification, SSAP..
This works well for small structures and local structural
alignments - however, insertions and deletions cause problems
 unrelated distances
HSERAHVFIM..
GQ-VMAC-NW..
i=5
k=4
A :
B :
- The real algorithm uses Dynamic programming on two levels,
first to find which distances to compare  Sik, then to align the
structures using these scores
20
Experimental techniques for structure
determination
• X-ray Crystallography
• Nuclear Magnetic Resonance
spectroscopy (NMR)
• Electron Microscopy/Diffraction
• Free electron lasers ?
21
X-ray Crystallography
22
X-ray Crystallography..
• From small molecules to viruses
• Information about the positions of
individual atoms
• Limited information about
dynamics
• Requires crystals
23
24
NMR
• Limited to molecules up to ~50kDa
(good quality up to 30 kDa)
• Distances between pairs of
hydrogen atoms
• Lots of information about dynamics
• Requires soluble, non-aggregating
material
• Assignment problem
25
Electron Microscopy/ Diffraction
• Low to medium resolution
• Limited information about
dynamics
• Can use very small crystals
(nm range)
• Can be used for very large
molecules and complexes
26
27
Structure Prediction
GPSRYIV…
?
28
Protein Folding
• Different sequence  Different
structure
• Free energy difference small due
to large entropy decrease,
DG = DH - TDS
29
Structure Prediction
Why is structure prediction and especially ab
initio calculations hard..?
• Many degrees of freedom / residue
• Remote noncovalent interactions
• Nature does not go through all conformations
• Folding assisted by enzymes & chaperones
30
Ab initio calculations used
for smaller problems ;
• Calculation of affinity
• Enzymatic pathways
Molecular dynamics
31
Sequence Classification rev.
• Class : Secondary structure content
• Fold : Major structural similarity.
• Superfamily : Probable common
evolutionary origin.
• Family : Clear evolutionary relationship.
32
• Search sequence data banks for homologs
• Search methods e.g. BLAST, PSIBLAST,
FASTA…
• Homologue in PDB..?
Structure Prediction
IVTY…PGGG HYW…QHG
33
Multiple sequence / structure alignment
• Contains more information than a single sequence
for applications like homology modeling and
secondary structure prediction
• Gives location of conserved parts
and residues likely to be buried in
the protein core or exposed to solvent
Structure Prediction
34
HFD fingerprint
Multiple alignment example
35
• Statistical Analysis (old fashioned):
– For each amino acid type assign it’s ‘propensity’
to be in a helix, sheet, or coil.
• Limited accuracy ~55-60%.
• Random prediction ~38%.
MTLLALGINHKTAP...
CCEEEEEECCCCCC...
Secondary Structure Prediction
36
• Each residue is classified as:
– Ha/Hb, strong helix / strand former.
– ha/hb, weak helix / strand former.
– I, indifferent.
– ba/bb, weak helix/strand breaker.
– Ba/Bb, strong helix / strand breaker.
The Chou & Fasman Method
37
The Chou & Fasman Method..
• Score each residue:
– Ha/ha=1, Ia=0 or ½, Ba/ba=-1.
– Hb/hb=1, Ib=0 or ½, Bb/bb=-1.
• Helix nucleation:
– Score > 4 in a “window” of 6 residues.
• Strand nucleation:
– Score > 3 in a “window” of 5 residues.
• Propagate until score < 1 in a 4 residue “window”.
38
GPSRYIVTLANGK
Helix:
Strand
-1 -1 0 0 -1 1 1 0 1 1 -1 -1 1
-1 -1 -1 .5 1 1 1 1 1 0 0 -1 -1
-2 0 1 2 3 3 1
No nucl.
-1.5 .5 2.5 4.5 5 4 3 1 -1
-2.5 -.5 1.5 … 3 1 -1
Nucleation
Propagate
GPSRYIVTLANGK
Result
The Chou & Fasman Method..
39
• Neural networks (e.g. the PHD server):
– Input: a number of protein sequences +
secondary structure.
– Output: a trained network that predicts
secondary structure elements with ~70%
accuracy.
• Use many different methods and compare
(e.g. the JPred server)!
Modern methods
40
Summary
• The function of a protein is governed by its structure
• Different sequence  Different structure
• PDB, protein data bank
• Secondary structure prediction is hard, tertiary
structure prediction is even harder
• Use homologs whenever possible or different methods
to assess quality
41

More Related Content

Similar to 2005_lecture_01.ppt

BrentWathen.ppt
BrentWathen.pptBrentWathen.ppt
BrentWathen.ppt
SivaRamT4
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
pkchoudhury
 
Cs273 structure prediction
Cs273 structure predictionCs273 structure prediction
Cs273 structure prediction
University of Allahabad
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
Santu Chall
 
P. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.pptP. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.ppt
pranalpatilPranal
 
docking
docking docking
docking
prateek kumar
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
Prof. Wim Van Criekinge
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
Bioinformatics and Computational Biosciences Branch
 
Dicroismo circular
Dicroismo circularDicroismo circular
Dicroismo circular
Inês Silva
 
Molecular docking
Molecular dockingMolecular docking
Molecular dockingpalliyath91
 
Proteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedProteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedlifsnu
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
GAUTAM KHUNE
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
Thanh Truong
 
Protein Crystallography of Lysozyme
Protein Crystallography of LysozymeProtein Crystallography of Lysozyme
Protein Crystallography of LysozymeGuillermo Llamas
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
Saramita De Chakravarti
 
NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics
Mark Berjanskii
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptx
AmnaAkram29
 
Atindra-protein.pptx
Atindra-protein.pptxAtindra-protein.pptx
Atindra-protein.pptx
Dr. Atindra Pandey
 
DHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptxDHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptx
DivyanshGupta922023
 
Isoelectric Focusing.pptx
Isoelectric Focusing.pptxIsoelectric Focusing.pptx
Isoelectric Focusing.pptx
rimshaijaz6
 

Similar to 2005_lecture_01.ppt (20)

BrentWathen.ppt
BrentWathen.pptBrentWathen.ppt
BrentWathen.ppt
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
Cs273 structure prediction
Cs273 structure predictionCs273 structure prediction
Cs273 structure prediction
 
Protein Structural predection
Protein Structural predectionProtein Structural predection
Protein Structural predection
 
P. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.pptP. Joshi SBDD and docking.ppt
P. Joshi SBDD and docking.ppt
 
docking
docking docking
docking
 
Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014Bioinformatics t7-proteinstructure v2014
Bioinformatics t7-proteinstructure v2014
 
Protein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on RosettaProtein structure prediction with a focus on Rosetta
Protein structure prediction with a focus on Rosetta
 
Dicroismo circular
Dicroismo circularDicroismo circular
Dicroismo circular
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Proteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modifiedProteins ccc 21_08_2012 modified
Proteins ccc 21_08_2012 modified
 
molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...molecular docking its types and de novo drug design and application and softw...
molecular docking its types and de novo drug design and application and softw...
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
 
Protein Crystallography of Lysozyme
Protein Crystallography of LysozymeProtein Crystallography of Lysozyme
Protein Crystallography of Lysozyme
 
QSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative StructureQSAR : Activity Relationships Quantitative Structure
QSAR : Activity Relationships Quantitative Structure
 
NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics NMR Random Coil Index & Protein Dynamics
NMR Random Coil Index & Protein Dynamics
 
Homology Modeling.pptx
Homology Modeling.pptxHomology Modeling.pptx
Homology Modeling.pptx
 
Atindra-protein.pptx
Atindra-protein.pptxAtindra-protein.pptx
Atindra-protein.pptx
 
DHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptxDHC Microbiome Presentation 4-23-19.pptx
DHC Microbiome Presentation 4-23-19.pptx
 
Isoelectric Focusing.pptx
Isoelectric Focusing.pptxIsoelectric Focusing.pptx
Isoelectric Focusing.pptx
 

More from KhalidBassiouny1

05lipids2009.ppt
05lipids2009.ppt05lipids2009.ppt
05lipids2009.ppt
KhalidBassiouny1
 
2-lipid.ppt
2-lipid.ppt2-lipid.ppt
2-lipid.ppt
KhalidBassiouny1
 
2-12-2012-2.ppt
2-12-2012-2.ppt2-12-2012-2.ppt
2-12-2012-2.ppt
KhalidBassiouny1
 
2-12-2012.ppt
2-12-2012.ppt2-12-2012.ppt
2-12-2012.ppt
KhalidBassiouny1
 
PeptidesandProteins.ppt
PeptidesandProteins.pptPeptidesandProteins.ppt
PeptidesandProteins.ppt
KhalidBassiouny1
 
PBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.pptPBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.ppt
KhalidBassiouny1
 
Nut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.pptNut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.ppt
KhalidBassiouny1
 
Lh6Ch03aAAs.ppt
Lh6Ch03aAAs.pptLh6Ch03aAAs.ppt
Lh6Ch03aAAs.ppt
KhalidBassiouny1
 
Lecture 14 2013.ppt
Lecture 14 2013.pptLecture 14 2013.ppt
Lecture 14 2013.ppt
KhalidBassiouny1
 
KSD_L8_CMB06.ppt
KSD_L8_CMB06.pptKSD_L8_CMB06.ppt
KSD_L8_CMB06.ppt
KhalidBassiouny1
 
HSPs.ppt
HSPs.pptHSPs.ppt
enzymes.ppt
enzymes.pptenzymes.ppt
enzymes.ppt
KhalidBassiouny1
 
Classification_of_protein_structure.ppt
Classification_of_protein_structure.pptClassification_of_protein_structure.ppt
Classification_of_protein_structure.ppt
KhalidBassiouny1
 
chapter6.ppt
chapter6.pptchapter6.ppt
chapter6.ppt
KhalidBassiouny1
 
class_L02.ppt
class_L02.pptclass_L02.ppt
class_L02.ppt
KhalidBassiouny1
 
centoni (2).ppt
centoni (2).pptcentoni (2).ppt
centoni (2).ppt
KhalidBassiouny1
 
Background.ppt
Background.pptBackground.ppt
Background.ppt
KhalidBassiouny1
 
Amino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.pptAmino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.ppt
KhalidBassiouny1
 
142 proteinsoutline.ppt
142 proteinsoutline.ppt142 proteinsoutline.ppt
142 proteinsoutline.ppt
KhalidBassiouny1
 
27-protease.ppt
27-protease.ppt27-protease.ppt
27-protease.ppt
KhalidBassiouny1
 

More from KhalidBassiouny1 (20)

05lipids2009.ppt
05lipids2009.ppt05lipids2009.ppt
05lipids2009.ppt
 
2-lipid.ppt
2-lipid.ppt2-lipid.ppt
2-lipid.ppt
 
2-12-2012-2.ppt
2-12-2012-2.ppt2-12-2012-2.ppt
2-12-2012-2.ppt
 
2-12-2012.ppt
2-12-2012.ppt2-12-2012.ppt
2-12-2012.ppt
 
PeptidesandProteins.ppt
PeptidesandProteins.pptPeptidesandProteins.ppt
PeptidesandProteins.ppt
 
PBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.pptPBIO4500 Protein Therapeutics.ppt
PBIO4500 Protein Therapeutics.ppt
 
Nut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.pptNut_Chapter_6_lecture.ppt
Nut_Chapter_6_lecture.ppt
 
Lh6Ch03aAAs.ppt
Lh6Ch03aAAs.pptLh6Ch03aAAs.ppt
Lh6Ch03aAAs.ppt
 
Lecture 14 2013.ppt
Lecture 14 2013.pptLecture 14 2013.ppt
Lecture 14 2013.ppt
 
KSD_L8_CMB06.ppt
KSD_L8_CMB06.pptKSD_L8_CMB06.ppt
KSD_L8_CMB06.ppt
 
HSPs.ppt
HSPs.pptHSPs.ppt
HSPs.ppt
 
enzymes.ppt
enzymes.pptenzymes.ppt
enzymes.ppt
 
Classification_of_protein_structure.ppt
Classification_of_protein_structure.pptClassification_of_protein_structure.ppt
Classification_of_protein_structure.ppt
 
chapter6.ppt
chapter6.pptchapter6.ppt
chapter6.ppt
 
class_L02.ppt
class_L02.pptclass_L02.ppt
class_L02.ppt
 
centoni (2).ppt
centoni (2).pptcentoni (2).ppt
centoni (2).ppt
 
Background.ppt
Background.pptBackground.ppt
Background.ppt
 
Amino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.pptAmino_Acids_and_Proteins.ppt
Amino_Acids_and_Proteins.ppt
 
142 proteinsoutline.ppt
142 proteinsoutline.ppt142 proteinsoutline.ppt
142 proteinsoutline.ppt
 
27-protease.ppt
27-protease.ppt27-protease.ppt
27-protease.ppt
 

Recently uploaded

Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
Ang Chong Yi
 
Kitchen Audit at restaurant as per FSSAI act
Kitchen Audit at restaurant as per FSSAI actKitchen Audit at restaurant as per FSSAI act
Kitchen Audit at restaurant as per FSSAI act
MuthuMK13
 
Food and beverage service Restaurant Services notes V1.pptx
Food and beverage service Restaurant Services notes V1.pptxFood and beverage service Restaurant Services notes V1.pptx
Food and beverage service Restaurant Services notes V1.pptx
mangenatendaishe
 
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
saseh1
 
Roti Bank Delhi: Nourishing Lives, One Meal at a Time
Roti Bank Delhi: Nourishing Lives, One Meal at a TimeRoti Bank Delhi: Nourishing Lives, One Meal at a Time
Roti Bank Delhi: Nourishing Lives, One Meal at a Time
Roti Bank
 
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
zaquoa
 
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics ResearchMS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
Panagiotis Arapitsas
 
Best Chicken Mandi in Ghaziabad near me.
Best Chicken Mandi in Ghaziabad near me.Best Chicken Mandi in Ghaziabad near me.
Best Chicken Mandi in Ghaziabad near me.
tasteofmiddleeast07
 
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
zaquoa
 
Roti Bank Hyderabad: A Beacon of Hope and Nourishment
Roti Bank Hyderabad: A Beacon of Hope and NourishmentRoti Bank Hyderabad: A Beacon of Hope and Nourishment
Roti Bank Hyderabad: A Beacon of Hope and Nourishment
Roti Bank
 

Recently uploaded (10)

Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
Ang Chong Yi Navigating Singaporean Flavors: A Journey from Cultural Heritage...
 
Kitchen Audit at restaurant as per FSSAI act
Kitchen Audit at restaurant as per FSSAI actKitchen Audit at restaurant as per FSSAI act
Kitchen Audit at restaurant as per FSSAI act
 
Food and beverage service Restaurant Services notes V1.pptx
Food and beverage service Restaurant Services notes V1.pptxFood and beverage service Restaurant Services notes V1.pptx
Food and beverage service Restaurant Services notes V1.pptx
 
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
一比一原版IC毕业证帝国理工大学毕业证成绩单如何办理
 
Roti Bank Delhi: Nourishing Lives, One Meal at a Time
Roti Bank Delhi: Nourishing Lives, One Meal at a TimeRoti Bank Delhi: Nourishing Lives, One Meal at a Time
Roti Bank Delhi: Nourishing Lives, One Meal at a Time
 
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
一比一原版UVM毕业证佛蒙特大学毕业证成绩单如何办理
 
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics ResearchMS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
MS Wine Day 2024 Arapitsas Advancements in Wine Metabolomics Research
 
Best Chicken Mandi in Ghaziabad near me.
Best Chicken Mandi in Ghaziabad near me.Best Chicken Mandi in Ghaziabad near me.
Best Chicken Mandi in Ghaziabad near me.
 
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
一比一原版UMN毕业证明尼苏达大学毕业证成绩单如何办理
 
Roti Bank Hyderabad: A Beacon of Hope and Nourishment
Roti Bank Hyderabad: A Beacon of Hope and NourishmentRoti Bank Hyderabad: A Beacon of Hope and Nourishment
Roti Bank Hyderabad: A Beacon of Hope and Nourishment
 

2005_lecture_01.ppt

  • 1. 1 Protein Structure, Structure Classification and Prediction Bioinformatics X3 January 2005 P. Johansson, D. Madsen Dept.of Cell & Molecular Biology, Uppsala University
  • 2. 2 Overview • Introduction to proteins, structure & classification • Protein Folding • Experimental techniques for structure determination • Structure prediction
  • 3. 3
  • 4. 4 Proteins • Proteins play a crucial role in virtually all biological processes with a broad range of functions. • The activity of an enzyme or the function of a protein is governed by the three-dimensional structure
  • 5. 5 20 amino acids - the building blocks
  • 7. 7 Hydrophilic or hydrophobic..? • Virtually all soluble proteins feature a hydrophobic core surrounded by a hydrophilic surface • But, peptide backbone is inherently polar ? • Solution ; neutralize potential H-donors & acceptors using ordered secondary structure
  • 9. 9 • 3.6 residues / turn • Axial dipole moment • Not Proline & Glycine • Protein surfaces Secondary Structure: a-helix
  • 11. 11 Secondary Structure: b-sheets • Parallel or antiparallel • Alternating side-chains • No mixing • Loops often have polar amino acids
  • 12. 12 Structural classification • Databases – SCOP, ’Structural Classification of Proteins’, manual classification – CATH, ’Class Architecture Topology Homology’, based on the SSAP algorithm – FSSP, ’Family of Structurally Similar Proteins’, based on the DALI algorithm – PClass, ’Protein Classification’ based on the LOCK and 3Dsearch algorithms
  • 13. 13 Structural classification, CATH • Class, four types : – Mainly a – a / b structures – Mainly b – No secondary structure • Arhitecture (fold) • Topology (superfamily) • Homology (family)
  • 15. 15 Structural classification.. • Two types of algorithms – Inter-Molecular, 3D, Rigid Body ; structural alignment in a common coordinate system (hard) e.g. VAST, LOCK.. alg. – Intra-Molecular, 2D, Internal Geometry ; structural alignment using internal distances and angles e.g. DALI, STRUCTURAL, SSAP.. alg.
  • 16. 16 Structural classification, SSAP • SSAP, ‘Sequential Structure Alignment Program’ Basic idea ; The similarity between residue i in molecule A and residue k in molecule B is characterised in terms of their structural surroundings This similarity can be quantified into a score, Sik Based on this similarity score and some specified gap penalty, dynamic programming is used to find the optimal structural alignment
  • 17. 17 Structural classification, SSAP The structural neighborhood of residue i in A compared to residue k in B i k
  • 18. 18 Structural classification, SSAP.. Distance between residue i & j in molecule A ; dA i,j Similarity for two pairs of residues, i j in A & k l in B ; , , b d d a s B kl A ij kl ij    a,b constants Similarity between residue i in A and residue k in B ;         n n m B m k k A m i i k i b d d a S , , , Idea ; Si,k is big if the distances from residue i in A to the 2n nearest neighbours are similar to the corresponding distances around k in B
  • 19. 19 Structural classification, SSAP.. This works well for small structures and local structural alignments - however, insertions and deletions cause problems  unrelated distances HSERAHVFIM.. GQ-VMAC-NW.. i=5 k=4 A : B : - The real algorithm uses Dynamic programming on two levels, first to find which distances to compare  Sik, then to align the structures using these scores
  • 20. 20 Experimental techniques for structure determination • X-ray Crystallography • Nuclear Magnetic Resonance spectroscopy (NMR) • Electron Microscopy/Diffraction • Free electron lasers ?
  • 22. 22 X-ray Crystallography.. • From small molecules to viruses • Information about the positions of individual atoms • Limited information about dynamics • Requires crystals
  • 23. 23
  • 24. 24 NMR • Limited to molecules up to ~50kDa (good quality up to 30 kDa) • Distances between pairs of hydrogen atoms • Lots of information about dynamics • Requires soluble, non-aggregating material • Assignment problem
  • 25. 25 Electron Microscopy/ Diffraction • Low to medium resolution • Limited information about dynamics • Can use very small crystals (nm range) • Can be used for very large molecules and complexes
  • 26. 26
  • 28. 28 Protein Folding • Different sequence  Different structure • Free energy difference small due to large entropy decrease, DG = DH - TDS
  • 29. 29 Structure Prediction Why is structure prediction and especially ab initio calculations hard..? • Many degrees of freedom / residue • Remote noncovalent interactions • Nature does not go through all conformations • Folding assisted by enzymes & chaperones
  • 30. 30 Ab initio calculations used for smaller problems ; • Calculation of affinity • Enzymatic pathways Molecular dynamics
  • 31. 31 Sequence Classification rev. • Class : Secondary structure content • Fold : Major structural similarity. • Superfamily : Probable common evolutionary origin. • Family : Clear evolutionary relationship.
  • 32. 32 • Search sequence data banks for homologs • Search methods e.g. BLAST, PSIBLAST, FASTA… • Homologue in PDB..? Structure Prediction IVTY…PGGG HYW…QHG
  • 33. 33 Multiple sequence / structure alignment • Contains more information than a single sequence for applications like homology modeling and secondary structure prediction • Gives location of conserved parts and residues likely to be buried in the protein core or exposed to solvent Structure Prediction
  • 35. 35 • Statistical Analysis (old fashioned): – For each amino acid type assign it’s ‘propensity’ to be in a helix, sheet, or coil. • Limited accuracy ~55-60%. • Random prediction ~38%. MTLLALGINHKTAP... CCEEEEEECCCCCC... Secondary Structure Prediction
  • 36. 36 • Each residue is classified as: – Ha/Hb, strong helix / strand former. – ha/hb, weak helix / strand former. – I, indifferent. – ba/bb, weak helix/strand breaker. – Ba/Bb, strong helix / strand breaker. The Chou & Fasman Method
  • 37. 37 The Chou & Fasman Method.. • Score each residue: – Ha/ha=1, Ia=0 or ½, Ba/ba=-1. – Hb/hb=1, Ib=0 or ½, Bb/bb=-1. • Helix nucleation: – Score > 4 in a “window” of 6 residues. • Strand nucleation: – Score > 3 in a “window” of 5 residues. • Propagate until score < 1 in a 4 residue “window”.
  • 38. 38 GPSRYIVTLANGK Helix: Strand -1 -1 0 0 -1 1 1 0 1 1 -1 -1 1 -1 -1 -1 .5 1 1 1 1 1 0 0 -1 -1 -2 0 1 2 3 3 1 No nucl. -1.5 .5 2.5 4.5 5 4 3 1 -1 -2.5 -.5 1.5 … 3 1 -1 Nucleation Propagate GPSRYIVTLANGK Result The Chou & Fasman Method..
  • 39. 39 • Neural networks (e.g. the PHD server): – Input: a number of protein sequences + secondary structure. – Output: a trained network that predicts secondary structure elements with ~70% accuracy. • Use many different methods and compare (e.g. the JPred server)! Modern methods
  • 40. 40 Summary • The function of a protein is governed by its structure • Different sequence  Different structure • PDB, protein data bank • Secondary structure prediction is hard, tertiary structure prediction is even harder • Use homologs whenever possible or different methods to assess quality
  • 41. 41