SlideShare a Scribd company logo
1
Central University of Bihar
BIS 553: protein modelling and simulation
Denovo structure prediction
Submitted to:- Submitted by:-
Dr. Durg Vijay Singh Shweta Kumari
Roll no- 21
2 nd semester
Central University of Bihar
Patna
2
CONTENT
Sl. No Topic Page No.
1 Introduction 3
2 Need of Ab inito prediction 4
3 Challenges 4
4 Principle of ab inito method 4
5 Denovo Structure Prediction V/S Template
Based Structure Prediction
4-5
6 Successful De Novo Modeling Requirements 6
7 Results from ab­initio 7
8 Domain prediction 7-8
9 Advantages of This Method 8-9
10 Complexity of abinitio methods 9
11 Ab initio methods have recently received 
increased attention in the prediction of loops
10
12 Protein folding and de novo protein design for 
biotechnological applications
10
13 Limitations of De novo Prediction Methods 11
14 CASPs 11-12
15 Application of Denovo structure prediction 12
16 List of de novo protein structure prediction 
software
12-13
17 References 14
3
Introduction:
Predicting the 3D structure without any “prior
knowledge”
•Predicting protein 3D structures from the amino acid sequence still remains
as an unsolved problem after five decades of efforts. If the target protein has a
homologue already solved, the task is relatively easy and high-resolution
models can be built by copying the framework of the solved structure.
•However, such a modelling procedure does not help answer the question of
how and why a protein adopts its specific structure. If structure homologues
(occasionally analogues) do not exist, or exist but cannot be identified, models
have to be constructed from scratch. This procedure, called ab initio
modelling.
•Ab initio modelling is essential for a complete solution to the protein
structure prediction problem; it can also help us understand the
physicochemical principle of how proteins fold in nature.
Thus,” In computational biology, de novo protein structure prediction refers to an
algorithmic process by which protein tertiary structure is predicted from its
amino acid primary sequence.”
4
Need of Ab inito prediction:
• First, in some cases, even a remotely related structural homologue may not be 
available.
• Second, new structure continue to be discovered which could not have been 
identified by methods which rely on comparison to known structure.
• Third, knowledge­based methods have been criticized for predicting protein 
protein structures without having to obtain a fundamental understanding of the 
mechanisms and driving forces of structure formation. Ab initio methods, in 
contrast, base their predictions on physical models for these mechanisms
Challenges:
• Energy functions that can  reliable   discriminat e   native and non­native  
structures.
• Enormous amount of computations.
Principle of ab inito method:
•It is based on the ‘thermodynamic hypothesis’, which states that the native 
structure of a protein is the one for which the free energy achieves the global 
minimum.
•ANFINSEN (1973) showed that all the information necessary for a protein to fold 
to the native state residue in the protein sequence.
•In the absence of large kinetic barriers in the force energy landscape, Anfinsen's 
result and those of large numbers of researchers in the intervening year suggest that 
the native confoermations of most proteins are the lowest free energy conformation 
for their sequences.
Denovo Structure Prediction V/S Template Based
Structure Prediction:
•De novo protein structure modeling is distinguished from Template-based
modeling (TBM) by the fact that no solved homolog to the protein of interest is
known, making efforts to predict protein structure from amino acid sequence
exceedingly difficult.
5
Sl.
No.
Homology
modelling
Fold Reconition De novo
prediction
1 templete based
modelling
templete based remote
homology modelling
templete free
modelling
2 applicable to sequence
having >= 30% homogy
on PDB database
applicable to sequence
having <20% homogy on
PDB database
applicable to any
sequence does not
having homologue
on PDB
3 length of sequence is not
limited
greater than 150AA
not applicable
4 limited search space search space greater than
homology modelling
very large search
space
5 more accurate structure generate less accurate
structure than homology
generate least
accurate structure
6 model quality at atomic
level
model quality at fold level model quality at
atomic level
7 computationally less
expensive
computationally more
expensive than homology
modelling
computationally
most expensive
8 applicable in Drug
designing, virtual
screening, designing site
directed mutagenisis,
characterization of active
site
applicable in Prediction of
protein family, functional
characterization by folding
assignment
applicable in
genome annotation,
domain prediction
and structural
genomics initiatives
6
Template­based methods begin with a sequence, predict the secondary structure, and 
attempt to find a template structure and/or fragments from existing structures in the 
PDB that will fold similar to the target sequence.
source: http://isw3.naist.jp/IS/Bio-Info-Unit/gogroup/study/study-en.html
“Denovo structure prediction method mean it
doesnot relay on homology between the query
sequence and a sequence in the Protein Data
Bank(PDB) to create a template for structure
prediction”
Successful De Novo Modeling Requirements:
•De novo conformation predictors usually function by producing candidate 
conformations (decoys) and then choosing amongst them based on their 
thermodynamic stability and energy state. Most successful predictors will have the 
following three factors in common:
1.  An accurate energy function that corresponds the most thermodynamically 
stable state to the native structure of a protein
2.  An efficient search method capable of quickly identify low­energy states 
through conformational search
3. The ability to select native­like models from a collection of decoy structures.
7
Results from ab­initio:
•Average error 5 Å -Average error 5 Å - 10 Å10 Å
•Function cannot beFunction cannot be predictedpredicted
••Long simulationsLong simulations
fig:fig: Some protein from ESome protein from E.coli.coli predicted at 7.6 Åpredicted at 7.6 Å
(CASP3, H.Scheraga)(CASP3, H.Scheraga)
Domain prediction:
•Domain prediction is a critical pre resquisite to the structur prediction “As the size of
the protein increases, its conformational space also increases.”
•Current denovo methods are limited to protein domain of 150 amino acid domain
residue for alpha-beta protein.
•80 residue for beta folds and 150 for alpha fold only.
•To overcome this two approaches can apply-
1. Increase the size range of denovo structure prediction.
2. Dividing protein into domains prior to attempting two protein structure
prediction.
•"A domain is generally define as a portion of protein that folds independently of the
rest of the protein."
•So dividing a query sequence into their smallest component domain prior to folding
is straight forward way to increase the size of the predictio.
•For many proteins domains division can be easily found while several domain
remains beyond our ability to correctly detached them.
•The determination of domain, family membership and its boundries for multidomain
protein is a vital step in structure annotation/ prediction.
•In brief, most domain protein partial methods relay on hierarchy searching for
domains in query sequence with collection of primary sequence methods, domains
library search and matches to structural domains in the PDB.
8
Advantages of This Method:
•The method is fully automated, and the methodology is the same regardless of the
existing homology between the query protein and the proteins in the structural
database. Thus, it can be easily applied to the structural annotation on a genomic
scale.
•A large success rate, which is competitive with other methods (a large fraction of
correct and accurate predictions), could be expected for the following types of
proteins.
The most advanced ab­initio method is fragment assembly
•Consists by breaking up the sequence in small subsegments of 3 to 9 residues  and 
generating structure for these segments based on a large library of known fragments.
•Decoys are generated from all possible combinations of fragments.
•An energy minimization process is applied to all decoys.
9
•Decoys are clustered and the final models are selected from the center of the largest
clusters.
Complexity of abinitio methods Can be easily explained by the understanding 
conformational complexity
Complexity of    abinitio    methods:
•3100 = 5 × 1047 conformations
•Fastest motions 10- 15 sec so sampling all
•conformations would take 5 × 1032 sec
•60 × 60 × 24 × 365 = 31536000 = 3.1536 x 107
•seconds in a year
•Sampling all conformations will take 1.6 × 1025 years
•years, much longer than the age of the universe
10
Ab initio methods have recently received increased 
attention in the prediction of loops:
•Loops exhibit greater structural variability than Beta-sheets and Alpha helices.
•Loop structure therefore is considerably more difficult to predict than the structure
of the geometrically highly regular Beta-sheets and Alpha helices.
•Loops are often exposed to the surface of proteins and contribute to active and binding sites. 
Consequently, loops are crucial for protein function.
Protein folding and    de novo    protein design for 
biotechnological applications:
Advances and challenges in the fields of protein structure prediction and de novo
protein design focusing on the interplay necessary for success. schematically shows
the roadmap and key challenges in protein structure prediction and de novo protein
design. The past few years have shown impressive applications of computational
structure prediction and design to biotechnology, spanning peptide or antibody
therapeutics, novel biocatalysts, and self-assembling nanomaterials.
Fig: Roadmap of key challenges in understanding how to predict protein sequence to structure to
function and design. Structure prediction begins with a primary amino acid sequence
11
Table. Summary of recent successful computational de novo designed and
redesigned systems and their biotechnological applications
source: http://www.sciencedirect.com/science/article/pii/S0167779913002266#
Limitations of De novo Prediction Methods:
•Pure ab­initio modelling is still very costly  and ineffective  but hybrid 
homology/ab­ initio methods  such as fragment  assembly have better performance
•A major limitation of de novo protein prediction methods is the extraordinary 
amount of computer time required to successfully solve for the native confirmation of
a protein.
•Distributed methods, such as Rosetta@home, have attempted to ameliorate this by 
recruiting individuals who then volunteer idle home computer time in order to 
process data.
•Even these methods face challenges, however. For example, a distributed method 
was utilized by a team of researchers at the University of Washington and the 
Howard Hughes Medical Institute to predict the tertiary structure of the protein 
T0283 from its amino acid sequence. In a blind test comparing the accuracy of this 
distributed technique with the experimentally confirmed structure deposited within 
the Protein Databank (PDB), the predictor produced excellent agreement with the 
deposited structure.
•However, the time and number of computers required for this feat was enormous – 
almost two years and approximately 70,000 home computers, respectively.
“One method proposed to overcome such limitations involves the use of Markov 
models (see Markov chain Monte Carlo). One possibility is that such models could
be constructed in order to assist with free energy computation and protein 
structure prediction, perhaps by refining computational simulations”
CASPs:
•“Progress for all variants of computational protein structure prediction methods is 
assessed in the biannual, community wide Critical Assessment of Protein Structure 
Prediction (CASP) experiments.
12
•To assess the current status of protein structure prediction, John Moult proposed the 
CASP (Critical Assessment of Techniques for Protein Structure Prediction) 
community­wide protein structure prediction experiment.
•The idea is that experimentalists who are about to determine protein structures make 
the sequences of the proteins available and then the protein structure prediction 
community makes predictions that are then assessed by independent reviewers.
•Attendees tested recently developed ab initio protein structure predictions methods
during the CASP3 exercises, conducted in December 1998 in Asilomar, California.
•Among the best performing ab initio methods was the Rosetta method developed by 
David Baker and coworkers.
Application of Denovo structure prediction:
Genome functional annotation and structure genomics initiate two areas of
research where ab initio protein structured prediction could take important
contributions.
1. Genome annotation:
a. The annotation of open reading frames lacking detectable sequence homology to protein of
known function represents a promising applicable for ab initio model.
Low resolution ab initio predicted structure and functional relationships between proteins not
apparent from sequence similarity alone.
Note:- This concept is well illustrated by some example of prediction from CASP4.
b. Ab initio structure could be probed for the presense of residue adopting conserved geometric
motifs (eg. Serin protease catalysis traids).
2) structural genomics initiatives:
a) an initio structure prediction can help guide target selection by focussing experimental structure
determination on those proteins likely to adopt novel folds or to be of particular biological
importance
b) an initio technique do not face the limitations which comes in homology modelling applied on
genomic scale ( need for at least one homologue of known structure with good coverage).
Thus, may be a valuable adjunct to homology methods, filling in structural gaps and
producing much more complete set of model.
13
List of de novo protein structure prediction
software:
Name Method Description Link
EVfold
Evolutionary couplings calculated from correlated
mutations in a protein family, used to predict 3D
structure from sequences alone and to predict
functional residues from coupling strengths. Predicts
both globular and transmembrane proteins.
Webserver
http://evfold
.org/evfold-
web/evfold.
do
QUARK Monte Carlo fragment assembly
On-line server for
protein modeling (best
for ab initio folding in
CASP9)
http://zhang
lab.ccmb.m
ed.umich.ed
u/QUARK/
NovaFold Combination of threading and ab initio folding
Commercial protein
structure prediction
application
http://www.
dnastar.com
/t-products-
NovaFold.a
spx
I-TASSER Threading fragment structure reassembly
On-line server for
protein modeling
http://zhang
lab.ccmb.m
ed.umich.ed
u/I-
TASSER/
Selvita Protein
Modeling Platform
Package of tools for protein modeling
Interactive webserver
and standalone program
including: CABS ab
initio modeling
http://www.
selvita.com/
selvita-
protein-
modeling-
platform.ht
ml
ROBETTA
Rosetta homology modeling and ab initio fragment
assembly with Ginzu domain prediction
Webserver
http://www.
robetta.org/
Rosetta@home
Distributed-computing implementation of Rosetta
algorithm
Downloadable program
http://boinc.
bakerlab.org
/rosetta/
CABS Reduced modeling tool Downloadable program
CABS-FOLD
Server for de novo modeling, can also use alternative
templates (consensus modeling).
Webserver
http://bioco
mp.chem.u
w.edu.pl/C
ABSfold/
Bhageerath
A computational protocol for modeling and
predicting protein structures at the atomic level.
Webserver
http://www.
scfbio-
iitd.res.in/b
hageerath/in
dex.jsp
Abalone Molecular Dynamics folding Program
PEP-FOLD
De novo approach, based on a HMM structural
alphabet
On-line server for
peptide structure
prediction
http://bioser
v.rpbs.univ-
paris-
diderot.fr/se
rvices/PEP-
FOLD/
14
References:
1.Structural Bioinformatics, 2nd
edition (Wiely-Blackwell)
2.Introduction to Bioinfoematics by Zhumur Ghosh
3. http://spin.niddk.nih.gov/bax/software/CSROSETTA/
4 http://zhanglab.ccmb.med.umich.edu/papers/2009_8.pdf
5.http://www.ercim.eu/publication/Ercim_News/enw43/ber
nasconi.html
6.http://zhanglab.ccmb.med.umich.edu/papers/2009_4.pdf
7. http://www2.denizyuret.com/ref/simons/baker-rosetta.pdf
8.http://en.wikipedia.org/wiki/De_novo_protein_structure_
prediction

More Related Content

What's hot

Homology modeling of proteins (ppt)
Homology modeling of proteins (ppt)Homology modeling of proteins (ppt)
Homology modeling of proteins (ppt)
Melvin Alex
 
Protein database
Protein databaseProtein database
Protein database
Khalid Hakeem
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
Sucheta Tripathy
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
Ayesha Choudhury
 
Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
Balachandramohan Bcm
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
Arindam Ghosh
 
Structure analysis of protein
Structure analysis of proteinStructure analysis of protein
Structure analysis of protein
KAUSHAL SAHU
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
SANJANA PANDEY
 
Bioinformatic in drug designing
Bioinformatic in drug designingBioinformatic in drug designing
Bioinformatic in drug designing
Salman Khan
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
Siva Dharshini R
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
SHRIKANT YANKANCHI
 
Est database
Est databaseEst database
Est database
Amit Ruchi Yadav
 
Protein structure analysis
Protein structure analysis Protein structure analysis
Protein structure analysis
Anfal Izaldeen AL KATEEB
 
Ramachandran Plot
Ramachandran PlotRamachandran Plot
Ramachandran Plot
Nishanth S
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
Anik Banik
 
Clustal
ClustalClustal
Clustal
Benittabenny
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
Sai Ram
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
SATHIYA NARAYANAN
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
Yogesh Joshi
 
Structural proteomics
Structural proteomicsStructural proteomics
Structural proteomics
Michiko Matsuo
 

What's hot (20)

Homology modeling of proteins (ppt)
Homology modeling of proteins (ppt)Homology modeling of proteins (ppt)
Homology modeling of proteins (ppt)
 
Protein database
Protein databaseProtein database
Protein database
 
Motif andpatterndatabase
Motif andpatterndatabaseMotif andpatterndatabase
Motif andpatterndatabase
 
Homology modelling
Homology modellingHomology modelling
Homology modelling
 
Protein Structure Prediction
Protein Structure PredictionProtein Structure Prediction
Protein Structure Prediction
 
Ab Initio Protein Structure Prediction
Ab Initio Protein Structure PredictionAb Initio Protein Structure Prediction
Ab Initio Protein Structure Prediction
 
Structure analysis of protein
Structure analysis of proteinStructure analysis of protein
Structure analysis of protein
 
Protein Threading
Protein ThreadingProtein Threading
Protein Threading
 
Bioinformatic in drug designing
Bioinformatic in drug designingBioinformatic in drug designing
Bioinformatic in drug designing
 
Secondary protein structure prediction
Secondary protein structure predictionSecondary protein structure prediction
Secondary protein structure prediction
 
Protein protein interactions
Protein protein interactionsProtein protein interactions
Protein protein interactions
 
Est database
Est databaseEst database
Est database
 
Protein structure analysis
Protein structure analysis Protein structure analysis
Protein structure analysis
 
Ramachandran Plot
Ramachandran PlotRamachandran Plot
Ramachandran Plot
 
Motif & Domain
Motif & DomainMotif & Domain
Motif & Domain
 
Clustal
ClustalClustal
Clustal
 
Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)Protein-Protein Interactions (PPIs)
Protein-Protein Interactions (PPIs)
 
Protein Databases
Protein DatabasesProtein Databases
Protein Databases
 
Protein data bank
Protein data bankProtein data bank
Protein data bank
 
Structural proteomics
Structural proteomicsStructural proteomics
Structural proteomics
 

Viewers also liked

Shweta ppt I1FFL
Shweta ppt I1FFLShweta ppt I1FFL
Shweta ppt I1FFL
ShwetA Kumari
 
1
11
Speaking out mena.def
Speaking out mena.defSpeaking out mena.def
Speaking out mena.def
kuma-parlons-en
 
Paper dhh
Paper dhhPaper dhh
Paper dhh
Dam Van Tien
 
Be475 mid term j-b
Be475 mid term j-bBe475 mid term j-b
Be475 mid term j-b
jketchu
 
พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2
Thanawat Boontan
 
พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5
Thanawat Boontan
 
L1 handout
L1 handoutL1 handout
Kurikulum 2013
Kurikulum 2013Kurikulum 2013
Kurikulum 2013
Vennylendia Tasya
 
Ikeday
IkedayIkeday
Ikeday
Dina Feller
 
Genre research.
Genre research.Genre research.
Genre research.
edwincameron
 
Sm11415 20140807
Sm11415 20140807Sm11415 20140807
Sm11415 20140807
Shawn Invento
 
Corrected draft programme dec 2014
Corrected draft programme dec 2014Corrected draft programme dec 2014
Corrected draft programme dec 2014
rahulsne
 
Démarrer une session de formation
Démarrer une session de formationDémarrer une session de formation
Démarrer une session de formation
kuma-parlons-en
 
Puget
PugetPuget
Puget
jketchu
 
X ray
X ray X ray
Wings power point1
Wings power point1Wings power point1
Wings power point1
mah9473
 
Be475 final j b version 3
Be475 final j b version 3Be475 final j b version 3
Be475 final j b version 3
jketchu
 
Burak gundogdu
Burak gundogduBurak gundogdu
Burak gundogdu
Burak Gündoğdu
 
Puget Puget sound restoration 424 final
Puget Puget sound restoration 424 finalPuget Puget sound restoration 424 final
Puget Puget sound restoration 424 final
jketchu
 

Viewers also liked (20)

Shweta ppt I1FFL
Shweta ppt I1FFLShweta ppt I1FFL
Shweta ppt I1FFL
 
1
11
1
 
Speaking out mena.def
Speaking out mena.defSpeaking out mena.def
Speaking out mena.def
 
Paper dhh
Paper dhhPaper dhh
Paper dhh
 
Be475 mid term j-b
Be475 mid term j-bBe475 mid term j-b
Be475 mid term j-b
 
พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2พื้นฐาน Adobe Flash CS3 ตอนที่2
พื้นฐาน Adobe Flash CS3 ตอนที่2
 
พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5พื้นฐาน Adobe Flash CS3 ตอนที่5
พื้นฐาน Adobe Flash CS3 ตอนที่5
 
L1 handout
L1 handoutL1 handout
L1 handout
 
Kurikulum 2013
Kurikulum 2013Kurikulum 2013
Kurikulum 2013
 
Ikeday
IkedayIkeday
Ikeday
 
Genre research.
Genre research.Genre research.
Genre research.
 
Sm11415 20140807
Sm11415 20140807Sm11415 20140807
Sm11415 20140807
 
Corrected draft programme dec 2014
Corrected draft programme dec 2014Corrected draft programme dec 2014
Corrected draft programme dec 2014
 
Démarrer une session de formation
Démarrer une session de formationDémarrer une session de formation
Démarrer une session de formation
 
Puget
PugetPuget
Puget
 
X ray
X ray X ray
X ray
 
Wings power point1
Wings power point1Wings power point1
Wings power point1
 
Be475 final j b version 3
Be475 final j b version 3Be475 final j b version 3
Be475 final j b version 3
 
Burak gundogdu
Burak gundogduBurak gundogdu
Burak gundogdu
 
Puget Puget sound restoration 424 final
Puget Puget sound restoration 424 finalPuget Puget sound restoration 424 final
Puget Puget sound restoration 424 final
 

Similar to De novo str_prediction

protein design, principles and examples.pptx
protein design, principles and examples.pptxprotein design, principles and examples.pptx
protein design, principles and examples.pptx
GopiChand121
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
MuhammadRizwan863722
 
Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure Prediction
SelimReza76
 
threading and homology modelling methods
threading and homology modelling methodsthreading and homology modelling methods
threading and homology modelling methods
mohammed muzammil
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
ashharnomani
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptx
kigaruantony
 
Computational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptxComputational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptx
ShraddhaRaut43
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
pkchoudhury
 
Modelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural BiologyModelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural Biology
Antonio E. Serrano
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
Dr. Thippeswamy S.
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
Pushpendra Singh Dangi
 
HOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYHOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAY
Shikha Popali
 
58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus
Annadurai B
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
Subin E K
 
Protein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.pptProtein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.ppt
60BT119YAZHINIK
 
Homology modeling: Modeller
Homology modeling: ModellerHomology modeling: Modeller
Comparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplicationsComparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplications
LynellBull52
 
Class powerpoint.ppt
Class powerpoint.pptClass powerpoint.ppt
Class powerpoint.ppt
MaryamFazeli7
 
Drug discovery presentation
Drug discovery presentationDrug discovery presentation
Drug discovery presentation
Theertha Raveendran
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
abdulahad563527
 

Similar to De novo str_prediction (20)

protein design, principles and examples.pptx
protein design, principles and examples.pptxprotein design, principles and examples.pptx
protein design, principles and examples.pptx
 
protein Modeling Abi.pptx
protein Modeling Abi.pptxprotein Modeling Abi.pptx
protein Modeling Abi.pptx
 
Protien Structure Prediction
Protien Structure PredictionProtien Structure Prediction
Protien Structure Prediction
 
threading and homology modelling methods
threading and homology modelling methodsthreading and homology modelling methods
threading and homology modelling methods
 
Computational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptxComputational Prediction Of Protein-1.pptx
Computational Prediction Of Protein-1.pptx
 
L1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptxL1Protein_Structure_Analysis.pptx
L1Protein_Structure_Analysis.pptx
 
Computational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptxComputational Prediction of Protein Structure.pptx
Computational Prediction of Protein Structure.pptx
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
Modelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural BiologyModelling Proteins By Computational Structural Biology
Modelling Proteins By Computational Structural Biology
 
cloudapplications.pptx
cloudapplications.pptxcloudapplications.pptx
cloudapplications.pptx
 
Cloud applications - Protein Structure Predication and gene expression data...
Cloud applications - Protein Structure Predication  and  gene expression data...Cloud applications - Protein Structure Predication  and  gene expression data...
Cloud applications - Protein Structure Predication and gene expression data...
 
HOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAYHOMOLOGY MODELING IN EASIER WAY
HOMOLOGY MODELING IN EASIER WAY
 
58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus58.Comparative modelling of cellulase from Aspergillus terreus
58.Comparative modelling of cellulase from Aspergillus terreus
 
In silico structure prediction
In silico structure predictionIn silico structure prediction
In silico structure prediction
 
Protein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.pptProtein struc pred-Ab initio and other methods as a short introduction.ppt
Protein struc pred-Ab initio and other methods as a short introduction.ppt
 
Homology modeling: Modeller
Homology modeling: ModellerHomology modeling: Modeller
Homology modeling: Modeller
 
Comparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplicationsComparative Protein Structure Modeling and itsApplications
Comparative Protein Structure Modeling and itsApplications
 
Class powerpoint.ppt
Class powerpoint.pptClass powerpoint.ppt
Class powerpoint.ppt
 
Drug discovery presentation
Drug discovery presentationDrug discovery presentation
Drug discovery presentation
 
protein structure prediction presentation .pptx
protein structure prediction presentation .pptxprotein structure prediction presentation .pptx
protein structure prediction presentation .pptx
 

More from ShwetA Kumari

Combanitorial approach for drug discovery
Combanitorial approach for drug discoveryCombanitorial approach for drug discovery
Combanitorial approach for drug discovery
ShwetA Kumari
 
Final shweta protein kinase cascade
Final shweta protein kinase cascadeFinal shweta protein kinase cascade
Final shweta protein kinase cascade
ShwetA Kumari
 
Data mining
Data miningData mining
Data mining
ShwetA Kumari
 
Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
ShwetA Kumari
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
ShwetA Kumari
 
paper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular proteinpaper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular protein
ShwetA Kumari
 
Aminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and PyrrolysineAminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and Pyrrolysine
ShwetA Kumari
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
ShwetA Kumari
 
protein kinase cascade
protein kinase cascadeprotein kinase cascade
protein kinase cascade
ShwetA Kumari
 
Evolution
EvolutionEvolution
Evolution
ShwetA Kumari
 
Natural selection
Natural selection Natural selection
Natural selection
ShwetA Kumari
 
Genetic drift
Genetic drift Genetic drift
Genetic drift
ShwetA Kumari
 

More from ShwetA Kumari (12)

Combanitorial approach for drug discovery
Combanitorial approach for drug discoveryCombanitorial approach for drug discovery
Combanitorial approach for drug discovery
 
Final shweta protein kinase cascade
Final shweta protein kinase cascadeFinal shweta protein kinase cascade
Final shweta protein kinase cascade
 
Data mining
Data miningData mining
Data mining
 
Distance measure between two biological sequences
Distance  measure between  two biological  sequences Distance  measure between  two biological  sequences
Distance measure between two biological sequences
 
modelling assignment
modelling assignmentmodelling assignment
modelling assignment
 
paper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular proteinpaper ppt 2ndry str in globular protein
paper ppt 2ndry str in globular protein
 
Aminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and PyrrolysineAminiacid Selenocysteine and Pyrrolysine
Aminiacid Selenocysteine and Pyrrolysine
 
dot plot analysis
dot plot analysisdot plot analysis
dot plot analysis
 
protein kinase cascade
protein kinase cascadeprotein kinase cascade
protein kinase cascade
 
Evolution
EvolutionEvolution
Evolution
 
Natural selection
Natural selection Natural selection
Natural selection
 
Genetic drift
Genetic drift Genetic drift
Genetic drift
 

Recently uploaded

Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
MaheshaNanjegowda
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
Leonel Morgado
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
İsa Badur
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
University of Maribor
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
RDhivya6
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
Scintica Instrumentation
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Selcen Ozturkcan
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 

Recently uploaded (20)

Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Basics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different formsBasics of crystallography, crystal systems, classes and different forms
Basics of crystallography, crystal systems, classes and different forms
 
Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...Authoring a personal GPT for your research and practice: How we created the Q...
Authoring a personal GPT for your research and practice: How we created the Q...
 
aziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobelaziz sancar nobel prize winner: from mardin to nobel
aziz sancar nobel prize winner: from mardin to nobel
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
Randomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNERandomised Optimisation Algorithms in DAPHNE
Randomised Optimisation Algorithms in DAPHNE
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 
23PH301 - Optics - Optical Lenses.pptx
23PH301 - Optics  -  Optical Lenses.pptx23PH301 - Optics  -  Optical Lenses.pptx
23PH301 - Optics - Optical Lenses.pptx
 
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
(June 12, 2024) Webinar: Development of PET theranostics targeting the molecu...
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdfMending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
Mending Clothing to Support Sustainable Fashion_CIMaR 2024.pdf
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 

De novo str_prediction

  • 1. 1 Central University of Bihar BIS 553: protein modelling and simulation Denovo structure prediction Submitted to:- Submitted by:- Dr. Durg Vijay Singh Shweta Kumari Roll no- 21 2 nd semester Central University of Bihar Patna
  • 2. 2 CONTENT Sl. No Topic Page No. 1 Introduction 3 2 Need of Ab inito prediction 4 3 Challenges 4 4 Principle of ab inito method 4 5 Denovo Structure Prediction V/S Template Based Structure Prediction 4-5 6 Successful De Novo Modeling Requirements 6 7 Results from ab­initio 7 8 Domain prediction 7-8 9 Advantages of This Method 8-9 10 Complexity of abinitio methods 9 11 Ab initio methods have recently received  increased attention in the prediction of loops 10 12 Protein folding and de novo protein design for  biotechnological applications 10 13 Limitations of De novo Prediction Methods 11 14 CASPs 11-12 15 Application of Denovo structure prediction 12 16 List of de novo protein structure prediction  software 12-13 17 References 14
  • 3. 3 Introduction: Predicting the 3D structure without any “prior knowledge” •Predicting protein 3D structures from the amino acid sequence still remains as an unsolved problem after five decades of efforts. If the target protein has a homologue already solved, the task is relatively easy and high-resolution models can be built by copying the framework of the solved structure. •However, such a modelling procedure does not help answer the question of how and why a protein adopts its specific structure. If structure homologues (occasionally analogues) do not exist, or exist but cannot be identified, models have to be constructed from scratch. This procedure, called ab initio modelling. •Ab initio modelling is essential for a complete solution to the protein structure prediction problem; it can also help us understand the physicochemical principle of how proteins fold in nature. Thus,” In computational biology, de novo protein structure prediction refers to an algorithmic process by which protein tertiary structure is predicted from its amino acid primary sequence.”
  • 4. 4 Need of Ab inito prediction: • First, in some cases, even a remotely related structural homologue may not be  available. • Second, new structure continue to be discovered which could not have been  identified by methods which rely on comparison to known structure. • Third, knowledge­based methods have been criticized for predicting protein  protein structures without having to obtain a fundamental understanding of the  mechanisms and driving forces of structure formation. Ab initio methods, in  contrast, base their predictions on physical models for these mechanisms Challenges: • Energy functions that can  reliable   discriminat e   native and non­native   structures. • Enormous amount of computations. Principle of ab inito method: •It is based on the ‘thermodynamic hypothesis’, which states that the native  structure of a protein is the one for which the free energy achieves the global  minimum. •ANFINSEN (1973) showed that all the information necessary for a protein to fold  to the native state residue in the protein sequence. •In the absence of large kinetic barriers in the force energy landscape, Anfinsen's  result and those of large numbers of researchers in the intervening year suggest that  the native confoermations of most proteins are the lowest free energy conformation  for their sequences. Denovo Structure Prediction V/S Template Based Structure Prediction: •De novo protein structure modeling is distinguished from Template-based modeling (TBM) by the fact that no solved homolog to the protein of interest is known, making efforts to predict protein structure from amino acid sequence exceedingly difficult.
  • 5. 5 Sl. No. Homology modelling Fold Reconition De novo prediction 1 templete based modelling templete based remote homology modelling templete free modelling 2 applicable to sequence having >= 30% homogy on PDB database applicable to sequence having <20% homogy on PDB database applicable to any sequence does not having homologue on PDB 3 length of sequence is not limited greater than 150AA not applicable 4 limited search space search space greater than homology modelling very large search space 5 more accurate structure generate less accurate structure than homology generate least accurate structure 6 model quality at atomic level model quality at fold level model quality at atomic level 7 computationally less expensive computationally more expensive than homology modelling computationally most expensive 8 applicable in Drug designing, virtual screening, designing site directed mutagenisis, characterization of active site applicable in Prediction of protein family, functional characterization by folding assignment applicable in genome annotation, domain prediction and structural genomics initiatives
  • 6. 6 Template­based methods begin with a sequence, predict the secondary structure, and  attempt to find a template structure and/or fragments from existing structures in the  PDB that will fold similar to the target sequence. source: http://isw3.naist.jp/IS/Bio-Info-Unit/gogroup/study/study-en.html “Denovo structure prediction method mean it doesnot relay on homology between the query sequence and a sequence in the Protein Data Bank(PDB) to create a template for structure prediction” Successful De Novo Modeling Requirements: •De novo conformation predictors usually function by producing candidate  conformations (decoys) and then choosing amongst them based on their  thermodynamic stability and energy state. Most successful predictors will have the  following three factors in common: 1.  An accurate energy function that corresponds the most thermodynamically  stable state to the native structure of a protein 2.  An efficient search method capable of quickly identify low­energy states  through conformational search 3. The ability to select native­like models from a collection of decoy structures.
  • 7. 7 Results from ab­initio: •Average error 5 Å -Average error 5 Å - 10 Å10 Å •Function cannot beFunction cannot be predictedpredicted ••Long simulationsLong simulations fig:fig: Some protein from ESome protein from E.coli.coli predicted at 7.6 Åpredicted at 7.6 Å (CASP3, H.Scheraga)(CASP3, H.Scheraga) Domain prediction: •Domain prediction is a critical pre resquisite to the structur prediction “As the size of the protein increases, its conformational space also increases.” •Current denovo methods are limited to protein domain of 150 amino acid domain residue for alpha-beta protein. •80 residue for beta folds and 150 for alpha fold only. •To overcome this two approaches can apply- 1. Increase the size range of denovo structure prediction. 2. Dividing protein into domains prior to attempting two protein structure prediction. •"A domain is generally define as a portion of protein that folds independently of the rest of the protein." •So dividing a query sequence into their smallest component domain prior to folding is straight forward way to increase the size of the predictio. •For many proteins domains division can be easily found while several domain remains beyond our ability to correctly detached them. •The determination of domain, family membership and its boundries for multidomain protein is a vital step in structure annotation/ prediction. •In brief, most domain protein partial methods relay on hierarchy searching for domains in query sequence with collection of primary sequence methods, domains library search and matches to structural domains in the PDB.
  • 8. 8 Advantages of This Method: •The method is fully automated, and the methodology is the same regardless of the existing homology between the query protein and the proteins in the structural database. Thus, it can be easily applied to the structural annotation on a genomic scale. •A large success rate, which is competitive with other methods (a large fraction of correct and accurate predictions), could be expected for the following types of proteins. The most advanced ab­initio method is fragment assembly •Consists by breaking up the sequence in small subsegments of 3 to 9 residues  and  generating structure for these segments based on a large library of known fragments. •Decoys are generated from all possible combinations of fragments. •An energy minimization process is applied to all decoys.
  • 9. 9 •Decoys are clustered and the final models are selected from the center of the largest clusters. Complexity of abinitio methods Can be easily explained by the understanding  conformational complexity Complexity of    abinitio    methods: •3100 = 5 × 1047 conformations •Fastest motions 10- 15 sec so sampling all •conformations would take 5 × 1032 sec •60 × 60 × 24 × 365 = 31536000 = 3.1536 x 107 •seconds in a year •Sampling all conformations will take 1.6 × 1025 years •years, much longer than the age of the universe
  • 10. 10 Ab initio methods have recently received increased  attention in the prediction of loops: •Loops exhibit greater structural variability than Beta-sheets and Alpha helices. •Loop structure therefore is considerably more difficult to predict than the structure of the geometrically highly regular Beta-sheets and Alpha helices. •Loops are often exposed to the surface of proteins and contribute to active and binding sites.  Consequently, loops are crucial for protein function. Protein folding and    de novo    protein design for  biotechnological applications: Advances and challenges in the fields of protein structure prediction and de novo protein design focusing on the interplay necessary for success. schematically shows the roadmap and key challenges in protein structure prediction and de novo protein design. The past few years have shown impressive applications of computational structure prediction and design to biotechnology, spanning peptide or antibody therapeutics, novel biocatalysts, and self-assembling nanomaterials. Fig: Roadmap of key challenges in understanding how to predict protein sequence to structure to function and design. Structure prediction begins with a primary amino acid sequence
  • 11. 11 Table. Summary of recent successful computational de novo designed and redesigned systems and their biotechnological applications source: http://www.sciencedirect.com/science/article/pii/S0167779913002266# Limitations of De novo Prediction Methods: •Pure ab­initio modelling is still very costly  and ineffective  but hybrid  homology/ab­ initio methods  such as fragment  assembly have better performance •A major limitation of de novo protein prediction methods is the extraordinary  amount of computer time required to successfully solve for the native confirmation of a protein. •Distributed methods, such as Rosetta@home, have attempted to ameliorate this by  recruiting individuals who then volunteer idle home computer time in order to  process data. •Even these methods face challenges, however. For example, a distributed method  was utilized by a team of researchers at the University of Washington and the  Howard Hughes Medical Institute to predict the tertiary structure of the protein  T0283 from its amino acid sequence. In a blind test comparing the accuracy of this  distributed technique with the experimentally confirmed structure deposited within  the Protein Databank (PDB), the predictor produced excellent agreement with the  deposited structure. •However, the time and number of computers required for this feat was enormous –  almost two years and approximately 70,000 home computers, respectively. “One method proposed to overcome such limitations involves the use of Markov  models (see Markov chain Monte Carlo). One possibility is that such models could be constructed in order to assist with free energy computation and protein  structure prediction, perhaps by refining computational simulations” CASPs: •“Progress for all variants of computational protein structure prediction methods is  assessed in the biannual, community wide Critical Assessment of Protein Structure  Prediction (CASP) experiments.
  • 12. 12 •To assess the current status of protein structure prediction, John Moult proposed the  CASP (Critical Assessment of Techniques for Protein Structure Prediction)  community­wide protein structure prediction experiment. •The idea is that experimentalists who are about to determine protein structures make  the sequences of the proteins available and then the protein structure prediction  community makes predictions that are then assessed by independent reviewers. •Attendees tested recently developed ab initio protein structure predictions methods during the CASP3 exercises, conducted in December 1998 in Asilomar, California. •Among the best performing ab initio methods was the Rosetta method developed by  David Baker and coworkers. Application of Denovo structure prediction: Genome functional annotation and structure genomics initiate two areas of research where ab initio protein structured prediction could take important contributions. 1. Genome annotation: a. The annotation of open reading frames lacking detectable sequence homology to protein of known function represents a promising applicable for ab initio model. Low resolution ab initio predicted structure and functional relationships between proteins not apparent from sequence similarity alone. Note:- This concept is well illustrated by some example of prediction from CASP4. b. Ab initio structure could be probed for the presense of residue adopting conserved geometric motifs (eg. Serin protease catalysis traids). 2) structural genomics initiatives: a) an initio structure prediction can help guide target selection by focussing experimental structure determination on those proteins likely to adopt novel folds or to be of particular biological importance b) an initio technique do not face the limitations which comes in homology modelling applied on genomic scale ( need for at least one homologue of known structure with good coverage). Thus, may be a valuable adjunct to homology methods, filling in structural gaps and producing much more complete set of model.
  • 13. 13 List of de novo protein structure prediction software: Name Method Description Link EVfold Evolutionary couplings calculated from correlated mutations in a protein family, used to predict 3D structure from sequences alone and to predict functional residues from coupling strengths. Predicts both globular and transmembrane proteins. Webserver http://evfold .org/evfold- web/evfold. do QUARK Monte Carlo fragment assembly On-line server for protein modeling (best for ab initio folding in CASP9) http://zhang lab.ccmb.m ed.umich.ed u/QUARK/ NovaFold Combination of threading and ab initio folding Commercial protein structure prediction application http://www. dnastar.com /t-products- NovaFold.a spx I-TASSER Threading fragment structure reassembly On-line server for protein modeling http://zhang lab.ccmb.m ed.umich.ed u/I- TASSER/ Selvita Protein Modeling Platform Package of tools for protein modeling Interactive webserver and standalone program including: CABS ab initio modeling http://www. selvita.com/ selvita- protein- modeling- platform.ht ml ROBETTA Rosetta homology modeling and ab initio fragment assembly with Ginzu domain prediction Webserver http://www. robetta.org/ Rosetta@home Distributed-computing implementation of Rosetta algorithm Downloadable program http://boinc. bakerlab.org /rosetta/ CABS Reduced modeling tool Downloadable program CABS-FOLD Server for de novo modeling, can also use alternative templates (consensus modeling). Webserver http://bioco mp.chem.u w.edu.pl/C ABSfold/ Bhageerath A computational protocol for modeling and predicting protein structures at the atomic level. Webserver http://www. scfbio- iitd.res.in/b hageerath/in dex.jsp Abalone Molecular Dynamics folding Program PEP-FOLD De novo approach, based on a HMM structural alphabet On-line server for peptide structure prediction http://bioser v.rpbs.univ- paris- diderot.fr/se rvices/PEP- FOLD/
  • 14. 14 References: 1.Structural Bioinformatics, 2nd edition (Wiely-Blackwell) 2.Introduction to Bioinfoematics by Zhumur Ghosh 3. http://spin.niddk.nih.gov/bax/software/CSROSETTA/ 4 http://zhanglab.ccmb.med.umich.edu/papers/2009_8.pdf 5.http://www.ercim.eu/publication/Ercim_News/enw43/ber nasconi.html 6.http://zhanglab.ccmb.med.umich.edu/papers/2009_4.pdf 7. http://www2.denizyuret.com/ref/simons/baker-rosetta.pdf 8.http://en.wikipedia.org/wiki/De_novo_protein_structure_ prediction