SlideShare a Scribd company logo
1 of 38
Human and Server CAPRI Protein Docking
Prediction Using LZerD with Combined
Scoring Functions
Daisuke Kihara
Department of Biological Sciences
Department of Computer Science
Purdue University, Indiana, USA
1
http://kiharalab.org
CAPRI Round 30 Results
2(Lensink et al., CAPRI30 group paper, 2016)
Overview of Protein Docking
Prediction Using LZerD in CAPRI
3
Re-ranking with
scoring functions
HHPred SparksX
MUFold
TASSER
Phyre2
TASSERlite
MultiCom
Single Chain
Modeling
PRESCO
Sub-unit models
LZerD
~50,000
docking models
Clustering,
RMSD < 5 Å
10 models
MD relaxation Submit
LZerD(Local 3D Zernike descriptor-based Docking program)
4
normal
vector
3DZernike
descriptor
6Å
Interface area
(Venkatraman, Yang, Sael, & Kihara,
BMC Bioinformatics, 2009)
(Lizard)
3D Zernike Descriptors (3DZD)
 An extension of
spherical harmonics
based descriptors
 A 3D object can be
represented by a
series of orthogonal
functions, thus
practically
represented by a
series of coefficients
as a feature vector
 Compact
 Rotation invariant
5
A surface representation of 1ew0A (A) is reconstructed from its 3D Zernike
invariants of the order 5, 10, 15, 20, and 25 (B-F). (Sael & Kihara, 2009)
),()(),,( ϕϑϕϑ m
lnl
m
nl YrRrZ =
),( ϕϑm
lY )(rRnl
),,( ϕϑrZ m
nl
: Spherical harmonics, : radial functions
polynomials in Cartesian coordinates
∫ ≤
=Ω
14
3
.)()(
x
xxx dZf m
nl
m
nl πZernike moments:
Zernike Descriptor:
2
)( m
nl
lm
lm
nlF Ω= ∑
=
−=
Protein Residue Environment SCOre
(PRESCO)
6
within a sphere of 6 or 8 Å
along the main-chain
Center
(Kim & Kihara, Proteins 2014)
Finding Similar Side-Chain Depth
Environment (SDE) from a database
7
Structure
Database
2536 proteins
500 lowest
RMSD
fragments of 9
side-chain
centroids;
Superimposed
with the query
fragment
Select SDE
with the same
number of
side-chain
centroids in
the sphere of
8.0Å
Query SDE
Compute RMSD
of residue-
depth for
corresponding
side-chain
centroids
Sort by depth RMSD
to the query
surface
CASP11 Free Modeling Category
Ranking (Model 1)
8
(http://www.predictioncenter.org/casp11/zscores_final.cgi?formula=assessors)
(Kim & Kihara, Proteins 2015)
DFIRE, GOAP, ITScore Scoring
Functions
 DFIRE (Yaoqi Zhou): statistical distance-
dependent atom contact potential using the
finite ideal-gas reference state
 GOAP (Jeff Skolnick): DFIRE * orientation
dependent term
 ITScore (Xiaoqin Zou):iteratively refined
statistical distance-dependent atom contact
potential
9
The BindML Algorithm
10(La D, & Kihara D, Proteins 2012)
Generating Substitution Models
iPFAM (505 Families)
Model Model
11
iPfam Dataset Benchmark
ROC based on 449
Protein Complexes
12
BindML
Webserver
13
http://kiharalab.org/bindml
(Wei Q, La D, & Kihara D,
Methods in Mol.Biol. In press 2016)
T79 (Round 30)

(Interface 2) Kihara: 3 hits; LZerD: 1 hit

Homodimer

LZerD runs:
 No-interface prediction
 With BindML-consPPISP prediction

LZerD selection strategy:
 Consensus of ITScore and GOAP
 5 from no-interface, 5 from BindML-consPPISP

Kihara selection strategy:
 Manual combination of ITScore, GOAP, DFIRE,
and PRESCO
 10 from no-interface
14
T79 Subunit Model Quality
Chain A
RMSD: 4.0 Å
Chain B
RMSD: 4.0 Å
native
model
15
T79 Human Selected Model
fnat 0.16, L-RMSD 14.1Å, i-RMSD 3.8 Å
native
model
16
T79 Interface Prediction
Method Precision Recall F-Score
BindML 0 0 NA
Cons-PPISP 0.10 0.18 0.12
17
T79 Scores (no-interface prediction)
18
ITScoreGOAP DFIRE
LRMSDfnatiRMSD
T79 Score Comparison
19ITScoreGOAP DFIRE
ITScoreGOAPDFIRE
T79 PRESCO scores
20
lRMSD
PRESCO PRESCO
With Inteface Prediction Without Interface Prediction
T79 Score performance summary
Run Score RFH Hits in top 10
nointerface ITScore 1 (62) 3
nointerface GOAP 1 (72) 3
nointerface DFIRE 1 (111) 5
BindML-
consPPISP
all - -
RFH: rank of first acceptable (medium) hit
21
T91 (Round 30)

Kihara: 8 hits; LZerD: 2 hits

Homodimer

LZerD runs:
 No-interface prediction (with our monomer model)
 With BindML+consPPISP interface prediction
 Zhang1 CASP server model, no-interface prediction

Server selection strategy
 10 from no-interface

Human selection strategy
 Consensus of ITScore, GOAP, PRESCO, and visual
inspection
 5 from no-interface, 5 from Zhang1
22
T91 Subunit Models
Chain C
Our model: RMSD 6.0 Å
Zhang: RMSD 4.9 Å
native
Our model
Zhang1
Chain D
Our model RMSD 6.5 Å
Zhang: RMSD 5.7 Å
23
T91 Human Selected Model
model
native
fnat 0.33, L-RMSD 9.0 Å, I-RMSD 4.2 Å
24
T91 Interface Prediction
Method Precision Recall F-Score
BindML 0.64 0.20 0.30
Cons-PPISP 0.50 0.28 0.36
25
T91 Score (no interface prediction)
26
ITScoreGOAP DFIRE
LRMSDfnatiRMSD
T91 Scores (With Interface prediction)
27
ITScoreGOAP DFIRE
LRMSDfnatiRMSD
T91 Scores (Zhang models)
28ITScoreGOAP DFIRE
LRMSDfnatiRMSD
T91 Zhang1 Score Comparison
29
ITScoreGOAP DFIRE
ITScoreGOAPDFIRE
T91 PRESCO Scores
Without Interface PredictionDocking with Zhang models
PRESCO PRESCO
LRMSD
Top 5 models selected from each
30
T91 Score Performance Summary
Run Score RFH Hits in top 10
nointerface ITScore 2 2
nointerface GOAP 2 1
nointerface DFIRE 1 2
interface ITScore 1042 0
interface GOAP 165 0
interface DFIRE 116 0
zhang1 ITScore 1 (4) 5
zhang1 GOAP 2 (16) 5
zhang1 DFIRE 1 (6) 6
RFH: rank of first acceptable (medium) hit
31
T96 (Round 31)

Heterodimer

Predictor hits: 0 (5 by other groups)

Scorer hits: human 1, server 0 (1 by other
group)
 Human: 6 selected by PRESCO, 4 selected from
with predicted interface, ITScore, GOAP, DFIRE

No PDB file for the native structure available:
metrics computed using two scorer hits
(average L-RMSD/I-RMSD, max fnat)
32
T96 scorer hits
Chain B
S39.M03 (Haliloglu)
fnat 0.22
L-RMSD 5.68 Å
I-RMSD 2.44 Å
Chain A
Chain B
S31.M06 (Kihara)
fnat 0.32
L-RMSD 7.99 Å
I-RMSD 2.67 Å
33
T96 interface prediction
Chain Method Precision Recall F-score
A BindML 0.15 0.2 0.17
Cons-PPISP 0 0 NA
B BindML 0.12 0.11 0.12
Cons-PPISP* NA NA NA
*Cons-PPISP predictions were only for the N-terminal tail; visual
inspection suggests that N-terminal tail is not a likely a binding site, so
these predictions were not used.
34
T96 Scorer-Models Scores
35
ITScoreGOAP DFIRE
lRMSDfnatiRMSD
T96 Score Performance Summary
Score RFH Hits in top 10
ITScore 529 0
GOAP 6 1
DFIRE 125 0
RFH: rank of first acceptable hit
• The hit for GOAP/DFIRE is the same model picked by PRESCO
36
Summary
 Our docking prediction procedure runs LZerD,
and decoys were selected by combining DFIRE,
ITScore, GOAP, and PRESCO. Binding sites were
predicted by BindML and cons-PPISP.
 On the examples shown, PRESCO’s performance
was not as spectacular as we expected from its
performance on single chain str. prediction.
 DFIRE, ITScore, GOAP showed similar,
reasonably good performance.
 Scoring functions performance depends on
subunit model quality.
 The way to use BindML prediction needs to be
improved. 37
Lab Members
38
@kiharalab
Lenna
Peterson
Hyung-
Rae Kim

More Related Content

Similar to DaisukeKihara

Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Purdue University
 
Igarss1792_v2.ppt
Igarss1792_v2.pptIgarss1792_v2.ppt
Igarss1792_v2.pptgrssieee
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsPurdue University
 
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...EL-Hachemi Guerrout
 
Fpga design of clutter generator for radar testing
Fpga design of clutter generator for radar testingFpga design of clutter generator for radar testing
Fpga design of clutter generator for radar testingcsijjournal
 
Fpga Design Of Clutter Generator For Radar Testing
Fpga Design Of Clutter Generator For Radar TestingFpga Design Of Clutter Generator For Radar Testing
Fpga Design Of Clutter Generator For Radar Testingcsijjournal
 
Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...csandit
 
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...csandit
 
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...cscpconf
 
Protocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkProtocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkWaqas Tariq
 
Model-counting Approaches For Nonlinear Numerical Constraints
Model-counting Approaches For Nonlinear Numerical ConstraintsModel-counting Approaches For Nonlinear Numerical Constraints
Model-counting Approaches For Nonlinear Numerical ConstraintsQuoc-Sang Phan
 
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Gota Morota
 
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...IJERA Editor
 
R for data visualization and graphics
R for data visualization and graphicsR for data visualization and graphics
R for data visualization and graphicsRobert Kabacoff
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsDavid Silver
 

Similar to DaisukeKihara (20)

Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019Kiharalab Bioinformatics Projects 2019
Kiharalab Bioinformatics Projects 2019
 
Igarss1792_v2.ppt
Igarss1792_v2.pptIgarss1792_v2.ppt
Igarss1792_v2.ppt
 
main_presentation
main_presentationmain_presentation
main_presentation
 
Structure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein InteractionsStructure Modeling of Disordered Protein Interactions
Structure Modeling of Disordered Protein Interactions
 
Fullprof Refinement
Fullprof RefinementFullprof Refinement
Fullprof Refinement
 
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...Medical Image Segmentation Using Hidden Markov Random Field  A Distributed Ap...
Medical Image Segmentation Using Hidden Markov Random Field A Distributed Ap...
 
Fpga design of clutter generator for radar testing
Fpga design of clutter generator for radar testingFpga design of clutter generator for radar testing
Fpga design of clutter generator for radar testing
 
Fpga Design Of Clutter Generator For Radar Testing
Fpga Design Of Clutter Generator For Radar TestingFpga Design Of Clutter Generator For Radar Testing
Fpga Design Of Clutter Generator For Radar Testing
 
Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...Adaptive blind multiuser detection under impulsive noise using principal comp...
Adaptive blind multiuser detection under impulsive noise using principal comp...
 
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
 
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
ADAPTIVE BLIND MULTIUSER DETECTION UNDER IMPULSIVE NOISE USING PRINCIPAL COMP...
 
Protection mode
Protection modeProtection mode
Protection mode
 
Protocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural NetworkProtocol Type Based Intrusion Detection Using RBF Neural Network
Protocol Type Based Intrusion Detection Using RBF Neural Network
 
Model-counting Approaches For Nonlinear Numerical Constraints
Model-counting Approaches For Nonlinear Numerical ConstraintsModel-counting Approaches For Nonlinear Numerical Constraints
Model-counting Approaches For Nonlinear Numerical Constraints
 
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
Application of Bayesian and Sparse Network Models for Assessing Linkage Diseq...
 
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...
Reduction of PAPR and Efficient detection ordering scheme for MIMO Transmissi...
 
Lgm saarbrucken
Lgm saarbruckenLgm saarbrucken
Lgm saarbrucken
 
R for data visualization and graphics
R for data visualization and graphicsR for data visualization and graphics
R for data visualization and graphics
 
Udacity-Didi Challenge Finalists
Udacity-Didi Challenge FinalistsUdacity-Didi Challenge Finalists
Udacity-Didi Challenge Finalists
 
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
Noise Immune and Area Optimized Serial Interface for FPGA based Industrial In...
 

Recently uploaded

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencySheetal Arora
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSarthak Sekhar Mondal
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsSumit Kumar yadav
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticssakshisoni2385
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfSumit Kumar yadav
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfrohankumarsinghrore1
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPirithiRaju
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfSumit Kumar yadav
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.Nitya salvi
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)Areesha Ahmad
 

Recently uploaded (20)

Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceuticsPulmonary drug delivery system M.pharm -2nd sem P'ceutics
Pulmonary drug delivery system M.pharm -2nd sem P'ceutics
 
Chemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdfChemistry 4th semester series (krishna).pdf
Chemistry 4th semester series (krishna).pdf
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdfPests of cotton_Sucking_Pests_Dr.UPR.pdf
Pests of cotton_Sucking_Pests_Dr.UPR.pdf
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
Botany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdfBotany 4th semester series (krishna).pdf
Botany 4th semester series (krishna).pdf
 
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
❤Jammu Kashmir Call Girls 8617697112 Personal Whatsapp Number 💦✅.
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)GBSN - Microbiology (Unit 2)
GBSN - Microbiology (Unit 2)
 

DaisukeKihara

  • 1. Human and Server CAPRI Protein Docking Prediction Using LZerD with Combined Scoring Functions Daisuke Kihara Department of Biological Sciences Department of Computer Science Purdue University, Indiana, USA 1 http://kiharalab.org
  • 2. CAPRI Round 30 Results 2(Lensink et al., CAPRI30 group paper, 2016)
  • 3. Overview of Protein Docking Prediction Using LZerD in CAPRI 3 Re-ranking with scoring functions HHPred SparksX MUFold TASSER Phyre2 TASSERlite MultiCom Single Chain Modeling PRESCO Sub-unit models LZerD ~50,000 docking models Clustering, RMSD < 5 Å 10 models MD relaxation Submit
  • 4. LZerD(Local 3D Zernike descriptor-based Docking program) 4 normal vector 3DZernike descriptor 6Å Interface area (Venkatraman, Yang, Sael, & Kihara, BMC Bioinformatics, 2009) (Lizard)
  • 5. 3D Zernike Descriptors (3DZD)  An extension of spherical harmonics based descriptors  A 3D object can be represented by a series of orthogonal functions, thus practically represented by a series of coefficients as a feature vector  Compact  Rotation invariant 5 A surface representation of 1ew0A (A) is reconstructed from its 3D Zernike invariants of the order 5, 10, 15, 20, and 25 (B-F). (Sael & Kihara, 2009) ),()(),,( ϕϑϕϑ m lnl m nl YrRrZ = ),( ϕϑm lY )(rRnl ),,( ϕϑrZ m nl : Spherical harmonics, : radial functions polynomials in Cartesian coordinates ∫ ≤ =Ω 14 3 .)()( x xxx dZf m nl m nl πZernike moments: Zernike Descriptor: 2 )( m nl lm lm nlF Ω= ∑ = −=
  • 6. Protein Residue Environment SCOre (PRESCO) 6 within a sphere of 6 or 8 Å along the main-chain Center (Kim & Kihara, Proteins 2014)
  • 7. Finding Similar Side-Chain Depth Environment (SDE) from a database 7 Structure Database 2536 proteins 500 lowest RMSD fragments of 9 side-chain centroids; Superimposed with the query fragment Select SDE with the same number of side-chain centroids in the sphere of 8.0Å Query SDE Compute RMSD of residue- depth for corresponding side-chain centroids Sort by depth RMSD to the query surface
  • 8. CASP11 Free Modeling Category Ranking (Model 1) 8 (http://www.predictioncenter.org/casp11/zscores_final.cgi?formula=assessors) (Kim & Kihara, Proteins 2015)
  • 9. DFIRE, GOAP, ITScore Scoring Functions  DFIRE (Yaoqi Zhou): statistical distance- dependent atom contact potential using the finite ideal-gas reference state  GOAP (Jeff Skolnick): DFIRE * orientation dependent term  ITScore (Xiaoqin Zou):iteratively refined statistical distance-dependent atom contact potential 9
  • 10. The BindML Algorithm 10(La D, & Kihara D, Proteins 2012)
  • 11. Generating Substitution Models iPFAM (505 Families) Model Model 11
  • 12. iPfam Dataset Benchmark ROC based on 449 Protein Complexes 12
  • 13. BindML Webserver 13 http://kiharalab.org/bindml (Wei Q, La D, & Kihara D, Methods in Mol.Biol. In press 2016)
  • 14. T79 (Round 30)  (Interface 2) Kihara: 3 hits; LZerD: 1 hit  Homodimer  LZerD runs:  No-interface prediction  With BindML-consPPISP prediction  LZerD selection strategy:  Consensus of ITScore and GOAP  5 from no-interface, 5 from BindML-consPPISP  Kihara selection strategy:  Manual combination of ITScore, GOAP, DFIRE, and PRESCO  10 from no-interface 14
  • 15. T79 Subunit Model Quality Chain A RMSD: 4.0 Å Chain B RMSD: 4.0 Å native model 15
  • 16. T79 Human Selected Model fnat 0.16, L-RMSD 14.1Å, i-RMSD 3.8 Å native model 16
  • 17. T79 Interface Prediction Method Precision Recall F-Score BindML 0 0 NA Cons-PPISP 0.10 0.18 0.12 17
  • 18. T79 Scores (no-interface prediction) 18 ITScoreGOAP DFIRE LRMSDfnatiRMSD
  • 19. T79 Score Comparison 19ITScoreGOAP DFIRE ITScoreGOAPDFIRE
  • 20. T79 PRESCO scores 20 lRMSD PRESCO PRESCO With Inteface Prediction Without Interface Prediction
  • 21. T79 Score performance summary Run Score RFH Hits in top 10 nointerface ITScore 1 (62) 3 nointerface GOAP 1 (72) 3 nointerface DFIRE 1 (111) 5 BindML- consPPISP all - - RFH: rank of first acceptable (medium) hit 21
  • 22. T91 (Round 30)  Kihara: 8 hits; LZerD: 2 hits  Homodimer  LZerD runs:  No-interface prediction (with our monomer model)  With BindML+consPPISP interface prediction  Zhang1 CASP server model, no-interface prediction  Server selection strategy  10 from no-interface  Human selection strategy  Consensus of ITScore, GOAP, PRESCO, and visual inspection  5 from no-interface, 5 from Zhang1 22
  • 23. T91 Subunit Models Chain C Our model: RMSD 6.0 Å Zhang: RMSD 4.9 Å native Our model Zhang1 Chain D Our model RMSD 6.5 Å Zhang: RMSD 5.7 Å 23
  • 24. T91 Human Selected Model model native fnat 0.33, L-RMSD 9.0 Å, I-RMSD 4.2 Å 24
  • 25. T91 Interface Prediction Method Precision Recall F-Score BindML 0.64 0.20 0.30 Cons-PPISP 0.50 0.28 0.36 25
  • 26. T91 Score (no interface prediction) 26 ITScoreGOAP DFIRE LRMSDfnatiRMSD
  • 27. T91 Scores (With Interface prediction) 27 ITScoreGOAP DFIRE LRMSDfnatiRMSD
  • 28. T91 Scores (Zhang models) 28ITScoreGOAP DFIRE LRMSDfnatiRMSD
  • 29. T91 Zhang1 Score Comparison 29 ITScoreGOAP DFIRE ITScoreGOAPDFIRE
  • 30. T91 PRESCO Scores Without Interface PredictionDocking with Zhang models PRESCO PRESCO LRMSD Top 5 models selected from each 30
  • 31. T91 Score Performance Summary Run Score RFH Hits in top 10 nointerface ITScore 2 2 nointerface GOAP 2 1 nointerface DFIRE 1 2 interface ITScore 1042 0 interface GOAP 165 0 interface DFIRE 116 0 zhang1 ITScore 1 (4) 5 zhang1 GOAP 2 (16) 5 zhang1 DFIRE 1 (6) 6 RFH: rank of first acceptable (medium) hit 31
  • 32. T96 (Round 31)  Heterodimer  Predictor hits: 0 (5 by other groups)  Scorer hits: human 1, server 0 (1 by other group)  Human: 6 selected by PRESCO, 4 selected from with predicted interface, ITScore, GOAP, DFIRE  No PDB file for the native structure available: metrics computed using two scorer hits (average L-RMSD/I-RMSD, max fnat) 32
  • 33. T96 scorer hits Chain B S39.M03 (Haliloglu) fnat 0.22 L-RMSD 5.68 Å I-RMSD 2.44 Å Chain A Chain B S31.M06 (Kihara) fnat 0.32 L-RMSD 7.99 Å I-RMSD 2.67 Å 33
  • 34. T96 interface prediction Chain Method Precision Recall F-score A BindML 0.15 0.2 0.17 Cons-PPISP 0 0 NA B BindML 0.12 0.11 0.12 Cons-PPISP* NA NA NA *Cons-PPISP predictions were only for the N-terminal tail; visual inspection suggests that N-terminal tail is not a likely a binding site, so these predictions were not used. 34
  • 36. T96 Score Performance Summary Score RFH Hits in top 10 ITScore 529 0 GOAP 6 1 DFIRE 125 0 RFH: rank of first acceptable hit • The hit for GOAP/DFIRE is the same model picked by PRESCO 36
  • 37. Summary  Our docking prediction procedure runs LZerD, and decoys were selected by combining DFIRE, ITScore, GOAP, and PRESCO. Binding sites were predicted by BindML and cons-PPISP.  On the examples shown, PRESCO’s performance was not as spectacular as we expected from its performance on single chain str. prediction.  DFIRE, ITScore, GOAP showed similar, reasonably good performance.  Scoring functions performance depends on subunit model quality.  The way to use BindML prediction needs to be improved. 37

Editor's Notes

  1. nointerface: our model, no interface restrictioninterface: our model, interface restriction zhang1: zhang1 model, no interface restriction For the server pick, 10 picks are from &amp;quot;nointerface&amp;quot; (our model). The server models were not released until after the server prediction was due.  For the human pick, the 5 using our model and the 5 using zhang1 model are both without interface restriction. We had noticed systematically worse energy scores for interface restricted docking. Based on my analysis of BindML predictions, I think this can be partially mitigated by using a permissive BindML prediction.
  2. For T96 scorer round, we had one hit for human (out of 2 total hits), and it was picked by PRESCO. Unfortunately, when I evaluated the scorer models using the best scorer model, our model is categorized as &amp;quot;incorrect&amp;quot; (it fails by all cutoffs, fnat, lrmsd, and irmsd).  This complex is between an arc-shaped alpha repeat protein (chain A) and eGFP (chain B). Visual inspection shows that in our pick vs the other hit, eGFP has rolled a small amount relative to the alpha repeat protein, so I would guess that the true position is somewhere in between the two hits. I could make an approximate L-RMSD for all the scorer models using the average of the L-RMSD against our hit and the other hit, but there is no obvious way to compute an average fnat.  The other problem is that for t96 and t97 scorer, Kim did not send the numerical scores, he just sent a list of the top 6 models.