SlideShare a Scribd company logo
dkihara@purdue.edu 1
Bioinformatics Projects in
Kiharalab 2019
Daisuke Kihara
Professor
Department of Biological Sciences
Department of Computer Science
Purdue University, USA
dkihara@purdue.edu
http://kiharalab.org
Active Projects
 Protein-protein docking prediction
 Protein structure prediction
 Protein structure modeling for cryo-
EM
 Protein structure/EM map database
search
 Virtual drug screening
 Protein function prediction
 Moonlighting proteins
2
LZerD
3
normal
vector
3DZernike
descriptor
6Å
(Venkatraman, Yang, Sael, & Kihara, BMC Bioinformatics, 10: 407, 2009)
Available at:
http://kiharalab.org/proteindocking/
3D Zernike Descriptors (3DZD)
 An extension of
spherical harmonics
based descriptors
 A 3D object can be
represented by a
series of orthogonal
functions, thus
practically
represented by a
series of coefficients
as a feature vector
 Compact
 Rotation invariant
4
A surface representation of 1ew0A (A) is reconstructed from its 3D Zernike
invariants of the order 5, 10, 15, 20, and 25 (B-F). (Sael & Kihara, 2009)
),()(),,(  m
lnl
m
nl YrRrZ 
),( m
lY )(rRnl
),,( rZ m
nl
: Spherical harmonics, : radial functions
polynomials in Cartesian coordinates
 

14
3
.)()(
x
xxx dZf m
nl
m
nl Zernike moments:
Zernike Descriptor:
2
)( m
nl
lm
lm
nlF  


Multi-LZerD
5
Esquivel-Rodríguez, J et al.
Proteins, 2012)
Shape-
based score
Physics-
based score
Multiple Docking Examples
6
RMSD: 0.53Å
RMSD: 0.73Å RMSD: 1.09Å
RMSD: 1.57Å
Assembly Order Prediction for
Protein Complexes
7
Urease accessary complex:
Dimer of UreF/UreH dimer is formed first, which binds to G dimer
Peterson LX, Togawa Y, Esquivel-Rodriguez J, Terashi G, Christoffer C, et al. (2018)
Modeling the assembly order of multimeric heteroprotein complexes. PLOS Computational
The number of votes for assembly pathways of
4hi0 across generations of the genetic
algorithm
8
Peterson LX, Togawa Y, Esquivel-Rodriguez J, Terashi G, Christoffer C, et al. (2018) Modeling the assembly order of
multimeric heteroprotein complexes. PLOS Computational Biology 14(1): e1005937.
https://doi.org/10.1371/journal.pcbi.1005937
http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005937
IDR-LZerD Overview
Disordered
sequence
Protein
structure
Docked fragments
Docked
fragment
combinatio
ns
DIEGLVELLNRVQSSGAHDQR
DIEGLVELL QSSGAHDQR
ELLNRVQSS
Refined
models
3
1
2
4
Predicted
fragment
structures
1. Predict multiple (30)
conformations for each 9
residue-window based
on sequence propensity
2. Dock fragments
3. Combine docked
fragments to create full
model
4. Refine
9
(Peterson, Roy, Christoffer, Terashi, Kihara, PLOS Comp. Biol. 2017)
Combining Docked Fragments
4,500 4,500 4,500
Seq.
window A
Seq.
window B
Seq.
window C
Docked
fragments
10
Eliminate pairs based on
distance and angle cutoffs:
Example of Unbound Predictions
11
PDB ID 1jya
69 AA
Rank 2
L-RMSD 9.9 Å
I-RMSD 5.4 Å
PDB ID 4ah2
20 AA
Rank 1
L-RMSD 5.8 Å
I-RMSD 2.4 Å
PDB ID 1ijj
25 AA
Rank 9
L-RMSD 5.1 Å
I-RMSD 2.5 Å
PDB ID 1l3e
44 AA
Rank 3
L-RMSD 7.4 Å
I-RMSD 6.3 Å
Model
Native
MAINMAST: De novo Structure Modeling for
medium (~4 Å) Resolution Maps
EM Map
Tracing
C⍺ model
EMD-6555 Porcine circovirus 2
Resolution: 2.9 Å
De-novo modeling: does not require known protein
structures.
Fully automated: No need for visual inspection and human
intervention.
12
(G. Terashi & D. Kihara, Nature Communications, 2018)
MAINMAST: (MAINchain Model trAcing from
Spanning Tree)
The longest path (red) does not
always cover the whole protein
structure.
13
EM map
Find local dense points
(LDPs) by Mean Shift
Connect All points by
Minimum Spanning
Tree
Refine Tree Structure
Thread sequence on
the longest path
C⍺ models ranked by threading score
Rapid & Seamless
Screening of Similar/
Complementary 3D
Molecules
14
d
15http://kiharalab.org/3d-surfer/
(La et al., Bioinformatics 2009)
(Xiong et al., Methods in Mol Biol, 2013)
16http://kiharalab.org/em-surfer/
(Esquivel-Rodriguez et al.,
BMC Bioinformatics 2015)
Patch-Surfer 2.0: Local Patch-Based
Pocket Comparison Method
17
(Sael & Kihara, Proteins, 2012)
(Zhu, Xiong, & Kihara, Bioinformatics, 2015)
Patch-Surfer Retrieval Results for
Flexible Ligands: FAD and NAD
18
flavin adenine
dinucleotide (FAD) FAD
Nicotinamide adenine
dinucleotide(NAD)
1cqx 1jr8 1e8g 1k87 1mi3 1s7g
Patch-based: 3rd
Global pocket: 31st
Patch-based: 1st
Global pocket: 18th
Patch-based: 2nd
Global pocket: 16th
Combined Computational &
Experimental Approach
19(Zeng et al., J Proteome Res, 2016)
PL-PatchSurfer2: Local Surface-
Based Virtual Screening
Shin, Christoffer, Wang, & Kihara, J Chem Inf. Model (2016) Shin, & Kihara, Methods in Mol. Biol. (2018)
Sequence-Based Function Prediction
(Jain & Kihara, Bioinformatics, 2018)
22
Features Considered:
• GO annotations (GO)
• PPI network (PPI)
• gene expression profiles
(GI)
• phylogenetic profiles (PE)
• genetic interactions (GI)
• disordered protein regions
(DOR)
• graph properties of PPI
DextMP: Finding Moonlighting Proteins
from Literature Text Information
23
Text Level
Prediction
Protein Level
Prediction
(Khan, Bhuiyan, & Kihara, Bioinformatics 2017)
iGFP: Group Function Prediction from
Functional Relevance Networks with
Conditional Random Field
24(Khan, Jain, Rawi, Bensmail, & Kihara, Bioinformatics 2018)
Research Group
http://kiharalab.org@kiharalab 25

More Related Content

Similar to Kiharalab Bioinformatics Projects 2019

Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
KAMAL CHOUDHARY
 
CBE_Symposium_Poster_Aparajita - sjp
CBE_Symposium_Poster_Aparajita - sjpCBE_Symposium_Poster_Aparajita - sjp
CBE_Symposium_Poster_Aparajita - sjp
Aparajita Dasgupta
 
Development of algorithm for identification of maligant growth in cancer usin...
Development of algorithm for identification of maligant growth in cancer usin...Development of algorithm for identification of maligant growth in cancer usin...
Development of algorithm for identification of maligant growth in cancer usin...
IJECEIAES
 
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
jaumebp
 
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEPROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
ijsc
 
Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine  Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine
ijsc
 
Bits protein structure
Bits protein structureBits protein structure
Bits protein structure
BITS
 
The Infobiotics Contact Map predictor at CASP9
The Infobiotics Contact Map predictor at CASP9The Infobiotics Contact Map predictor at CASP9
The Infobiotics Contact Map predictor at CASP9
jaumebp
 
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Purdue University
 
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVMCOMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
sipij
 
Swaati pro sa web
Swaati pro sa webSwaati pro sa web
Swaati pro sa web
Swati Kumari
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for Molecules
Ichigaku Takigawa
 
Protein secondary structure prediction by a neural network architecture with...
Protein secondary structure prediction by a neural network  architecture with...Protein secondary structure prediction by a neural network  architecture with...
Protein secondary structure prediction by a neural network architecture with...
IJECEIAES
 
Presentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali ShahPresentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali Shah
guest5de83e
 
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
ijbbjournal
 
Towards the comparative analysis of genomic variants with Jalview
Towards the comparative analysis of genomic variants with JalviewTowards the comparative analysis of genomic variants with Jalview
Towards the comparative analysis of genomic variants with Jalview
Jim Procter
 
Lgm saarbrucken
Lgm saarbruckenLgm saarbrucken
Lgm saarbrucken
Yasuo Tabei
 
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Cresset
 
sp2016ren
sp2016rensp2016ren
sp2016ren
Danny Vo
 
Powerpoint
PowerpointPowerpoint
Powerpoint
butest
 

Similar to Kiharalab Bioinformatics Projects 2019 (20)

Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
CBE_Symposium_Poster_Aparajita - sjp
CBE_Symposium_Poster_Aparajita - sjpCBE_Symposium_Poster_Aparajita - sjp
CBE_Symposium_Poster_Aparajita - sjp
 
Development of algorithm for identification of maligant growth in cancer usin...
Development of algorithm for identification of maligant growth in cancer usin...Development of algorithm for identification of maligant growth in cancer usin...
Development of algorithm for identification of maligant growth in cancer usin...
 
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...Data Mining Protein Structures' Topological Properties  to Enhance Contact Ma...
Data Mining Protein Structures' Topological Properties to Enhance Contact Ma...
 
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINEPROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
PROTEIN STRUCTURE PREDICTION USING SUPPORT VECTOR MACHINE
 
Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine  Protein Structure Prediction Using Support Vector Machine
Protein Structure Prediction Using Support Vector Machine
 
Bits protein structure
Bits protein structureBits protein structure
Bits protein structure
 
The Infobiotics Contact Map predictor at CASP9
The Infobiotics Contact Map predictor at CASP9The Infobiotics Contact Map predictor at CASP9
The Infobiotics Contact Map predictor at CASP9
 
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
Protein docking by LZerD, KiharaLab at CAPRI meeting 2016
 
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVMCOMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
COMPRESSION BASED FACE RECOGNITION USING DWT AND SVM
 
Swaati pro sa web
Swaati pro sa webSwaati pro sa web
Swaati pro sa web
 
Machine Learning for Molecules
Machine Learning for MoleculesMachine Learning for Molecules
Machine Learning for Molecules
 
Protein secondary structure prediction by a neural network architecture with...
Protein secondary structure prediction by a neural network  architecture with...Protein secondary structure prediction by a neural network  architecture with...
Protein secondary structure prediction by a neural network architecture with...
 
Presentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali ShahPresentation 2007 Journal Club Azhar Ali Shah
Presentation 2007 Journal Club Azhar Ali Shah
 
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
Structural Studies of Aspartic Endopeptidase pep2 from Neosartorya Fisherica ...
 
Towards the comparative analysis of genomic variants with Jalview
Towards the comparative analysis of genomic variants with JalviewTowards the comparative analysis of genomic variants with Jalview
Towards the comparative analysis of genomic variants with Jalview
 
Lgm saarbrucken
Lgm saarbruckenLgm saarbrucken
Lgm saarbrucken
 
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
Mark Mackey, Cresset, 'Meet Molecular Architect, A new product for understand...
 
sp2016ren
sp2016rensp2016ren
sp2016ren
 
Powerpoint
PowerpointPowerpoint
Powerpoint
 

More from Purdue University

CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)
Purdue University
 
Predicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein ComplexesPredicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein Complexes
Purdue University
 
DextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteinsDextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteins
Purdue University
 
Discovery of Ligand-Protein Interactome
Discovery of Ligand-Protein InteractomeDiscovery of Ligand-Protein Interactome
Discovery of Ligand-Protein Interactome
Purdue University
 
Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11
Purdue University
 
Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016
Purdue University
 
Flexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure modelsFlexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure models
Purdue University
 

More from Purdue University (7)

CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)CASP14 Data Assisted Modeling (KIharalab)
CASP14 Data Assisted Modeling (KIharalab)
 
Predicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein ComplexesPredicting Assembly Order of Multimeric Protein Complexes
Predicting Assembly Order of Multimeric Protein Complexes
 
DextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteinsDextMP: Text mining for finding moonlighting proteins
DextMP: Text mining for finding moonlighting proteins
 
Discovery of Ligand-Protein Interactome
Discovery of Ligand-Protein InteractomeDiscovery of Ligand-Protein Interactome
Discovery of Ligand-Protein Interactome
 
Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11Kihara Lab protein structure prediction performance in CASP11
Kihara Lab protein structure prediction performance in CASP11
 
Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016Kihara Bioinformatics Lab Research Summary 2016
Kihara Bioinformatics Lab Research Summary 2016
 
Flexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure modelsFlexscore: Ensemble-based evaluation for protein Structure models
Flexscore: Ensemble-based evaluation for protein Structure models
 

Recently uploaded

Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
Vandana Devesh Sharma
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
Shashank Shekhar Pandey
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
Carl Bergstrom
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
Advanced-Concepts-Team
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
Sérgio Sacani
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
sandertein
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
ananya23nair
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
frank0071
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
vluwdy49
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
PirithiRaju
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
yourprojectpartner05
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Sérgio Sacani
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Leonel Morgado
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
International Food Policy Research Institute- South Asia Office
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
Sciences of Europe
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills MN
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
Areesha Ahmad
 

Recently uploaded (20)

Compexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titrationCompexometric titration/Chelatorphy titration/chelating titration
Compexometric titration/Chelatorphy titration/chelating titration
 
HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1HOW DO ORGANISMS REPRODUCE?reproduction part 1
HOW DO ORGANISMS REPRODUCE?reproduction part 1
 
The cost of acquiring information by natural selection
The cost of acquiring information by natural selectionThe cost of acquiring information by natural selection
The cost of acquiring information by natural selection
 
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
ESA/ACT Science Coffee: Diego Blas - Gravitational wave detection with orbita...
 
The debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically youngThe debris of the ‘last major merger’ is dynamically young
The debris of the ‘last major merger’ is dynamically young
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
cathode ray oscilloscope and its applications
cathode ray oscilloscope and its applicationscathode ray oscilloscope and its applications
cathode ray oscilloscope and its applications
 
fermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptxfermented food science of sauerkraut.pptx
fermented food science of sauerkraut.pptx
 
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
Juaristi, Jon. - El canon espanol. El legado de la cultura española a la civi...
 
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
在线办理(salfor毕业证书)索尔福德大学毕业证毕业完成信一模一样
 
Methods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdfMethods of grain storage Structures in India.pdf
Methods of grain storage Structures in India.pdf
 
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptxLEARNING TO LIVE WITH LAWS OF MOTION .pptx
LEARNING TO LIVE WITH LAWS OF MOTION .pptx
 
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
Candidate young stellar objects in the S-cluster: Kinematic analysis of a sub...
 
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
Describing and Interpreting an Immersive Learning Case with the Immersion Cub...
 
Alternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart AgricultureAlternate Wetting and Drying - Climate Smart Agriculture
Alternate Wetting and Drying - Climate Smart Agriculture
 
Direct Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart AgricultureDirect Seeded Rice - Climate Smart Agriculture
Direct Seeded Rice - Climate Smart Agriculture
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)Sciences of Europe journal No 142 (2024)
Sciences of Europe journal No 142 (2024)
 
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
Travis Hills of MN is Making Clean Water Accessible to All Through High Flux ...
 
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of ProteinsGBSN - Biochemistry (Unit 6) Chemistry of Proteins
GBSN - Biochemistry (Unit 6) Chemistry of Proteins
 

Kiharalab Bioinformatics Projects 2019

  • 1. dkihara@purdue.edu 1 Bioinformatics Projects in Kiharalab 2019 Daisuke Kihara Professor Department of Biological Sciences Department of Computer Science Purdue University, USA dkihara@purdue.edu http://kiharalab.org
  • 2. Active Projects  Protein-protein docking prediction  Protein structure prediction  Protein structure modeling for cryo- EM  Protein structure/EM map database search  Virtual drug screening  Protein function prediction  Moonlighting proteins 2
  • 3. LZerD 3 normal vector 3DZernike descriptor 6Å (Venkatraman, Yang, Sael, & Kihara, BMC Bioinformatics, 10: 407, 2009) Available at: http://kiharalab.org/proteindocking/
  • 4. 3D Zernike Descriptors (3DZD)  An extension of spherical harmonics based descriptors  A 3D object can be represented by a series of orthogonal functions, thus practically represented by a series of coefficients as a feature vector  Compact  Rotation invariant 4 A surface representation of 1ew0A (A) is reconstructed from its 3D Zernike invariants of the order 5, 10, 15, 20, and 25 (B-F). (Sael & Kihara, 2009) ),()(),,(  m lnl m nl YrRrZ  ),( m lY )(rRnl ),,( rZ m nl : Spherical harmonics, : radial functions polynomials in Cartesian coordinates    14 3 .)()( x xxx dZf m nl m nl Zernike moments: Zernike Descriptor: 2 )( m nl lm lm nlF    
  • 5. Multi-LZerD 5 Esquivel-Rodríguez, J et al. Proteins, 2012) Shape- based score Physics- based score
  • 6. Multiple Docking Examples 6 RMSD: 0.53Å RMSD: 0.73Å RMSD: 1.09Å RMSD: 1.57Å
  • 7. Assembly Order Prediction for Protein Complexes 7 Urease accessary complex: Dimer of UreF/UreH dimer is formed first, which binds to G dimer Peterson LX, Togawa Y, Esquivel-Rodriguez J, Terashi G, Christoffer C, et al. (2018) Modeling the assembly order of multimeric heteroprotein complexes. PLOS Computational
  • 8. The number of votes for assembly pathways of 4hi0 across generations of the genetic algorithm 8 Peterson LX, Togawa Y, Esquivel-Rodriguez J, Terashi G, Christoffer C, et al. (2018) Modeling the assembly order of multimeric heteroprotein complexes. PLOS Computational Biology 14(1): e1005937. https://doi.org/10.1371/journal.pcbi.1005937 http://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1005937
  • 9. IDR-LZerD Overview Disordered sequence Protein structure Docked fragments Docked fragment combinatio ns DIEGLVELLNRVQSSGAHDQR DIEGLVELL QSSGAHDQR ELLNRVQSS Refined models 3 1 2 4 Predicted fragment structures 1. Predict multiple (30) conformations for each 9 residue-window based on sequence propensity 2. Dock fragments 3. Combine docked fragments to create full model 4. Refine 9 (Peterson, Roy, Christoffer, Terashi, Kihara, PLOS Comp. Biol. 2017)
  • 10. Combining Docked Fragments 4,500 4,500 4,500 Seq. window A Seq. window B Seq. window C Docked fragments 10 Eliminate pairs based on distance and angle cutoffs:
  • 11. Example of Unbound Predictions 11 PDB ID 1jya 69 AA Rank 2 L-RMSD 9.9 Å I-RMSD 5.4 Å PDB ID 4ah2 20 AA Rank 1 L-RMSD 5.8 Å I-RMSD 2.4 Å PDB ID 1ijj 25 AA Rank 9 L-RMSD 5.1 Å I-RMSD 2.5 Å PDB ID 1l3e 44 AA Rank 3 L-RMSD 7.4 Å I-RMSD 6.3 Å Model Native
  • 12. MAINMAST: De novo Structure Modeling for medium (~4 Å) Resolution Maps EM Map Tracing C⍺ model EMD-6555 Porcine circovirus 2 Resolution: 2.9 Å De-novo modeling: does not require known protein structures. Fully automated: No need for visual inspection and human intervention. 12 (G. Terashi & D. Kihara, Nature Communications, 2018)
  • 13. MAINMAST: (MAINchain Model trAcing from Spanning Tree) The longest path (red) does not always cover the whole protein structure. 13 EM map Find local dense points (LDPs) by Mean Shift Connect All points by Minimum Spanning Tree Refine Tree Structure Thread sequence on the longest path C⍺ models ranked by threading score
  • 14. Rapid & Seamless Screening of Similar/ Complementary 3D Molecules 14
  • 15. d 15http://kiharalab.org/3d-surfer/ (La et al., Bioinformatics 2009) (Xiong et al., Methods in Mol Biol, 2013)
  • 17. Patch-Surfer 2.0: Local Patch-Based Pocket Comparison Method 17 (Sael & Kihara, Proteins, 2012) (Zhu, Xiong, & Kihara, Bioinformatics, 2015)
  • 18. Patch-Surfer Retrieval Results for Flexible Ligands: FAD and NAD 18 flavin adenine dinucleotide (FAD) FAD Nicotinamide adenine dinucleotide(NAD) 1cqx 1jr8 1e8g 1k87 1mi3 1s7g Patch-based: 3rd Global pocket: 31st Patch-based: 1st Global pocket: 18th Patch-based: 2nd Global pocket: 16th
  • 19. Combined Computational & Experimental Approach 19(Zeng et al., J Proteome Res, 2016)
  • 20. PL-PatchSurfer2: Local Surface- Based Virtual Screening Shin, Christoffer, Wang, & Kihara, J Chem Inf. Model (2016) Shin, & Kihara, Methods in Mol. Biol. (2018)
  • 21. Sequence-Based Function Prediction (Jain & Kihara, Bioinformatics, 2018)
  • 22. 22 Features Considered: • GO annotations (GO) • PPI network (PPI) • gene expression profiles (GI) • phylogenetic profiles (PE) • genetic interactions (GI) • disordered protein regions (DOR) • graph properties of PPI
  • 23. DextMP: Finding Moonlighting Proteins from Literature Text Information 23 Text Level Prediction Protein Level Prediction (Khan, Bhuiyan, & Kihara, Bioinformatics 2017)
  • 24. iGFP: Group Function Prediction from Functional Relevance Networks with Conditional Random Field 24(Khan, Jain, Rawi, Bensmail, & Kihara, Bioinformatics 2018)