ESQC 2013 Poster - Anders Christensen

Inferential protein structure determination using
chemical shifts derived from quantum mechanics
Anders S. Christensen, Troels E. Linnet, Mikael Borg, Wouter Boomsma, Kresten
Lindorff-Larsen, Thomas Hamelryck, Jan H. Jensen
Hybrid RHF/MP2 geometry optimizations in the
Effective Fragment Molecular Orbital Method
Anders S. Christensen, Casper Steinmann, Dmitri G. Fedorov, Jan H. Jensen
andersx@nano.ku.dk
EFMO-RHF:MP2 geometry
optimizations
Energy of a molecular system in the effective fragment
potential:
EEFMO
=
I
E0
I +
RIJ≤Rresdim
I>J


∆E0
IJ − EPOL
IJ



+
RIJ>Rresdim
I>J
EES
+ EPOL
tot . (1)
In the Frozen Domain approximation:
EEFMO
= E0
F + E0
A + E0
F/A + EPOL
tot , (2)
Figure 1: The geometry of the F layer is frozen during
optimization. The active layer A is optimized the RHF
level, except for the S fragment at MP2.
This gives the EFMO-RHF:MP2 energy:
EEFMO−RHF:MP2
= E0,RHF
F + E0,RHF
A + E0,RHF
F/A (3)
+EPOL
tot + E0,MP2
S∈A , (4)
with the gradient(implemented in GAMESS):
∂EEFMO-RHF:MP2
∂xA
=
∂EEFMO
∂xA
+
∂EMP2
S∈A
∂xA
(5)
Results and discussions
OH
O
O O-
O
O-
O
O-
O
O
O-
HO
OH
O
O
O-
-
O
O
1
4
2
3 3
2
1
4
1
4
3
2
1 2 3
Figure 2: Claisen rearrangement of prephanate to choris-
mate.
Table 1: Electronic energy barrier for the conversion
of prephanate to chorismate in Chorismate Mutase and
the corresponding reaction coordinate for the transition
state using EFMO-RHF:MP2 or ONIOM-RHF:MP2 (from
Steinmann et al).
Method MP2 basis R(TS) Barrier
[kcal/mol]
EFMO 6-31G(d) -0.17 Å 20.95
EFMO cc-pVDZ -0.43 Å 19.21
EFMO cc-pVTZ -0.43 Å 18.34
ONIOM 6-31G(d) 0.13 Å 22.24
ONIOM cc-pVDZ -0.36 Å 19.75
ONIOM cc-pVTZ 0.13 Å 21.79
ONIOM cc-pVQZ 0.13 Å 21.68
Acknowlegements
Collaboration:
Jan H. Jensen (KU/Chem), Casper Steinmann (SDU),
Dimitri G. Fedorov (IAST/Japan)
Inferential protein structure
determination
Calculating protein chemical shifts
Protein chemical shifts can be approximated as:
δ = ∆δBB(φ, ψ, {χn}) + ∆δHB + ∆δRC (6)
where δBB(φ, ψ, {χn}) is a term that depends on the tor-
sion angles, ∆δHB is a sum that depends on the local
hydrogen bonding network, and ∆δRC describes perturba-
tion from ring-currents in nearby aromatic residues. The
individual terms are parametrized from nearly 1,000,000
QM calculations and implemented in the open-source
ProCS program (part of the PHAISTOS frame work).
Sampling protein structures
Protein structures are sampled from a Bayesian posterior
distribution:
p (X| {δexp
i } , I) =
p ({δexp
i } |X, I) p (X|I)
p ({δexp
i } , I)
(7)
which relates the sampled structure, X to a set of mea-
sured experimental chemical shifts, {δexp
i }, and other prior
information, I (e.g. amino acid sequence, etc).
Calculating likelihood
The agreement between a structure X and a set of mea-
sured experimental chemical shifts, {δexp
i }, is calculated
using a Gaussian error model:
p ({δexp
i } |X, {σi}) =
n
i=1











1
2πσ2
i
exp −
(∆δi)2
2σ2
i











, (8)
where ∆δi is the difference between experimental chemical
shifts for the i’th nucleus and the chemical shifts calcu-
lated using Eq. 6. σi denotes the uncertainty of the pre-
dicted value of δi. Our program supports the QM based
ProCS model and the empirical CamShift model.
p (X|I) can be evaluated using a molecular mechanics
force field energy, EFF:
p (X|I) = exp −
EFF
kBT
(9)
Our program supports the OPLS/AA-L force field with a
GB/SA solvent model, and the corse-grained PROFASI
force field.
Results and discussions
Folding of Protein G (56 residues) takes less than a day
with the developed code.
Figure 3: Protein G, from unfolded to folded state. The
blue structure is the experimental NMR structure (PDB:
2OED), and green are snapshots from sampling using the
OPLS-AA/L force field with a GB/SA solvent model and
chemical shifts from CamShift.
Average hydrogen bond length in three proteins
Figure 4: Structures are sampled using the OPLS-AA/L
force field with a GB/SA solvent model and amide proton
chemical shifts from either ProCS or CamShift. Structures
sampled with ProCS show better agreement with exper-
imental data than structures sampled using CamShift or
no chemical shift restraints.
Acknowlegements
Funding:
Novo Nordisk STAR Program
Collaboration and students:
Jan H. Jensen (KU/Chem), Thomas Hamelryck
(KU/Binf), Jens Breinholt (Novo), Kresten Lindorff-
Larsen (KU/NMR), Stephan P. A. Sauer (KU/Chem),
Anders Larsen (KU/Nano), Maher Channir (KU/Nano),
Lars Brathol (KU/Nano), Wouter Boomsma (KU/Binf),
Simon Olsson (KU/Binf), Mikael Borg (KU/Binf)
References
[1] Steinmann C, Fedorov D G, Jensen J H (2010) Effective
fragment molecular orbital method: A merger of the effective
fragment potential and fragment molecular orbital methods. J
Phys Chem A 114:8705-8712.
[2] Hybrid RHF/MP2 geometry optimizations with the Effective
Fragment Molecular Orbital Method (2013) Christensen A S,
Steinmann C, Fedorov D G, Jensen JH arXiv preprint
arXiv:1305.0676
[3] Steinmann C, Fedorov D G, Jensen J H (2013) Mapping
Enzymatic Catalysis using the Effective Fragment Molecular
Orbital Method: Towards all ab initio Biochemistry. PLoS ONE
8(4): e60602.
[4] Christensen A S, Linnet T E, Borg M, Boomsma W,
Lindorf-Larsen K, Hamelryk T, Jensen J H (2013) Protein
structure validation and refinement using amide proton
chemical shifts derived from quantum mechanics PLOS ONE,
accepted arXiv:1305.2164
[5] Boomsma W, et al. (2013) PHAISTOS: A Framework for
Markov Chain Monte Carlo Simulation and Inference of Protein
Structure J Comp Chem (in press, DOI: 10.1002/jcc.23292).
(QR link to this poster)
Blog: http://combichem.blogspot.com/
ESQC, September 2013, Italy

ESQC 2013 Poster - Anders Christensen

Recommended

Recommended

More Related Content

What's hot

What's hot (20)

Similar to ESQC 2013 Poster - Anders Christensen

Similar to ESQC 2013 Poster - Anders Christensen (20)

ESQC 2013 Poster - Anders Christensen