Computational Studies of Proteins and Nucleic AcidsOn pKa Calculations in RNA and the Use of Structure to Improve Sequence AlignmentsMay 13, 2008 – Christopher L. Tang, ICRC 816, Columbia University
Electrostatics and RNAMetal ion binding sites in23S ribosomal RNAMisra & Draper, 2002
Ribozyme Function is Highly Varied(b)(d)(c)(a)(h)(g)(f)(e)Ribozymes use only nucleic acid groups to perform catalysisRibozymes. (a) 50S ribosome (b) Ribonuclease P, (c) Group I intron, (d) glms, (e) Diels-Alderase, (f) Hepatitis delta virus, (g) Hairpin, (h) Hammerhead
ribozymedelta antigenThe Hepatitis Delta VirusReplication of HDV genome employs catalysis by two related ribozymesHDV genomic RNAcomplement  ribozymeI.   HDV genome (G) replicatesGcomplement strand cleavedreligationCIII.   complement (C) replicatesgenome strand cleavedMM Lai, 1995
HDVR function involves hydrolysis of an internal phosphodiester bondPRODUCTREACTANT:+H2’:5’+CCHShih & Been, Ann Rev. Bioch., 2002
Cleavage site of HDVRwithin 10 ÅKeet al., 2004
2.8 ÅHDVR ACTIVE SITEPRODUCT5’OH(Nakano & Bevilacqua, 2000)
HDVR active site structure is suggestive of catalytic mechanismPRODUCTPRODUCTREACTANTU-1U-1Not observedC+75C75 G 1 G 1+CCHShih & Been, Ann Rev. Bioch., 2002
PRODUCT2’5’— O – H :O –—BASE CATALYZEDACID CATALYZEDSummary of mechanism_H+2’5’— O: –:H – O —+CCH(Nakano & Bevilacqua, 2000)
2’5’OHO–— O – H :O –—BASE CATALYZEDACID CATALYZEDH+2’5’— O: –:C75H – O —+CCHpH
Not observed !~6~7kobs+ free OHO–pHObserved reaction rate dependence on pH(Perrottaet al., 2007)???C75
Solution pKas of nucleotides Ade and Cyt nucleotides are poor basesAde  Ade (+)(3.8)Cyt Cyt(+)(4.3)+H(pKaref)Izattet al., 1970pH1478910111213654321+CCH% ionized
Framework for calculating pKa shift I: Reference and Structure pKas IsolatednucleotideH+H+––pKaref++–++–––++++DpKaStructure++++–––––––++++––pKastruct––++––++++++++–++
Framework for calculating pKa shift II: Effects due to Salt–––H+H+––
Potentials can be solved using classical equations of continuum electrostaticsPoisson-Boltzmann equationpairwise interactionsmobile ion interactionswater interactions (solvation)eext
Framework for calculating pKa shift III: Free energyBADpKa =  (DG AB (struct) – DG AB (ref)) / 2.3kBT
+H+CCH
pKa calculations using MCCE (Alexov & Gunner)Calculating pKas in RNA with Multiple Titratable groups0002N states001B010011…100AC+1=111+1+1The potential at any single site (A) depends on the protonation state of all other sites, (B and C) This problem is solved using the Multiple Conformation Continuum Electrostatics (MCCE) method to identify and sample low energy states
Origin of the Error using the Linear Approximation for Highly Charged SystemsLINEAR PBENONLINEAR PBE++++LINEAR CASENONLINEAR CASE0102
Correction of the Error due to Nonlinearity is a Function that Depends on Protonation StateDG ABNL (struct) = DG ABLPB (struct) + DG AB (corr APPROX)Tang, Alexov, Pyle & Honig 2007
Partial charges were derived for A+ & C+Tang, Alexov, Pyle & Honig 2007
RESULTSTang, Alexov, Pyle & Honig 2007
Calculated and experimental pKas are consistentBranch Point Helix(12 conf)A135.5A76.1pKaA7A8A13A16A17A18A25BPHSmith & Niconowicz, 1998Tang, Alexov, Pyle & Honig 2007
Calculated and experimental pKas are consistentA16Lead-Dependent Ribozyme (25 conf)3.8A173.83.5A18pKaA84.3A256.5A7A8A13A16A17A18A25Salt dependence of A25 pKain leadzymeLeadzymeBPHTang, Alexov, Pyle & Honig 2007Legault & Pardi, 1997
HDVR exhibits a deep cleft proximal to C75 and the scissile bondP1P2P3C75P1.1P4U1A2.8 ÅNakano et al., 2001G1C75
pKa shifts of Adenosines and Cytidines in HDVR DpKaC75C412.8 ÅG1C75Tang, Alexov, Pyle & Honig 2007
The pKa of C75 is elevated above its solution pKaDesolvation opposes the shift but the potential acting on C75 due to phosphate charges is much stronger Interactions with Phos. only:	– 9.2 kcal/molInteractions with solvent:	+ 1.5 kcal/mol– 7.7 kcal/mol
pKa shifts of Adenosines and Cytidines in HDVR C41DpKaC75C412.8 ÅG1C75Tang, Alexov, Pyle & Honig 2007
G73C44A43C+41Modifying the C41 nucleotide interaction can abrogate pH-dependent HDV activityA73U44C41A43Nakano et al., 2007
Bell-shaped pH curve: Revisitedbasekobs+ free C41 !C75pHkobs
G28G12C26C13A12A25C10C8C41 is part of a commonly found structural motifIn all cases where the motif is found, the folded RNA is stable at higher pH than expected given the solution pKa of CG73C44A43C41BWYVPEMV	calc pKaHDVR	10.6BWYV	13.7PEMV	10.6Determined by NMR at pH = 6Crystallized at pH = 7, RTpH at Tm ½ = 7pH at Tm ½ = 7Tang et al. 2007*pH ½ (Su, Rich et al., 1998)(Nixon, Giedrocet al., 2000; 2002)
A1005+ : C963A45+ : U115How common are pKa-shifted nucleotides?Nucleotides probably protonated in H. marismortui 50S ribosomal subunit (pdb: 1ffk)identified using a hydrogen bond distance rule (acc-acc dist. < 3.0 Å):Results (3.0 Å cutoff)C1026+ : G940C1069+ : phos
ConclusionsEstablished a method for calculating pKas of nucleotides of RNAA version of MCCE 2.4 can now perform calculations on RNANucleotides with shifted pKasActive sitesConserved motifspKa shifts can be understood in terms of structureSome geometric rules can be established for finding pKa-shifted nucleotides50S ribosomal subunit, H. morismortui
Barry Honig
Acknowledgements  Emil Alexov, Anna Marie Pyle, Lei Xie, BurkhardRost, Larry  Shapiro, Diana Murray, An-Suei Yang, Marilyn Gunner  Lucy Forrest, ShoshanaPosy, Markus Fischer,   Remo Rohs, Donald Petrey, Jiang Zhu, Jeremie Vendome,KelyNorel, Brian Chen, Rachel Kolodny, Trevor Siggers,  Cinque Soto, Andy Kuziemko, Sean West, Peng Liu,   Andrew Kernytsky, AvnerSchlessinger, Henry Bigelow,  Michael, Christine, Kaz, Dariusz, Claudia, Marco, Sven, Raj… and many other members of the Honig and Rost  labs, andfriends of us, both past and present.Yifan Song and Junjun Mao for their help with MCCE 2.4  My roommates Chintan and Sean, for putting up with my lapses on the household chores  Dr. Ron Liem, Katie Rosa, ZaiaSivo, Fred Loweff, and the Integrated and Biochemistry officesBarry Honig
John Tang1942-1988
P15’ O:<3.0 ÅC75G1P1.1C41
tRNAmRNAribosometRNA 	> 1958 (Hoagland, et. al)mRNA 	> 1961 (Brenner, Jacob, Meselson)
Framework for calculating pKa shift IV: Electrostatic potentialBut f in solution must involve:Pr+PPairwise interactionsSolvent interactionsMobile ions
Given the chemical reaction: Define:  Ka = acid dissociation constantDefine:  Then: from the Law of Mass Action: Then:Similarlyfor bases: and:
Derivation of DpKaGiven the reactions:BB+ (unfolded); andBB+ (folded);What is DpKa?DpKa=pKa(folded) – pKa(unfolded)DG BB+ (unfolded) 	= – 2.3 kT ( pKa (unfolded) – pH )DG BB+ (folded) 	= – 2.3 kT ( pKa (folded) – pH )DG BB+ (folded) – DG BB+ (unfolded) 		= – 2.3 kT ( pKa (folded) – pKa (unfolded) )DpKa = 1/(2.3 kT) DDG BB+ (folded – unfolded)1 DpK unit = 1.36 kcal/mol = 2.3kT.
Calculated and experimental pKas are consistentpKaA8A7A13A16A17A18A25Salt dependence of A25 pKain leadzyme1Lead-dependent2ribozymeBPH1Smith & Niconowicz, 1998Legault & Pardi, 1997
A+CA+- C pair, leadzyme pH 5.5C+CC+- C pair, DNA pH 6Protonated nucleotides in RNA structuresUC
Branch Point Helix(12 conf)A13A10C14A7C15A6A17C3C20C21
+H+CH?
Poisson-Boltzmann equationTwo forms for Function F (j):Linear: suitable for low charge systems like proteins, low saltNonlinear: necessary for high charge systems that require salt
Mutagenesis and rescue experimentsNakano et al., 2007 and Perrottaet al., 2007baseC75baseC41+ free C75
320K 347K
G73C44*A43C41Nucleotides in analogous structures have similar pKas BWYV Y-knotHDV ribozymeHDVRBWYVNixon & GiedrocNixon et al.
G28G12C13C26G73C44A12A25C10C8A43C41C14G74G123.8 ÅG733.9 ÅC8C41U72C11C10C9G59Analogous structures are stabilized by conserved interactions13.7(9.3-10.3)10.2 (9.9)10.6BWYV-yPEMV-yHDVRU29G28C10A27U9A262.9 Å
Biology of HDV and function of HDV ribozymeHepatitis delta virus
Obligate satellite virus of Hepatitis B virus
Circular ss-RNA genome, unique in animal viruses & reminiscent of plant satellite viruses.
Genome replicates by rolling circle mechanism
HDV ribozyme
Two forms, present on genomic and anti-genomic genome
Self-cleaving activity is essential for HDV replication cycle – trims multimeric strands into monomeric unitsGenome structure of HDVMMC Lai, The molecular biology of the Hepatitis Delta Virus, 1995
Rosenstein & Been, 1991genomic strandHepatitis Delta Virus (HDV)and its Ribozymes688compliment strand (antigenomic)904I.   genomic strand (G) replicatesGribozyme (gen) cleaves &	compliment circularizesAGIII.   compliment (AG) replicatesribozyme (anti) cleaves &	genome circularizes…
HDVRApplication in a system where pKas have not been directly measuredHepatitis Delta Virus ribozymeCritical for the replication of delta virus RNA genomeStructure12 crystal structures of the precursor & product exist – Ferre-D’Amare & Doudna; Ke et al.2.3-3.5 Å resolutionNested, double Y-knot topology5 major helical sectionsBiochemistryA77, A78, C75 (J4/2)C41 (J1.1/4 – P4)pH-dependent reaction kineticsSalt-dependent reaction kinetics
Structural roles for ionized nucleotides in RNAExample: BWYV H-type Y-knotFirst crystallized by Su et al., 1999 to 1.6 ÅResponsible for a -1 frame-shifting event that controls expression of alternative gene products in various virusesTertiary structure is well-stabilized, beyond typical H-type knotsMeasured stability exceeds that expected by considering only 2° structureFormation of 3° structure contributes more H-bonds than the formation 2° structureFolding of the Y-knot studied by Nixon & GiedrocpH-dependent (transition at pH 6.8)Attributed to C8-G12-C26-A25 interactionSubstitution of C8 with U reduces stabilityNixon & Giedroc, 2000
Why is this RNA substructure conserved?GC+CThree properties of this substructure:Rotation per turn deviates from 25° to almost 90°Base stacking is preserved through both sides of C+GC plane6 hydrogen-bonds, probably most stabile isostere with this structure
O5’C75G1U20O3’,O1PC22O2PC21pKa calculations identify active site residues**A78A77C75G1(C21,C22)
Mechanism Proposed: Ke et al., Nature 2004
Independent tests confirm partial charges derived for nucleotides are consistent w/ exp.New charges for A and C are similar to those in standard FFsCalculated hydration free energy yielded good agreement w/ exp.Hydration free energy depends on cavitation + solvationExperimental hydration free energies: (rms error depends on r-scaling)DGtr(vw)(9-methyladenine), DGtr(vcf)(9-methyladenine), DGtr(vcf)(1-methyluracil)(v=vacuum e=1, w=water e=80, cf=chloroform e=4.8) – Wolfenden & colleagues, 1998Used for pKa calculationsSitkoff, Sharp & Honig
Nonlinear contribution is mostly dependent on total charge of the RNAn = # of protonsCorrection for nonlinearity: E(Coulomb) + E(Solvation) + E(Ions) + ENL(n) = Etotal
Structural elements in RNAUCBase pair structureDouble HelixHairpin TurnSecondary structure
622 RNA-only structures in the RCSB1132 ribozyme structuresUUCC1PDB statistics page:http://www.rcsb.org
P2A78U-1A77C75J4/2G1C22C21P1.1G39G38C75 is located in the active site of HDVR
A cluster of phosphates surrounds the catalytic C75
In biology…perturbed pKas are observed in distinct & diverse biological processes
Splice site recognition: branch-point helix motif(RNA binding of viral matrix proteins)
Bulged DNA duplexes and triple helices
Structural: tRNA(Lys)
Functional: Frameshifting pseudoknots
Shifts codon reading frames: BWYV, PEMV, PLRVtRNA(Lys)(Durant & Davis, 1999)Beet Western Yellows Viruspseudoknot (Su et al., 1999)
How much versatility in just 4 bases?UUAetc…UUHoogsteenWatson CrickCOther Pairs ObservedCC
Functional rolesStructural roles2.8 ÅC75↑Hypothetical mechanism for catalysis & product state of HDVR ↑BWYV pseudoknot &HDV ribozyme RNA with same local structure↓Vanadate-stabilized transition/product states of HR obs. at 2.4 Å↓G73C44*A43C41
SolvationPairwise interactionsMobile IonspKa shifts depend on electrostatic properties we can calculate~ water ~
++LINEAR:  fsalt = k1 * (2q/r)         = k1 * 2 * (q/r)NONLINEAR:  fsalt = k2 * sinh (2q/r) ≠ k2 * 2 * sinh (q/r)
3’3’3’5’5’
~32°+HUU+CCCH
RNApolymeraseribosome

Computational molecular biophysics of ribonucleic acids and ribozymes

  • 1.
    Computational Studies ofProteins and Nucleic AcidsOn pKa Calculations in RNA and the Use of Structure to Improve Sequence AlignmentsMay 13, 2008 – Christopher L. Tang, ICRC 816, Columbia University
  • 2.
    Electrostatics and RNAMetalion binding sites in23S ribosomal RNAMisra & Draper, 2002
  • 3.
    Ribozyme Function isHighly Varied(b)(d)(c)(a)(h)(g)(f)(e)Ribozymes use only nucleic acid groups to perform catalysisRibozymes. (a) 50S ribosome (b) Ribonuclease P, (c) Group I intron, (d) glms, (e) Diels-Alderase, (f) Hepatitis delta virus, (g) Hairpin, (h) Hammerhead
  • 4.
    ribozymedelta antigenThe HepatitisDelta VirusReplication of HDV genome employs catalysis by two related ribozymesHDV genomic RNAcomplement ribozymeI. HDV genome (G) replicatesGcomplement strand cleavedreligationCIII. complement (C) replicatesgenome strand cleavedMM Lai, 1995
  • 5.
    HDVR function involveshydrolysis of an internal phosphodiester bondPRODUCTREACTANT:+H2’:5’+CCHShih & Been, Ann Rev. Bioch., 2002
  • 6.
    Cleavage site ofHDVRwithin 10 ÅKeet al., 2004
  • 7.
    2.8 ÅHDVR ACTIVESITEPRODUCT5’OH(Nakano & Bevilacqua, 2000)
  • 8.
    HDVR active sitestructure is suggestive of catalytic mechanismPRODUCTPRODUCTREACTANTU-1U-1Not observedC+75C75 G 1 G 1+CCHShih & Been, Ann Rev. Bioch., 2002
  • 9.
    PRODUCT2’5’— O –H :O –—BASE CATALYZEDACID CATALYZEDSummary of mechanism_H+2’5’— O: –:H – O —+CCH(Nakano & Bevilacqua, 2000)
  • 10.
    2’5’OHO–— O –H :O –—BASE CATALYZEDACID CATALYZEDH+2’5’— O: –:C75H – O —+CCHpH
  • 11.
    Not observed !~6~7kobs+free OHO–pHObserved reaction rate dependence on pH(Perrottaet al., 2007)???C75
  • 12.
    Solution pKas ofnucleotides Ade and Cyt nucleotides are poor basesAde  Ade (+)(3.8)Cyt Cyt(+)(4.3)+H(pKaref)Izattet al., 1970pH1478910111213654321+CCH% ionized
  • 13.
    Framework for calculatingpKa shift I: Reference and Structure pKas IsolatednucleotideH+H+––pKaref++–++–––++++DpKaStructure++++–––––––++++––pKastruct––++––++++++++–++
  • 14.
    Framework for calculatingpKa shift II: Effects due to Salt–––H+H+––
  • 15.
    Potentials can besolved using classical equations of continuum electrostaticsPoisson-Boltzmann equationpairwise interactionsmobile ion interactionswater interactions (solvation)eext
  • 16.
    Framework for calculatingpKa shift III: Free energyBADpKa = (DG AB (struct) – DG AB (ref)) / 2.3kBT
  • 17.
  • 18.
    pKa calculations usingMCCE (Alexov & Gunner)Calculating pKas in RNA with Multiple Titratable groups0002N states001B010011…100AC+1=111+1+1The potential at any single site (A) depends on the protonation state of all other sites, (B and C) This problem is solved using the Multiple Conformation Continuum Electrostatics (MCCE) method to identify and sample low energy states
  • 19.
    Origin of theError using the Linear Approximation for Highly Charged SystemsLINEAR PBENONLINEAR PBE++++LINEAR CASENONLINEAR CASE0102
  • 20.
    Correction of theError due to Nonlinearity is a Function that Depends on Protonation StateDG ABNL (struct) = DG ABLPB (struct) + DG AB (corr APPROX)Tang, Alexov, Pyle & Honig 2007
  • 21.
    Partial charges werederived for A+ & C+Tang, Alexov, Pyle & Honig 2007
  • 22.
  • 23.
    Calculated and experimentalpKas are consistentBranch Point Helix(12 conf)A135.5A76.1pKaA7A8A13A16A17A18A25BPHSmith & Niconowicz, 1998Tang, Alexov, Pyle & Honig 2007
  • 24.
    Calculated and experimentalpKas are consistentA16Lead-Dependent Ribozyme (25 conf)3.8A173.83.5A18pKaA84.3A256.5A7A8A13A16A17A18A25Salt dependence of A25 pKain leadzymeLeadzymeBPHTang, Alexov, Pyle & Honig 2007Legault & Pardi, 1997
  • 25.
    HDVR exhibits adeep cleft proximal to C75 and the scissile bondP1P2P3C75P1.1P4U1A2.8 ÅNakano et al., 2001G1C75
  • 26.
    pKa shifts ofAdenosines and Cytidines in HDVR DpKaC75C412.8 ÅG1C75Tang, Alexov, Pyle & Honig 2007
  • 27.
    The pKa ofC75 is elevated above its solution pKaDesolvation opposes the shift but the potential acting on C75 due to phosphate charges is much stronger Interactions with Phos. only: – 9.2 kcal/molInteractions with solvent: + 1.5 kcal/mol– 7.7 kcal/mol
  • 28.
    pKa shifts ofAdenosines and Cytidines in HDVR C41DpKaC75C412.8 ÅG1C75Tang, Alexov, Pyle & Honig 2007
  • 29.
    G73C44A43C+41Modifying the C41nucleotide interaction can abrogate pH-dependent HDV activityA73U44C41A43Nakano et al., 2007
  • 30.
    Bell-shaped pH curve:Revisitedbasekobs+ free C41 !C75pHkobs
  • 31.
    G28G12C26C13A12A25C10C8C41 is partof a commonly found structural motifIn all cases where the motif is found, the folded RNA is stable at higher pH than expected given the solution pKa of CG73C44A43C41BWYVPEMV calc pKaHDVR 10.6BWYV 13.7PEMV 10.6Determined by NMR at pH = 6Crystallized at pH = 7, RTpH at Tm ½ = 7pH at Tm ½ = 7Tang et al. 2007*pH ½ (Su, Rich et al., 1998)(Nixon, Giedrocet al., 2000; 2002)
  • 32.
    A1005+ : C963A45+: U115How common are pKa-shifted nucleotides?Nucleotides probably protonated in H. marismortui 50S ribosomal subunit (pdb: 1ffk)identified using a hydrogen bond distance rule (acc-acc dist. < 3.0 Å):Results (3.0 Å cutoff)C1026+ : G940C1069+ : phos
  • 33.
    ConclusionsEstablished a methodfor calculating pKas of nucleotides of RNAA version of MCCE 2.4 can now perform calculations on RNANucleotides with shifted pKasActive sitesConserved motifspKa shifts can be understood in terms of structureSome geometric rules can be established for finding pKa-shifted nucleotides50S ribosomal subunit, H. morismortui
  • 34.
  • 35.
    Acknowledgements EmilAlexov, Anna Marie Pyle, Lei Xie, BurkhardRost, Larry Shapiro, Diana Murray, An-Suei Yang, Marilyn Gunner Lucy Forrest, ShoshanaPosy, Markus Fischer, Remo Rohs, Donald Petrey, Jiang Zhu, Jeremie Vendome,KelyNorel, Brian Chen, Rachel Kolodny, Trevor Siggers, Cinque Soto, Andy Kuziemko, Sean West, Peng Liu, Andrew Kernytsky, AvnerSchlessinger, Henry Bigelow, Michael, Christine, Kaz, Dariusz, Claudia, Marco, Sven, Raj… and many other members of the Honig and Rost labs, andfriends of us, both past and present.Yifan Song and Junjun Mao for their help with MCCE 2.4 My roommates Chintan and Sean, for putting up with my lapses on the household chores Dr. Ron Liem, Katie Rosa, ZaiaSivo, Fred Loweff, and the Integrated and Biochemistry officesBarry Honig
  • 37.
  • 39.
  • 40.
    tRNAmRNAribosometRNA > 1958(Hoagland, et. al)mRNA > 1961 (Brenner, Jacob, Meselson)
  • 41.
    Framework for calculatingpKa shift IV: Electrostatic potentialBut f in solution must involve:Pr+PPairwise interactionsSolvent interactionsMobile ions
  • 42.
    Given the chemicalreaction: Define: Ka = acid dissociation constantDefine: Then: from the Law of Mass Action: Then:Similarlyfor bases: and:
  • 43.
    Derivation of DpKaGiventhe reactions:BB+ (unfolded); andBB+ (folded);What is DpKa?DpKa=pKa(folded) – pKa(unfolded)DG BB+ (unfolded) = – 2.3 kT ( pKa (unfolded) – pH )DG BB+ (folded) = – 2.3 kT ( pKa (folded) – pH )DG BB+ (folded) – DG BB+ (unfolded) = – 2.3 kT ( pKa (folded) – pKa (unfolded) )DpKa = 1/(2.3 kT) DDG BB+ (folded – unfolded)1 DpK unit = 1.36 kcal/mol = 2.3kT.
  • 46.
    Calculated and experimentalpKas are consistentpKaA8A7A13A16A17A18A25Salt dependence of A25 pKain leadzyme1Lead-dependent2ribozymeBPH1Smith & Niconowicz, 1998Legault & Pardi, 1997
  • 48.
    A+CA+- C pair,leadzyme pH 5.5C+CC+- C pair, DNA pH 6Protonated nucleotides in RNA structuresUC
  • 49.
    Branch Point Helix(12conf)A13A10C14A7C15A6A17C3C20C21
  • 50.
  • 52.
    Poisson-Boltzmann equationTwo formsfor Function F (j):Linear: suitable for low charge systems like proteins, low saltNonlinear: necessary for high charge systems that require salt
  • 53.
    Mutagenesis and rescueexperimentsNakano et al., 2007 and Perrottaet al., 2007baseC75baseC41+ free C75
  • 54.
  • 55.
    G73C44*A43C41Nucleotides in analogousstructures have similar pKas BWYV Y-knotHDV ribozymeHDVRBWYVNixon & GiedrocNixon et al.
  • 56.
    G28G12C13C26G73C44A12A25C10C8A43C41C14G74G123.8 ÅG733.9 ÅC8C41U72C11C10C9G59Analogousstructures are stabilized by conserved interactions13.7(9.3-10.3)10.2 (9.9)10.6BWYV-yPEMV-yHDVRU29G28C10A27U9A262.9 Å
  • 57.
    Biology of HDVand function of HDV ribozymeHepatitis delta virus
  • 58.
    Obligate satellite virusof Hepatitis B virus
  • 59.
    Circular ss-RNA genome,unique in animal viruses & reminiscent of plant satellite viruses.
  • 60.
    Genome replicates byrolling circle mechanism
  • 61.
  • 62.
    Two forms, presenton genomic and anti-genomic genome
  • 63.
    Self-cleaving activity isessential for HDV replication cycle – trims multimeric strands into monomeric unitsGenome structure of HDVMMC Lai, The molecular biology of the Hepatitis Delta Virus, 1995
  • 65.
    Rosenstein & Been,1991genomic strandHepatitis Delta Virus (HDV)and its Ribozymes688compliment strand (antigenomic)904I. genomic strand (G) replicatesGribozyme (gen) cleaves & compliment circularizesAGIII. compliment (AG) replicatesribozyme (anti) cleaves & genome circularizes…
  • 66.
    HDVRApplication in asystem where pKas have not been directly measuredHepatitis Delta Virus ribozymeCritical for the replication of delta virus RNA genomeStructure12 crystal structures of the precursor & product exist – Ferre-D’Amare & Doudna; Ke et al.2.3-3.5 Å resolutionNested, double Y-knot topology5 major helical sectionsBiochemistryA77, A78, C75 (J4/2)C41 (J1.1/4 – P4)pH-dependent reaction kineticsSalt-dependent reaction kinetics
  • 67.
    Structural roles forionized nucleotides in RNAExample: BWYV H-type Y-knotFirst crystallized by Su et al., 1999 to 1.6 ÅResponsible for a -1 frame-shifting event that controls expression of alternative gene products in various virusesTertiary structure is well-stabilized, beyond typical H-type knotsMeasured stability exceeds that expected by considering only 2° structureFormation of 3° structure contributes more H-bonds than the formation 2° structureFolding of the Y-knot studied by Nixon & GiedrocpH-dependent (transition at pH 6.8)Attributed to C8-G12-C26-A25 interactionSubstitution of C8 with U reduces stabilityNixon & Giedroc, 2000
  • 68.
    Why is thisRNA substructure conserved?GC+CThree properties of this substructure:Rotation per turn deviates from 25° to almost 90°Base stacking is preserved through both sides of C+GC plane6 hydrogen-bonds, probably most stabile isostere with this structure
  • 69.
    O5’C75G1U20O3’,O1PC22O2PC21pKa calculations identifyactive site residues**A78A77C75G1(C21,C22)
  • 70.
    Mechanism Proposed: Keet al., Nature 2004
  • 71.
    Independent tests confirmpartial charges derived for nucleotides are consistent w/ exp.New charges for A and C are similar to those in standard FFsCalculated hydration free energy yielded good agreement w/ exp.Hydration free energy depends on cavitation + solvationExperimental hydration free energies: (rms error depends on r-scaling)DGtr(vw)(9-methyladenine), DGtr(vcf)(9-methyladenine), DGtr(vcf)(1-methyluracil)(v=vacuum e=1, w=water e=80, cf=chloroform e=4.8) – Wolfenden & colleagues, 1998Used for pKa calculationsSitkoff, Sharp & Honig
  • 72.
    Nonlinear contribution ismostly dependent on total charge of the RNAn = # of protonsCorrection for nonlinearity: E(Coulomb) + E(Solvation) + E(Ions) + ENL(n) = Etotal
  • 75.
    Structural elements inRNAUCBase pair structureDouble HelixHairpin TurnSecondary structure
  • 76.
    622 RNA-only structuresin the RCSB1132 ribozyme structuresUUCC1PDB statistics page:http://www.rcsb.org
  • 78.
  • 79.
    A cluster ofphosphates surrounds the catalytic C75
  • 80.
    In biology…perturbed pKasare observed in distinct & diverse biological processes
  • 81.
    Splice site recognition:branch-point helix motif(RNA binding of viral matrix proteins)
  • 82.
    Bulged DNA duplexesand triple helices
  • 83.
  • 84.
  • 85.
    Shifts codon readingframes: BWYV, PEMV, PLRVtRNA(Lys)(Durant & Davis, 1999)Beet Western Yellows Viruspseudoknot (Su et al., 1999)
  • 86.
    How much versatilityin just 4 bases?UUAetc…UUHoogsteenWatson CrickCOther Pairs ObservedCC
  • 87.
    Functional rolesStructural roles2.8ÅC75↑Hypothetical mechanism for catalysis & product state of HDVR ↑BWYV pseudoknot &HDV ribozyme RNA with same local structure↓Vanadate-stabilized transition/product states of HR obs. at 2.4 Å↓G73C44*A43C41
  • 88.
    SolvationPairwise interactionsMobile IonspKashifts depend on electrostatic properties we can calculate~ water ~
  • 89.
    ++LINEAR: fsalt= k1 * (2q/r) = k1 * 2 * (q/r)NONLINEAR: fsalt = k2 * sinh (2q/r) ≠ k2 * 2 * sinh (q/r)
  • 91.
  • 92.
  • 93.
  • 94.
  • 95.
  • 96.
    RNA catalysispolymerase> 1986(Zaug, Cech)Group I intron(Stahley & Strobel, 2005)
  • 97.
  • 98.
    NH’C = OOg1CbH”CaCgNOg2H”’O= COb’CaCbNOb”Standard MCCE (protein-based calculation)Multiple Conformation Continuum ElectrostaticsEach polar or charged residue is treated as a set of rotamersCharges on the backbone & nonpolar aa. are treated as fixedMany possible conformations due to the choice of rotamers at each position (‘microstate’)A microstate energy is defined for each possible conformationComponents of microstate energy solved using linear PB equationTakes advantage of linearity to compute microstate energy1 Alexov & Gunner, 1997
  • 99.
    O3’PO1PO2PO5’O1’A or AH+O3’H–O2’PO1PO2PO5’O1’Cor CH+O3’H–O2’New MCCE (RNA-based calculation)Instead of amino acids, nucleotidesSimple modelAdenosine (A) and cytidine (C) each can have two rotamers, ionized (AH+,CH+) or neutral (A,C)Adenosine: pKaref = 3.8Cytidine: pKaref = 4.3Deal with nonlinearityUse a nonlinear correction to microstate energy (NL-MCCE)Need to estimate partial charges for AH+ and CH+Use an AMBER-like approach
  • 100.
  • 101.
    Enzymatic activity dependson the structural organization of the active siteHis 57Ser 195Asp 102Structural conservation of the active sites in trypsin, chymotrypsin & elastaseCatalytic chemistry depends on the arrangement of the active site residuesArrangement of His 57, Asp 102 & Ser 195 in the active site of serine proteases
  • 102.
    Why might pKacalculations be interesting to the casual biologist?An example using the protein transferrinDifferences between the open and closed forms of transferrin result in dramatic changes to their theoretical titration curvesTheoretical titration curves predicted by THEMATICS(Murga, Ondrechen & Ringe 2008)
  • 103.
    pKas calculations forcatalytic residues are generally accurate in proteinsE.g. for LysozymeExperimental (expt) pKas :Glu 35 : 6.2Asp 52 : 3.7 Calculated (range) pKas :Glu 35 : 5.6 (4.3-7.3)Asp 52 : 3.8 (1.8-6.6) Nielsen et al., 2003
  • 104.
    tRNA ~100ntribosome30S subunit ~3500nt
  • 105.
    Honig LabProtein structurepredictionMarc Fasnacht, Lucy Forrest, Andy Kuziemko, Rachel Kolodny, Peng Liu, Donald Petrey, Cinque Soto, Chris Tang, KelyNorel, Sean West, Li Xie, Jiang Zhu Remo Rohs & others in the Honig & Pyle labsProtein-DNA recognitionRemo Rohs, Tonia Silkov, Alona SosinskyShana Posy & Jeremie Vendome (cadherins), Mickey Kosloff (GPCRs)Columbia, Integrated / Biochemistry & BiophysicsZaia Sivo, Katie Rosa, Ron Liem, Stacey Warren, Rachel Hernandez, Ed Johnson

Editor's Notes

  • #2 RNA stands for ribonucleic acid
  • #4 A particular kind of RNA we’d like to study is the ribozymeCatalyze enzymatic activitySpecific shapesActive sites[glms?]In particular Ribozymes (d), (f) and (h) undergo general acid-base catalysisDeinococcusradiodurans 50S
  • #5 HDVsuperinfects Hepatitis B
  • #14 isolated
  • #16 … so we can put this all together to calculate pKas. But several challenges may complicate our desire to do this…
  • #19 Why is this complex? Microstates…
  • #22 http://images.google.com/imgres?imgurl=http://www.beilstein-institut.de/bozen2000/proceedings/clark/figure1.gif&imgrefurl=http://www.beilstein-institut.de/bozen2000/proceedings/clark/clark.htm&h=288&w=599&sz=40&hl=en&start=42&um=1&tbnid=ssl0rgEVUbSXxM:&tbnh=65&tbnw=135&prev=/images%3Fq%3Dkollman%2Bnucleotide%26start%3D40%26ndsp%3D20%26um%3D1%26hl%3Den%26client%3Dfirefox-a%26rls%3Dorg.mozilla:en-US:official%26sa%3DNPartial charges were generated by fitting atom-centered charges to electrostatic potentials (ESP) derived ab initio using the B3LYP/6-31g* level of theory and using the program Gaussian 98 (gaussian.com). Nine calculations were performed: one for each of six ribonucleosides (A, A+, C, C+, G and U), and one for each of three conformations of dimethyl-phosphate (gauche-gauche, gauche-trans and trans-trans). The partial charges on ribose atoms C5’, H5’1, H5’2, C4’, H4’, O4’, C3’, H3’, C2’, O2’, HO2 were made equivalent in all six ribonucleosides by averaging the corresponding partial charges for each atom. Excess charges were redistributed over the atoms C1’, H1’ and N1/9 (nitrogen involved in the glycosidic bond) to ensure the net charge per nucleotide was integral. A single set of partial charges was obtained for the phosphate atoms P, O1P, O2P, O3’, O5’ by averaging the corresponding partial charges in the three conformers. The protons in each pair, H5’1/H5’2 (ribose), H21/H22 (guanosine), H41/H42 (cytidine), H61/H62 (adenosine), were made equivalent by redistributing the partial charge evenly between the two protons. The overall redistribution of charge resulting from this procedure was very small. Partial charges for all remaining nucleobase atoms were not modified. United atoms were created for all RNA hydroxyl groups, O2’/HO2, O3T/H3T (3’ terminus), O5T/H5T (5’ terminus), by summing the partial charge on the oxygen and hydrogen and placing the sum at the coordinates of the oxygen.This procedure produced partial charges that were not significantly different from those of AMBER 94 or ChARMM 27. (See Supplemental Table 2; atom names and nucleotide structures are given in Supplemental Figure 3.)
  • #26 Biochemistry,40 (40), 12022 -12038, 2001. 10.1021/bi011253n S0006-2960(01)01253-3 Web Release Date: September 12, 2001 Copyright © 2001 American Chemical SocietyMechanistic Characterization of the HDV Genomic Ribozyme: Assessing the Catalytic and Structural Contributions of Divalent Metal Ions within a Multichannel Reaction Mechanism Shu-ichi Nakano,David J. Proctor, andPhilip C. Bevilacqua*Department of Chemistry, The Pennsylvania State University, University Park, Pennsylvania 16802 Structural Models and Programs. Electrostatic potentialcalculations were carried out using numerical solutions tothe nonlinear Poisson-Boltzmann (NLPB) equation. Calculationswere carried out using a modified form of DelPhithat utilizes a finite difference multigridding algorithm (18,19). This approach was recently used in reports of NLPBcalculations on RNA (20-22). Structural coordinates wereobtained from the Protein Data Bank (PDB) for the hepatitisdelta virus ribozyme (PDB entry 1cx0) (5), fragment I of5S rRNA (PDB entry 364d) (23), and the P4-P6 domain ofthe Tetrahymena group I intron (PDB entry 1gid) (24).Hydrogen atoms were added using REDUCE (25), and boundmetal ions and water molecules were omitted from the NLPBcalculations.In general, calculations were performed in a fashion similarto that published previously (20). Each atom was placed ina medium with a low dielectric constant (m ) 2) definedby the solvent accessible surface-enclosed volume, whichwas obtained using a probe radius of 1.4 Å. The solvent wastreated as a continuum with a dielectric of 80, containing a1:1 electrolyte. A 2.0 Å ion exclusion radius (Stern layer)was added to the surface of the RNA to approximate ahydrated sodium ion. A salt concentration of 0.15 M wasused in the calculations, except for the P4-P6 domain, whichhad to be calculated with a salt concentration of 0.25 M forthe solution to converge (20). Atomic radii and partialcharges were defined using the cvff91 parameter set fromDISCOVER (MSI). Calculations were performed using a 65 65 65 cubic lattice, and potentials were calculated usinga two-step focusing procedure (26). Initial potentials wereapproximated analytically at lattice points on the boundaryof the grid using the Debye-Hu¨ckel equation (27), andsolutions were obtained for 0, 0.05, 0.10, and 0.15 M saltusing boundary conditions interpolated from the previousstep. Potentials were iterated to a convergence of <1 10-4kT/e. Three-dimensional structures and electrostaticpotentials were rendered using GRASP (28). Figures aredisplayed using scaled potentials ranging linearly from-40kT/e to 40kT/e to accommodate the large negative chargeof the ribozyme.
  • #32 437d crystallized to (1.6 Å) resolution
  • #41 RNA is centralDiagram of Central DogmaGenetic information encoded on DNA, passed on to future generations by replicationIn order to express the code, The message must be transferred to another molecule called RNAIn particular messenger RNA (in proc called transcription)Which, through their interactions with transfer RNA and (A cellular machine called) the ribosome,The message can be translated into proteinsHoagland, M. B., Stephenson, M. L., Scott, J. F., Hecht, L. I.& Zamecnik, P. C. A soluble ribonucleic acid intermediatein protein synthesis. J. Biol. Chem. 231, 241–257 (1958)
  • #54 Biochemistry – simplify !!
  • #71 RCSB statisticshttp://www.rcsb.org/pdb/static.do?p=general_information/pdb_statistics/index.html&
  • #76 RCSB statisticshttp://www.rcsb.org/pdb/static.do?p=general_information/pdb_statistics/index.html&
  • #79 Figure 4. Vanadate is selectively bound by enzymes that catalyze phosphate transesterification. (a) Detail of the active site of the tyrosyl-DNA phosphodiesterase Tdp1 in complex with a DNA-protein adduct and vanadate.[37] The vanadate makes two apical coordinations to the nucleophile and leaving groups (amino acids with asterisk). An equatorial ligand is contributed by the DNA. Each of the two remaining equatorial oxygens hydrogen bonds to a lysine and an asparagine side chain of the enzyme (one of the asparagines is omitted from the figure for clarity). (b) Detail of the active site of the hairpin ribozyme-vanadate complex.[39] The three purine residues in blue (from stem A) and red (from stem B) make a total of five hydrogen bonds to the vanadate. It is likely that two active site residues carry positive charges as a result of protonation (see text). Figures 1-4 were prepared with RIBBONS.[52]Figure 5. Mechanisms of a protein and an RNA enzyme that catalyze the same RNA cleavage reaction. (a) Ribonuclease A employs general acid-base catalysis using two histidines with near-neutral pKa, and also stabilizes the transition state with the positively charged side chain of a lysine.[4] Numbers in parentheses indicate the magnitude of loss of catalytic activity upon mutation of the histidines (to alanine) or the lysine (to cysteine). (b) The hairpin ribozyme active site with the four purine nucleobases implicated in catalysis. Dashed lines indicate hydrogen bonds made between the ribozyme and the vanadate transition-state mimic. Numbers in parentheses indicate the magnitude of loss of catalytic activity upon deletion of the nucleobase. Characterization of an A38 abasic ribozyme has not been reported.
  • #82 http://images.google.com/imgres?imgurl=http://www.beilstein-institut.de/bozen2000/proceedings/clark/figure1.gif&imgrefurl=http://www.beilstein-institut.de/bozen2000/proceedings/clark/clark.htm&h=288&w=599&sz=40&hl=en&start=42&um=1&tbnid=ssl0rgEVUbSXxM:&tbnh=65&tbnw=135&prev=/images%3Fq%3Dkollman%2Bnucleotide%26start%3D40%26ndsp%3D20%26um%3D1%26hl%3Den%26client%3Dfirefox-a%26rls%3Dorg.mozilla:en-US:official%26sa%3DN
  • #94 Unlike messenger RNA, transfer RNA and ribosomal RNA must fold into specific structures in order for them perform the functions they are called upon to do.These are in fact a small subset of RNAs that must fold into regular structure in order to function.They range in size from 20 ribonucleotides to well over 3000.Shown here are tRNA … and the ribosome …Their shapes are important for function, but so too are their physical propertiesIn particular, RNAs are made of a backbone of phosphate groups that make the RNA highly charged.As a consequence