Energy Minimization Using Gromacs


Published on

Published in: Education, Technology, Business
1 Comment
No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Energy Minimization Using Gromacs

  1. 1. <ul><li>MOLECULAR DYNAMICS SIMULATION </li></ul>Energy Minimization using GROMACS
  2. 2. Intro <ul><li>Proteins are complex, high molecular mass, organic compounds that consist of amino acids joined by peptide bonds. </li></ul><ul><li>Proteins are essential to the structure and function of all living cells. </li></ul><ul><li>Many proteins are enzymes or subunits of enzymes. Other proteins play structural or mechanical roles, such as those that form the structures and joints of cytoskeleton. Still more functions of proteins include the storage and transport of various ligands. </li></ul><ul><li>In nutrition, protein serves as a source of amino acid for organisms that do not synthesize those amino acids natively. </li></ul><ul><li>There are also some more proteins that are pigmented in nature such as melanin. </li></ul>
  3. 3. Simulating a Protein <ul><li>For the generation of a representative equilibrium two methods are available: </li></ul><ul><ul><li>Monte Carlo simulations and </li></ul></ul><ul><ul><li>Molecular Dynamics simulations. </li></ul></ul><ul><li>For the generation of non-equilibrium ensembles and for the analysis of dynamic events, the second method is appropriate. </li></ul><ul><li>While Monte Carlo simulations are more simple than MD (as they do not require the computation of forces), they do not yield significantly better statistics than MD in a given amount of computer time. </li></ul><ul><li>Therefore Molecular Dynamics is the more universal technique for energy minimization. </li></ul><ul><li>Reason to perform an energy minimization is the removal of all kinetic energy from the system: if several ’snapshots’ from dynamic simulations is compared, energy minimization reduces the thermal noise in the structures and unstability, so that they can be compared better. </li></ul>
  4. 4. Molecular Dynamics Simulation <ul><li>A form of computer simulation, wherein atoms and molecules are allowed to interact for a period of time under some laws of physics. </li></ul><ul><li>Calculates the time dependent behaviour of a molecular system. </li></ul><ul><li>Simulations have provided detailed information on the fluctuations and conformational changes of proteins and nucleic acids. </li></ul><ul><li>Used to investigate the structure, dynamics and thermodynamics of biological molecules and their complexes. </li></ul><ul><li>One of the principal tools in the theoritical study of biological molecules. </li></ul>
  5. 5. Significance of MD Simulation <ul><li>MD simulations permit the study of complex, dynamic processes that occur in biological systems. These include, for example; </li></ul><ul><ul><li>Protein Stability </li></ul></ul><ul><ul><li>Conformational changes </li></ul></ul><ul><ul><li>Protein folding </li></ul></ul><ul><ul><li>Molecular recognition: proteins, DNA, membranes, complexes </li></ul></ul><ul><ul><li>Ion transport in biological systems </li></ul></ul><ul><ul><li>Drug Design </li></ul></ul><ul><ul><li>Refinement of the structure determined through X-ray Crystallography and NMR Spectroscopy. </li></ul></ul>
  6. 6. Performing MD Simulation <ul><li>To run a simulation several things are needed like: </li></ul><ul><ul><li>A file containing the coordinates for all atoms </li></ul></ul><ul><ul><li>Information on the interactions (bond angles, charges, Vander Waals) </li></ul></ul><ul><ul><li>Parameters to control the simulation </li></ul></ul><ul><ul><li>Processors to run the simulation </li></ul></ul><ul><li>The PDB file contains the coordinates for all atoms and is the input structure file for MD Simulation. The interactions are listed in the topology (.top) file and the input parameters are put into a .mdp file. </li></ul>
  7. 7. GROMACS <ul><li>GROMACS (Groningen Machine for Chemical Simulations) is a molecular dynamics simulation package originally developed in the University of Groningen. </li></ul><ul><li>GROMACS is an engine to perform molecular dynamics simulations and energy minimization. </li></ul><ul><li>A high performance Molecular Dynamics Program. </li></ul><ul><li>It is reasonably well optimized for low memory usage. </li></ul><ul><li>The highly optimized code makes GROMACS the fastest program for molecular simulations. </li></ul><ul><li>The support of different force fields and the open source (GPL) character make GROMACS very flexible. </li></ul><ul><li>It is a high performance research tool which can be run on almost every platform. </li></ul><ul><li>This program is free software; it can be redistributed and/or modified under the terms of the GNU General Public License as published by the Free Software Foundation.(A free software licence grants, to the recipients, freedoms in the form of permissions to modify or distribute copyrighted work) </li></ul>
  8. 8. Why GROMACS? <ul><li>GROMACS is preferred over other Molecular Dynamics Simulation Software packages because ; </li></ul><ul><ul><li>It is a versatile package to perform molecular dynamics. </li></ul></ul><ul><ul><li>It is user friendly, with topologies, parameter files, and error messages written in clear text format. </li></ul></ul><ul><ul><li>It provides extremely high performance compared to all other programs. </li></ul></ul><ul><ul><li>There is a lot of consistency checking. </li></ul></ul><ul><ul><li>There is no scripting language. </li></ul></ul><ul><ul><li>As the simulation proceeds, GROMACS continuously tells how far it has gone, and at what time and date it is expected to be finished. </li></ul></ul><ul><ul><li>Can write coordinates using compression, which provides a very compact way of storing trajectory data. </li></ul></ul><ul><ul><li>Comes with a large selection of flexible tools for trajectory analysis. </li></ul></ul><ul><ul><li>GROMACS can be run in parallel, using standard MPI (Message Passing Interface). </li></ul></ul>
  9. 9. GROMACS File Formats <ul><li>.pdb </li></ul><ul><li>.top </li></ul><ul><li>.itp </li></ul><ul><li>.rtp </li></ul><ul><li>.ndx </li></ul><ul><li>.xvg </li></ul><ul><li>.mdp </li></ul><ul><li>.tpr </li></ul><ul><li>.edr </li></ul><ul><li>.log </li></ul><ul><li>.trr </li></ul><ul><li>.xtc </li></ul>
  10. 10. Before Performing MD Simulation <ul><li>Molecular Dynamics studies depend on the quality of the available experimental data. To generate an accurate minimized structure, information about its primary, secondary and tertiary structure is needed. </li></ul><ul><li>There are several publically available databases that contain primary, secondary and tertiary structural on proteins. Two particularly important databases are: </li></ul><ul><ul><li>Protein Data Bank (PDB) at </li></ul></ul><ul><ul><li>This is the primary world wide archive of structural data of biological macromolecules. </li></ul></ul><ul><ul><li>Expert Protein Analysis System (ExPASy) at </li></ul></ul><ul><ul><li>This molecular biology server is dedicated to the analysis of protein sequences and structures. </li></ul></ul>
  11. 11. The process Running Simulation using GROMACS
  12. 12. Commands <ul><li>Many commands are there for performing different tasks, here only mentioned the commands used for simulations. </li></ul><ul><li>Just type ‘–h’ to know about all commands </li></ul>Simulating the protein using GROMACS involves these commands (steps)
  13. 13. GROMACS Flow Chart <ul><li>Several steps of energy minimization may be necessary, these consist of cycles: </li></ul><ul><li>grompp -> mdrun. </li></ul>
  14. 14. <ul><li>Generating Topologies and Coordinates </li></ul><ul><li>Running a Simulation </li></ul><ul><li>pdb2gmx: converts pdb files to topology and coordinate files </li></ul><ul><li>x2top: generates a primitive topology from coordinates </li></ul><ul><li>editconf: edits the box and writes subgroups </li></ul><ul><li>genbox: solvates a system </li></ul><ul><li>genion: generates mono atomic ions on energetically favorable positions </li></ul><ul><li>genconf: multiplies a conformation in 'random' orientations </li></ul><ul><li>genrestr: generates position restraints or distance restraints for index groups </li></ul><ul><li>protonate: protonates structures </li></ul><ul><li>grompp: makes a run input file </li></ul><ul><li>tpbconv: makes a run input file for restarting a crashed run </li></ul><ul><li>mdrun: performs a simulation </li></ul>Commands
  15. 15. Commands (Cont.) <ul><li>Many other commands are available for different tasks like=> </li></ul><ul><ul><li>Viewing trajectories </li></ul></ul><ul><ul><li>Processing energies </li></ul></ul><ul><ul><li>Converting files </li></ul></ul><ul><ul><li>Tools </li></ul></ul><ul><ul><li>Distances between structures </li></ul></ul><ul><ul><li>Distances in structures over time </li></ul></ul><ul><ul><li>Mass distribution properties over time </li></ul></ul><ul><ul><li>Analyzing bonded interactions </li></ul></ul><ul><ul><li>Structural properties </li></ul></ul><ul><ul><li>Kinetic properties </li></ul></ul><ul><ul><li>Electrostatic properties </li></ul></ul><ul><ul><li>Protein specific analysis </li></ul></ul><ul><ul><li>Interfaces </li></ul></ul><ul><ul><li>Covariance analysis </li></ul></ul><ul><ul><li>Normal modes </li></ul></ul>
  16. 16. A little more details of; <ul><li>pdb2gmx </li></ul><ul><li>editconf </li></ul><ul><li>genbox </li></ul><ul><li>grompp </li></ul><ul><li>mdrun </li></ul>
  17. 17. Changing the File Format <ul><li>The pdb2gmx command converts the pdb file to a gromacs file and writes the topology. </li></ul><ul><ul><li>pdb2gmx –ignh –ff [] –f [] –o [] –p [] –water spce </li></ul></ul><ul><ul><ul><li>-ignh = ignore hydrogen atoms </li></ul></ul></ul><ul><ul><ul><li>-ff = select the forcefield </li></ul></ul></ul><ul><ul><ul><li>-f = reads the pdb file </li></ul></ul></ul><ul><ul><ul><li>-o = specify new pdb output file </li></ul></ul></ul><ul><ul><ul><li>-p = output for topology file </li></ul></ul></ul><ul><ul><ul><li>-water = uses the SPC/E water model </li></ul></ul></ul><ul><li>A Force Field is used to minimize the bond stretching energy of the molecule. </li></ul>
  18. 18. Setting up the Box for Simulations <ul><li>editconf converts generic structure format to .gro, .g96 or .pdb. </li></ul><ul><ul><li>editconf -bt [] –f [] –o [] –d [] </li></ul></ul><ul><ul><ul><li>-bt = mention the box type eg cubic </li></ul></ul></ul><ul><ul><ul><li>-f = reads the pdb file </li></ul></ul></ul><ul><ul><ul><li>-o = output </li></ul></ul></ul><ul><ul><ul><li>-d = sets the dimension of the box </li></ul></ul></ul><ul><li>It specifies which type of box has to be used. </li></ul><ul><li>The dimensions of the box has to be set based upon setting the box edge approx 1.5 nm from the molecule(s) periphery. </li></ul><ul><li>The box can be modified with options provided. </li></ul><ul><li>The box type can be set : triclinic is a triclinic box, cubic is a rectangular box with all sides equal dodecahedron represents a rhombic dodecahedron and octahedron is a truncated octahedron. </li></ul>
  19. 19. Adding Solvent to the Box <ul><li>genbox generates a box of solvent. </li></ul><ul><ul><li>genbox –cp [] –cs [] –o [] –p [] </li></ul></ul><ul><ul><ul><li>-cp = pdb input file </li></ul></ul></ul><ul><ul><ul><li>-cs = specify water model file </li></ul></ul></ul><ul><ul><ul><li>-o = output </li></ul></ul></ul><ul><ul><ul><li>-p = topology input file </li></ul></ul></ul><ul><li>Adds residues at specific positions if required. </li></ul><ul><li>The genbox command generates the solvent box based upon the dimensions/box type that was specified using editconf. </li></ul>
  20. 20. Processing the file with Preprocessors <ul><li>grompp is the pre-processor program (the gromacs pre-processor), which setups the run for input into mdrun command. </li></ul><ul><li>The pre-processor reads a molecular topology file, checks the validity of the file, expands the topology from a molecular description to an atomic description. </li></ul><ul><li>Steps performed are ; </li></ul><ul><ul><li>st for steepest decents </li></ul></ul><ul><ul><li>cg for conjugate gradient </li></ul></ul><ul><ul><li>md for molecular dynamics </li></ul></ul>
  21. 21. Running Molecular Dynamics <ul><li>Molecular Dynamics Simulation is run using mdrun. </li></ul><ul><li>The mdrun program is the main computational chemistry engine of MD Simulations. </li></ul><ul><li>mdrun produces at least three output file, plus one log file per node. </li></ul><ul><ul><li>The trajectory file, containing coordinates, velocities and optionally forces. </li></ul></ul><ul><ul><li>The structure file, containing the coordinates and velocities of the last step. </li></ul></ul><ul><ul><li>The energy file, contains the information about the energies, the temperature and pressure. </li></ul></ul>
  22. 22. Visualizing of Plots and Minimized Structure <ul><li>Visualization of the trajectory file using various tools. </li></ul><ul><li>Generation and visualization of the plots for pressure, temperature, kinetic energy, etc. </li></ul><ul><li>Finding out the structure that has the minimal energy fluctuations. </li></ul><ul><li>Visualising this minimized structure using graphical tools. </li></ul>
  23. 23. Visualization of Plots <ul><li>The generated plots can be visualized using XMGRACE. </li></ul><ul><li>xmgrace is a tool to make twodimensional plots of numerical data. </li></ul><ul><li>It runs under various flavors of Unix with X11 and M*tif (LessTif or Motif). </li></ul><ul><li>It also runs under VMS, OS/2, and Windows (95/98/NT/2000/XP). </li></ul>
  24. 24. Sample Plots <ul><li>For Potential Energy </li></ul><ul><li>For Computing Root Mean Square Displacement </li></ul>
  25. 25. Visualizing Minimized Structures <ul><li>DS Visualizer </li></ul><ul><li>Rasmol </li></ul><ul><li>Pymol </li></ul><ul><li>SPDBV </li></ul><ul><li>VMD </li></ul>
  26. 26. Sample Structures <ul><li>MINIMIZED STRUCTURES OBTAINED FROM g_covar (Covariance Analysis) </li></ul>
  27. 27. Significance of Minimized Structure <ul><li>Helps to find the minimal energy conformations. </li></ul><ul><li>Helps in better understanding of the nature of complex polymers. </li></ul><ul><li>Helps in better understanding of the mechanism of polymerization of proteins. </li></ul>
  28. 28. Rajendra K. Labala Scientist-1 Thank You