Bio Linux


Published on

Published in: Technology
1 Like
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

Bio Linux

  1. 1. LIVE BIOTOOLS Through”VIGYAAN” Palani Kannan. K AU-KBC Research Centre
  2. 2. Bio Linux <ul><li>It is a Linux system customized for bioinformatics analysis and development work. </li></ul><ul><li>Most of them are open source. </li></ul>
  3. 3. Classification based on Usage <ul><li>Package Repositories </li></ul><ul><li>It is a stored form of Operating Packages in bioinformatics applications. It is available as whole operating system supported by Linux environment with bioinformatics applications and tools. </li></ul><ul><li>Eg. </li></ul>
  4. 4. Classification based on Usage <ul><li>Live DVD or CD </li></ul><ul><li>It is a temporarily mountable operating environment for bioinformatics applications. The tools and applications are mounted along with the operating environment. </li></ul><ul><li>Eg. </li></ul>
  5. 5. Need of Bio Linux <ul><li>Provision of complete systems </li></ul><ul><li>Provision of bioinformatics software repositories </li></ul><ul><li>Addition of bioinformatics packages to standard distributions </li></ul><ul><li>Live DVD/CDs with bioinformatics software added </li></ul><ul><li>Community building and support systems </li></ul>
  6. 6. LIVE Vigyaan <ul><li>Live environment </li></ul><ul><li>Based on Linux environment (Knoppix) </li></ul><ul><li>Tools and applications of computational biology and Computations Chemistry. </li></ul><ul><li>Provides tools required to compile and install other applications </li></ul>
  7. 7. Vigyaan Demo… <ul><li>Artemis, Genome viewer : It loads a genome for viewing. In this demo a local copy of genome is loaded, however Artemis can also load sequences from the Internet. </li></ul><ul><li>ClustalX, Sequence alignment : It calls clustalx with a filename as argument. The filename is PPIases, which contains sequence of 10 proteins in FASTA format. To perform sequence alignment from top menu select: Alignment -> Do Complete Alignment. On the pop-up box click on ALIGN. </li></ul><ul><li>Ghemical, MD (small molecule): This demo loads sample.mm1gp. To add hydrogens click mouse right button on black area and from the pop-up menu select:Build -> Hydrogens -> Add. To optimize the structure from the pop-menu select: Compute -> Geometry Optimization. Click OK. To perform molecular dynamics: Compute -> Molecular Dynamics. </li></ul>
  8. 8. Vigyaan Demo… <ul><li>GROMACS, MD (peptide in water): This demo prepares and performs molecular dynamics for a peptide in water. </li></ul><ul><li>Jmol, Quantum chemistry calculation visualization : This demo loads an output file from GAMESS-US run (Hessian calculation for proton between 2 water molecules). Use Extras -> Vibrate to view different normal modes. </li></ul><ul><li>Open Babel, Structure file format conversion : This demo converts a protein structure from PDB format to XYZ format using command babel. </li></ul>
  9. 9. Vigyaan Demo… <ul><li>NJPlot, Phylogenetic tree creation : This demo create a phylogenetic tree based on a sample input file. </li></ul><ul><li>PSI3, Quantum chemistry calculation : This demo runs a quantum chemistry calculation using PSI3 package. The calculation performed is SCF optimization of CH4 with DZP basis set. </li></ul><ul><li>PyMOL, Structure visualization and high-quality image rendering : This demo loads a sample project with a protein and 3 conformations of a peptide in the active-site. To change the display use options from S (show), H (hide), L (label) and color menu. To render a high-quality image use ray option in the main menu. To save image use from main menu: File -> Save Image. </li></ul>
  10. 10. Vigyaan Demo… <ul><li>Rasmol, Structure visualization: This demo loads a protein for visualization. You can use mouse to rotate the molecule. Display and colors can be changed from top menu. </li></ul><ul><li>Raster3D, High quality image rendering (for biomolecules) : This demo creates a high quality image of a protein active-site. </li></ul><ul><li>TINKER, Molecular modeling : This demo creates a TINKER file from PDB file for a protein, performs a single point energy calculation and computes molecular volume and surface area. </li></ul>
  11. 12. GROMACS Groningen Machine for Chemical Simulations
  12. 13. Advantage <ul><li>Though designed to calculate bonded interactions, GROMACS is extremely fast at calculating non-bonded interactions; </li></ul><ul><li>GROMACS provides extremely high performance compared to other programs </li></ul><ul><li>GROMACS comes with a large selection of flexible tools for trajectory analysis; </li></ul><ul><li>GROMACS is user-friendly, with topologies and parameter files written in clear text format </li></ul>
  13. 14. molecular dynamics (MD) a computer simulation technique where the time evolution of a set of interacting atoms is followed by integrating their equations of motion In molecular dynamics we follow the laws of classical mechanics, and most notably Newton's law: F=ma Here, m is the atom mass, a its acceleration, and F the force acting upon it, due to the interactions with other atoms.
  14. 15. <ul><li>pdb : </li></ul><ul><ul><li>Protein data bank format </li></ul></ul><ul><li>gro: </li></ul><ul><ul><li>Gromacs format (atom co-ordinates) </li></ul></ul><ul><li>Information in the columns, from left to right: </li></ul><ul><li>residue number,residue name,atom name,atom number </li></ul><ul><li>x, y, and z position, in nm, x, y, and z velocity, in nm/ps </li></ul><ul><li>itp: </li></ul><ul><ul><li>atom topologies (charges, mass, radii, etc) </li></ul></ul><ul><li>top: </li></ul><ul><ul><li>topology file contains all the force field parameters force fields, number of molecules, water, etc </li></ul></ul>Input Files
  15. 16. Creating Input Files <ul><li>Converting pdb to gro: </li></ul><ul><li>Pdb2gmx </li></ul><ul><ul><li>reads in a pdb file and allows the user to chose a forcefield </li></ul></ul><ul><ul><li>reads some database files to make special bonds (i.e. Cys-Cys) adds hydrogen's to the protein </li></ul></ul><ul><ul><li>generates a coordinate file in Gromacs (Gromos) format (*.gro) and a topology file in Gromacs format (*.top). </li></ul></ul><ul><li>Editconf </li></ul><ul><ul><li>converts gromacs files (*.gro) back to pdb files (*.pdb) allows user to setup the box: </li></ul></ul><ul><ul><li>the user can define the type of box (i.e. cubic, triclinic, octahedron) set the dimensions of the box edges relative to the molecule (-d 0.7 will set the box edges 0.7 nm from the molecule)center the molecule in the box </li></ul></ul>
  16. 17. <ul><li>mdp: </li></ul><ul><ul><li>molecular dynamics simulation parameters allows the user to set up specific parameters for all the calculations that Gromacs performs. </li></ul></ul><ul><li>tpr: </li></ul><ul><ul><li>contains the starting structure of the simulation, the molecular topology file and all the simulation parameters; binary format (all of the above) </li></ul></ul><ul><li>em.mdp file : </li></ul><ul><ul><li>sets the parameters for running energy minimizations; allows you to specify the integrator (steepest descent or conjugate gradients), the number of iterations, frequency to update the neighbor list, constraints, etc. </li></ul></ul><ul><li>md.mdp file : </li></ul><ul><ul><li>sets the parameters for running the molecular dynamics program; </li></ul></ul>Input Files
  17. 18. <ul><li>Solvation of the System </li></ul><ul><li>Genbox </li></ul><ul><li>solvates the box based on the dimensions specified using editconf </li></ul><ul><li>solvates the given protein in the specified solvent (by default SPC- Simple Point Charge water) </li></ul><ul><li>water molecules are removed from the box if the distance between any atom of the solute and the solvent is less than the sum of the VanderWaals radii of both atoms (the radii are read from the database vdwradii.dat) </li></ul><ul><li>Electroneutrality: </li></ul><ul><li>check the total charge. If net charge is not zero, then add counter ions to get neutral system. (select random water molecules and replace with ion). Eg: H2O -> Cl- (remove H1, H2, and rename O as Cl-) or use genion (which does the same thing). </li></ul>
  18. 19. <ul><li>Grompp </li></ul><ul><ul><li>reads a molecular topology file (*.top) and checks the validity of the file </li></ul></ul><ul><ul><li>expands the topology from a molecular description to an atomic description (*.tpr) </li></ul></ul><ul><ul><li>it reads the parameter file (*.mdp), the coordinate file (*.gro) and the topology file (*.top) </li></ul></ul><ul><ul><li>it ouputs a *.tpr file for input into the MD program mdrun </li></ul></ul><ul><ul><li>since *.tpr is a binary file, it can not be read with ‘more’ but it may be read using gmxdump, which prints out the input file in readable format (it also prints out the contents of a *.trr file) </li></ul></ul><ul><li>Mdrun </li></ul><ul><ul><li>performs the Molecular Dynamics simulation </li></ul></ul><ul><ul><li>can also perform Brownian Dynamics, Langevin Dynamics, and Conjugate Gradient or Steepest Descents energy minimization </li></ul></ul><ul><ul><li>reads the *.tpr file, creates neighborlists from that file and calculates the forces. </li></ul></ul><ul><ul><li>globally sums up the forces and updates the positions and velocities. </li></ul></ul><ul><ul><li>outputs at least three types of files: </li></ul></ul><ul><li>(1) trajectory file (*.trr): contains coordinates, velocities, and forces </li></ul><ul><li>(2) structure file (*.gro):contains coordinates and velocities of the last step </li></ul><ul><li>(3) energy file (*.edr): contains energies, temperatures, pressures </li></ul>
  19. 20. <ul><li>gmxcheck : </li></ul><ul><li>gmxcheck reads a trajectory (*.trr) or an energy file (*.edr) and prints out useful information in them. </li></ul><ul><li>g_energy : </li></ul><ul><li>extracts energy components or distance restraint data from an energy file into a *.xvg file (may be read using Xmgr or Grace). </li></ul><ul><li>trjconv : </li></ul><ul><li>allows compression of trajectory file into a *.xtc file that can be analyzed using ngmx </li></ul><ul><li>ngmx : </li></ul><ul><li>-Gromacs trajectory viewer </li></ul><ul><li>- plots a 3-D structure of the molecule </li></ul><ul><li>- allows rotation, scaling, translation, labels on atoms, animation of trajectories, etc. </li></ul>
  20. 21. Output files: <ul><li>trr: contains the trajectory data for the simulation; binary format. </li></ul><ul><li>It contains all the coordinates, velocities, forces and energies as was indicated the mdp file. </li></ul><ul><li>xtc: portable format for trajectories which stores the information about the trajectories of the atoms in a compact manner (it only contains cartesian coordinates). </li></ul><ul><li>edr: portable file that contains the energies </li></ul><ul><li>log: CPU time, MFLOP, etc. </li></ul>
  21. 22. PSI3 is a program system and development platform for ab initio molecular electronic structure computations. The PSI3 suite of quantum chemical programs is designed for efficient, high-accuracy calculations of properties of small to medium-sized molecules. It’s capabilities include a variety of Hartree-Fock, coupled cluster, complete-active-space self-consistent-field, and multi-reference configuration interaction models. Molecular point-group symmetry is utilized throughout to maximize efficiency. Non-standard computations are possible using a customizable input format. PSI3 can perform ab initio computations employing basis sets of up to 32768 contracted Gaussian-type functions of virtually arbitrary orbital quantum number. PSI3 can recognize and exploit the largest Abelian subgroup of the point group describing the full symmetry of the molecule.
  22. 23. It includes mature programming interfaces for parsing user input, accessing commonly used data such as basis-set information or molecular orbital coefficients, and retrieving and storing binary data especially multi-index quantities such as electron repulsion integrals. This platform is useful for the rapid implementation of both standard quantum chemical methods, as well as the development of new models. Features that have already been implemented include Hartree-Fock, multiconfigurational self-consistent-field, second-order Møller-Plesset perturbation theory, coupled cluster, and configuration interaction wave functions. Distinctive capabilities include the ability to employ Gaussian basis functions with arbitrary angular momentum levels; linear R12 second-order perturbation theory; coupled cluster frequency-dependent response properties, including dipole polarizabilities and optical rotation; and diagonal Born-Oppenheimer corrections with correlated wave functions.
  23. 24. Thank You