SlideShare a Scribd company logo
Delivered by Publishing Technology to: Sankar Basu
IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07
Copyright: American Scientific Publishers
COMMUNICATION
Copyright © 2013 American Scientific Publishers
All rights reserved
Printed in the United States of America
Journal of
Bioinformatics and Intelligent Control
Vol. 2, 321–323, 2013
SARAMA: A Standalone Suite of Programs for the
Complementarity Plot—A Graphical Structure
Validation Tool for Proteins
Sankar Basu1 ∗
, Dhananjay Bhattacharyya2
, and Rahul Banerjee1 ∗
1
Crystallography and Molecular Biology Division, Saha Institute of Nuclear Physics, Kolkata 700064, India
2
Computational Science Division, Saha Institute of Nuclear Physics, Kolkata 700064, India
Structure validation is a crucial component not only in protein crystallography but also in model
quality estimation in homology modeling, protein design and de-novo structure prediction. Two key
attributes of a correctly determined atomic model are optimal packing between side-chains and
absence of destabilizing unbalanced electric fields within the interior of a protein molecule. The
complementarity plot (CP) combines them in a single unified measure. CP has now been compiled
into a user friendly validation package and made available as a standalone suite of programs in the
public domain (http://www.saha.ac.in/biop/www/sarama.html). The application of CP in the detection
of wrong rotamer assignment has been surveyed.
Keywords: Complementarity, Packing and Electrostatics, Structure Validation.
We report the free availability of a standalone suite of pro-
grams (Sarama) for the Complementarity Plot (Linux Plat-
form) with detailed features and documentation available
at the website: http://www.saha.ac.in/biop/www/sarama.
html. The basic methodology has already been reported.1
Briefly, the Complementarity Plot (CP) estimates the shape
and electrostatic complementarity of interior residues of a
globular protein and is a sensitive indicator of their har-
mony or disharmony with regard to the short and long
range forces sustaining the native fold. A correctly deter-
mined natively folded protein structure should have opti-
mal packing between its buried side-chains and absence of
destabilizing unbalanced electric fields within the interior
of the molecule. CP has already been demonstrated to be
effective in detecting local regions of suboptimal packing
or electrostatics which were found to be highly correlated
to coordinate errors. CP has now been compiled into an
user friendly validation package which should be an use-
ful addition in the already existing repertoire of structure
validation tools. A set of scores have now been included
in the methodology which gives an estimate of the prob-
abilities associated with the distribution of points in the
plot and the propensities of specific residues to different
degrees solvent exposure.
As has been reported previously1
CP requires the sur-
face (Ssc
m ) and electrostatic (Esc
m complementarity to be
computed for buried residues. In this regard, the extent of
∗
Authors to whom correspondence should be addressed.
burial (Bur) of every amino acid residue with respect to
the solvent was estimated by the ratio of the solvent acces-
sible areas (probe radius: 1.4 Å)2
of the residue (X) in the
polypeptide chain to that of an identical residue in a Gly-
X-Gly peptide fragment, in a fully extended conforma-
tion. Only those residues with the burial ratio Bur ≤ 0 30
were henceforth considered for the complementarity plot.
The van der Waals surface was calculated for the entire
polypeptide chain, sampled at 10 dots/Å2 3
and surface
(Ssc
m and electrostatic (Esc
m complementarities calculated
for buried or partially buried side-chains.1 3
For surface complementarity Ssc
m , only side-chain sur-
face points of buried residues (target) were considered and
their nearest neighboring surface points identified from
the rest of the polypeptide chain (within a distance of
3.5 Å). Surface points essentially being area elements are
characterized by their positions (x y z) and the direc-
tion cosines (dl dm dn) of their normals. Then, adapted
from Lawrence and Colman,4
the following expression was
calculated:
S a b = na ·nb ·exp −w d2
ab (1)
where na and nb are two unit normal vectors, correspond-
ing to the dot surface point a (located on the side chain
surface of the target residue) and b (the dot point near-
est to a, within 3.5 Å) respectively, with dab the distance
between them and w, a scaling constant set to 0.5. Ssc
m
was defined as the median of the distribution {S a b }
J. Bioinf. Intell. Control 2013, Vol. 2, No. 4 2326-7496/2013/2/321/003 doi:10.1166/jbic.2013.1059 321
Delivered by Publishing Technology to: Sankar Basu
IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07
Copyright: American Scientific Publishers
COMMUNICATION
SARAMA: A Standalone Suite of Programs for the CP—A Graphical Structure Validation Tool for Proteins Basu et al.
calculated over all the dot surface points of the side-chain
target residue.
For electrostatic complementarity (Esc
m ), the electrostatic
potential of the molecular surface was estimated using
the finite difference Poisson-Boltzmann method as imple-
mented in DelPhi.5
The potential on the side-chain sur-
face points of a buried residue was then computed twice,1
first, due to all atoms of the target residue and second as
a function of all atoms from the rest of the polypeptide
chain (excluding the target). Thus, each surface point was
tagged with two values of electrostatic potential. Follow-
ing McCoy et al.,6
negative of the Pearson’s correlation
coefficient between these two sets of potential values over
the side-chain dot surface points of the target residue was
defined as Esc
m
Esc
m = −
N
i=1 i − ¯ i − ¯
N
i=1 i − ¯ 2 N
i=1 i − ¯ 2 1/2
(2)
where, for a given residue consisting of a total of N side-
chain dot surface points, i is the potential on its ith
point realized due to its own atoms and i , due to the
rest of the protein atoms, ¯ and ¯ are the mean potentials
of i , i = 1 N and i , i = 1 N respectively.
The plot of Ssc
m on the X-axis and Esc
m on the Y -axis
(spanning −1 to 1 in both axes) constitutes the ‘Com-
plementarity Plot’ (CP), which is actually divided into
three plots based on the burial ranges: 0 00 ≤ Bur ≤ 0 05
(CP1), 0 05 < Bur ≤ 0 15 (CP2) and 0 15 < Bur ≤ 0 30
(CP3). Initially, all the buried residues from a training
database (DB2) consisting of 400 highly resolved protein
crystal structures1
were plotted in the CPs, which had been
divided into square-grids (of width 0 05 × 0 05), and the
center of every square grid was assigned an initial prob-
ability (Pgrid equal to the number of points in the grid
divided by the total number of points in the plot. The
probability of a residue to occupy a specific position in
the plot was then estimated by bilinear interpolation from
the probability values of its four nearest neighboring vox-
els. Each CP was contoured based on the initial proba-
bility values (Pgrid ≥ 0 005 for the first contour level and
Pgrid ≥ 0 002 for the second) thus dividing the plot into
three distinct regions. The cumulative probability of locat-
ing a point within the second (outer) contour for the three
plots were 91%, 90%, 88% respectively whereas for the
first (inner) contour, the probability gradually dropped with
increasing solvent exposure (82%, 76%, 71%). Inspired by
the Ramachandran Plot, the region within the first con-
tour was termed ‘probable,’ between the first and second
contour, ‘less probable’ and outside the second contour,
‘improbable’ (Fig. 1).
In such a plot residues with low Ssc
m and Esc
m (<0.2 for
both) are easily identified. The methodology has already
been shown to detect errors in side-chain conformers
in obsolete structures w.r.t. their upgraded counterparts.1
Such side-chains were found to have suboptimal pack-
ing and/or electrostatics and thus predominantly lie in the
Fig. 1. CP1: The complementarity plot for the 1st burial bin. ‘Probable,’
‘less probable’ and improbable’ regions of the plot are colored in purple,
mauve and sky-blue respectively.
improbable region of the plots. An example of such a cal-
culation is given below.
110 pairs of obsolete and their corresponding upgraded
counterparts were assembled from the PDB (ftp://ftp.
wwpdb.org/pub/pdb/data/status/obsolete.dat). In order to
ensure that the upgraded structure was genuinely bet-
ter than its obsoleted counterpart, only those pairs were
selected wherein the improvement in resolution and
R-factor were better than 0.2 Å and 0.02 respectively.
222 deeply buried residues (0 0 ≤ Bur ≤ 0 05) from the
upgraded structures were identified which were origi-
nally found to be located in the probable region of CP1,
and whose counterparts in the corresponding obsolete
structures differed by more than 40 (involving 1 and 2
though belonging to another valid rotamer combination.7
They were then replaced by their corresponding coun-
terparts from the obsolete structures. Subsequent to the
replacement, 45% of the points were relocated in the
improbable region of the plot, 16% were found in the less
probable region whereas 39% were retained in the prob-
able region (Fig. 2). Deviations from the expected dis-
tributions (DB2) were estimated by means of 2
(df =
3–1, probable, less probable, improbable; 2
0 05 = 5 991)
subsequent to the replacement which was found to be
397.63. Thus, CP could have applications when deal-
ing with low-resolution data where automated side-chain
rebuilding methods generally do not work very efficiently.
The Complementarity Plot as a validation technique is
probabilistic in nature and can be utilized either over the
full chain, or on any distribution of points. Further, this is
the only validation procedure which combines both packing
322 J. Bioinf. Intell. Control 2, 321–323, 2013
Delivered by Publishing Technology to: Sankar Basu
IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07
Copyright: American Scientific Publishers
COMMUNICATION
Basu et al. SARAMA: A Standalone Suite of Programs for the CP—A Graphical Structure Validation Tool for Proteins
Fig. 2. Distributions (in CP1) for residues with native side-chain conformers from the upgraded structures and replaced by rotamers from corresponding
obsolete counterparts. (A) Distribution of residues with native side-chains all falling into the probable regions of CP1 and (B) distribution subsequent
to the replacement.
and electrostatics in a single unified measure and displays
graphically (apart from actually listing) residues with faulty
packing and/or electrostatics. Thus, CP should be a useful
addition in the already existing repertoire of structure val-
idation tools. The output of the program gives Ssc
m and Esc
m
of buried residues which can also be used for a wide range
of other applications e.g., fold recognition, analysis of
side-chain packing, detection of unbalanced partial charges
within protein interiors, protein design and modeling.
The website contains detailed documentation of the
different thresholds for successful validation for all the
CP-scores for a given atomic model. The model might
be experimentally or computationally derived but should
definitely contain coordinates of (geometrically fixed)
hydrogen atoms consistent with the format of REDUCE.8
The suite has been successfully tested on Redhat Enter-
prise and open Suse linux platforms with PERL and Fortran
compilers f90, f95, gfortran or ifort. DELPHI5
must be pre-
installed and running under the command: delphi_static.
References and Notes
1. S. Basu, D. Bhattacharyya, and R. Banerjee, Self-complementarity
within proteins: Bridging the gap between binding and folding.
Biophys. J. 102, 2605 (2012).
2. B. Lee and F. M. Richards, The interpretation of protein structures:
Estimation of static accessibility. J. Mol. Biol. 55, 379 (1971).
3. R. Banerjee, M. Sen, D. Bhattacharyya, and P. Saha, The Jigsaw puz-
zle model: Search for conformational specificity in protein interiors.
J. Mol. Biol. 333, 211 (2003).
4. M. C. Lawrence and P. M. Colman, Shape complementarity at pro-
tein/protein interfaces. J. Mol. Biol. 234, 946 (1993).
5. A. Nichollos and B. Honig, A rapid finite difference algorithm, utiliz-
ing successive over-relaxation to solve the Poisson-Boltzmann equa-
tion. J. Comput. Chem. 12, 435 (1991).
6. A. J. McCoy, V. C. Epa, and P. M. Colman, Electrostatic complemen-
tarity at protein/protein interfaces. J. Mol. Biol. 268, 570 (1997).
7. M. S. Shapovalov and R. L. Dunbrack, Jr, A smoothed backbone-
dependent rotamer library for proteins derived from adaptive kernel
density estimates and regressions. Structure 19, 844 (2011).
8. J. M. Word, S. C. Lovell, J. S. Richardson, and D. C. Richardson,
Asparagine and glutamine: Using hydrogen atom contacts in the
choice of side-chain amide orientation. J. Mol. Biol. 285, 1735 (1999).
Received: 10 August 2013. Accepted: 3 September 2013.
J. Bioinf. Intell. Control 2, 321–323, 2013 323

More Related Content

What's hot

Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...researchinventy
 
Variation of Fundamental Constants
Variation of Fundamental ConstantsVariation of Fundamental Constants
Variation of Fundamental Constants
Los Alamos National Laboratory
 
BIOS 203 Lecture 3: Classical molecular dynamics
BIOS 203 Lecture 3: Classical molecular dynamicsBIOS 203 Lecture 3: Classical molecular dynamics
BIOS 203 Lecture 3: Classical molecular dynamicsbios203
 
StephaneValladier - SpecialistPaper - NDEtheory--v2.6
StephaneValladier - SpecialistPaper - NDEtheory--v2.6StephaneValladier - SpecialistPaper - NDEtheory--v2.6
StephaneValladier - SpecialistPaper - NDEtheory--v2.6Stephane Valladier
 
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
Miltão Ribeiro
 
AFMM Manual
AFMM ManualAFMM Manual
AFMM Manual
Anax Fotopoulos
 
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...Sérgio Sacani
 
Dynamics of Twointeracting Electronsinthree-Dimensional Lattice
Dynamics of Twointeracting Electronsinthree-Dimensional LatticeDynamics of Twointeracting Electronsinthree-Dimensional Lattice
Dynamics of Twointeracting Electronsinthree-Dimensional Lattice
IOSR Journals
 
Hall 2006 problems-encounteredfromuserayleighdamping
Hall 2006 problems-encounteredfromuserayleighdampingHall 2006 problems-encounteredfromuserayleighdamping
Hall 2006 problems-encounteredfromuserayleighdamping
Juan Camacho
 
Sergey seriy thomas fermi-dirac theory
Sergey seriy thomas fermi-dirac theorySergey seriy thomas fermi-dirac theory
Sergey seriy thomas fermi-dirac theorySergey Seriy
 
Calculando o tensor de condutividade em materiais topológicos
Calculando o tensor de condutividade em materiais topológicosCalculando o tensor de condutividade em materiais topológicos
Calculando o tensor de condutividade em materiais topológicos
Vtonetto
 
7.local and global minima
7.local and global minima7.local and global minima
7.local and global minima
Abhijeet Kadam
 
Harmonic Minimization using PSO of meld μgird-MLCI
Harmonic Minimization using PSO of meld μgird-MLCIHarmonic Minimization using PSO of meld μgird-MLCI
Harmonic Minimization using PSO of meld μgird-MLCI
IJERA Editor
 
Metric Projections to Identify Critical Points in Electric Power Systems
Metric Projections to Identify Critical Points in Electric Power SystemsMetric Projections to Identify Critical Points in Electric Power Systems
Metric Projections to Identify Critical Points in Electric Power Systems
theijes
 
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
International Journal of Power Electronics and Drive Systems
 
Parameters for Classical Force Fields, E. Tajkhorshid
Parameters for Classical Force Fields, E. TajkhorshidParameters for Classical Force Fields, E. Tajkhorshid
Parameters for Classical Force Fields, E. TajkhorshidTCBG
 

What's hot (20)

Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...Research Inventy : International Journal of Engineering and Science is publis...
Research Inventy : International Journal of Engineering and Science is publis...
 
Variation of Fundamental Constants
Variation of Fundamental ConstantsVariation of Fundamental Constants
Variation of Fundamental Constants
 
BIOS 203 Lecture 3: Classical molecular dynamics
BIOS 203 Lecture 3: Classical molecular dynamicsBIOS 203 Lecture 3: Classical molecular dynamics
BIOS 203 Lecture 3: Classical molecular dynamics
 
StephaneValladier - SpecialistPaper - NDEtheory--v2.6
StephaneValladier - SpecialistPaper - NDEtheory--v2.6StephaneValladier - SpecialistPaper - NDEtheory--v2.6
StephaneValladier - SpecialistPaper - NDEtheory--v2.6
 
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
A Global Approach with Cutoff Exponential Function, Mathematically Well Defin...
 
AFMM Manual
AFMM ManualAFMM Manual
AFMM Manual
 
1310.1421v3
1310.1421v31310.1421v3
1310.1421v3
 
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
 
Dynamics of Twointeracting Electronsinthree-Dimensional Lattice
Dynamics of Twointeracting Electronsinthree-Dimensional LatticeDynamics of Twointeracting Electronsinthree-Dimensional Lattice
Dynamics of Twointeracting Electronsinthree-Dimensional Lattice
 
Hall 2006 problems-encounteredfromuserayleighdamping
Hall 2006 problems-encounteredfromuserayleighdampingHall 2006 problems-encounteredfromuserayleighdamping
Hall 2006 problems-encounteredfromuserayleighdamping
 
Sergey seriy thomas fermi-dirac theory
Sergey seriy thomas fermi-dirac theorySergey seriy thomas fermi-dirac theory
Sergey seriy thomas fermi-dirac theory
 
Calculando o tensor de condutividade em materiais topológicos
Calculando o tensor de condutividade em materiais topológicosCalculando o tensor de condutividade em materiais topológicos
Calculando o tensor de condutividade em materiais topológicos
 
7.local and global minima
7.local and global minima7.local and global minima
7.local and global minima
 
Harmonic Minimization using PSO of meld μgird-MLCI
Harmonic Minimization using PSO of meld μgird-MLCIHarmonic Minimization using PSO of meld μgird-MLCI
Harmonic Minimization using PSO of meld μgird-MLCI
 
Metric Projections to Identify Critical Points in Electric Power Systems
Metric Projections to Identify Critical Points in Electric Power SystemsMetric Projections to Identify Critical Points in Electric Power Systems
Metric Projections to Identify Critical Points in Electric Power Systems
 
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
Dynamic simulation of three-phase nine-level multilevel inverter with switchi...
 
F0523740
F0523740F0523740
F0523740
 
SciReport 2015
SciReport 2015SciReport 2015
SciReport 2015
 
paper
paperpaper
paper
 
Parameters for Classical Force Fields, E. Tajkhorshid
Parameters for Classical Force Fields, E. TajkhorshidParameters for Classical Force Fields, E. Tajkhorshid
Parameters for Classical Force Fields, E. Tajkhorshid
 

Viewers also liked

IJBB-51-3-188-200
IJBB-51-3-188-200IJBB-51-3-188-200
IJBB-51-3-188-200sankar basu
 
A chave para desfrutarmos dos cuidados do pai um tempo para avançarmos na p...
A chave para desfrutarmos dos cuidados do pai   um tempo para avançarmos na p...A chave para desfrutarmos dos cuidados do pai   um tempo para avançarmos na p...
A chave para desfrutarmos dos cuidados do pai um tempo para avançarmos na p...RAMO ESTENDIDO
 
D09 carrefour folleto bolsas
D09 carrefour folleto bolsasD09 carrefour folleto bolsas
D09 carrefour folleto bolsastecnorlc
 
Conalep estado de mexico
Conalep estado de mexicoConalep estado de mexico
Conalep estado de mexico
CEPELLEN
 
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
Stathis Gourzis
 
Our rights
Our rightsOur rights
Our rights
nazhasultana
 
2. expansión en serie de fourier de una función en un intervalo infinito
2. expansión en serie de fourier de una función en un intervalo infinito2. expansión en serie de fourier de una función en un intervalo infinito
2. expansión en serie de fourier de una función en un intervalo infinito
grower miranda mamani
 
Transferência de riquezas: as riquezas a serviço do Reino de Deus.
Transferência de riquezas: as riquezas a serviço do Reino de Deus.Transferência de riquezas: as riquezas a serviço do Reino de Deus.
Transferência de riquezas: as riquezas a serviço do Reino de Deus.RAMO ESTENDIDO
 
Compreensão profética do mês hebraico de Sivan.
Compreensão profética do mês hebraico de Sivan.Compreensão profética do mês hebraico de Sivan.
Compreensão profética do mês hebraico de Sivan.RAMO ESTENDIDO
 
A prática da coragem: um fundamento chave para o avanço.
A prática da coragem: um fundamento chave para o avanço.A prática da coragem: um fundamento chave para o avanço.
A prática da coragem: um fundamento chave para o avanço.RAMO ESTENDIDO
 
Aula 7º ano - Origem da vida na Terra
Aula 7º ano - Origem da vida na TerraAula 7º ano - Origem da vida na Terra
Aula 7º ano - Origem da vida na TerraLeonardo Kaplan
 
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
RAMO ESTENDIDO
 
P4 ec exponenciales y radicales 1 solución
P4 ec exponenciales y radicales 1 soluciónP4 ec exponenciales y radicales 1 solución
P4 ec exponenciales y radicales 1 solución
lutv223
 
Rotterdam
RotterdamRotterdam
Rotterdam
xavier laborda
 
Early Childhood Development and Girls HEART reading pack
Early Childhood Development and Girls HEART reading packEarly Childhood Development and Girls HEART reading pack
Early Childhood Development and Girls HEART reading pack
Laura Bolton
 
Blogkonzepte - Blogger:sessions
Blogkonzepte - Blogger:sessionsBlogkonzepte - Blogger:sessions
Blogkonzepte - Blogger:sessions
Mara Giese
 
Sindrome del-edificio-enfermo
Sindrome del-edificio-enfermoSindrome del-edificio-enfermo
Sindrome del-edificio-enfermo
Julio Lovon
 

Viewers also liked (19)

IJBB-51-3-188-200
IJBB-51-3-188-200IJBB-51-3-188-200
IJBB-51-3-188-200
 
A chave para desfrutarmos dos cuidados do pai um tempo para avançarmos na p...
A chave para desfrutarmos dos cuidados do pai   um tempo para avançarmos na p...A chave para desfrutarmos dos cuidados do pai   um tempo para avançarmos na p...
A chave para desfrutarmos dos cuidados do pai um tempo para avançarmos na p...
 
D09 carrefour folleto bolsas
D09 carrefour folleto bolsasD09 carrefour folleto bolsas
D09 carrefour folleto bolsas
 
Doc1
Doc1Doc1
Doc1
 
Conalep estado de mexico
Conalep estado de mexicoConalep estado de mexico
Conalep estado de mexico
 
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
203 - Πείραμα αγωγής ρεύματος από διάλυμα NaCl
 
Our rights
Our rightsOur rights
Our rights
 
2. expansión en serie de fourier de una función en un intervalo infinito
2. expansión en serie de fourier de una función en un intervalo infinito2. expansión en serie de fourier de una función en un intervalo infinito
2. expansión en serie de fourier de una función en un intervalo infinito
 
Transferência de riquezas: as riquezas a serviço do Reino de Deus.
Transferência de riquezas: as riquezas a serviço do Reino de Deus.Transferência de riquezas: as riquezas a serviço do Reino de Deus.
Transferência de riquezas: as riquezas a serviço do Reino de Deus.
 
Compreensão profética do mês hebraico de Sivan.
Compreensão profética do mês hebraico de Sivan.Compreensão profética do mês hebraico de Sivan.
Compreensão profética do mês hebraico de Sivan.
 
A prática da coragem: um fundamento chave para o avanço.
A prática da coragem: um fundamento chave para o avanço.A prática da coragem: um fundamento chave para o avanço.
A prática da coragem: um fundamento chave para o avanço.
 
Aula 7º ano - Origem da vida na Terra
Aula 7º ano - Origem da vida na TerraAula 7º ano - Origem da vida na Terra
Aula 7º ano - Origem da vida na Terra
 
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
Siga o caminho de Deus e ele te surpreenderá, um tempo para ser surpreendido ...
 
P4 ec exponenciales y radicales 1 solución
P4 ec exponenciales y radicales 1 soluciónP4 ec exponenciales y radicales 1 solución
P4 ec exponenciales y radicales 1 solución
 
Rotterdam
RotterdamRotterdam
Rotterdam
 
Early Childhood Development and Girls HEART reading pack
Early Childhood Development and Girls HEART reading packEarly Childhood Development and Girls HEART reading pack
Early Childhood Development and Girls HEART reading pack
 
Blogkonzepte - Blogger:sessions
Blogkonzepte - Blogger:sessionsBlogkonzepte - Blogger:sessions
Blogkonzepte - Blogger:sessions
 
Sindrome del-edificio-enfermo
Sindrome del-edificio-enfermoSindrome del-edificio-enfermo
Sindrome del-edificio-enfermo
 
polDNAmelt-jbic
polDNAmelt-jbicpolDNAmelt-jbic
polDNAmelt-jbic
 

Similar to sarama-jbic

Structural, electronic, elastic, optical and thermodynamical properties of zi...
Structural, electronic, elastic, optical and thermodynamical properties of zi...Structural, electronic, elastic, optical and thermodynamical properties of zi...
Structural, electronic, elastic, optical and thermodynamical properties of zi...
Alexander Decker
 
Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...
Alexander Decker
 
acs.jpca.9b08723.pdf
acs.jpca.9b08723.pdfacs.jpca.9b08723.pdf
acs.jpca.9b08723.pdf
ashwanikushwaha15
 
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
Kyle Poe
 
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein ModelingElectron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
N. Sukumar
 
Radiation patterns account of a circular microstrip antenna loaded two annular
Radiation patterns account of a circular microstrip antenna  loaded two annularRadiation patterns account of a circular microstrip antenna  loaded two annular
Radiation patterns account of a circular microstrip antenna loaded two annular
wailGodaymi1
 
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
Editor IJCATR
 
minas
minasminas
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classification
Pioneer Natural Resources
 
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...GOASA
 
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
IJCNCJournal
 
ENERGY_BAND_THEORY.pdf for physics students
ENERGY_BAND_THEORY.pdf for physics studentsENERGY_BAND_THEORY.pdf for physics students
ENERGY_BAND_THEORY.pdf for physics students
divyanshp1211
 
My_Journal_Paper
My_Journal_PaperMy_Journal_Paper
My_Journal_PaperJess Wang
 

Similar to sarama-jbic (20)

Structural, electronic, elastic, optical and thermodynamical properties of zi...
Structural, electronic, elastic, optical and thermodynamical properties of zi...Structural, electronic, elastic, optical and thermodynamical properties of zi...
Structural, electronic, elastic, optical and thermodynamical properties of zi...
 
Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...Topology of charge density from pseudopotential density functional theory cal...
Topology of charge density from pseudopotential density functional theory cal...
 
acs.jpca.9b08723.pdf
acs.jpca.9b08723.pdfacs.jpca.9b08723.pdf
acs.jpca.9b08723.pdf
 
1204.5674v1
1204.5674v11204.5674v1
1204.5674v1
 
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
Investigation of Steady-State Carrier Distribution in CNT Porins in Neuronal ...
 
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein ModelingElectron Density Derived Descriptors in Drug Discovery and Protein Modeling
Electron Density Derived Descriptors in Drug Discovery and Protein Modeling
 
Report
ReportReport
Report
 
Radiation patterns account of a circular microstrip antenna loaded two annular
Radiation patterns account of a circular microstrip antenna  loaded two annularRadiation patterns account of a circular microstrip antenna  loaded two annular
Radiation patterns account of a circular microstrip antenna loaded two annular
 
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
EVALUATING STRUCTURAL, OPTICAL & ELECTRICAL CHARACTERIZATION OF ZINC CHALCOGE...
 
Paper
PaperPaper
Paper
 
jkps.66.323
jkps.66.323jkps.66.323
jkps.66.323
 
minas
minasminas
minas
 
Active learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classificationActive learning algorithms in seismic facies classification
Active learning algorithms in seismic facies classification
 
PhysRevSTAB.5.102001
PhysRevSTAB.5.102001PhysRevSTAB.5.102001
PhysRevSTAB.5.102001
 
Ah03302090215
Ah03302090215Ah03302090215
Ah03302090215
 
400
400400
400
 
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
Detection of an_unidentified_emission_line_in_the_stacked_xray_spectrum_of_ga...
 
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
PERFORMANCE ASSESSMENT OF CHAOTIC SEQUENCE DERIVED FROM BIFURCATION DEPENDENT...
 
ENERGY_BAND_THEORY.pdf for physics students
ENERGY_BAND_THEORY.pdf for physics studentsENERGY_BAND_THEORY.pdf for physics students
ENERGY_BAND_THEORY.pdf for physics students
 
My_Journal_Paper
My_Journal_PaperMy_Journal_Paper
My_Journal_Paper
 

sarama-jbic

  • 1. Delivered by Publishing Technology to: Sankar Basu IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07 Copyright: American Scientific Publishers COMMUNICATION Copyright © 2013 American Scientific Publishers All rights reserved Printed in the United States of America Journal of Bioinformatics and Intelligent Control Vol. 2, 321–323, 2013 SARAMA: A Standalone Suite of Programs for the Complementarity Plot—A Graphical Structure Validation Tool for Proteins Sankar Basu1 ∗ , Dhananjay Bhattacharyya2 , and Rahul Banerjee1 ∗ 1 Crystallography and Molecular Biology Division, Saha Institute of Nuclear Physics, Kolkata 700064, India 2 Computational Science Division, Saha Institute of Nuclear Physics, Kolkata 700064, India Structure validation is a crucial component not only in protein crystallography but also in model quality estimation in homology modeling, protein design and de-novo structure prediction. Two key attributes of a correctly determined atomic model are optimal packing between side-chains and absence of destabilizing unbalanced electric fields within the interior of a protein molecule. The complementarity plot (CP) combines them in a single unified measure. CP has now been compiled into a user friendly validation package and made available as a standalone suite of programs in the public domain (http://www.saha.ac.in/biop/www/sarama.html). The application of CP in the detection of wrong rotamer assignment has been surveyed. Keywords: Complementarity, Packing and Electrostatics, Structure Validation. We report the free availability of a standalone suite of pro- grams (Sarama) for the Complementarity Plot (Linux Plat- form) with detailed features and documentation available at the website: http://www.saha.ac.in/biop/www/sarama. html. The basic methodology has already been reported.1 Briefly, the Complementarity Plot (CP) estimates the shape and electrostatic complementarity of interior residues of a globular protein and is a sensitive indicator of their har- mony or disharmony with regard to the short and long range forces sustaining the native fold. A correctly deter- mined natively folded protein structure should have opti- mal packing between its buried side-chains and absence of destabilizing unbalanced electric fields within the interior of the molecule. CP has already been demonstrated to be effective in detecting local regions of suboptimal packing or electrostatics which were found to be highly correlated to coordinate errors. CP has now been compiled into an user friendly validation package which should be an use- ful addition in the already existing repertoire of structure validation tools. A set of scores have now been included in the methodology which gives an estimate of the prob- abilities associated with the distribution of points in the plot and the propensities of specific residues to different degrees solvent exposure. As has been reported previously1 CP requires the sur- face (Ssc m ) and electrostatic (Esc m complementarity to be computed for buried residues. In this regard, the extent of ∗ Authors to whom correspondence should be addressed. burial (Bur) of every amino acid residue with respect to the solvent was estimated by the ratio of the solvent acces- sible areas (probe radius: 1.4 Å)2 of the residue (X) in the polypeptide chain to that of an identical residue in a Gly- X-Gly peptide fragment, in a fully extended conforma- tion. Only those residues with the burial ratio Bur ≤ 0 30 were henceforth considered for the complementarity plot. The van der Waals surface was calculated for the entire polypeptide chain, sampled at 10 dots/Å2 3 and surface (Ssc m and electrostatic (Esc m complementarities calculated for buried or partially buried side-chains.1 3 For surface complementarity Ssc m , only side-chain sur- face points of buried residues (target) were considered and their nearest neighboring surface points identified from the rest of the polypeptide chain (within a distance of 3.5 Å). Surface points essentially being area elements are characterized by their positions (x y z) and the direc- tion cosines (dl dm dn) of their normals. Then, adapted from Lawrence and Colman,4 the following expression was calculated: S a b = na ·nb ·exp −w d2 ab (1) where na and nb are two unit normal vectors, correspond- ing to the dot surface point a (located on the side chain surface of the target residue) and b (the dot point near- est to a, within 3.5 Å) respectively, with dab the distance between them and w, a scaling constant set to 0.5. Ssc m was defined as the median of the distribution {S a b } J. Bioinf. Intell. Control 2013, Vol. 2, No. 4 2326-7496/2013/2/321/003 doi:10.1166/jbic.2013.1059 321
  • 2. Delivered by Publishing Technology to: Sankar Basu IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07 Copyright: American Scientific Publishers COMMUNICATION SARAMA: A Standalone Suite of Programs for the CP—A Graphical Structure Validation Tool for Proteins Basu et al. calculated over all the dot surface points of the side-chain target residue. For electrostatic complementarity (Esc m ), the electrostatic potential of the molecular surface was estimated using the finite difference Poisson-Boltzmann method as imple- mented in DelPhi.5 The potential on the side-chain sur- face points of a buried residue was then computed twice,1 first, due to all atoms of the target residue and second as a function of all atoms from the rest of the polypeptide chain (excluding the target). Thus, each surface point was tagged with two values of electrostatic potential. Follow- ing McCoy et al.,6 negative of the Pearson’s correlation coefficient between these two sets of potential values over the side-chain dot surface points of the target residue was defined as Esc m Esc m = − N i=1 i − ¯ i − ¯ N i=1 i − ¯ 2 N i=1 i − ¯ 2 1/2 (2) where, for a given residue consisting of a total of N side- chain dot surface points, i is the potential on its ith point realized due to its own atoms and i , due to the rest of the protein atoms, ¯ and ¯ are the mean potentials of i , i = 1 N and i , i = 1 N respectively. The plot of Ssc m on the X-axis and Esc m on the Y -axis (spanning −1 to 1 in both axes) constitutes the ‘Com- plementarity Plot’ (CP), which is actually divided into three plots based on the burial ranges: 0 00 ≤ Bur ≤ 0 05 (CP1), 0 05 < Bur ≤ 0 15 (CP2) and 0 15 < Bur ≤ 0 30 (CP3). Initially, all the buried residues from a training database (DB2) consisting of 400 highly resolved protein crystal structures1 were plotted in the CPs, which had been divided into square-grids (of width 0 05 × 0 05), and the center of every square grid was assigned an initial prob- ability (Pgrid equal to the number of points in the grid divided by the total number of points in the plot. The probability of a residue to occupy a specific position in the plot was then estimated by bilinear interpolation from the probability values of its four nearest neighboring vox- els. Each CP was contoured based on the initial proba- bility values (Pgrid ≥ 0 005 for the first contour level and Pgrid ≥ 0 002 for the second) thus dividing the plot into three distinct regions. The cumulative probability of locat- ing a point within the second (outer) contour for the three plots were 91%, 90%, 88% respectively whereas for the first (inner) contour, the probability gradually dropped with increasing solvent exposure (82%, 76%, 71%). Inspired by the Ramachandran Plot, the region within the first con- tour was termed ‘probable,’ between the first and second contour, ‘less probable’ and outside the second contour, ‘improbable’ (Fig. 1). In such a plot residues with low Ssc m and Esc m (<0.2 for both) are easily identified. The methodology has already been shown to detect errors in side-chain conformers in obsolete structures w.r.t. their upgraded counterparts.1 Such side-chains were found to have suboptimal pack- ing and/or electrostatics and thus predominantly lie in the Fig. 1. CP1: The complementarity plot for the 1st burial bin. ‘Probable,’ ‘less probable’ and improbable’ regions of the plot are colored in purple, mauve and sky-blue respectively. improbable region of the plots. An example of such a cal- culation is given below. 110 pairs of obsolete and their corresponding upgraded counterparts were assembled from the PDB (ftp://ftp. wwpdb.org/pub/pdb/data/status/obsolete.dat). In order to ensure that the upgraded structure was genuinely bet- ter than its obsoleted counterpart, only those pairs were selected wherein the improvement in resolution and R-factor were better than 0.2 Å and 0.02 respectively. 222 deeply buried residues (0 0 ≤ Bur ≤ 0 05) from the upgraded structures were identified which were origi- nally found to be located in the probable region of CP1, and whose counterparts in the corresponding obsolete structures differed by more than 40 (involving 1 and 2 though belonging to another valid rotamer combination.7 They were then replaced by their corresponding coun- terparts from the obsolete structures. Subsequent to the replacement, 45% of the points were relocated in the improbable region of the plot, 16% were found in the less probable region whereas 39% were retained in the prob- able region (Fig. 2). Deviations from the expected dis- tributions (DB2) were estimated by means of 2 (df = 3–1, probable, less probable, improbable; 2 0 05 = 5 991) subsequent to the replacement which was found to be 397.63. Thus, CP could have applications when deal- ing with low-resolution data where automated side-chain rebuilding methods generally do not work very efficiently. The Complementarity Plot as a validation technique is probabilistic in nature and can be utilized either over the full chain, or on any distribution of points. Further, this is the only validation procedure which combines both packing 322 J. Bioinf. Intell. Control 2, 321–323, 2013
  • 3. Delivered by Publishing Technology to: Sankar Basu IP: 116.193.141.16 On: Thu, 19 Dec 2013 19:48:07 Copyright: American Scientific Publishers COMMUNICATION Basu et al. SARAMA: A Standalone Suite of Programs for the CP—A Graphical Structure Validation Tool for Proteins Fig. 2. Distributions (in CP1) for residues with native side-chain conformers from the upgraded structures and replaced by rotamers from corresponding obsolete counterparts. (A) Distribution of residues with native side-chains all falling into the probable regions of CP1 and (B) distribution subsequent to the replacement. and electrostatics in a single unified measure and displays graphically (apart from actually listing) residues with faulty packing and/or electrostatics. Thus, CP should be a useful addition in the already existing repertoire of structure val- idation tools. The output of the program gives Ssc m and Esc m of buried residues which can also be used for a wide range of other applications e.g., fold recognition, analysis of side-chain packing, detection of unbalanced partial charges within protein interiors, protein design and modeling. The website contains detailed documentation of the different thresholds for successful validation for all the CP-scores for a given atomic model. The model might be experimentally or computationally derived but should definitely contain coordinates of (geometrically fixed) hydrogen atoms consistent with the format of REDUCE.8 The suite has been successfully tested on Redhat Enter- prise and open Suse linux platforms with PERL and Fortran compilers f90, f95, gfortran or ifort. DELPHI5 must be pre- installed and running under the command: delphi_static. References and Notes 1. S. Basu, D. Bhattacharyya, and R. Banerjee, Self-complementarity within proteins: Bridging the gap between binding and folding. Biophys. J. 102, 2605 (2012). 2. B. Lee and F. M. Richards, The interpretation of protein structures: Estimation of static accessibility. J. Mol. Biol. 55, 379 (1971). 3. R. Banerjee, M. Sen, D. Bhattacharyya, and P. Saha, The Jigsaw puz- zle model: Search for conformational specificity in protein interiors. J. Mol. Biol. 333, 211 (2003). 4. M. C. Lawrence and P. M. Colman, Shape complementarity at pro- tein/protein interfaces. J. Mol. Biol. 234, 946 (1993). 5. A. Nichollos and B. Honig, A rapid finite difference algorithm, utiliz- ing successive over-relaxation to solve the Poisson-Boltzmann equa- tion. J. Comput. Chem. 12, 435 (1991). 6. A. J. McCoy, V. C. Epa, and P. M. Colman, Electrostatic complemen- tarity at protein/protein interfaces. J. Mol. Biol. 268, 570 (1997). 7. M. S. Shapovalov and R. L. Dunbrack, Jr, A smoothed backbone- dependent rotamer library for proteins derived from adaptive kernel density estimates and regressions. Structure 19, 844 (2011). 8. J. M. Word, S. C. Lovell, J. S. Richardson, and D. C. Richardson, Asparagine and glutamine: Using hydrogen atom contacts in the choice of side-chain amide orientation. J. Mol. Biol. 285, 1735 (1999). Received: 10 August 2013. Accepted: 3 September 2013. J. Bioinf. Intell. Control 2, 321–323, 2013 323