SlideShare a Scribd company logo
BIONEXT-DSS CONSTRUCTION
Four distinct elements are compiled for each test case:
- The reference site (bin 0) belongs to a protein of pharma-
ceutical interest and is systematically selected as the binding
site of a ligand of pharmaceutical interest according to the
Linpisky rules5. Many of the small ions, sugars, solvent or
bulky HEME cofactors that can be found in Kahraman1 and
Hoffmann2 datasets were for instance ruled out.
- The positive sites were selected as sites containing the same
ligand or a ligand very similar to that of the reference.
- Similarity assessment (Binning) : each positive site was
manually aligned with its reference site using PyMol6. Che-
mical groups of positive sites matching those of the reference
site were materialized by spheres (see Fig.1) and a visual
assessment of the structural similarity of each positive site
to its reference was then performed based on the number
and relevance of such spheres. Although this classification
is very subjective, we tried to conform to certain rules: Bin 1
contains sites very similar to the reference (up to the overall
fold); Bin 2 includes sites with enough fluctuation whilst the
global pattern is still retained; Bin 3 contains sites where the
similarity is still visible yet getting blurred. In higher bins, few
local similarities are still observed whilst the global pattern
can be considered dissimilar.
- Noise : we believe that control of negative structures is as
important as control over positive structures when it comes to
algorithms evaluation. Therefore, compared to previous works,
we provide “noise” structures instead of using positives from
other test cases. We achieved this by clustering structures
in the Protein Data Bank (PDB) to 30% similarity. Structures
quality was moreover taken into account (only structures with
a resolution under 3 Å and chains containing at least 80
amino acids were retained). Construction of a dataset of
true negative sites is difficult. Therefore we provide instead
a list of “noise” structures built so as to be as exempt as
possible of sites similar to a positive site. We ensured this by
removing structures sharing Interpro annotation7 with any of
the positive structures.
FIT FOR FITTING:
BIONEXT-DSS, A DATASET OF SIMILARS SITES
/ RESOURCE
SUMMARY
Predicting novel binding pockets for a given medicine is a challenging task with numerous pharmaceutical
applications. Based on the principle that “similar binding sites share similar ligands”4
, many binding pocket pre-
diction algorithms are based on the evaluation and retrieval of pockets being similar to a known binding site.
Twodistinctyetconnectedproblematicscanhencebedescribed:(i)theabilitytoretrieveknownsitesforagivenligandindistinct
proteins,and(ii)theretrievalofsimilarsites.Nowadays,manyprotocolsanddatasetsexistfortheevaluationofligandpocketpre-
diction,mainlybasedonpioneeringworksfromKahraman1
andHoffmann2
.However,toourknowledge,nodatasetexistfor
thespecificevaluationofsimilaritybasedalgorithmprobablybecausetheverynotionof“sitesimilarity”eludesastraightdefinition.
Forthesereasonswecompiled“Bionext-DSS”,adatasetofsimilarbindingsiteswhichcontains35testcasesconnectedtoa
pharmaceuticalapplication.Eachcaseincludesareferencesite,aswellasseveralpositivesitesfurtherclassifiedin6binsac-
cordingtotheirsimilarityleveltothereference.Wemoreoverprovidealistofnoisestructurescarefullyselectedsoastobeused
asnegativecontrolsinalgorithmsevaluation.Ourdatasetwouldbeusedtobenchourapproach(BioBind3
)toseveralothers.
The Bionext-DSS dataset is freely available on our servers (https://goo.gl/n5rAOp).
Fig.1
DSS: 35 innovative testcases
to define site similarity in a
pharmaceutical context.
«
© BIONEXT 2017. Technology developed in Strasbourg, France. Bionext may be registered trademarks or service marks of Bionext registered in many jurisdictions worldwide. This document is
current as of the initial date of publication and may be changed by Bionext at any time.
HTTPS:/ /BIOSIGHT.BIO NEXT.COM
01
Dataset
Benchmark
Site
similarity
Bin classification according to eye-scored site similarity. The overall fold of the structure of two captopril binding proteins from test case are clearly dissimilar (cartoon repre-
sentation). Nevertheless, when sites of both structures are superimposed (residues in line and ligands in stick, in the middle) atoms exhibit some similarity (materialized with
spheres : nitrogen and oxygen) which denotes a Bin 2 according to our classification.
4DPR ( Bin 0 ) 4C2P ( Bin 2 )
DSS can be used for the two aforementioned problematics :
- In order to evaluate the effectiveness of similarity retrieval,
one can use the reference site as query and probe the
ability to retrieve positive sites versus noise through the
measurement of the area under the ROC curve (AUC).
- In order to evaluate the algorithms ability to retrieve all
known pockets of a given ligand, one can use alternatively
all positive sites of a test case as a query and calculate
an average AUC.
As a complement, the structural alignment of positive sites
with the reference provided in DSS can be used to dis-
criminate valid prediction of positive sites.
FACTS AND COMPARISONS
For comparisons we applied our method for the evalua-
tion of site similarity to the sites of the Kahraman and
Hoffmann datasets.
As can be seen in Table 1, DSS contains a larger amount
of test cases as well as a larger amount of positive sites.
Fig. 2 moreover reveals that DSS contains a bigger pro-
portion of trivial similarities (bins 1 and 2), making our
dataset more suited to evaluate algorithms on their ability
to detect similarity. Moreover, the amount of difficult cases
(bins 3 to 5) in DSS is comparable to that in Kahraman
and Hoffmann, which makes our dataset also fit for the
assessment of the more classical problematics of ligand
binding pocket retrieval.
With this dataset, we propose an experimental definition
of similarity compliant to the intuition of a pharmaceuti-
cally trained bioinformatician. To our knowledge, this is
the first effort in that direction, and we believe this work
is necessary in order to tune similarity-based algorithms.
Although our classification in similarity bins is necessarily
subjective, experiments conducted with our in-house al-
gorithm BioBind as well as with ProBis8 show that lower
bins were retrieved first, thus confirming an agreement
between our intuitive definition of similarity and these
algorithms.
© BIONEXT 2017. Technology developed in Strasbourg, France. Bionext may be registered trademarks or service marks of Bionext registered in many jurisdictions worldwide. This document is
current as of the initial date of publication and may be changed by Bionext at any time.
HTTPS:/ /BIOSIGHT.BIO NEXT.COM
ABOUT BIONEXT
BIONEXTisaFrenchbioinformaticscompanyfounded
in 2009 and dedicated to the better understanding
and treatment of various diseases. As such, BIONEXT
focuses on pharmaceutical problematics with a ma-
jor interest in guiding the prediction of biochemical
compounds for pharmaceuticals targets. BIONEXT
aims to accelerate and improve the effectiveness
of drug development and biological research by
developing programs and software-based tools to
tackle problematics of prime importance in the phar-
maceutical field, such as drug profile assessment or
suggestion of drug repurposing. Our first application
BioBind is based on the now accepted principle
that similar receptors bind similar ligands.
W W W.BIO NEXT.COM
POSSIBLE USES OF BIONEXT-DSS
Dataset
Positives Sites
Test cases
Bionext-DSS Kahraman Hoffman
222 64 97
35 7 10
Table 1: Number of test cases and positive sites for the studied datasets.
Fig. 2: Percentage of positive sites per bin for each dataset.
Numbers on the top of each bar refer to the numbers of positive sites.
Fig.3: Application of DSS to BioBind and ProBis algorithms.
Dataset
Benchmark
Site
similarity
REFERENCES
1. KAHRAMAN, A. ET AL. (2007). SHAPE VARIATIO N IN PROTEIN BINDIN G PO CKETS AND THEIR
LIGANDS. JOURNAL OF MOLECULAR BIOLO GY, 368(1), 283–301.
2. HOFFMAN N, B. ET AL. (2010). A NEW PROTEIN BINDIN G PO CKET SIMILARITY MEASURE BASED
O N C OMPARISO N OF CLOUDS OF ATOMS IN 3D: APPLICATIO N TO LIGAND PREDICTIO N. BMC
BIOINFORMATICS, 11, 99.
3. BIOBIND: ALG ORITHM PATENTED BY BIO NEXT TO C OMPARE AND CLASSIFY PHYSIC O-CHEMICAL
SURFACES. CF OTHER BIO NEXT C OMMUNICATIO NS FOR MORE INFORMATIO N.
4. KLABUNDE, T. (2007). CHEMO GEN OMIC APPROACHES TO DRUG DISCOVERY: SIMILAR RECEPTORS
BIND SIMILAR LIGANDS. BR J PHARMAC OL, 152(1), 5–7
5. LIPINSKI, C. A (2004). LEAD- AND DRUG-LIKE C OMPOUNDS: THE RULE-OF-FIVE REVOLUTIO N.
DRUG DISC OV TODAY, DEC(4), 337-41
6. DELAN O W. (2002). THE PYMOL MOLECULAR GRAPHICS SYSTEM. SAN CARLOS,CA,USA:
DELAN O SCIENTIFIC LLC. HTTP: / /PYMOL.SOURCEFORGE.NET .
7. ROBERT D.FIN N. (2016) INTERPRO IN 2017 - BEYO ND PROTEIN FAMILY AND DOMAIN
AN N OTATIO NS. NUCLEIC ACIDS RES (2017), 45 (D1), D190-D199.
8. KO N C, J. ET AL. (2015) PROBIS-CHARMMIN G: WEB INTERFACE FOR PREDICTIO N AND
OPTIMIZATIO N OF LIGANDS IN PROTEIN BINDING SITES. J. CHEM. INF. MODEL., 55, 2308-2314.
02

More Related Content

What's hot

Target oriented generic fingerprint-based molecular representation
Target oriented generic fingerprint-based molecular representationTarget oriented generic fingerprint-based molecular representation
Target oriented generic fingerprint-based molecular representation
csandit
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
baoilleach
 
Ligand based drug design
Ligand based drug designLigand based drug design
Ligand based drug design
Satyendra Yadav
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
MadhuraDatar
 
Structure based and ligand based drug designing
Structure based and ligand based drug designingStructure based and ligand based drug designing
Structure based and ligand based drug designing
Dr Vysakh Mohan M
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
KiranmayiKnv
 

What's hot (6)

Target oriented generic fingerprint-based molecular representation
Target oriented generic fingerprint-based molecular representationTarget oriented generic fingerprint-based molecular representation
Target oriented generic fingerprint-based molecular representation
 
Computational Drug Design
Computational Drug DesignComputational Drug Design
Computational Drug Design
 
Ligand based drug design
Ligand based drug designLigand based drug design
Ligand based drug design
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
 
Structure based and ligand based drug designing
Structure based and ligand based drug designingStructure based and ligand based drug designing
Structure based and ligand based drug designing
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
 

Similar to Fit for fitting: Bionext-DSS, a dataset of similars sites

Validation of Clomipramine interactions identified by BioBind against experim...
Validation of Clomipramine interactions identified by BioBind against experim...Validation of Clomipramine interactions identified by BioBind against experim...
Validation of Clomipramine interactions identified by BioBind against experim...
Marie-Julie Denelle
 
Chemistry Reserach as a Social Machine
 Chemistry Reserach as a Social Machine Chemistry Reserach as a Social Machine
Chemistry Reserach as a Social Machine
Jeremy Frey
 
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
ijbbjournal
 
Pep Talk San Diego 011311
Pep Talk San Diego 011311Pep Talk San Diego 011311
Pep Talk San Diego 011311
Philip Bourne
 
Homology modeling
Homology modelingHomology modeling
PaulZRXie_Nucl. Acids Res.-2013
PaulZRXie_Nucl. Acids Res.-2013PaulZRXie_Nucl. Acids Res.-2013
PaulZRXie_Nucl. Acids Res.-2013
Zhongru (Paul) Xie
 
E0362430
E0362430E0362430
E0362430
IOSR Journals
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientation
ijaia
 
Cadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.PharmCadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.Pharm
Shikha Popali
 
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSISHMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
ijcseit
 
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
csandit
 
Bind database
Bind databaseBind database
Bind database
Ritisha Gupta
 
BCSRCv1.3
BCSRCv1.3BCSRCv1.3
BCSRCv1.3
Kishan Bhut
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
researchinventy
 
MULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
MULISA : A New Strategy for Discovery of Protein Functional Motifs and ResiduesMULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
MULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
csandit
 
RATIONAL DRUG DESIGN.pptx
RATIONAL DRUG DESIGN.pptxRATIONAL DRUG DESIGN.pptx
RATIONAL DRUG DESIGN.pptx
Praveen kumar S
 
Structure based drug designing
Structure based drug designingStructure based drug designing
Structure based drug designing
Seenam Iftikhar
 
Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)
University of Allahabad
 
Ppi
PpiPpi
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
ratanvishwas
 

Similar to Fit for fitting: Bionext-DSS, a dataset of similars sites (20)

Validation of Clomipramine interactions identified by BioBind against experim...
Validation of Clomipramine interactions identified by BioBind against experim...Validation of Clomipramine interactions identified by BioBind against experim...
Validation of Clomipramine interactions identified by BioBind against experim...
 
Chemistry Reserach as a Social Machine
 Chemistry Reserach as a Social Machine Chemistry Reserach as a Social Machine
Chemistry Reserach as a Social Machine
 
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
ENHANCED POPULATION BASED ANT COLONY FOR THE 3D HYDROPHOBIC POLAR PROTEIN STR...
 
Pep Talk San Diego 011311
Pep Talk San Diego 011311Pep Talk San Diego 011311
Pep Talk San Diego 011311
 
Homology modeling
Homology modelingHomology modeling
Homology modeling
 
PaulZRXie_Nucl. Acids Res.-2013
PaulZRXie_Nucl. Acids Res.-2013PaulZRXie_Nucl. Acids Res.-2013
PaulZRXie_Nucl. Acids Res.-2013
 
E0362430
E0362430E0362430
E0362430
 
Determining stable ligand orientation
Determining stable ligand orientationDetermining stable ligand orientation
Determining stable ligand orientation
 
Cadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.PharmCadd and molecular modeling for M.Pharm
Cadd and molecular modeling for M.Pharm
 
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSISHMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
HMM’S INTERPOLATION OF PROTIENS FOR PROFILE ANALYSIS
 
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
 
Bind database
Bind databaseBind database
Bind database
 
BCSRCv1.3
BCSRCv1.3BCSRCv1.3
BCSRCv1.3
 
Research Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and ScienceResearch Inventy : International Journal of Engineering and Science
Research Inventy : International Journal of Engineering and Science
 
MULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
MULISA : A New Strategy for Discovery of Protein Functional Motifs and ResiduesMULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
MULISA : A New Strategy for Discovery of Protein Functional Motifs and Residues
 
RATIONAL DRUG DESIGN.pptx
RATIONAL DRUG DESIGN.pptxRATIONAL DRUG DESIGN.pptx
RATIONAL DRUG DESIGN.pptx
 
Structure based drug designing
Structure based drug designingStructure based drug designing
Structure based drug designing
 
Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)Cadd assignment 4 (sarita)
Cadd assignment 4 (sarita)
 
Ppi
PpiPpi
Ppi
 
Threading modeling methods
Threading modeling methodsThreading modeling methods
Threading modeling methods
 

Recently uploaded

Role of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of HyperthyroidismRole of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of Hyperthyroidism
Dr. Jyothirmai Paindla
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
Earlene McNair
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
Holistified Wellness
 
Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
taiba qazi
 
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
rightmanforbloodline
 
Netter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdfNetter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdf
BrissaOrtiz3
 
The Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in IndiaThe Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in India
Swastik Ayurveda
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
Dr. Jyothirmai Paindla
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
Donc Test
 
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
rightmanforbloodline
 
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptxREGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
LaniyaNasrink
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
AyeshaZaid1
 
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptx
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptxMuscles of Mastication by Dr. Rabia Inam Gandapore.pptx
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
HongBiThi1
 
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdfCHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
rishi2789
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
Dr. Jyothirmai Paindla
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
19various
 
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
rishi2789
 
Efficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in AyurvedaEfficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in Ayurveda
Dr. Jyothirmai Paindla
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
Health Advances
 

Recently uploaded (20)

Role of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of HyperthyroidismRole of Mukta Pishti in the Management of Hyperthyroidism
Role of Mukta Pishti in the Management of Hyperthyroidism
 
Chapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptxChapter 11 Nutrition and Chronic Diseases.pptx
Chapter 11 Nutrition and Chronic Diseases.pptx
 
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
8 Surprising Reasons To Meditate 40 Minutes A Day That Can Change Your Life.pptx
 
Tests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptxTests for analysis of different pharmaceutical.pptx
Tests for analysis of different pharmaceutical.pptx
 
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
TEST BANK For An Introduction to Brain and Behavior, 7th Edition by Bryan Kol...
 
Netter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdfNetter's Atlas of Human Anatomy 7.ed.pdf
Netter's Atlas of Human Anatomy 7.ed.pdf
 
The Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in IndiaThe Best Ayurvedic Antacid Tablets in India
The Best Ayurvedic Antacid Tablets in India
 
A Classical Text Review on Basavarajeeyam
A Classical Text Review on BasavarajeeyamA Classical Text Review on Basavarajeeyam
A Classical Text Review on Basavarajeeyam
 
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
TEST BANK For Community Health Nursing A Canadian Perspective, 5th Edition by...
 
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
TEST BANK For Basic and Clinical Pharmacology, 14th Edition by Bertram G. Kat...
 
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptxREGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
REGULATION FOR COMBINATION PRODUCTS AND MEDICAL DEVICES.pptx
 
Histololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptxHistololgy of Female Reproductive System.pptx
Histololgy of Female Reproductive System.pptx
 
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptx
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptxMuscles of Mastication by Dr. Rabia Inam Gandapore.pptx
Muscles of Mastication by Dr. Rabia Inam Gandapore.pptx
 
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấuK CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
K CỔ TỬ CUNG.pdf tự ghi chép, chữ hơi xấu
 
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdfCHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
CHEMOTHERAPY_RDP_CHAPTER 6_Anti Malarial Drugs.pdf
 
Journal Article Review on Rasamanikya
Journal Article Review on RasamanikyaJournal Article Review on Rasamanikya
Journal Article Review on Rasamanikya
 
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa CentralClinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
Clinic ^%[+27633867063*Abortion Pills For Sale In Tembisa Central
 
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdfCHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
CHEMOTHERAPY_RDP_CHAPTER 1_ANTI TB DRUGS.pdf
 
Efficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in AyurvedaEfficacy of Avartana Sneha in Ayurveda
Efficacy of Avartana Sneha in Ayurveda
 
Cell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune DiseaseCell Therapy Expansion and Challenges in Autoimmune Disease
Cell Therapy Expansion and Challenges in Autoimmune Disease
 

Fit for fitting: Bionext-DSS, a dataset of similars sites

  • 1. BIONEXT-DSS CONSTRUCTION Four distinct elements are compiled for each test case: - The reference site (bin 0) belongs to a protein of pharma- ceutical interest and is systematically selected as the binding site of a ligand of pharmaceutical interest according to the Linpisky rules5. Many of the small ions, sugars, solvent or bulky HEME cofactors that can be found in Kahraman1 and Hoffmann2 datasets were for instance ruled out. - The positive sites were selected as sites containing the same ligand or a ligand very similar to that of the reference. - Similarity assessment (Binning) : each positive site was manually aligned with its reference site using PyMol6. Che- mical groups of positive sites matching those of the reference site were materialized by spheres (see Fig.1) and a visual assessment of the structural similarity of each positive site to its reference was then performed based on the number and relevance of such spheres. Although this classification is very subjective, we tried to conform to certain rules: Bin 1 contains sites very similar to the reference (up to the overall fold); Bin 2 includes sites with enough fluctuation whilst the global pattern is still retained; Bin 3 contains sites where the similarity is still visible yet getting blurred. In higher bins, few local similarities are still observed whilst the global pattern can be considered dissimilar. - Noise : we believe that control of negative structures is as important as control over positive structures when it comes to algorithms evaluation. Therefore, compared to previous works, we provide “noise” structures instead of using positives from other test cases. We achieved this by clustering structures in the Protein Data Bank (PDB) to 30% similarity. Structures quality was moreover taken into account (only structures with a resolution under 3 Å and chains containing at least 80 amino acids were retained). Construction of a dataset of true negative sites is difficult. Therefore we provide instead a list of “noise” structures built so as to be as exempt as possible of sites similar to a positive site. We ensured this by removing structures sharing Interpro annotation7 with any of the positive structures. FIT FOR FITTING: BIONEXT-DSS, A DATASET OF SIMILARS SITES / RESOURCE SUMMARY Predicting novel binding pockets for a given medicine is a challenging task with numerous pharmaceutical applications. Based on the principle that “similar binding sites share similar ligands”4 , many binding pocket pre- diction algorithms are based on the evaluation and retrieval of pockets being similar to a known binding site. Twodistinctyetconnectedproblematicscanhencebedescribed:(i)theabilitytoretrieveknownsitesforagivenligandindistinct proteins,and(ii)theretrievalofsimilarsites.Nowadays,manyprotocolsanddatasetsexistfortheevaluationofligandpocketpre- diction,mainlybasedonpioneeringworksfromKahraman1 andHoffmann2 .However,toourknowledge,nodatasetexistfor thespecificevaluationofsimilaritybasedalgorithmprobablybecausetheverynotionof“sitesimilarity”eludesastraightdefinition. Forthesereasonswecompiled“Bionext-DSS”,adatasetofsimilarbindingsiteswhichcontains35testcasesconnectedtoa pharmaceuticalapplication.Eachcaseincludesareferencesite,aswellasseveralpositivesitesfurtherclassifiedin6binsac- cordingtotheirsimilarityleveltothereference.Wemoreoverprovidealistofnoisestructurescarefullyselectedsoastobeused asnegativecontrolsinalgorithmsevaluation.Ourdatasetwouldbeusedtobenchourapproach(BioBind3 )toseveralothers. The Bionext-DSS dataset is freely available on our servers (https://goo.gl/n5rAOp). Fig.1 DSS: 35 innovative testcases to define site similarity in a pharmaceutical context. « © BIONEXT 2017. Technology developed in Strasbourg, France. Bionext may be registered trademarks or service marks of Bionext registered in many jurisdictions worldwide. This document is current as of the initial date of publication and may be changed by Bionext at any time. HTTPS:/ /BIOSIGHT.BIO NEXT.COM 01 Dataset Benchmark Site similarity Bin classification according to eye-scored site similarity. The overall fold of the structure of two captopril binding proteins from test case are clearly dissimilar (cartoon repre- sentation). Nevertheless, when sites of both structures are superimposed (residues in line and ligands in stick, in the middle) atoms exhibit some similarity (materialized with spheres : nitrogen and oxygen) which denotes a Bin 2 according to our classification. 4DPR ( Bin 0 ) 4C2P ( Bin 2 )
  • 2. DSS can be used for the two aforementioned problematics : - In order to evaluate the effectiveness of similarity retrieval, one can use the reference site as query and probe the ability to retrieve positive sites versus noise through the measurement of the area under the ROC curve (AUC). - In order to evaluate the algorithms ability to retrieve all known pockets of a given ligand, one can use alternatively all positive sites of a test case as a query and calculate an average AUC. As a complement, the structural alignment of positive sites with the reference provided in DSS can be used to dis- criminate valid prediction of positive sites. FACTS AND COMPARISONS For comparisons we applied our method for the evalua- tion of site similarity to the sites of the Kahraman and Hoffmann datasets. As can be seen in Table 1, DSS contains a larger amount of test cases as well as a larger amount of positive sites. Fig. 2 moreover reveals that DSS contains a bigger pro- portion of trivial similarities (bins 1 and 2), making our dataset more suited to evaluate algorithms on their ability to detect similarity. Moreover, the amount of difficult cases (bins 3 to 5) in DSS is comparable to that in Kahraman and Hoffmann, which makes our dataset also fit for the assessment of the more classical problematics of ligand binding pocket retrieval. With this dataset, we propose an experimental definition of similarity compliant to the intuition of a pharmaceuti- cally trained bioinformatician. To our knowledge, this is the first effort in that direction, and we believe this work is necessary in order to tune similarity-based algorithms. Although our classification in similarity bins is necessarily subjective, experiments conducted with our in-house al- gorithm BioBind as well as with ProBis8 show that lower bins were retrieved first, thus confirming an agreement between our intuitive definition of similarity and these algorithms. © BIONEXT 2017. Technology developed in Strasbourg, France. Bionext may be registered trademarks or service marks of Bionext registered in many jurisdictions worldwide. This document is current as of the initial date of publication and may be changed by Bionext at any time. HTTPS:/ /BIOSIGHT.BIO NEXT.COM ABOUT BIONEXT BIONEXTisaFrenchbioinformaticscompanyfounded in 2009 and dedicated to the better understanding and treatment of various diseases. As such, BIONEXT focuses on pharmaceutical problematics with a ma- jor interest in guiding the prediction of biochemical compounds for pharmaceuticals targets. BIONEXT aims to accelerate and improve the effectiveness of drug development and biological research by developing programs and software-based tools to tackle problematics of prime importance in the phar- maceutical field, such as drug profile assessment or suggestion of drug repurposing. Our first application BioBind is based on the now accepted principle that similar receptors bind similar ligands. W W W.BIO NEXT.COM POSSIBLE USES OF BIONEXT-DSS Dataset Positives Sites Test cases Bionext-DSS Kahraman Hoffman 222 64 97 35 7 10 Table 1: Number of test cases and positive sites for the studied datasets. Fig. 2: Percentage of positive sites per bin for each dataset. Numbers on the top of each bar refer to the numbers of positive sites. Fig.3: Application of DSS to BioBind and ProBis algorithms. Dataset Benchmark Site similarity REFERENCES 1. KAHRAMAN, A. ET AL. (2007). SHAPE VARIATIO N IN PROTEIN BINDIN G PO CKETS AND THEIR LIGANDS. JOURNAL OF MOLECULAR BIOLO GY, 368(1), 283–301. 2. HOFFMAN N, B. ET AL. (2010). A NEW PROTEIN BINDIN G PO CKET SIMILARITY MEASURE BASED O N C OMPARISO N OF CLOUDS OF ATOMS IN 3D: APPLICATIO N TO LIGAND PREDICTIO N. BMC BIOINFORMATICS, 11, 99. 3. BIOBIND: ALG ORITHM PATENTED BY BIO NEXT TO C OMPARE AND CLASSIFY PHYSIC O-CHEMICAL SURFACES. CF OTHER BIO NEXT C OMMUNICATIO NS FOR MORE INFORMATIO N. 4. KLABUNDE, T. (2007). CHEMO GEN OMIC APPROACHES TO DRUG DISCOVERY: SIMILAR RECEPTORS BIND SIMILAR LIGANDS. BR J PHARMAC OL, 152(1), 5–7 5. LIPINSKI, C. A (2004). LEAD- AND DRUG-LIKE C OMPOUNDS: THE RULE-OF-FIVE REVOLUTIO N. DRUG DISC OV TODAY, DEC(4), 337-41 6. DELAN O W. (2002). THE PYMOL MOLECULAR GRAPHICS SYSTEM. SAN CARLOS,CA,USA: DELAN O SCIENTIFIC LLC. HTTP: / /PYMOL.SOURCEFORGE.NET . 7. ROBERT D.FIN N. (2016) INTERPRO IN 2017 - BEYO ND PROTEIN FAMILY AND DOMAIN AN N OTATIO NS. NUCLEIC ACIDS RES (2017), 45 (D1), D190-D199. 8. KO N C, J. ET AL. (2015) PROBIS-CHARMMIN G: WEB INTERFACE FOR PREDICTIO N AND OPTIMIZATIO N OF LIGANDS IN PROTEIN BINDING SITES. J. CHEM. INF. MODEL., 55, 2308-2314. 02