SlideShare a Scribd company logo
Christopher Southan, Elena Faccenda, Simon J. Harding, Joanna L. Sharman, Adam J. Pawson, and Jamie A
Davies, Centre for Integrative Physiology, The University of Edinburgh, EH8 9XD UK,
www.guidetopharmacology.org http://www.slideshare.net/cdsouthan/assessing-gtopdb-ligand-content-in-pubchem
Assessing the IUPHAR/BPS Guide to
PHARMACOLOGY ligand content in PubChem
INTRODUCTION
The utilities of these intersects are outlined below (in order of counts):
• CNER refers to “Chemical Named Entity Recognition” for the automated extraction of chemistry from patents by sources submitting to PubChem (of
which SureChEMBL is the largest at 16.3 million). This means that users can track-back most of our ligands to early patent filings that can often include
more SAR than eventually appeared in the papers.
• Our low overlap with DrugBank indicates both sources are complementary in bioactive compound selection (i.e. the OR union is 12605)
• The possibility of sourcing purchasable compounds is important for experimental pharmacologists. From the 64 million vendor structures in PubChem
we have nearly an 80% overlap and similarity searches may pick up analogues where there is no exact match.
• The “BioAssay active” tag overlaps extensively with ChEMBL entries but users can check for a range of activities for a ligand that maybe additional to
the values we have extracted from selected papers.
• The MeSH term “pharmacological action” is useful but our impression is that NLM is falling behind in the PubChem indexing of this term.
• PDB ligand structures are valued database cross-references for many reasons.
• We have introduced a new feature that allows users to retrieve just our 1291 approved drug SID entries (Query “approved[Comment] AND
"IUPHAR/BPS Guide to PHARMACOLOGY"[SourceName]”). The “PubChem Same Compound” select then generates 1174 small-molecule CIDs. This
facilitates different types of comparative analysis between drug lists.
• As expected, our overlap with ChEMBL structures is high but we have captured 1147 structures not in this source, mainly due to different journal capture
and shorter release cycles.
• The selection “unique to GtoPdb” indicates those CIDs where we are the only source in the whole of PubChem. These are predominantly novel
structures we have extracted from papers but in some cases we have selected a different structure from other sources.
• There may be interest in which pharmacologically active peptides we have CIDs for. A simple Mw-cut isolates 178 entries
Further details related to intersects above are given this GtoPdb blog post https://blog.guidetopharmacology.org/2016/10/31/gtopdb-ligands-in-pubchem/.
This post about PubChem sources in general may also be of interest https://cdsouthan.blogspot.se/2016/06/pubchem-source-of-month.html.
Reference[1]: “The IUPHAR/BPS Guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000
ligands”. Southan et al, Nucleic Acids Research, 2016 Jan 4;44(D1): Database Issue, D1054-68, PMID: 2646443
The International Union of Basic and Clinical Pharmacology and British
Pharmacological Society (UPHAR/BPS) Guide to PHARMACOLOGY
database (GtoPdb) and its precursor IUPHAR-DB have been capturing
the structures of pharmacologically relevant ligands since 2005 [1].
The snapshot on the right shows our eight-category ligand
classification. As an active collaboration with the PubChem team, we
have submitted our ligand records for every GtoPdb release since
2012. For release 2016.4 (October) the query ("IUPHAR/BPS Guide
to PHARMACOLOGY"[SourceName]) retrieves 8674 Substance
Identifiers (SIDs) and 6565 Compound Identifiers (CIDs). The excess
of 2109 SIDs is accounted for by antibodies, small proteins and larger
peptides that cannot form CIDs. At just over 92 million CIDs covering
473 sources, a range of property filters and full Boolean operations for
combining query sets, PubChem provides an opportunity to “slice and
dice” our ligand set in comparative and informative ways. Just a small
set of example results is shown below.
RESULTS
Supported by

More Related Content

What's hot

Capturing BIA-10-2474 and related FAAH inhibitor data
Capturing BIA-10-2474 and related FAAH inhibitor dataCapturing BIA-10-2474 and related FAAH inhibitor data
Capturing BIA-10-2474 and related FAAH inhibitor data
Chris Southan
 
Antimalarial drug dscovery data disclosure
Antimalarial drug dscovery data disclosureAntimalarial drug dscovery data disclosure
Antimalarial drug dscovery data disclosure
Chris Southan
 
Southan real drugs_paris_oct_11_2014
Southan real drugs_paris_oct_11_2014Southan real drugs_paris_oct_11_2014
Southan real drugs_paris_oct_11_2014
Chris Southan
 
Pubchem
PubchemPubchem
Pubchem
samantlalit
 
Multiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChemMultiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChem
Chris Southan
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
GtoPdb_ITMAT_2017
GtoPdb_ITMAT_2017GtoPdb_ITMAT_2017
GtoPdb_ITMAT_2017
Guide to PHARMACOLOGY
 
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Guide to PHARMACOLOGY
 
Abstract template
Abstract templateAbstract template
Abstract templateJessie W
 
Abstract template
Abstract templateAbstract template
Abstract template
uthreddan
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
Chris Southan
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3plmiami
 
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
Guide to PHARMACOLOGY
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)
Dinesh Barupal
 
Pharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug DesigningPharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug Designing
Vinod Tonde
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 

What's hot (18)

Capturing BIA-10-2474 and related FAAH inhibitor data
Capturing BIA-10-2474 and related FAAH inhibitor dataCapturing BIA-10-2474 and related FAAH inhibitor data
Capturing BIA-10-2474 and related FAAH inhibitor data
 
Antimalarial drug dscovery data disclosure
Antimalarial drug dscovery data disclosureAntimalarial drug dscovery data disclosure
Antimalarial drug dscovery data disclosure
 
Southan real drugs_paris_oct_11_2014
Southan real drugs_paris_oct_11_2014Southan real drugs_paris_oct_11_2014
Southan real drugs_paris_oct_11_2014
 
Pubchem
PubchemPubchem
Pubchem
 
Multiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChemMultiplexing analysis of 1000 approved drugs in PubChem
Multiplexing analysis of 1000 approved drugs in PubChem
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
GtoPdb_ITMAT_2017
GtoPdb_ITMAT_2017GtoPdb_ITMAT_2017
GtoPdb_ITMAT_2017
 
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
 
Abstract Template
Abstract TemplateAbstract Template
Abstract Template
 
Abstract template
Abstract templateAbstract template
Abstract template
 
Abstract template
Abstract templateAbstract template
Abstract template
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3
 
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
 
Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)Metabolite Set Enrichment Analysis (ChemRICH)
Metabolite Set Enrichment Analysis (ChemRICH)
 
Pharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug DesigningPharmacophore Modeling in Drug Designing
Pharmacophore Modeling in Drug Designing
 
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...Structure Identification Using High Resolution Mass Spectrometry Data and the...
Structure Identification Using High Resolution Mass Spectrometry Data and the...
 

Viewers also liked

Gavin Certificate
Gavin   CertificateGavin   Certificate
Gavin CertificateReza Bidaki
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
Chris Southan
 
Segunda guerra mundial (1939 1945)
Segunda guerra mundial (1939  1945)Segunda guerra mundial (1939  1945)
Segunda guerra mundial (1939 1945)
Nohemipazar
 
Division de poderes
Division de poderesDivision de poderes
Division de poderes
Klaus Hdz
 
ramas de poder
ramas de poderramas de poder
ramas de podermarymesa
 
La Segunda Guerra Mundial
La Segunda Guerra MundialLa Segunda Guerra Mundial
La Segunda Guerra Mundial
Juan Carlos Ocaña
 
Ramas del poder publico
Ramas del poder publicoRamas del poder publico
Ramas del poder publicoandres5914
 

Viewers also liked (7)

Gavin Certificate
Gavin   CertificateGavin   Certificate
Gavin Certificate
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
 
Segunda guerra mundial (1939 1945)
Segunda guerra mundial (1939  1945)Segunda guerra mundial (1939  1945)
Segunda guerra mundial (1939 1945)
 
Division de poderes
Division de poderesDivision de poderes
Division de poderes
 
ramas de poder
ramas de poderramas de poder
ramas de poder
 
La Segunda Guerra Mundial
La Segunda Guerra MundialLa Segunda Guerra Mundial
La Segunda Guerra Mundial
 
Ramas del poder publico
Ramas del poder publicoRamas del poder publico
Ramas del poder publico
 

Similar to Assessing GtoPdb ligand content in PubChem

Correct drug structures for pharmacology
Correct drug structures for pharmacologyCorrect drug structures for pharmacology
Correct drug structures for pharmacology
Chris Southan
 
Poster GtoPdb Pharmacology 2017
Poster GtoPdb Pharmacology 2017Poster GtoPdb Pharmacology 2017
Poster GtoPdb Pharmacology 2017
Guide to PHARMACOLOGY
 
IUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to PharmacologyIUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to Pharmacology
Guide to PHARMACOLOGY
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
Chris Southan
 
GtoPdb poster ELIXIR-UK All Hands meeting
GtoPdb poster ELIXIR-UK All Hands meeting GtoPdb poster ELIXIR-UK All Hands meeting
GtoPdb poster ELIXIR-UK All Hands meeting
Guide to PHARMACOLOGY
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
Chris Southan
 
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
Chris Southan
 
Analysing curated protein targets: Partitioning the drugged and the druggable
Analysing curated protein targets: Partitioning the drugged and the druggable Analysing curated protein targets: Partitioning the drugged and the druggable
Analysing curated protein targets: Partitioning the drugged and the druggable
Chris Southan
 
GtoPdb general poster 2017
GtoPdb general poster 2017GtoPdb general poster 2017
GtoPdb general poster 2017
Guide to PHARMACOLOGY
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to PHARMACOLOGY
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
Chris Southan
 
pharmacophoremapping05-180503150916-converted.pptx
pharmacophoremapping05-180503150916-converted.pptxpharmacophoremapping05-180503150916-converted.pptx
pharmacophoremapping05-180503150916-converted.pptx
ashharnomani
 
GtoPdb ELIXIR-All Hands 2018
GtoPdb ELIXIR-All Hands 2018GtoPdb ELIXIR-All Hands 2018
GtoPdb ELIXIR-All Hands 2018
Guide to PHARMACOLOGY
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
Chris Southan
 
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
Chris Southan
 
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
Guide to PHARMACOLOGY
 
IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018
Guide to PHARMACOLOGY
 
Southan BIA 10-2474 Pharmacology 2017
Southan BIA 10-2474 Pharmacology 2017Southan BIA 10-2474 Pharmacology 2017
Southan BIA 10-2474 Pharmacology 2017
Guide to PHARMACOLOGY
 
Druggable genome in GtoPdb and other dbs
Druggable genome in GtoPdb and other dbsDruggable genome in GtoPdb and other dbs
Druggable genome in GtoPdb and other dbs
Chris Southan
 
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
B V V S Hanagal Shri Kumareshwar College of Pharmacy, Bagalkote
 

Similar to Assessing GtoPdb ligand content in PubChem (20)

Correct drug structures for pharmacology
Correct drug structures for pharmacologyCorrect drug structures for pharmacology
Correct drug structures for pharmacology
 
Poster GtoPdb Pharmacology 2017
Poster GtoPdb Pharmacology 2017Poster GtoPdb Pharmacology 2017
Poster GtoPdb Pharmacology 2017
 
IUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to PharmacologyIUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to Pharmacology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
GtoPdb poster ELIXIR-UK All Hands meeting
GtoPdb poster ELIXIR-UK All Hands meeting GtoPdb poster ELIXIR-UK All Hands meeting
GtoPdb poster ELIXIR-UK All Hands meeting
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
Introducing the IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb)
 
Analysing curated protein targets: Partitioning the drugged and the druggable
Analysing curated protein targets: Partitioning the drugged and the druggable Analysing curated protein targets: Partitioning the drugged and the druggable
Analysing curated protein targets: Partitioning the drugged and the druggable
 
GtoPdb general poster 2017
GtoPdb general poster 2017GtoPdb general poster 2017
GtoPdb general poster 2017
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
pharmacophoremapping05-180503150916-converted.pptx
pharmacophoremapping05-180503150916-converted.pptxpharmacophoremapping05-180503150916-converted.pptx
pharmacophoremapping05-180503150916-converted.pptx
 
GtoPdb ELIXIR-All Hands 2018
GtoPdb ELIXIR-All Hands 2018GtoPdb ELIXIR-All Hands 2018
GtoPdb ELIXIR-All Hands 2018
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
Comparing ChEMBL, DrugBank, Human Metabolome db and Therapeutic Target db at ...
 
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
Drug-to-protein mappings in the Guide to PHARMACOLOGY: Utility as a target va...
 
IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018IUPHAR/BPS Guide to Pharmacology in 2018
IUPHAR/BPS Guide to Pharmacology in 2018
 
Southan BIA 10-2474 Pharmacology 2017
Southan BIA 10-2474 Pharmacology 2017Southan BIA 10-2474 Pharmacology 2017
Southan BIA 10-2474 Pharmacology 2017
 
Druggable genome in GtoPdb and other dbs
Druggable genome in GtoPdb and other dbsDruggable genome in GtoPdb and other dbs
Druggable genome in GtoPdb and other dbs
 
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
 

More from Chris Southan

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
Chris Southan
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
Chris Southan
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
Chris Southan
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
Chris Southan
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
Chris Southan
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
Chris Southan
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
Chris Southan
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
Chris Southan
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
Chris Southan
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
Chris Southan
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
Chris Southan
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
Chris Southan
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
Chris Southan
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
Chris Southan
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
Chris Southan
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
Chris Southan
 
Patents in PubChem
Patents in PubChemPatents in PubChem
Patents in PubChem
Chris Southan
 
Pub Med to PubChem Connectivity
Pub Med to PubChem ConnectivityPub Med to PubChem Connectivity
Pub Med to PubChem Connectivity
Chris Southan
 
The IUPHAR/MMV Guide to Malaria Pharmacology
The  IUPHAR/MMV Guide to Malaria Pharmacology  The  IUPHAR/MMV Guide to Malaria Pharmacology
The IUPHAR/MMV Guide to Malaria Pharmacology
Chris Southan
 
The big data join in pharmacology
The big data join in pharmacologyThe big data join in pharmacology
The big data join in pharmacology
Chris Southan
 

More from Chris Southan (20)

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Patents in PubChem
Patents in PubChemPatents in PubChem
Patents in PubChem
 
Pub Med to PubChem Connectivity
Pub Med to PubChem ConnectivityPub Med to PubChem Connectivity
Pub Med to PubChem Connectivity
 
The IUPHAR/MMV Guide to Malaria Pharmacology
The  IUPHAR/MMV Guide to Malaria Pharmacology  The  IUPHAR/MMV Guide to Malaria Pharmacology
The IUPHAR/MMV Guide to Malaria Pharmacology
 
The big data join in pharmacology
The big data join in pharmacologyThe big data join in pharmacology
The big data join in pharmacology
 

Recently uploaded

In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
Richard Gill
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 

Recently uploaded (20)

In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
Richard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlandsRichard's aventures in two entangled wonderlands
Richard's aventures in two entangled wonderlands
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 

Assessing GtoPdb ligand content in PubChem

  • 1. Christopher Southan, Elena Faccenda, Simon J. Harding, Joanna L. Sharman, Adam J. Pawson, and Jamie A Davies, Centre for Integrative Physiology, The University of Edinburgh, EH8 9XD UK, www.guidetopharmacology.org http://www.slideshare.net/cdsouthan/assessing-gtopdb-ligand-content-in-pubchem Assessing the IUPHAR/BPS Guide to PHARMACOLOGY ligand content in PubChem INTRODUCTION The utilities of these intersects are outlined below (in order of counts): • CNER refers to “Chemical Named Entity Recognition” for the automated extraction of chemistry from patents by sources submitting to PubChem (of which SureChEMBL is the largest at 16.3 million). This means that users can track-back most of our ligands to early patent filings that can often include more SAR than eventually appeared in the papers. • Our low overlap with DrugBank indicates both sources are complementary in bioactive compound selection (i.e. the OR union is 12605) • The possibility of sourcing purchasable compounds is important for experimental pharmacologists. From the 64 million vendor structures in PubChem we have nearly an 80% overlap and similarity searches may pick up analogues where there is no exact match. • The “BioAssay active” tag overlaps extensively with ChEMBL entries but users can check for a range of activities for a ligand that maybe additional to the values we have extracted from selected papers. • The MeSH term “pharmacological action” is useful but our impression is that NLM is falling behind in the PubChem indexing of this term. • PDB ligand structures are valued database cross-references for many reasons. • We have introduced a new feature that allows users to retrieve just our 1291 approved drug SID entries (Query “approved[Comment] AND "IUPHAR/BPS Guide to PHARMACOLOGY"[SourceName]”). The “PubChem Same Compound” select then generates 1174 small-molecule CIDs. This facilitates different types of comparative analysis between drug lists. • As expected, our overlap with ChEMBL structures is high but we have captured 1147 structures not in this source, mainly due to different journal capture and shorter release cycles. • The selection “unique to GtoPdb” indicates those CIDs where we are the only source in the whole of PubChem. These are predominantly novel structures we have extracted from papers but in some cases we have selected a different structure from other sources. • There may be interest in which pharmacologically active peptides we have CIDs for. A simple Mw-cut isolates 178 entries Further details related to intersects above are given this GtoPdb blog post https://blog.guidetopharmacology.org/2016/10/31/gtopdb-ligands-in-pubchem/. This post about PubChem sources in general may also be of interest https://cdsouthan.blogspot.se/2016/06/pubchem-source-of-month.html. Reference[1]: “The IUPHAR/BPS Guide to PHARMACOLOGY in 2016: towards curated quantitative interactions between 1300 protein targets and 6000 ligands”. Southan et al, Nucleic Acids Research, 2016 Jan 4;44(D1): Database Issue, D1054-68, PMID: 2646443 The International Union of Basic and Clinical Pharmacology and British Pharmacological Society (UPHAR/BPS) Guide to PHARMACOLOGY database (GtoPdb) and its precursor IUPHAR-DB have been capturing the structures of pharmacologically relevant ligands since 2005 [1]. The snapshot on the right shows our eight-category ligand classification. As an active collaboration with the PubChem team, we have submitted our ligand records for every GtoPdb release since 2012. For release 2016.4 (October) the query ("IUPHAR/BPS Guide to PHARMACOLOGY"[SourceName]) retrieves 8674 Substance Identifiers (SIDs) and 6565 Compound Identifiers (CIDs). The excess of 2109 SIDs is accounted for by antibodies, small proteins and larger peptides that cannot form CIDs. At just over 92 million CIDs covering 473 sources, a range of property filters and full Boolean operations for combining query sets, PubChem provides an opportunity to “slice and dice” our ligand set in comparative and informative ways. Just a small set of example results is shown below. RESULTS Supported by