SlideShare a Scribd company logo
1 of 16
www.guidetopharmacology.org
Virtues and vicissitudes of curatorial data wrangling:
the Guide to PHARMACOLGY experience
Christopher Southan, Adam J. Pawson, Joanna L. Sharman, Elena
Faccenda, Simon Harding, Jamie Davis, IUPHAR/BPS Guide to
PHARMACOLOGY, Centre for Integrative Physiology, University of
Edinburgh
Sun, Mar 13, 2016 CINF 6:Tomayto vs. Tomahto: Overcoming
Incompatibilities in Scientific Data 8:30 AM - 12:00 PM
1
http://www.slideshare.net/cdsouthan/curatorial-data-wrangling-for-the-guide-to-pharmacolgy
Abstract (will be skipped for presenting)
2
A wide range of valuable databases, both academic and commercial, use the curation
model to extract and standardise selected result sets from the literature. This is the
classic unstructured-to-structured transformation, predominantly of target binding data
(e.g. IC50, Ki or Kd) between ligands and targets. Since 2009 the Guide to
PHARMACOLOG (GoPdb) has now curated quantitative interactions between 1300
protein targets and 6000 ligands covering a substantial proportion of the druggable
proteome. The team has thus considerable expertise in the challenges of
standardisation. This needs to be interposed between not only the primary literature
but also other the databases that have extracted data. The wide range of compatibility
and other issues associated with selecting new content for GtoPdb will be outlined.
These will include the problem of equivocal measurement units as well as non-
standard chemical IDs, protein IDs being used by authors. The issue of data gaps will
also be expanded on. The presentation will conclude with an assessment of new
initiatives from several publishers for authors to mark-up their chemical compounds
before publication.
Outline
• Introducing our dataset
• Standardising entities
• Protein differences
• Chemistry
• The units case for BIA 10-2747
• Author mark-up
3
GtoPdb content
4
Human targets
Ligands
Standardising entities:
a major challenge of expert document extraction
• Judgment vs author primacy (e.g. do we transform/round units or fix errors?)
• Which document is it?
• Doi vs PubMed vs European PubMed Central
• Primary vs secondary literature
• Patent no format (WO- vs WO) kind codes (A1 B2) patent families
• Slide sets – is url persistent?
• Is the ligand chemically definable?
• Small molecule (PubChem CID, isomeric SMILES, InChIKey)
• A provenanced name-to-struc mapping for the company code name(s)
• Peptide (IUPAC, SMILES/InChI, sequence, HELM, Swiss-Prot feature line)
• Protein (clinical antibody, IMGT sequence, patent sequence, UniParc ID)
• Can the target be given a protein ID(s)?
• And the species was?
• Protein name/synonym-to-ID-to-sequence
• Was an alternative splice form or sequence variant specified?
• Should a complex be specified? (NC-IUPHAR complex, ChEMBL group, name
e.g. gamma secretase)
5
Protein ID and name standardisation:
philosophical and technical discordance between the pipelines
The 7-way concordance shrinks the proteome by 9%
6
UniProt + human = 151,569
UniProt + human + Swiss-Prot = 20,198
+ neXtProt = 20,040
+ HGNC = 19,836
+ Ensembl = 18,933
+ CCDS = 18,286
+ Entrez Gene ID = 18,245
+ RefSeq = 18,244
+ Evidence at protein level = 14,065
Protein
standardisation:
transatlantic
sequence
differences
UniProtKB - P56817 vs
NP_036236.1
7
Protein standardisation:
transatlantic (and intra-Europe) naming and splice variants
• Hinxton (HGNC) approved (gene) name: beta-site APP-cleaving enzyme 1
• Geneva (Swiss-Prot) recommended (protein) name: Beta-secretase 1
• Bethesda RefSeq NP_036236 beta-secretase 1 isoform A preproprotein
8
Making chemical curation easy:
Amgen giveth in their papers (IUPAC name in the title)
9
Making chemical curation difficult:
Amgen taketh away in their patents…
10
Measurement units: comparing with BIA-10-2474
• Suddenly (from Jan 15th 2016) there was serious need for
standardised comparative affinity values for FAAH inhibitors
• Bial 10-2474 in vitro results in WO2010074588 are 27.8 % inhibition
at 100nM in rat brain extracts
• J&J paper reports inactivation for enzyme but lists IC50
• Pfizer report (k(inact)/K(i) and IC(50) values of 40300 M(-1) s(-1) and
7.2 nM for PF-04457845
• V158866 from Vernalis; no structure (blinded) no papers, no affinity
data, no Phase II results
• Absence of normalised comparators for 10-2474 SAR compromises
in silico modeling and prediction studies
11
Measurement units:
Authors/inventors should read off their own affinity values
12
rather than us
having to
Will author mark-up overcome tomayto vs. tomahto?
• No – but it would help a lot
• Authors not always as au fait with their own entities compared to
biocurators
• Needs strong publisher/editor mandate and referee buy-in
• Automatic mark-up is useful for triage but can produce a dogs breakfast
of false-positives
• Initiatives underway from, Nature Chem. Biol. ACS J.Med Chem, Wiley
Brit. J. Pharmacol and others (see
http://cdsouthan.blogspot.com/2015/08/joining-chemistry-between-
journals-and.html)
• Need to accelerate in crucial areas (e.g. Zika and Dengue virus)
• Not only useful if authors cite their SAR patents but also push-back
against their attorneys to prevent data relationship obfuscation
13
Author-mark example: J Med Chem
14
Author/curator collaboration model: Brit J Pharmacol
15
References, acknowledgments and questions
16
http://www.ncbi.nlm.nih.gov/pubmed/24234439
http://www.ncbi.nlm.nih.gov/pubmed/23159359
GtoPdb FAQ section (explains curation guidelines)
http://www.guidetopharmacology.org/faq.jsp
http://cdsouthan.blogspot.com/2016/01/the-unfortunate-case-of-bia-10-2474.html
BIA 10-2747 story

More Related Content

What's hot

Assessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChemAssessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChemChris Southan
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...open_phacts
 
Will the correct drugs please stand up?
Will  the correct drugs please stand up?Will  the correct drugs please stand up?
Will the correct drugs please stand up?Chris Southan
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Frederik van den Broek
 
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...Frederik van den Broek
 
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesGuide to PHARMACOLOGY
 
Connecting antimalarial data
Connecting antimalarial dataConnecting antimalarial data
Connecting antimalarial dataChris Southan
 
GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019Guide to PHARMACOLOGY
 
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...Guide to PHARMACOLOGY
 
Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to PHARMACOLOGY
 
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019Guide to PHARMACOLOGY
 
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...Frederik van den Broek
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...Frederik van den Broek
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigData_Europe
 
GtoPdb: A resource for cell-based perturbogens
GtoPdb:  A resource for cell-based perturbogensGtoPdb:  A resource for cell-based perturbogens
GtoPdb: A resource for cell-based perturbogensChris Southan
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to PHARMACOLOGY
 
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Guide to PHARMACOLOGY
 
Sorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffChris Southan
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europeopen_phacts
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChemSunghwan Kim
 

What's hot (20)

Assessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChemAssessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChem
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
Will the correct drugs please stand up?
Will  the correct drugs please stand up?Will  the correct drugs please stand up?
Will the correct drugs please stand up?
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
 
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
 
Connecting antimalarial data
Connecting antimalarial dataConnecting antimalarial data
Connecting antimalarial data
 
GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019
 
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
 
Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019
 
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
 
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
BigDataEurope - Big Data & Health
BigDataEurope - Big Data & HealthBigDataEurope - Big Data & Health
BigDataEurope - Big Data & Health
 
GtoPdb: A resource for cell-based perturbogens
GtoPdb:  A resource for cell-based perturbogensGtoPdb:  A resource for cell-based perturbogens
GtoPdb: A resource for cell-based perturbogens
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
 
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...Sorting bioactive wheat from database chaff: Challenges of discerning correct...
Sorting bioactive wheat from database chaff: Challenges of discerning correct...
 
Sorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaffSorting bioactive wheat from database chaff
Sorting bioactive wheat from database chaff
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChem
 

Viewers also liked

Module 7 pharmacology
Module 7 pharmacologyModule 7 pharmacology
Module 7 pharmacologybeccamori
 
Pharmacology basic concept
Pharmacology basic conceptPharmacology basic concept
Pharmacology basic conceptDr Vinay Gupta
 
B A S I C S O F Pharmacology
B A S I C S  O F  PharmacologyB A S I C S  O F  Pharmacology
B A S I C S O F PharmacologyBob Bin
 
Introduction Of Pharmacology
Introduction Of PharmacologyIntroduction Of Pharmacology
Introduction Of Pharmacologyshabeel pn
 
16 Pharmacology.ppt
16 Pharmacology.ppt16 Pharmacology.ppt
16 Pharmacology.pptShama
 
Pharmacology: Class Session 1 and 2 Introduction to Pharmacology
Pharmacology: Class Session 1 and 2 Introduction to PharmacologyPharmacology: Class Session 1 and 2 Introduction to Pharmacology
Pharmacology: Class Session 1 and 2 Introduction to PharmacologyMariaJose2001
 
Introduction to basic principles of pharmacology
Introduction to basic principles of pharmacologyIntroduction to basic principles of pharmacology
Introduction to basic principles of pharmacologyBalmukund Thakkar
 
How to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksHow to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksSlideShare
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShareSlideShare
 

Viewers also liked (11)

Module 7 pharmacology
Module 7 pharmacologyModule 7 pharmacology
Module 7 pharmacology
 
Pharmacology basic concept
Pharmacology basic conceptPharmacology basic concept
Pharmacology basic concept
 
B A S I C S O F Pharmacology
B A S I C S  O F  PharmacologyB A S I C S  O F  Pharmacology
B A S I C S O F Pharmacology
 
Introduction Of Pharmacology
Introduction Of PharmacologyIntroduction Of Pharmacology
Introduction Of Pharmacology
 
16 Pharmacology.ppt
16 Pharmacology.ppt16 Pharmacology.ppt
16 Pharmacology.ppt
 
Pharmacology: Class Session 1 and 2 Introduction to Pharmacology
Pharmacology: Class Session 1 and 2 Introduction to PharmacologyPharmacology: Class Session 1 and 2 Introduction to Pharmacology
Pharmacology: Class Session 1 and 2 Introduction to Pharmacology
 
INTRODUCTION TO PHARMACOLOGY
INTRODUCTION TO PHARMACOLOGYINTRODUCTION TO PHARMACOLOGY
INTRODUCTION TO PHARMACOLOGY
 
Pharmacology
PharmacologyPharmacology
Pharmacology
 
Introduction to basic principles of pharmacology
Introduction to basic principles of pharmacologyIntroduction to basic principles of pharmacology
Introduction to basic principles of pharmacology
 
How to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & TricksHow to Make Awesome SlideShares: Tips & Tricks
How to Make Awesome SlideShares: Tips & Tricks
 
Getting Started With SlideShare
Getting Started With SlideShareGetting Started With SlideShare
Getting Started With SlideShare
 

Similar to Curatorial data wrangling for the Guide to PHARMACOLGY

Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbChris Southan
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide TribulationsChris Southan
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-upopen_phacts
 
Estimating bioactivity database error rates, tiikkainen
Estimating bioactivity database error rates, tiikkainenEstimating bioactivity database error rates, tiikkainen
Estimating bioactivity database error rates, tiikkainenPekka Tiikkainen
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAGopen_phacts
 
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5ChemAxon
 
The IUPHAR/MMV Guide to Malaria Pharmacology
The  IUPHAR/MMV Guide to Malaria Pharmacology  The  IUPHAR/MMV Guide to Malaria Pharmacology
The IUPHAR/MMV Guide to Malaria Pharmacology Chris Southan
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProtChris Southan
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...European Data Forum
 
Computational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityComputational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityThet Su Win
 
Pistoia Alliance webinar on Antibody structures in the PDB
Pistoia Alliance webinar on Antibody structures in the PDBPistoia Alliance webinar on Antibody structures in the PDB
Pistoia Alliance webinar on Antibody structures in the PDBPistoia Alliance
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016Sean Ekins
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Greg Landrum
 
2014-03-20 Open PHACTS - A Data Platform for Drug Discovery
2014-03-20 Open PHACTS - A Data Platform for Drug Discovery2014-03-20 Open PHACTS - A Data Platform for Drug Discovery
2014-03-20 Open PHACTS - A Data Platform for Drug Discoveryopen_phacts
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicologySean Ekins
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Syed Muhammad Ali Hasnain
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014LushPrize
 

Similar to Curatorial data wrangling for the Guide to PHARMACOLGY (20)

Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
 
Practical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS projectPractical semantics in the pharmaceutical industry - the Open PHACTS project
Practical semantics in the pharmaceutical industry - the Open PHACTS project
 
Estimating bioactivity database error rates, tiikkainen
Estimating bioactivity database error rates, tiikkainenEstimating bioactivity database error rates, tiikkainen
Estimating bioactivity database error rates, tiikkainen
 
2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG2011-11-28 Open PHACTS at RSC CICAG
2011-11-28 Open PHACTS at RSC CICAG
 
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
Pasteur Institute User Story - Cheminfo Stories 2020 Day 5
 
The IUPHAR/MMV Guide to Malaria Pharmacology
The  IUPHAR/MMV Guide to Malaria Pharmacology  The  IUPHAR/MMV Guide to Malaria Pharmacology
The IUPHAR/MMV Guide to Malaria Pharmacology
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 PresentationSLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
SLAS Screen Design and Assay Technology SIG: SLAS2013 Presentation
 
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
EDF2014: Paul Groth, Department of Computer Science & The Network Institute, ...
 
Computational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activityComputational prediction of antimicrobial peptide activity
Computational prediction of antimicrobial peptide activity
 
Pistoia Alliance webinar on Antibody structures in the PDB
Pistoia Alliance webinar on Antibody structures in the PDBPistoia Alliance webinar on Antibody structures in the PDB
Pistoia Alliance webinar on Antibody structures in the PDB
 
Slas talk 2016
Slas talk 2016Slas talk 2016
Slas talk 2016
 
Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...Reproducibility in cheminformatics and computational chemistry research: cert...
Reproducibility in cheminformatics and computational chemistry research: cert...
 
2014-03-20 Open PHACTS - A Data Platform for Drug Discovery
2014-03-20 Open PHACTS - A Data Platform for Drug Discovery2014-03-20 Open PHACTS - A Data Platform for Drug Discovery
2014-03-20 Open PHACTS - A Data Platform for Drug Discovery
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
Unc slides on computational toxicology
Unc slides on computational toxicologyUnc slides on computational toxicology
Unc slides on computational toxicology
 
Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014Grafström - Lush Prize Conference 2014
Grafström - Lush Prize Conference 2014
 

More from Chris Southan

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCPChris Southan
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityChris Southan
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulationsChris Southan
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Chris Southan
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeChris Southan
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentChris Southan
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Chris Southan
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCPChris Southan
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteinsChris Southan
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFERChris Southan
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databasesChris Southan
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology Chris Southan
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 posterChris Southan
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagensChris Southan
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyChris Southan
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand upChris Southan
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRChris Southan
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology updateChris Southan
 
Pub Med to PubChem Connectivity
Pub Med to PubChem ConnectivityPub Med to PubChem Connectivity
Pub Med to PubChem ConnectivityChris Southan
 

More from Chris Southan (20)

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Patents in PubChem
Patents in PubChemPatents in PubChem
Patents in PubChem
 
Pub Med to PubChem Connectivity
Pub Med to PubChem ConnectivityPub Med to PubChem Connectivity
Pub Med to PubChem Connectivity
 

Recently uploaded

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfWadeK3
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfSELF-EXPLANATORY
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxkessiyaTpeter
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfnehabiju2046
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfSwapnil Therkar
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PPRINCE C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 

Recently uploaded (20)

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdfNAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
NAVSEA PEO USC - Unmanned & Small Combatants 26Oct23.pdf
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdfBehavioral Disorder: Schizophrenia & it's Case Study.pdf
Behavioral Disorder: Schizophrenia & it's Case Study.pdf
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptxSOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
SOLUBLE PATTERN RECOGNITION RECEPTORS.pptx
 
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Munirka Delhi 💯Call Us 🔝8264348440🔝
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
A relative description on Sonoporation.pdf
A relative description on Sonoporation.pdfA relative description on Sonoporation.pdf
A relative description on Sonoporation.pdf
 
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdfAnalytical Profile of Coleus Forskohlii | Forskolin .pdf
Analytical Profile of Coleus Forskohlii | Forskolin .pdf
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
The Philosophy of Science
The Philosophy of ScienceThe Philosophy of Science
The Philosophy of Science
 
VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)Recombinant DNA technology (Immunological screening)
Recombinant DNA technology (Immunological screening)
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 

Curatorial data wrangling for the Guide to PHARMACOLGY

  • 1. www.guidetopharmacology.org Virtues and vicissitudes of curatorial data wrangling: the Guide to PHARMACOLGY experience Christopher Southan, Adam J. Pawson, Joanna L. Sharman, Elena Faccenda, Simon Harding, Jamie Davis, IUPHAR/BPS Guide to PHARMACOLOGY, Centre for Integrative Physiology, University of Edinburgh Sun, Mar 13, 2016 CINF 6:Tomayto vs. Tomahto: Overcoming Incompatibilities in Scientific Data 8:30 AM - 12:00 PM 1 http://www.slideshare.net/cdsouthan/curatorial-data-wrangling-for-the-guide-to-pharmacolgy
  • 2. Abstract (will be skipped for presenting) 2 A wide range of valuable databases, both academic and commercial, use the curation model to extract and standardise selected result sets from the literature. This is the classic unstructured-to-structured transformation, predominantly of target binding data (e.g. IC50, Ki or Kd) between ligands and targets. Since 2009 the Guide to PHARMACOLOG (GoPdb) has now curated quantitative interactions between 1300 protein targets and 6000 ligands covering a substantial proportion of the druggable proteome. The team has thus considerable expertise in the challenges of standardisation. This needs to be interposed between not only the primary literature but also other the databases that have extracted data. The wide range of compatibility and other issues associated with selecting new content for GtoPdb will be outlined. These will include the problem of equivocal measurement units as well as non- standard chemical IDs, protein IDs being used by authors. The issue of data gaps will also be expanded on. The presentation will conclude with an assessment of new initiatives from several publishers for authors to mark-up their chemical compounds before publication.
  • 3. Outline • Introducing our dataset • Standardising entities • Protein differences • Chemistry • The units case for BIA 10-2747 • Author mark-up 3
  • 5. Standardising entities: a major challenge of expert document extraction • Judgment vs author primacy (e.g. do we transform/round units or fix errors?) • Which document is it? • Doi vs PubMed vs European PubMed Central • Primary vs secondary literature • Patent no format (WO- vs WO) kind codes (A1 B2) patent families • Slide sets – is url persistent? • Is the ligand chemically definable? • Small molecule (PubChem CID, isomeric SMILES, InChIKey) • A provenanced name-to-struc mapping for the company code name(s) • Peptide (IUPAC, SMILES/InChI, sequence, HELM, Swiss-Prot feature line) • Protein (clinical antibody, IMGT sequence, patent sequence, UniParc ID) • Can the target be given a protein ID(s)? • And the species was? • Protein name/synonym-to-ID-to-sequence • Was an alternative splice form or sequence variant specified? • Should a complex be specified? (NC-IUPHAR complex, ChEMBL group, name e.g. gamma secretase) 5
  • 6. Protein ID and name standardisation: philosophical and technical discordance between the pipelines The 7-way concordance shrinks the proteome by 9% 6 UniProt + human = 151,569 UniProt + human + Swiss-Prot = 20,198 + neXtProt = 20,040 + HGNC = 19,836 + Ensembl = 18,933 + CCDS = 18,286 + Entrez Gene ID = 18,245 + RefSeq = 18,244 + Evidence at protein level = 14,065
  • 8. Protein standardisation: transatlantic (and intra-Europe) naming and splice variants • Hinxton (HGNC) approved (gene) name: beta-site APP-cleaving enzyme 1 • Geneva (Swiss-Prot) recommended (protein) name: Beta-secretase 1 • Bethesda RefSeq NP_036236 beta-secretase 1 isoform A preproprotein 8
  • 9. Making chemical curation easy: Amgen giveth in their papers (IUPAC name in the title) 9
  • 10. Making chemical curation difficult: Amgen taketh away in their patents… 10
  • 11. Measurement units: comparing with BIA-10-2474 • Suddenly (from Jan 15th 2016) there was serious need for standardised comparative affinity values for FAAH inhibitors • Bial 10-2474 in vitro results in WO2010074588 are 27.8 % inhibition at 100nM in rat brain extracts • J&J paper reports inactivation for enzyme but lists IC50 • Pfizer report (k(inact)/K(i) and IC(50) values of 40300 M(-1) s(-1) and 7.2 nM for PF-04457845 • V158866 from Vernalis; no structure (blinded) no papers, no affinity data, no Phase II results • Absence of normalised comparators for 10-2474 SAR compromises in silico modeling and prediction studies 11
  • 12. Measurement units: Authors/inventors should read off their own affinity values 12 rather than us having to
  • 13. Will author mark-up overcome tomayto vs. tomahto? • No – but it would help a lot • Authors not always as au fait with their own entities compared to biocurators • Needs strong publisher/editor mandate and referee buy-in • Automatic mark-up is useful for triage but can produce a dogs breakfast of false-positives • Initiatives underway from, Nature Chem. Biol. ACS J.Med Chem, Wiley Brit. J. Pharmacol and others (see http://cdsouthan.blogspot.com/2015/08/joining-chemistry-between- journals-and.html) • Need to accelerate in crucial areas (e.g. Zika and Dengue virus) • Not only useful if authors cite their SAR patents but also push-back against their attorneys to prevent data relationship obfuscation 13
  • 14. Author-mark example: J Med Chem 14
  • 15. Author/curator collaboration model: Brit J Pharmacol 15
  • 16. References, acknowledgments and questions 16 http://www.ncbi.nlm.nih.gov/pubmed/24234439 http://www.ncbi.nlm.nih.gov/pubmed/23159359 GtoPdb FAQ section (explains curation guidelines) http://www.guidetopharmacology.org/faq.jsp http://cdsouthan.blogspot.com/2016/01/the-unfortunate-case-of-bia-10-2474.html BIA 10-2747 story