SlideShare a Scribd company logo
1 of 22
www.guidetopharmacology.org
Open and Closed Antimalarial Drug Discovery:
Comparing data Connectivity gaps
and Disclosure Speed
Dr Christopher Southan, Senior Database Curator, IUPHAR/BPS Guide to
PHARMACOLGY (GtoPdb), University of Edinburgh
BioIT Boston 2016, Wed 6th ´April, Track 11, Open Source Innovations 16:30
1
http://www.slideshare.net/cdsouthan/antimalarial-drug-dscovery-data-disclosure
Abstract (will be skipped for presentation)
2
Antimalarial research is the poster child for Open Source Drug Discovery
(OSDD). However many leads compounds still have their origins in
Traditional Closed Drug Discovery (TCDD) and uncertainty remains as to the
differences. To provide an assessment, this work examined 32 recent
antimalarial structures in terms of their PubChem connectivity. Of these, 21
had patent matches, only 23 linked to publications and only 21 had BioAssay
records. Major data connectivity problems included 1) leads not findable by
code name, 2) patents not cited in publications 3) leads not reciprocally
linked to Plasmodium protein targets and pathways 4) name-to-structures
only being declared years after patent disclosure. These issues will be
contrasted with the Sydney University Open Source Malaria approach were
open lab books are used to surface structures (e.g. as Google-findable
InChIKey) and crowdsourced collaboration data close to real time, thereby
shaving years of the discovery phase.
Outline
• Introduction to Open Source Drug Discovery (OSDD)
• Differences to Traditional Closed Drug Discovery (TCDD)
• Extracting antimalarial leads from the literature
• Profiling structures in PubChem
• A look into the MMV Pathogen Box
• Introducing Open Source Malaria (OSM)
• Profiling the OSM structure collection
• Speed sharing
• Google searching InChIKeys
• Conclusions
• Open structure sets
• References and questions please
3
Introduction
• The OSDD concept is not tied to any particular group
• While antimalarials have become a poster-child for OSDD many leads still
come through TCDD route so boundaries between the two are blurred
• OSDD has become a test bed (e.g. open data sets from GSK and others,
the Medicines for Malaria Ventures (MMV) “Malaria Box” and WIPO
Re:Search IP sharing)
• Sydney Open Source Malaria project (@O_S_M) adheres to OSDD
principles (see PMID 23985301)
• I have donated voluntary support to the OSM team since 2012 (i.e. in
addition to my Guide to PHARMACOLOGY Senior Database Curator job)
• This has focused on structure searching and data surfacing
• I blog on data connectivity in general, and for antimalarials in particular
• The surfacing speed for structures reflect “shades of openness” that will be
discussed
4
Open vs closed research routes to new medicines
TCDD
• Proprietary data
• Patent filings
• Leads maybe blinded by code
numbers
• Papers after patents
• No direct submissions to public
databases
• Predominantly commercial
software and databases
• Typically ~10 years R&D
• Still the dominant model
OSDD
• Open ELNs
• No patent filings
• Data surfaced rapidly for sharing
• Open access papers
• Submissions to public databases
• Anyone can contribute
• Crowdsourcing
• Preference for open source
software and public databases
• Potential to shorten research
• Pure OSDD relatively rare
5
6
Recent review of
leads - but
• Link-free zone (except
for references)
• PDF “tomb” with
images for structures
• No chemical
specifications
• No database
identifiers
• No target protein
identifiers
• DDD107498 was
blinded at that time
(no structure)
• I mapped to PubChem
CIDs as a community
service
Consequently, much effort was needed
to get from this to this
7
Getting name-to-structure out of primary papers: not trivial
8
• On a good day, MeSH curators will index the lead structures specified in
PubMed and connect them to PubChem
• On a bad day (as in this case), they may record the name but without a link to
a chemical structure
• The code name is still PubChem –ve after a year
Curatorial ferreting: DDD107498 structure and patent
9
IUPAC from supp dat > chemicalize.org > PubChem > SureChEMBL > SAR table
PubChem profile for 32 antimalarial lead structures
10
http://cdsouthan.blogspot.se/2014/06/getting-into-box-with-some-recent.html
http://cdsouthan.blogspot.se/2015/05/entity-resolution-for-antimalarial.html
22 CIDs collated as Pathogen Box proposals plus 16 structures from the PMID
26000721 review (six in common, see blog posts below)
Profile for 114 antimalarial actives from the Pathogen Box
11
http://cdsouthan.blogspot.se/2016/03/a-peek-into-mmv-pathogen-box.html
With OSM finding stuff is easier
12
The entire portfolio is open, including new designs
13
https://docs.google.com/spreadsheets/d/1Rvy6OiM291d1GN_cyT6eSw_C3lSuJ1jaR7AJa8hgGsc/edit#gid=510297618
http://www.cheminfo.org/flavor/malaria/Display_data.html
411 molecular records
in March 2016 OSM
master sheet (Mat
Todd et al.) and custom
ELN (Luc Patiny et al.)
Rapid triage of the OSM portfolio in PubChem
14
250 identity matches from 410 InChIs uploaded
PubChem profile for 250 OSM matches
15
• Note 160 from 410 had no exact matches (e.g. includes design proposals)
• Patents include matches for reference cpds (i.e. not antimalarial claims)
Speed sharing: OSM > Twitter (bot) > chemicalize.org
16
Googling the InChIKey for global findability
17
• Direct from Open
Lab Books
• Or from a
chemicalize
conversion
• Search in ~0.3 sec
• Works with inner
layer
• Can cross-check
PubChem < > ELN
Getting structures into PubChem is not difficult
18
• As TW2Informatics I deposited MMV670437 in 2013 as a test case
• The bioactivity data was later submitted by OSM > ChEMBL > PubChem (but did
not include the code name)
• Both SIDs were merged into CID 71819647 , thereby linking name > struc > activity
Extending connectivity to target and pathway mapping
19
http://www.wikipathwa
ys.org/index.php/Wiki
Pathways
Conclusions
• Encouragingly, published output of antimalarial leads is increasing
• However, challenges of curating and mapping are similar to those
encountered by the GtoPdb team for human targets and ligands
• There is a grey zone between TCDD and OSDD and some leads are
patented
• Authors and stakeholders should ensure their SAR is surfaced and
name-to-structure connected in databases (i.e. FAIR principles, see
PMID 26978244)
• Gaps persist in mappings between leads, targets and pathways
• The practice of OSDD by OSM and collaborators accelerates research
• PubChem MyNCBI collections are useful for sharing structure sets
20
PubChem MyNCBI open structure sets
• 16 clinical candidates from PMID 26000721
• 22 leads from various sources
• 114 from the Pathogen Box
• 250 from the OSM PubChem matches
n.b. Those engaged in antimalarial research can contact me if they need
technical details and/or possible generation of new lists (e.g. CID subsets or
patent extractions)
21
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48460617/public/
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/49901772/public
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/49700347/public/
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48358242/public/
References: questions welcome
22
http://www.ncbi.nlm.nih.gov/pubmed/23618056
http://www.ncbi.nlm.nih.gov/pubmed/23399051
http://cdsouthan.blogspot.com/ various blog posts on antimalarial ferreting
https://github.com/OpenSourceMalaria OSM on GitHub
http://www.ncbi.nlm.nih.gov/pubmed/23985301
http://www.guidetopharmacology.org/faq.jsp GtoPdb FAQ on curating human
bioactives

More Related Content

What's hot

Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChem
Sunghwan Kim
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3
plmiami
 

What's hot (20)

Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
Finding novel lead compounds in pesticide discovery inspired by pharmaceutica...
 
Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019Guide to Malaria Pharmacology, GEMM 2019
Guide to Malaria Pharmacology, GEMM 2019
 
GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019GuideToImmunopharmacology_SIF_Nov2019
GuideToImmunopharmacology_SIF_Nov2019
 
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
GtoPDB_ELIXIR_UK_AllHands_update_Dec2019
 
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesThe IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates
 
Assessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChemAssessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChem
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
 
Will the correct drugs please stand up?
Will  the correct drugs please stand up?Will  the correct drugs please stand up?
Will the correct drugs please stand up?
 
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
Slicing and dicing curated protein targets: Analysing the drugged, druggable ...
 
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
Navigating between publications and databases for drug discovery: IUPHAR/BPS ...
 
IUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to PharmacologyIUPHAR/BPS Guide to Pharmacology
IUPHAR/BPS Guide to Pharmacology
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
Connecting antimalarial data
Connecting antimalarial dataConnecting antimalarial data
Connecting antimalarial data
 
Searching for chemical information using PubChem
Searching for chemical information using PubChemSearching for chemical information using PubChem
Searching for chemical information using PubChem
 
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
De-siloing data and building knowledge graphs outside of drug discovery: Oppo...
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
 
Session 1 part 3
Session 1 part 3Session 1 part 3
Session 1 part 3
 
How can you access PubChem programmatically?
How can you access PubChem programmatically?How can you access PubChem programmatically?
How can you access PubChem programmatically?
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 

Similar to Antimalarial drug dscovery data disclosure

Opening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveatsOpening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveats
Chris Southan
 
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and PresentToward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Tim Williams
 
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
Dr. Haxel Consult
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Greg Landrum
 
Challenges in Clinical Trials Networks
Challenges in Clinical Trials NetworksChallenges in Clinical Trials Networks
Challenges in Clinical Trials Networks
US Cochrane Center
 

Similar to Antimalarial drug dscovery data disclosure (20)

Digging out Structures for Repurposing: Non-competitive Intelligence ...
Digging out Structures for Repurposing: Non-competitive Intelligence        ...Digging out Structures for Repurposing: Non-competitive Intelligence        ...
Digging out Structures for Repurposing: Non-competitive Intelligence ...
 
Opening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveatsOpening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveats
 
Semantics and linked data at astra zeneca
Semantics and linked data at astra zenecaSemantics and linked data at astra zeneca
Semantics and linked data at astra zeneca
 
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and PresentToward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
Toward F.A.I.R. Pharma. PhUSE Linked Data Initiatives Past and Present
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
ICIC 2017: Tutorial - Digging bioactive chemistry out of patents using open r...
 
Knowledge Graphs: Changing How We Think About Data
Knowledge Graphs: Changing How We Think About DataKnowledge Graphs: Changing How We Think About Data
Knowledge Graphs: Changing How We Think About Data
 
Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium Lankade data Vinnova webbinarium
Lankade data Vinnova webbinarium
 
Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...Is that a scientific report or just some cool pictures from the lab? Reproduc...
Is that a scientific report or just some cool pictures from the lab? Reproduc...
 
Evolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategiesEvolving consensus-based curatorial strategies
Evolving consensus-based curatorial strategies
 
Mining Drug Targets, Structures and Activity Data
Mining Drug Targets, Structures and Activity DataMining Drug Targets, Structures and Activity Data
Mining Drug Targets, Structures and Activity Data
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
 
Challenges in Clinical Trials Networks
Challenges in Clinical Trials NetworksChallenges in Clinical Trials Networks
Challenges in Clinical Trials Networks
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
Sharing and standards christopher hart - clinical innovation and partnering...
Sharing and standards   christopher hart - clinical innovation and partnering...Sharing and standards   christopher hart - clinical innovation and partnering...
Sharing and standards christopher hart - clinical innovation and partnering...
 
How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...How can we ensure research data is re-usable? The role of Publishers in Resea...
How can we ensure research data is re-usable? The role of Publishers in Resea...
 
Shifting the goal post – from high impact journals to high impact data
 Shifting the goal post – from high impact journals to high impact data Shifting the goal post – from high impact journals to high impact data
Shifting the goal post – from high impact journals to high impact data
 
effective data sharing for a learning healthcare system
effective data sharing for a learning healthcare systemeffective data sharing for a learning healthcare system
effective data sharing for a learning healthcare system
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
A practical guide to practicing open science
A practical guide to practicing open scienceA practical guide to practicing open science
A practical guide to practicing open science
 

More from Chris Southan

Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
Chris Southan
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
Chris Southan
 

More from Chris Southan (20)

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 
Patents in PubChem
Patents in PubChemPatents in PubChem
Patents in PubChem
 

Recently uploaded

Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Sérgio Sacani
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
gindu3009
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
Sérgio Sacani
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
RohitNehra6
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Lokesh Kothari
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
PirithiRaju
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
anilsa9823
 

Recently uploaded (20)

VIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C PVIRUSES structure and classification ppt by Dr.Prince C P
VIRUSES structure and classification ppt by Dr.Prince C P
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Presentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptxPresentation Vikram Lander by Vedansh Gupta.pptx
Presentation Vikram Lander by Vedansh Gupta.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
Labelling Requirements and Label Claims for Dietary Supplements and Recommend...
 
fundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomologyfundamental of entomology all in one topics of entomology
fundamental of entomology all in one topics of entomology
 
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls AgencyHire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
Hire 💕 9907093804 Hooghly Call Girls Service Call Girls Agency
 
Pests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdfPests of mustard_Identification_Management_Dr.UPR.pdf
Pests of mustard_Identification_Management_Dr.UPR.pdf
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service  🪡
CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡
 
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...
 
CELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdfCELL -Structural and Functional unit of life.pdf
CELL -Structural and Functional unit of life.pdf
 
Botany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questionsBotany krishna series 2nd semester Only Mcq type questions
Botany krishna series 2nd semester Only Mcq type questions
 
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatidSpermiogenesis or Spermateleosis or metamorphosis of spermatid
Spermiogenesis or Spermateleosis or metamorphosis of spermatid
 
Forensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdfForensic Biology & Its biological significance.pdf
Forensic Biology & Its biological significance.pdf
 

Antimalarial drug dscovery data disclosure

  • 1. www.guidetopharmacology.org Open and Closed Antimalarial Drug Discovery: Comparing data Connectivity gaps and Disclosure Speed Dr Christopher Southan, Senior Database Curator, IUPHAR/BPS Guide to PHARMACOLGY (GtoPdb), University of Edinburgh BioIT Boston 2016, Wed 6th ´April, Track 11, Open Source Innovations 16:30 1 http://www.slideshare.net/cdsouthan/antimalarial-drug-dscovery-data-disclosure
  • 2. Abstract (will be skipped for presentation) 2 Antimalarial research is the poster child for Open Source Drug Discovery (OSDD). However many leads compounds still have their origins in Traditional Closed Drug Discovery (TCDD) and uncertainty remains as to the differences. To provide an assessment, this work examined 32 recent antimalarial structures in terms of their PubChem connectivity. Of these, 21 had patent matches, only 23 linked to publications and only 21 had BioAssay records. Major data connectivity problems included 1) leads not findable by code name, 2) patents not cited in publications 3) leads not reciprocally linked to Plasmodium protein targets and pathways 4) name-to-structures only being declared years after patent disclosure. These issues will be contrasted with the Sydney University Open Source Malaria approach were open lab books are used to surface structures (e.g. as Google-findable InChIKey) and crowdsourced collaboration data close to real time, thereby shaving years of the discovery phase.
  • 3. Outline • Introduction to Open Source Drug Discovery (OSDD) • Differences to Traditional Closed Drug Discovery (TCDD) • Extracting antimalarial leads from the literature • Profiling structures in PubChem • A look into the MMV Pathogen Box • Introducing Open Source Malaria (OSM) • Profiling the OSM structure collection • Speed sharing • Google searching InChIKeys • Conclusions • Open structure sets • References and questions please 3
  • 4. Introduction • The OSDD concept is not tied to any particular group • While antimalarials have become a poster-child for OSDD many leads still come through TCDD route so boundaries between the two are blurred • OSDD has become a test bed (e.g. open data sets from GSK and others, the Medicines for Malaria Ventures (MMV) “Malaria Box” and WIPO Re:Search IP sharing) • Sydney Open Source Malaria project (@O_S_M) adheres to OSDD principles (see PMID 23985301) • I have donated voluntary support to the OSM team since 2012 (i.e. in addition to my Guide to PHARMACOLOGY Senior Database Curator job) • This has focused on structure searching and data surfacing • I blog on data connectivity in general, and for antimalarials in particular • The surfacing speed for structures reflect “shades of openness” that will be discussed 4
  • 5. Open vs closed research routes to new medicines TCDD • Proprietary data • Patent filings • Leads maybe blinded by code numbers • Papers after patents • No direct submissions to public databases • Predominantly commercial software and databases • Typically ~10 years R&D • Still the dominant model OSDD • Open ELNs • No patent filings • Data surfaced rapidly for sharing • Open access papers • Submissions to public databases • Anyone can contribute • Crowdsourcing • Preference for open source software and public databases • Potential to shorten research • Pure OSDD relatively rare 5
  • 6. 6 Recent review of leads - but • Link-free zone (except for references) • PDF “tomb” with images for structures • No chemical specifications • No database identifiers • No target protein identifiers • DDD107498 was blinded at that time (no structure) • I mapped to PubChem CIDs as a community service
  • 7. Consequently, much effort was needed to get from this to this 7
  • 8. Getting name-to-structure out of primary papers: not trivial 8 • On a good day, MeSH curators will index the lead structures specified in PubMed and connect them to PubChem • On a bad day (as in this case), they may record the name but without a link to a chemical structure • The code name is still PubChem –ve after a year
  • 9. Curatorial ferreting: DDD107498 structure and patent 9 IUPAC from supp dat > chemicalize.org > PubChem > SureChEMBL > SAR table
  • 10. PubChem profile for 32 antimalarial lead structures 10 http://cdsouthan.blogspot.se/2014/06/getting-into-box-with-some-recent.html http://cdsouthan.blogspot.se/2015/05/entity-resolution-for-antimalarial.html 22 CIDs collated as Pathogen Box proposals plus 16 structures from the PMID 26000721 review (six in common, see blog posts below)
  • 11. Profile for 114 antimalarial actives from the Pathogen Box 11 http://cdsouthan.blogspot.se/2016/03/a-peek-into-mmv-pathogen-box.html
  • 12. With OSM finding stuff is easier 12
  • 13. The entire portfolio is open, including new designs 13 https://docs.google.com/spreadsheets/d/1Rvy6OiM291d1GN_cyT6eSw_C3lSuJ1jaR7AJa8hgGsc/edit#gid=510297618 http://www.cheminfo.org/flavor/malaria/Display_data.html 411 molecular records in March 2016 OSM master sheet (Mat Todd et al.) and custom ELN (Luc Patiny et al.)
  • 14. Rapid triage of the OSM portfolio in PubChem 14 250 identity matches from 410 InChIs uploaded
  • 15. PubChem profile for 250 OSM matches 15 • Note 160 from 410 had no exact matches (e.g. includes design proposals) • Patents include matches for reference cpds (i.e. not antimalarial claims)
  • 16. Speed sharing: OSM > Twitter (bot) > chemicalize.org 16
  • 17. Googling the InChIKey for global findability 17 • Direct from Open Lab Books • Or from a chemicalize conversion • Search in ~0.3 sec • Works with inner layer • Can cross-check PubChem < > ELN
  • 18. Getting structures into PubChem is not difficult 18 • As TW2Informatics I deposited MMV670437 in 2013 as a test case • The bioactivity data was later submitted by OSM > ChEMBL > PubChem (but did not include the code name) • Both SIDs were merged into CID 71819647 , thereby linking name > struc > activity
  • 19. Extending connectivity to target and pathway mapping 19 http://www.wikipathwa ys.org/index.php/Wiki Pathways
  • 20. Conclusions • Encouragingly, published output of antimalarial leads is increasing • However, challenges of curating and mapping are similar to those encountered by the GtoPdb team for human targets and ligands • There is a grey zone between TCDD and OSDD and some leads are patented • Authors and stakeholders should ensure their SAR is surfaced and name-to-structure connected in databases (i.e. FAIR principles, see PMID 26978244) • Gaps persist in mappings between leads, targets and pathways • The practice of OSDD by OSM and collaborators accelerates research • PubChem MyNCBI collections are useful for sharing structure sets 20
  • 21. PubChem MyNCBI open structure sets • 16 clinical candidates from PMID 26000721 • 22 leads from various sources • 114 from the Pathogen Box • 250 from the OSM PubChem matches n.b. Those engaged in antimalarial research can contact me if they need technical details and/or possible generation of new lists (e.g. CID subsets or patent extractions) 21 http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48460617/public/ http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/49901772/public http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/49700347/public/ http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48358242/public/
  • 22. References: questions welcome 22 http://www.ncbi.nlm.nih.gov/pubmed/23618056 http://www.ncbi.nlm.nih.gov/pubmed/23399051 http://cdsouthan.blogspot.com/ various blog posts on antimalarial ferreting https://github.com/OpenSourceMalaria OSM on GitHub http://www.ncbi.nlm.nih.gov/pubmed/23985301 http://www.guidetopharmacology.org/faq.jsp GtoPdb FAQ on curating human bioactives