SlideShare a Scribd company logo
1 of 1
Discussion
Christopher Southan IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb), Centre for Integrative Physiology, The
University of Edinburgh, EH8 9XD, UK. http://www.slideshare.net/cdsouthan/connecting-antimalarial-data
As outlined in the introduction to the CINF Symposium, among Jean-Claude
Bradley’s achievements, his work on Open Notebook Science (ONS)
(https://en.wikipedia.org/wiki/Open_notebook_science) has not only perhaps
the largest impact but the ripple effect continues to broaden. This is
particularly the case in Open Source Drug Discovery, OSDD (used here as a
generic term not specific to any group) where ONS forms a core enablement
for the movement (PMID:23985301). This is a radical departure from what
we can call Traditional Closed Drug Discovery (TCDD). While boundaries
between these camps are blurred, the use of ONS is a clear differentiator in
the philosophy of real-time data surfacing (typically via an Electronic
Laboratory Notebook ELN). This means that teams can intersect with, share
and optimise any chemical space since they are no longer competitively
compelled to IP-protect lead structures. The domain of small-molecule
malaria treatments has become a poster child for OSDD and also spawned
the “Box” concept of physically distributable active compound sets.
Opening up and connecting antimalarial data:
Progress with caveats
An ACS SciMix contribution from the CINF session: The Growing Impact of
Openness in Chemistry: A Symposium in Honour of JC Bradley
Jean-Claude Bradley’s pioneering of ONS has the potential to shorten lead discovery and optimisation by years. Consequently it will bring more new
medicines to more patients faster. This is not restricted to NTDs but is likely to be adopted by rare disease consortia. Notwithstanding, as a proportion of the
current antimalarial chemical estate, the ONS contribution is small. Notably, the majority of lead SAR is still instantiated in patents and papers from the TCDD
motus operandi. This was the reason why curating leads for the PB remained a typically arduous exercise (that we are used to at GtoPdb). It is also important
to note that impediments to findability and connectivity of molecular relationships in the “system” (including target and pathway mapping) remain serious
concerns for malaria and other OSDD domains. In the context of drug discovery ONS, like any other approach, has its caveats. The main one is that real-time
data (hot off the instruments or just out of the fume hood) tends to be unstructured and confirmations pending. In this situation of “positive collaborative
anarchy” across different global teams ONS data can be difficult to find, provenance, verify, curate, standardise and mine. Of course, a similar constellation of
informatics challenges also arises from TCDD but (on a good day) open (e.g. in PubChem direct or curated from the literature via ChEMBL and/or GtoPdb)
SAR may surface in a minable form even if some years after the fact. Notwithstanding, the major acceleration that ONS facilitates will ensure its expansion that
will include new drug discovery commercial gaps with unmet clinical needs, as a fitting legacy of Jean-Claude Bradley’s innovation.
As context for this invited presentation, while my day-job is working for the Edinburgh GtoPdb team, I have donated a small amount of voluntary support to the
Sydney OSM team since 2012 (https://www.thinkable.org/submission/2136 at 1.54 on the video). This has focused mainly on chemical structure searching,
data organisation and surfacing strategies. In addition, I blog occasionally on the themes of data connectivity in general and for antimalarial leads in particular.
For the record, MMV have thanked me for contributing the 28 structures. By various criteria they will not all go in to the PB but I hope to find out the inclusions.
Figure 1. Comparing the Genome Ontology function splits between all human proteins (left, 20,198) and the GtoPdb targets with small-
molecule quantitative interactions (right, 978)
Following on from the award-winning success of the Medicines for
Malaria Ventures (MMV) Malaria Box of 400 compounds
(http://www.mmv.org/malariabox) a Pathogen Box (PB) is in preparation
for a range of Neglected Tropical Diseases (NTDs) in addition to malaria
(http://pathogenbox.org/). Since I had already highlighted the vicissitudes
of establishing the explicit molecular identities of published malaria leads
in several blog posts I extended these to 28 structures for possible
inclusion in the PB (http://cdsouthan.blogspot.se/2014/06/getting-into-
box-with-some-recent.html) the first page of which is shown below.
The challenges of curating leads for the PB were similar to those
encountered by the GtoPdb team for human targets and their ligands on a
daily basis (PMID:24234439). They were in fact somewhat worse, as
reflected in the statistics of the 22 PubChem CIDs linked below
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/
48358242/public/. Quirks encountered are detailed in the blog post but
included;
• The 6 structures not in PubChem are de facto unfindable in open dbs
but some may get Google InChIKey matches via chemicalize.org cache
• The only systematic identifier encountered was the IUPAC name which
often had to be dug out of the supplementary data as in blog page on
the left (i.e. neither SMILES nor InChI in papers or patents)
• No authors made direct database submissions
• The code name was often not a PubChem synonym
• ChEMBL had picked up 16 with data > to PubChem BioAssay
• 13 had patent-extraction matches and 11 chemical vendor matches
• The MeSH annotation had only linked two directly to PMIDs
Out of the documents and into the BoxIntroduction
RESULTS (3)
Finding structures and linking data from the Sydney University OSM team
and their collaborators (http://opensourcemalaria.org/) is much easier than
for the PB 28. This is primarily because of their adoption of ONS, Google
docs, other surfacing routes and direct submissions to ChEMBL. This is
illustrated for MMV670437 (as an OSM 44 nM lead in the 28) by simply
Googling the inner InChIKey layer (PMID:23399051). Matches (including
below left) returned in 0.35 sec, include PubChem, OSM in GitHub and my
blog. The PubChem SID (below right) with MMV code is my submission.
Further ONS utility is exemplified by the surfacing of 250 project structures
in the first link below. The second link maps 167 of these into PubChem
https://docs.google.com/spreadsheets/d/1Rvy6OiM291d1GN_cyT6eSw_C3lSuJ1jaR7AJa8hgGsc/edit#gid=510297618
http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48338932/public/
Connecting up with Open Source Malaria (OSM)

More Related Content

What's hot

In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
Chris Southan
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...
Valery Tkachenko
 
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Alasdair Gray
 

What's hot (20)

In silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug DevelopmentIn silico 360 Analysis for Drug Development
In silico 360 Analysis for Drug Development
 
Update on the Druggable Proteome
Update on the Druggable ProteomeUpdate on the Druggable Proteome
Update on the Druggable Proteome
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
Data-driven drug discovery for rare diseases - Tales from the trenches (CINF ...
 
Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)Badapple: promiscuity patterns from noisy evidence (poster)
Badapple: promiscuity patterns from noisy evidence (poster)
 
Implementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTSImplementing chemistry platform for OpenPHACTS
Implementing chemistry platform for OpenPHACTS
 
Sourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicologySourcing high quality online data resources for computational toxicology
Sourcing high quality online data resources for computational toxicology
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
 
Assessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChemAssessing GtoPdb ligand content in PubChem
Assessing GtoPdb ligand content in PubChem
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
UDM (Unified Data Model) - Enabling Exchange of Comprehensive Reaction Inform...
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
Will the correct drugs please stand up?
Will  the correct drugs please stand up?Will  the correct drugs please stand up?
Will the correct drugs please stand up?
 
Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...Building linked data large-scale chemistry platform - challenges, lessons and...
Building linked data large-scale chemistry platform - challenges, lessons and...
 
Asking the scientific literature to tell us about metabolism
Asking the scientific literature to tell us about metabolismAsking the scientific literature to tell us about metabolism
Asking the scientific literature to tell us about metabolism
 
Data Visualization to Enhance our Understanding of the Cancer Genome
Data Visualization to Enhance our Understanding of the Cancer GenomeData Visualization to Enhance our Understanding of the Cancer Genome
Data Visualization to Enhance our Understanding of the Cancer Genome
 
A Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and WikidataA Global Commons for Scientific Data: Molecules and Wikidata
A Global Commons for Scientific Data: Molecules and Wikidata
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
Use of data
Use of dataUse of data
Use of data
 
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
Scientific Lenses over Linked Data: Identity Management in the Open PHACTS p...
 

Viewers also liked

Causes and consequences of automated extraction of patent-specified virtual d...
Causes and consequences of automated extraction of patent-specified virtual d...Causes and consequences of automated extraction of patent-specified virtual d...
Causes and consequences of automated extraction of patent-specified virtual d...
Chris Southan
 
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Chris Southan
 

Viewers also liked (6)

Causes and consequences of automated extraction of patent-specified virtual d...
Causes and consequences of automated extraction of patent-specified virtual d...Causes and consequences of automated extraction of patent-specified virtual d...
Causes and consequences of automated extraction of patent-specified virtual d...
 
From Biological Data to Clinical Applications: Positioning a digital infrastr...
From Biological Data to Clinical Applications: Positioning a digital infrastr...From Biological Data to Clinical Applications: Positioning a digital infrastr...
From Biological Data to Clinical Applications: Positioning a digital infrastr...
 
20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse20 million public patent structures: looking at the gift horse
20 million public patent structures: looking at the gift horse
 
Correct drug structures for pharmacology
Correct drug structures for pharmacologyCorrect drug structures for pharmacology
Correct drug structures for pharmacology
 
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
Exploiting Edinburgh's Guide to PHARMACOLOGY database as a source of protein ...
 
The Open Patent Chemistry “Big Bang”: Implications, Opportunities and Caveats
The Open Patent Chemistry “Big Bang”: Implications, Opportunities and CaveatsThe Open Patent Chemistry “Big Bang”: Implications, Opportunities and Caveats
The Open Patent Chemistry “Big Bang”: Implications, Opportunities and Caveats
 

Similar to Connecting antimalarial data

Opening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveatsOpening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveats
Chris Southan
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services Industry
Barry Smith
 

Similar to Connecting antimalarial data (20)

A Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration FrameworkA Cell-Cycle Knowledge Integration Framework
A Cell-Cycle Knowledge Integration Framework
 
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
Illuminating the Druggable Genome with Knowledge Engineering and Machine Lear...
 
Opening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveatsOpening up and connecting antimalarial data: Progress with caveats
Opening up and connecting antimalarial data: Progress with caveats
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services Industry
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
 
The IUPHAR/MMV Guide to Malaria Pharmacology
The  IUPHAR/MMV Guide to Malaria Pharmacology  The  IUPHAR/MMV Guide to Malaria Pharmacology
The IUPHAR/MMV Guide to Malaria Pharmacology
 
Using OpenTox to map Toxicity data to Adverse Outcome Pathways
Using OpenTox to map Toxicity data to Adverse Outcome PathwaysUsing OpenTox to map Toxicity data to Adverse Outcome Pathways
Using OpenTox to map Toxicity data to Adverse Outcome Pathways
 
NetBioSIG2012 chrisevelo
NetBioSIG2012 chriseveloNetBioSIG2012 chrisevelo
NetBioSIG2012 chrisevelo
 
Bioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
Bioinformatics in the Clinical Pipeline: Contribution in Genomic MedicineBioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
Bioinformatics in the Clinical Pipeline: Contribution in Genomic Medicine
 
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
Collaborative Drug Discovery: A Platform For Transforming Neglected Disease R...
 
Molecular and data visualization in drug discovery
Molecular and data visualization in drug discoveryMolecular and data visualization in drug discovery
Molecular and data visualization in drug discovery
 
Genome-Scale Metabolic Models and Systems Medicine of Metabolic Syndrome
Genome-Scale Metabolic Models and Systems Medicine of Metabolic SyndromeGenome-Scale Metabolic Models and Systems Medicine of Metabolic Syndrome
Genome-Scale Metabolic Models and Systems Medicine of Metabolic Syndrome
 
Disruptive Strategies for Removing Drug Discovery Bottlenecks
Disruptive Strategies for Removing Drug Discovery BottlenecksDisruptive Strategies for Removing Drug Discovery Bottlenecks
Disruptive Strategies for Removing Drug Discovery Bottlenecks
 
Disruptive Strategies for Removing Drug Discovery Bottlenecks
Disruptive Strategies for Removing Drug Discovery Bottlenecks Disruptive Strategies for Removing Drug Discovery Bottlenecks
Disruptive Strategies for Removing Drug Discovery Bottlenecks
 
Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020Guide to Pharmacology Poster - ELIXIR All Hands 2020
Guide to Pharmacology Poster - ELIXIR All Hands 2020
 
Four Disruptive Strategies for Removing Drug Discovery Bottlenecks
Four Disruptive Strategies for Removing Drug Discovery BottlenecksFour Disruptive Strategies for Removing Drug Discovery Bottlenecks
Four Disruptive Strategies for Removing Drug Discovery Bottlenecks
 
When pharmaceutical companies publish large datasets an abundance of riches o...
When pharmaceutical companies publish large datasets an abundance of riches o...When pharmaceutical companies publish large datasets an abundance of riches o...
When pharmaceutical companies publish large datasets an abundance of riches o...
 
verlinde1994.pdf
verlinde1994.pdfverlinde1994.pdf
verlinde1994.pdf
 
Illuminating the druggable genome and the quest for new drug targets
Illuminating the druggable genome and the quest for new drug targetsIlluminating the druggable genome and the quest for new drug targets
Illuminating the druggable genome and the quest for new drug targets
 
PubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biologyPubChem for drug discovery and chemical biology
PubChem for drug discovery and chemical biology
 

More from Chris Southan

Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
Chris Southan
 

More from Chris Southan (20)

FAIR connectivity for DARCP
FAIR  connectivity for DARCPFAIR  connectivity for DARCP
FAIR connectivity for DARCP
 
Connectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivityConnectivity > documents > structures > bioactivity
Connectivity > documents > structures > bioactivity
 
Peptide tribulations
Peptide tribulationsPeptide tribulations
Peptide tribulations
 
Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2 Vicissitudes of target validation for BACE1 and BACE2
Vicissitudes of target validation for BACE1 and BACE2
 
Guide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updaeGuide to Pharmacology database: ELIXIR updae
Guide to Pharmacology database: ELIXIR updae
 
Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?Will the correct BACE ORFs please stand up?
Will the correct BACE ORFs please stand up?
 
Desperately seeking DARCP
Desperately seeking DARCPDesperately seeking DARCP
Desperately seeking DARCP
 
Seeking glimmers of light in Pharos “Tdark” proteins
Seeking glimmers of light in  Pharos “Tdark” proteinsSeeking glimmers of light in  Pharos “Tdark” proteins
Seeking glimmers of light in Pharos “Tdark” proteins
 
5HT2A modulators update for SAFER
5HT2A modulators update for SAFER5HT2A modulators update for SAFER
5HT2A modulators update for SAFER
 
Quality and noise in big chemistry databases
Quality and noise in big chemistry databasesQuality and noise in big chemistry databases
Quality and noise in big chemistry databases
 
Connecting chemistry-to-biology
Connecting chemistry-to-biology Connecting chemistry-to-biology
Connecting chemistry-to-biology
 
GtoPdb June 2019 poster
GtoPdb June 2019 posterGtoPdb June 2019 poster
GtoPdb June 2019 poster
 
PubChem as a source of systems biology perturbagens
PubChem as a source of  systems biology perturbagensPubChem as a source of  systems biology perturbagens
PubChem as a source of systems biology perturbagens
 
Will the real proteins please stand up
Will the real proteins please stand upWill the real proteins please stand up
Will the real proteins please stand up
 
Peptide Tribulations
Peptide TribulationsPeptide Tribulations
Peptide Tribulations
 
Looking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIRLooking at chemistry - protein - papers connectivity in ELIXIR
Looking at chemistry - protein - papers connectivity in ELIXIR
 
Guide to Immunopharmacology update
Guide to Immunopharmacology updateGuide to Immunopharmacology update
Guide to Immunopharmacology update
 
Druggable Proteome sources in UniProt
Druggable Proteome sources in UniProtDruggable Proteome sources in UniProt
Druggable Proteome sources in UniProt
 
Peptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdbPeptide Tribulations in GtoPdb
Peptide Tribulations in GtoPdb
 
Patents in PubChem
Patents in PubChemPatents in PubChem
Patents in PubChem
 

Recently uploaded

Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
Cherry
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
Cherry
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cherry
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
seri bangash
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
Cherry
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
Cherry
 

Recently uploaded (20)

Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.Porella : features, morphology, anatomy, reproduction etc.
Porella : features, morphology, anatomy, reproduction etc.
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil Record
 
Role of AI in seed science Predictive modelling and Beyond.pptx
Role of AI in seed science  Predictive modelling and  Beyond.pptxRole of AI in seed science  Predictive modelling and  Beyond.pptx
Role of AI in seed science Predictive modelling and Beyond.pptx
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 
CYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptxCYTOGENETIC MAP................ ppt.pptx
CYTOGENETIC MAP................ ppt.pptx
 
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
GBSN - Biochemistry (Unit 2) Basic concept of organic chemistry
 
Efficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence accelerationEfficient spin-up of Earth System Models usingsequence acceleration
Efficient spin-up of Earth System Models usingsequence acceleration
 
Cyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptxCyanide resistant respiration pathway.pptx
Cyanide resistant respiration pathway.pptx
 
FAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical ScienceFAIRSpectra - Enabling the FAIRification of Analytical Science
FAIRSpectra - Enabling the FAIRification of Analytical Science
 
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIACURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
CURRENT SCENARIO OF POULTRY PRODUCTION IN INDIA
 
Understanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution MethodsUnderstanding Partial Differential Equations: Types and Solution Methods
Understanding Partial Differential Equations: Types and Solution Methods
 
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
Human & Veterinary Respiratory Physilogy_DR.E.Muralinath_Associate Professor....
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.Cyathodium bryophyte: morphology, anatomy, reproduction etc.
Cyathodium bryophyte: morphology, anatomy, reproduction etc.
 
The Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptxThe Mariana Trench remarkable geological features on Earth.pptx
The Mariana Trench remarkable geological features on Earth.pptx
 
LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.LUNULARIA -features, morphology, anatomy ,reproduction etc.
LUNULARIA -features, morphology, anatomy ,reproduction etc.
 
POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.POGONATUM : morphology, anatomy, reproduction etc.
POGONATUM : morphology, anatomy, reproduction etc.
 
Concept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdfConcept of gene and Complementation test.pdf
Concept of gene and Complementation test.pdf
 

Connecting antimalarial data

  • 1. Discussion Christopher Southan IUPHAR/BPS Guide to PHARMACOLOGY (GtoPdb), Centre for Integrative Physiology, The University of Edinburgh, EH8 9XD, UK. http://www.slideshare.net/cdsouthan/connecting-antimalarial-data As outlined in the introduction to the CINF Symposium, among Jean-Claude Bradley’s achievements, his work on Open Notebook Science (ONS) (https://en.wikipedia.org/wiki/Open_notebook_science) has not only perhaps the largest impact but the ripple effect continues to broaden. This is particularly the case in Open Source Drug Discovery, OSDD (used here as a generic term not specific to any group) where ONS forms a core enablement for the movement (PMID:23985301). This is a radical departure from what we can call Traditional Closed Drug Discovery (TCDD). While boundaries between these camps are blurred, the use of ONS is a clear differentiator in the philosophy of real-time data surfacing (typically via an Electronic Laboratory Notebook ELN). This means that teams can intersect with, share and optimise any chemical space since they are no longer competitively compelled to IP-protect lead structures. The domain of small-molecule malaria treatments has become a poster child for OSDD and also spawned the “Box” concept of physically distributable active compound sets. Opening up and connecting antimalarial data: Progress with caveats An ACS SciMix contribution from the CINF session: The Growing Impact of Openness in Chemistry: A Symposium in Honour of JC Bradley Jean-Claude Bradley’s pioneering of ONS has the potential to shorten lead discovery and optimisation by years. Consequently it will bring more new medicines to more patients faster. This is not restricted to NTDs but is likely to be adopted by rare disease consortia. Notwithstanding, as a proportion of the current antimalarial chemical estate, the ONS contribution is small. Notably, the majority of lead SAR is still instantiated in patents and papers from the TCDD motus operandi. This was the reason why curating leads for the PB remained a typically arduous exercise (that we are used to at GtoPdb). It is also important to note that impediments to findability and connectivity of molecular relationships in the “system” (including target and pathway mapping) remain serious concerns for malaria and other OSDD domains. In the context of drug discovery ONS, like any other approach, has its caveats. The main one is that real-time data (hot off the instruments or just out of the fume hood) tends to be unstructured and confirmations pending. In this situation of “positive collaborative anarchy” across different global teams ONS data can be difficult to find, provenance, verify, curate, standardise and mine. Of course, a similar constellation of informatics challenges also arises from TCDD but (on a good day) open (e.g. in PubChem direct or curated from the literature via ChEMBL and/or GtoPdb) SAR may surface in a minable form even if some years after the fact. Notwithstanding, the major acceleration that ONS facilitates will ensure its expansion that will include new drug discovery commercial gaps with unmet clinical needs, as a fitting legacy of Jean-Claude Bradley’s innovation. As context for this invited presentation, while my day-job is working for the Edinburgh GtoPdb team, I have donated a small amount of voluntary support to the Sydney OSM team since 2012 (https://www.thinkable.org/submission/2136 at 1.54 on the video). This has focused mainly on chemical structure searching, data organisation and surfacing strategies. In addition, I blog occasionally on the themes of data connectivity in general and for antimalarial leads in particular. For the record, MMV have thanked me for contributing the 28 structures. By various criteria they will not all go in to the PB but I hope to find out the inclusions. Figure 1. Comparing the Genome Ontology function splits between all human proteins (left, 20,198) and the GtoPdb targets with small- molecule quantitative interactions (right, 978) Following on from the award-winning success of the Medicines for Malaria Ventures (MMV) Malaria Box of 400 compounds (http://www.mmv.org/malariabox) a Pathogen Box (PB) is in preparation for a range of Neglected Tropical Diseases (NTDs) in addition to malaria (http://pathogenbox.org/). Since I had already highlighted the vicissitudes of establishing the explicit molecular identities of published malaria leads in several blog posts I extended these to 28 structures for possible inclusion in the PB (http://cdsouthan.blogspot.se/2014/06/getting-into- box-with-some-recent.html) the first page of which is shown below. The challenges of curating leads for the PB were similar to those encountered by the GtoPdb team for human targets and their ligands on a daily basis (PMID:24234439). They were in fact somewhat worse, as reflected in the statistics of the 22 PubChem CIDs linked below http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/ 48358242/public/. Quirks encountered are detailed in the blog post but included; • The 6 structures not in PubChem are de facto unfindable in open dbs but some may get Google InChIKey matches via chemicalize.org cache • The only systematic identifier encountered was the IUPAC name which often had to be dug out of the supplementary data as in blog page on the left (i.e. neither SMILES nor InChI in papers or patents) • No authors made direct database submissions • The code name was often not a PubChem synonym • ChEMBL had picked up 16 with data > to PubChem BioAssay • 13 had patent-extraction matches and 11 chemical vendor matches • The MeSH annotation had only linked two directly to PMIDs Out of the documents and into the BoxIntroduction RESULTS (3) Finding structures and linking data from the Sydney University OSM team and their collaborators (http://opensourcemalaria.org/) is much easier than for the PB 28. This is primarily because of their adoption of ONS, Google docs, other surfacing routes and direct submissions to ChEMBL. This is illustrated for MMV670437 (as an OSM 44 nM lead in the 28) by simply Googling the inner InChIKey layer (PMID:23399051). Matches (including below left) returned in 0.35 sec, include PubChem, OSM in GitHub and my blog. The PubChem SID (below right) with MMV code is my submission. Further ONS utility is exemplified by the surfacing of 250 project structures in the first link below. The second link maps 167 of these into PubChem https://docs.google.com/spreadsheets/d/1Rvy6OiM291d1GN_cyT6eSw_C3lSuJ1jaR7AJa8hgGsc/edit#gid=510297618 http://www.ncbi.nlm.nih.gov/sites/myncbi/christopher.southan.1/collections/48338932/public/ Connecting up with Open Source Malaria (OSM)