SlideShare a Scribd company logo
CINF20 - 31 March 2019
Dr Frederik van den Broek, Elsevier Professional Services
Data-driven drug
discovery for rare
diseases
Tales from the trenches
This is what we are all after in drug discovery…
Image: Elsevier
If drug discovery and development only were that simple…
Disease
Drug
compound
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
If drug discovery and development only were that simple…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
This makes it all a lengthy and costly process
Image: https://www.phrma.org/graphic/the-biopharmaceutical-research-and-development-process
With rare diseases it is even harder
Small(er) patient populations leading to
• Less (integral) medical and scientific knowledge
• Small population for clinical trials
• Unawareness with doctors, researchers, policymakers
• Smaller potential market size for a drug
Image: http://www.campingtourist.com/camping-activities/climbing/difficult-mountains-climb/
Drug repurposing: a new hope for rare diseases
• Less costly and of interest for pharma
• Quicker to Phase II/III tests, so hopefully quicker to market
• Need reliable information from various sources to find suitable repurposing
candidates
Image: https://www.starwars.com/news/poll-what-is-the-best-scene-in-star-wars-a-new-hope
Accelerate with new knowledge and data
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
Various initiatives we were recently involved in
• Project with Findacure to find drug repurposing candidates for Congenital
Hyperinsulinism
• Pistoia Hackaton: Elsevier-Findacure challenge on Friedrich’s Ataxia
• Sub-network enrichment analysis for neuromuscular disorder pathways
• Disease pathway analysis for Huntingdon's Disease
• Pistoia Datathon for drug repurposing for rare diseases
| 13
• A rare genetic disease
• Permanently excessive level of insulin in the
blood
• Develops within the first few days of life
• Can lead to brain injury or even death
• In the most severe cases the only viable treatment is
the removal of the pancreas, consigning the patient to
a lifetime of diabetes
Congenital hyperinsulinsm (CHI)
https://res.cloudinary.com/indiegogo-media-prod-
cld/image/upload/c_limit,w_620/v1440424745/uzvnq
zhvbpsrtthzxqpu.jpg
Creating a comprehensive view of CHI
• CHI Literature Library
• Disease, Target, Pathway, and
Compound Analysis
• Research Landscape Analysis
Information Assets Applied
• Content Elsevier’s vast set of literature and patent data
• Data normalization Taxonomies and dictionaries to
normalize author names, institutions, drugs, targets, and
other important terms
• Information extraction Finding semantic
relationships, targets, pathways, drugs, and bioactivities
Building and refining the CHI disease model
Picked relevant
pathways
(from a collection of 1800
models)
Explored functions of
proteins using 6.2M pre-
text mined relations
and embedded Gene
Ontology
Summarized what is known
about CHI mechanism in an
overview model
From pathways to CHI treatments:
Automated analysis combines bioassay data with pathway data
Mean of activities among
these targets
Me
Targets and activities for
each compound
Drug-likeness
metrics for
sorting/classification
• All compounds that
were observed to bind
to targets in pathway
• Sorted by number of
active targets.
Too many targets may
suggest lack of specificity.
Find all targets that
could be used to affect
the disease state
Query for each target to find
compounds that have high
affinity for them (>6 log units)
Collate data by compound to summarize the
targets/activities related to disease that the
compound hits
• Compute geometric mean of activities for ranking
• Rank by number of targets and geometric mean of
activities against targets
Step 1 Step 2
Step 3
Pistoia Hackathon Challenge (2017)
Elsevier would like you to demonstrate the ability of deep learning to help
Findacure, a UK-based charity, accelerate treatment and clinical research for
Friedreich’s ataxia (FRDA). You’ll have access to a heterogeneous set of
data related to the disease: biological pathway analysis, associated chemical
compounds and bioactivities, potential candidates for drug re-purposing, full-
text scientific literature and clinical trial data.
Basically, giving others a go with the data sets we worked with on CHI….
Promising results, but still hard work
“We spent most of our time the first day just trying to get our heads around
the data, so we could start to find some solutions. Even opening the files was
tricky.” The students used various tools to try to extract data from the
provided XML files, but it was slow going. Daniel [one of the participants]
commented that, “we wound up having to do a lot of things manually, so we
could at least read the files in plain text.”
Sharing disease pathways
• Shared curated pathways (with supporting literature
references) with rare disease organisations to help their
discussions with researchers and fill in potential “blanks”
• Comparing gene expression algorithms for the identification
of expression regulators
• Well-defined datasets, with supporting
literature references which resonate
with researchers
Datathon (2019):
Applying AI in Drug Repurposing for Rare Diseases
“Machine learning
won’t work if your data
is rigidly siloed.”
“One major challenge
is collecting enough
reliable information to
properly train AI systems.
AI is as good as the
data.”
Nick Patience
Founder, 451
Research
“Organizations need to
make sure that the data
being accessed is
treated and defined
consistently across the
sources. Otherwise,
virtualization won't work.”
“All the major AI
advances have been
fueled by advances in
data sets. The algorithms
are easy….
"Collecting, classifying
and labeling datasets
used to train the
algorithms is the grunt
work that’s difficult”
Aspuru-Guzik
Professor of Chemistry &
Machine Learning, Harvard
University JJ Guy
CTO, Jask (AI co.)
‘Siloed’ Lack of standards
Requires labeling and
contextPoor quality1
2 3 4
Using the Entellect Platform and Data Curation
Access, curation of
authoritative life science
data
Integration of disparate
data, structured and
unstructured
Normalized and
standardized data with
industry standard
taxonomies
Build custom and off-the-
shelf analytics tools
‘Un-siloed’ Harmonized Enriched and linkedQuality
Nick Patience
Founder, 451
Research
Aspuru-Guzik
Professor of Chemistry &
Machine Learning, Harvard
University
1
2 3 4
Using the Entellect Platform and Data Curation
Adverse
Event
Person
Org
Which
drugs
affect this
target?
Bio-
Activity
Pathway
Disease
Bioprocess
Trial
Disease
Species
Target
Drug
Assay
Substance:
- provenanceName
- substance
- name
- compoundType
- substanceTypeName
- inchiCode
- molecularFormula
- charge
- numberOfAtoms
- numberOfComponents
- numberOfElements
- numberOfFragments
- numberOfStructure
- molWeightPublishedValue
- molWeightPublishedUnit
- molWeightStandardValue
- mpvalue
Bioactivity:
- provenanceName
- effect
- inducedBy
- target
- targetsCount
- bioactivityParameterName
- displayValue
- publishedValue
- publishedUnit
- pX
Target:
- provenanceName
- target
- uniprotId
- sequence
- targetType
- label
- speciesId
- speciesName
- geneSymbol
Entellect Platform and Data Curation
Various teams using various approaches
• Semantic data: Target Identification
• Semantic data: Small Molecule Binding
• Machine Learning
− Ensemble Learning
− Mol2Vec, Prot2Vec
− Network diffusion
• Expert collaboration
− Virtual docking
− Adverse Event profiling
“I could work on the important stuff straight away, using all the data”
Promising results so far (March 2019)
Aiming to make data-driven drug discovery for rare diseases
a little easier…
Disease
Protein
Target
Drug
compound
• Cell processes
• Regulators
• Pathways
• …
• Bioactivity
• Toxicity
• Specificity
• …
• Availability
• Synthesis
• PK/PD
• …
• Genotype
• Phenotype
• Individual
Conclusions
• Data, data, data…
• Data has to be FAIR and of good and trusted provenance as the
researchers and clinicians will want to see the “chain of evidence” (beware
of black box models)
• Data sets also have to be FAIR for each other: enabling the integral
approaches repurposing needs have to be linked data sets across siloes
and domains to go from disease to target to compound (and back)
Image: Sangya Pundir, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=53414062
Acknowledgements
• Maria Shkrob
• Jabe Wilson
• Anton Yuryev
• Matthew Clark
• Christy Wilson
• Finlay Maclean
• Elsevier’s Entellect team
• Pistioia hackaton and datathon teams
Questions?
By Malis - https://commons.wikimedia.org/w/index.php?curid=2633354
Appendix – Datathon approaches

More Related Content

What's hot

Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEs
Chris Southan
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
Chris Southan
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
open_phacts
 
GtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in contextGtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in context
Chris Southan
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
Frederik van den Broek
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tension
Paul Groth
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
open_phacts
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
Sunghwan Kim
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
Sunghwan Kim
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
Chris Southan
 
Generating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web TechnologiesGenerating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web Technologies
Michel Dumontier
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem
Sunghwan Kim
 
Applicationsofbioinformaticsindrugdiscoveryandprocess
ApplicationsofbioinformaticsindrugdiscoveryandprocessApplicationsofbioinformaticsindrugdiscoveryandprocess
Applicationsofbioinformaticsindrugdiscoveryandprocess
jaidev53ster
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics education
Sunghwan Kim
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Data Consortium
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug Design
Shahir Shamsir
 
Automate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilanceAutomate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilance
Ann-Marie Roche
 

What's hot (20)

Patent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEsPatent chemisty big bang: utilities for SMEs
Patent chemisty big bang: utilities for SMEs
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
 
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
2015-04-28 Open PHACTS at Swedish Linked Data Network Meet-up
 
GtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in contextGtoPdb and GtoImmuPdb in context
GtoPdb and GtoImmuPdb in context
 
Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...Integrating and curating internet based chemistry resources to serve life sci...
Integrating and curating internet based chemistry resources to serve life sci...
 
Online Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery SystemsOnline Resources to Support Open Drug Discovery Systems
Online Resources to Support Open Drug Discovery Systems
 
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
A brief history of reaction analytics (CINF 144, ACS National Meeting 2018-08...
 
Data Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tensionData Integration vs Transparency: Tackling the tension
Data Integration vs Transparency: Tackling the tension
 
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
2015-02-10 The Open PHACTS Discovery Platform: Semantic Data Integration for ...
 
PubChem for chemical information literacy training
PubChem for chemical information literacy trainingPubChem for chemical information literacy training
PubChem for chemical information literacy training
 
Toxicological information in PubChem
Toxicological information in PubChemToxicological information in PubChem
Toxicological information in PubChem
 
BIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdbBIA 10-2474 in GtoPdb
BIA 10-2474 in GtoPdb
 
Generating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web TechnologiesGenerating Biomedical Hypotheses Using Semantic Web Technologies
Generating Biomedical Hypotheses Using Semantic Web Technologies
 
Searching for patent information in PubChem
Searching for patent information in PubChem Searching for patent information in PubChem
Searching for patent information in PubChem
 
Applicationsofbioinformaticsindrugdiscoveryandprocess
ApplicationsofbioinformaticsindrugdiscoveryandprocessApplicationsofbioinformaticsindrugdiscoveryandprocess
Applicationsofbioinformaticsindrugdiscoveryandprocess
 
PubChem and its application for cheminformatics education
PubChem and its application for cheminformatics educationPubChem and its application for cheminformatics education
PubChem and its application for cheminformatics education
 
Health Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven EdwardsHealth Datapalooza 2013: Datalab - Steven Edwards
Health Datapalooza 2013: Datalab - Steven Edwards
 
Aiding Computer Aided Drug Design
Aiding Computer Aided Drug DesignAiding Computer Aided Drug Design
Aiding Computer Aided Drug Design
 
Automate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilanceAutomate your literature monitoring for more effective pharmacovigilance
Automate your literature monitoring for more effective pharmacovigilance
 

Similar to Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31)

Presentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioPresentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-Antonio
Anton Yuryev
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
W Roseybala Devi
 
Biomedical Literature
Biomedical Literature Biomedical Literature
Biomedical Literature
Arete-Zoe, LLC
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
Sean Ekins
 
Mobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseasesMobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseases
Maria Shkrob
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinar
Ann-Marie Roche
 
Research methodology
Research methodologyResearch methodology
Research methodologyTosif Ahmad
 
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
José M. Guerrero
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
Philip Payne
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
BigData_Europe
 
Nursing Research Resources
Nursing Research Resources Nursing Research Resources
Nursing Research Resources
Ann Celestine
 
Patient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a LecturePatient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a Lecture
CMDLMS
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
Ida Sim
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
Human Variome Project
 
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Canadian Organization for Rare Disorders
 
IRDiRC: progress and expectations
IRDiRC: progress and expectationsIRDiRC: progress and expectations
IRDiRC: progress and expectations
Canadian Organization for Rare Disorders
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDavid Lee Scher, MD
 
Amia tbi-14-final
Amia tbi-14-finalAmia tbi-14-final
Amia tbi-14-final
Russ Altman
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Adam Ford
 

Similar to Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31) (20)

Presentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-AntonioPresentation at Rare Disease conference in San-Antonio
Presentation at Rare Disease conference in San-Antonio
 
Role of bioinformatics in drug designing
Role of bioinformatics in drug designingRole of bioinformatics in drug designing
Role of bioinformatics in drug designing
 
Biomedical Literature
Biomedical Literature Biomedical Literature
Biomedical Literature
 
Indications discovery and drug repurposing
Indications discovery and drug repurposingIndications discovery and drug repurposing
Indications discovery and drug repurposing
 
Mobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseasesMobilizing informational resources for rare diseases
Mobilizing informational resources for rare diseases
 
Mobilizing informational resources webinar
Mobilizing informational resources   webinarMobilizing informational resources   webinar
Mobilizing informational resources webinar
 
Research methodology
Research methodologyResearch methodology
Research methodology
 
Search for evidence
Search for evidenceSearch for evidence
Search for evidence
 
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...Open Data in Medicine. Application of Mind Maping automation to visualize inf...
Open Data in Medicine. Application of Mind Maping automation to visualize inf...
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
Nursing Research Resources
Nursing Research Resources Nursing Research Resources
Nursing Research Resources
 
Patient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a LecturePatient Centered Care | Unit 8a Lecture
Patient Centered Care | Unit 8a Lecture
 
The Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based MedicineThe Uneven Future of Evidence-Based Medicine
The Uneven Future of Evidence-Based Medicine
 
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
MseqDR consortium: a grass-roots effort to establish a global resource aimed ...
 
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference Julian Little & Beth Potter: Rare Disease Day 2016 Conference
Julian Little & Beth Potter: Rare Disease Day 2016 Conference
 
IRDiRC: progress and expectations
IRDiRC: progress and expectationsIRDiRC: progress and expectations
IRDiRC: progress and expectations
 
Digital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient AdvocateDigital Health Technology: The Ultimate Patient Advocate
Digital Health Technology: The Ultimate Patient Advocate
 
Amia tbi-14-final
Amia tbi-14-finalAmia tbi-14-final
Amia tbi-14-final
 
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
Ben Goertzel AIs, Superflies and the Path to Immortality - singsum au 2011
 

Recently uploaded

micro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdfmicro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdf
Anurag Sharma
 
Prix Galien International 2024 Forum Program
Prix Galien International 2024 Forum ProgramPrix Galien International 2024 Forum Program
Prix Galien International 2024 Forum Program
Levi Shapiro
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Prof. Marcus Renato de Carvalho
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Dr Jeenal Mistry
 
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model SafeSurat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Savita Shen $i11
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
MedicoseAcademics
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
Dr. Rabia Inam Gandapore
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
NephroTube - Dr.Gawad
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
FFragrant
 
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness JourneyTom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
greendigital
 
heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
SumeraAhmad5
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Dr. Rabia Inam Gandapore
 
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Oleg Kshivets
 
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 UpakalpaniyaadhyayaCharaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Dr KHALID B.M
 
How to Give Better Lectures: Some Tips for Doctors
How to Give Better Lectures: Some Tips for DoctorsHow to Give Better Lectures: Some Tips for Doctors
How to Give Better Lectures: Some Tips for Doctors
LanceCatedral
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
i3 Health
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
VarunMahajani
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
pal078100
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
Shweta
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Dr. Rabia Inam Gandapore
 

Recently uploaded (20)

micro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdfmicro teaching on communication m.sc nursing.pdf
micro teaching on communication m.sc nursing.pdf
 
Prix Galien International 2024 Forum Program
Prix Galien International 2024 Forum ProgramPrix Galien International 2024 Forum Program
Prix Galien International 2024 Forum Program
 
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidadeNovas diretrizes da OMS para os cuidados perinatais de mais qualidade
Novas diretrizes da OMS para os cuidados perinatais de mais qualidade
 
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdfAlcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
Alcohol_Dr. Jeenal Mistry MD Pharmacology.pdf
 
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model SafeSurat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
Surat @ℂall @Girls ꧁❤8527049040❤꧂@ℂall @Girls Service Vip Top Model Safe
 
Physiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of TastePhysiology of Special Chemical Sensation of Taste
Physiology of Special Chemical Sensation of Taste
 
Superficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptxSuperficial & Deep Fascia of the NECK.pptx
Superficial & Deep Fascia of the NECK.pptx
 
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.GawadHemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
Hemodialysis: Chapter 3, Dialysis Water Unit - Dr.Gawad
 
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptxHow STIs Influence the Development of Pelvic Inflammatory Disease.pptx
How STIs Influence the Development of Pelvic Inflammatory Disease.pptx
 
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness JourneyTom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
Tom Selleck Health: A Comprehensive Look at the Iconic Actor’s Wellness Journey
 
heat stroke and heat exhaustion in children
heat stroke and heat exhaustion in childrenheat stroke and heat exhaustion in children
heat stroke and heat exhaustion in children
 
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptxPharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
Pharynx and Clinical Correlations BY Dr.Rabia Inam Gandapore.pptx
 
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
Lung Cancer: Artificial Intelligence, Synergetics, Complex System Analysis, S...
 
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 UpakalpaniyaadhyayaCharaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
Charaka Samhita Sutra sthana Chapter 15 Upakalpaniyaadhyaya
 
How to Give Better Lectures: Some Tips for Doctors
How to Give Better Lectures: Some Tips for DoctorsHow to Give Better Lectures: Some Tips for Doctors
How to Give Better Lectures: Some Tips for Doctors
 
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
New Directions in Targeted Therapeutic Approaches for Older Adults With Mantl...
 
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
Pulmonary Thromboembolism - etilogy, types, medical- Surgical and nursing man...
 
Ocular injury ppt Upendra pal optometrist upums saifai etawah
Ocular injury  ppt  Upendra pal  optometrist upums saifai etawahOcular injury  ppt  Upendra pal  optometrist upums saifai etawah
Ocular injury ppt Upendra pal optometrist upums saifai etawah
 
Evaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animalsEvaluation of antidepressant activity of clitoris ternatea in animals
Evaluation of antidepressant activity of clitoris ternatea in animals
 
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptxTriangles of Neck and Clinical Correlation by Dr. RIG.pptx
Triangles of Neck and Clinical Correlation by Dr. RIG.pptx
 

Data-driven drug discovery for rare diseases - Tales from the trenches (CINF 20, ACS National Meeting 2019-03-31)

  • 1. CINF20 - 31 March 2019 Dr Frederik van den Broek, Elsevier Professional Services Data-driven drug discovery for rare diseases Tales from the trenches
  • 2. This is what we are all after in drug discovery… Image: Elsevier
  • 3. If drug discovery and development only were that simple… Disease Drug compound
  • 4. If drug discovery and development only were that simple… Disease Protein Target Drug compound
  • 5. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • …
  • 6. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 7. If drug discovery and development only were that simple… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 8. This makes it all a lengthy and costly process Image: https://www.phrma.org/graphic/the-biopharmaceutical-research-and-development-process
  • 9. With rare diseases it is even harder Small(er) patient populations leading to • Less (integral) medical and scientific knowledge • Small population for clinical trials • Unawareness with doctors, researchers, policymakers • Smaller potential market size for a drug Image: http://www.campingtourist.com/camping-activities/climbing/difficult-mountains-climb/
  • 10. Drug repurposing: a new hope for rare diseases • Less costly and of interest for pharma • Quicker to Phase II/III tests, so hopefully quicker to market • Need reliable information from various sources to find suitable repurposing candidates Image: https://www.starwars.com/news/poll-what-is-the-best-scene-in-star-wars-a-new-hope
  • 11. Accelerate with new knowledge and data Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 12. Various initiatives we were recently involved in • Project with Findacure to find drug repurposing candidates for Congenital Hyperinsulinism • Pistoia Hackaton: Elsevier-Findacure challenge on Friedrich’s Ataxia • Sub-network enrichment analysis for neuromuscular disorder pathways • Disease pathway analysis for Huntingdon's Disease • Pistoia Datathon for drug repurposing for rare diseases
  • 13. | 13 • A rare genetic disease • Permanently excessive level of insulin in the blood • Develops within the first few days of life • Can lead to brain injury or even death • In the most severe cases the only viable treatment is the removal of the pancreas, consigning the patient to a lifetime of diabetes Congenital hyperinsulinsm (CHI) https://res.cloudinary.com/indiegogo-media-prod- cld/image/upload/c_limit,w_620/v1440424745/uzvnq zhvbpsrtthzxqpu.jpg
  • 14. Creating a comprehensive view of CHI • CHI Literature Library • Disease, Target, Pathway, and Compound Analysis • Research Landscape Analysis Information Assets Applied • Content Elsevier’s vast set of literature and patent data • Data normalization Taxonomies and dictionaries to normalize author names, institutions, drugs, targets, and other important terms • Information extraction Finding semantic relationships, targets, pathways, drugs, and bioactivities
  • 15. Building and refining the CHI disease model Picked relevant pathways (from a collection of 1800 models) Explored functions of proteins using 6.2M pre- text mined relations and embedded Gene Ontology Summarized what is known about CHI mechanism in an overview model
  • 16. From pathways to CHI treatments: Automated analysis combines bioassay data with pathway data Mean of activities among these targets Me Targets and activities for each compound Drug-likeness metrics for sorting/classification • All compounds that were observed to bind to targets in pathway • Sorted by number of active targets. Too many targets may suggest lack of specificity. Find all targets that could be used to affect the disease state Query for each target to find compounds that have high affinity for them (>6 log units) Collate data by compound to summarize the targets/activities related to disease that the compound hits • Compute geometric mean of activities for ranking • Rank by number of targets and geometric mean of activities against targets Step 1 Step 2 Step 3
  • 17. Pistoia Hackathon Challenge (2017) Elsevier would like you to demonstrate the ability of deep learning to help Findacure, a UK-based charity, accelerate treatment and clinical research for Friedreich’s ataxia (FRDA). You’ll have access to a heterogeneous set of data related to the disease: biological pathway analysis, associated chemical compounds and bioactivities, potential candidates for drug re-purposing, full- text scientific literature and clinical trial data. Basically, giving others a go with the data sets we worked with on CHI….
  • 18. Promising results, but still hard work “We spent most of our time the first day just trying to get our heads around the data, so we could start to find some solutions. Even opening the files was tricky.” The students used various tools to try to extract data from the provided XML files, but it was slow going. Daniel [one of the participants] commented that, “we wound up having to do a lot of things manually, so we could at least read the files in plain text.”
  • 19. Sharing disease pathways • Shared curated pathways (with supporting literature references) with rare disease organisations to help their discussions with researchers and fill in potential “blanks” • Comparing gene expression algorithms for the identification of expression regulators • Well-defined datasets, with supporting literature references which resonate with researchers
  • 20. Datathon (2019): Applying AI in Drug Repurposing for Rare Diseases
  • 21. “Machine learning won’t work if your data is rigidly siloed.” “One major challenge is collecting enough reliable information to properly train AI systems. AI is as good as the data.” Nick Patience Founder, 451 Research “Organizations need to make sure that the data being accessed is treated and defined consistently across the sources. Otherwise, virtualization won't work.” “All the major AI advances have been fueled by advances in data sets. The algorithms are easy…. "Collecting, classifying and labeling datasets used to train the algorithms is the grunt work that’s difficult” Aspuru-Guzik Professor of Chemistry & Machine Learning, Harvard University JJ Guy CTO, Jask (AI co.) ‘Siloed’ Lack of standards Requires labeling and contextPoor quality1 2 3 4 Using the Entellect Platform and Data Curation
  • 22. Access, curation of authoritative life science data Integration of disparate data, structured and unstructured Normalized and standardized data with industry standard taxonomies Build custom and off-the- shelf analytics tools ‘Un-siloed’ Harmonized Enriched and linkedQuality Nick Patience Founder, 451 Research Aspuru-Guzik Professor of Chemistry & Machine Learning, Harvard University 1 2 3 4 Using the Entellect Platform and Data Curation
  • 23. Adverse Event Person Org Which drugs affect this target? Bio- Activity Pathway Disease Bioprocess Trial Disease Species Target Drug Assay Substance: - provenanceName - substance - name - compoundType - substanceTypeName - inchiCode - molecularFormula - charge - numberOfAtoms - numberOfComponents - numberOfElements - numberOfFragments - numberOfStructure - molWeightPublishedValue - molWeightPublishedUnit - molWeightStandardValue - mpvalue Bioactivity: - provenanceName - effect - inducedBy - target - targetsCount - bioactivityParameterName - displayValue - publishedValue - publishedUnit - pX Target: - provenanceName - target - uniprotId - sequence - targetType - label - speciesId - speciesName - geneSymbol Entellect Platform and Data Curation
  • 24. Various teams using various approaches • Semantic data: Target Identification • Semantic data: Small Molecule Binding • Machine Learning − Ensemble Learning − Mol2Vec, Prot2Vec − Network diffusion • Expert collaboration − Virtual docking − Adverse Event profiling “I could work on the important stuff straight away, using all the data”
  • 25. Promising results so far (March 2019)
  • 26. Aiming to make data-driven drug discovery for rare diseases a little easier… Disease Protein Target Drug compound • Cell processes • Regulators • Pathways • … • Bioactivity • Toxicity • Specificity • … • Availability • Synthesis • PK/PD • … • Genotype • Phenotype • Individual
  • 27. Conclusions • Data, data, data… • Data has to be FAIR and of good and trusted provenance as the researchers and clinicians will want to see the “chain of evidence” (beware of black box models) • Data sets also have to be FAIR for each other: enabling the integral approaches repurposing needs have to be linked data sets across siloes and domains to go from disease to target to compound (and back) Image: Sangya Pundir, CC BY-SA 4.0, https://commons.wikimedia.org/w/index.php?curid=53414062
  • 28. Acknowledgements • Maria Shkrob • Jabe Wilson • Anton Yuryev • Matthew Clark • Christy Wilson • Finlay Maclean • Elsevier’s Entellect team • Pistioia hackaton and datathon teams
  • 29. Questions? By Malis - https://commons.wikimedia.org/w/index.php?curid=2633354
  • 30. Appendix – Datathon approaches