SlideShare a Scribd company logo
Data Science Meets Drug Discovery
Philip Bourne, University of Virginia
Zheng Zhao, University of Virginia
Lei Xie, The City University of New York
0
Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
1
One Definition of Data Science – The 4+1 Model
■ Value – ensuring societal
benefit
■ Design - Communication
of the value of data
■ Systems – the means to
communicate and convey
benefit
■ Analytics – models and
methods
■ Practice – where
everything happens – for
today that is early-stage
drug discovery
[Adapted from Raf Alvarado] Bourne PE (2021) Is “bioinformatics” dead? PLoS Biol 19(3): e3001165.
A Snapshot of In Silico Drug Discovery
Today’s Discussion
Methods
Simple Complex
Data
Small
Large
Discovery
Process
One Drug – One Target
One Drug – Multiple Targets
Multiple Drugs – Multiple Targets
Multiple Drugs – Multiple Targets – One Person
Populations/Environment
Molecules
Omics
Model
Organisms
Organs
Cells
Docking MD Bioinformatics ML DL
A Snapshot of In Silico Drug Discovery
Today’s Discussion
Method
Simple Complex
Data
Small
Large
Discovery
Process
One Drug – One Target
One Drug – Multiple Targets
Multiple Drugs – Multiple Targets
Multiple Drugs – Multiple Targets – One Person
Populations
Molecules
Omics
Model
Organisms
Organs
Cells
Docking MD Bioinformatics ML DL
One-drug-one-target Small Molecule Drug Discovery
Process
5
>90% Failure Rate
Druggable
target
identification
Lead compound
identification and
optimization
Pre-clinical
testing
Clinical trial
Omics
data
analysis
Kiriiri et al. Future J Pharm Sci. (2020)
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
Fundamental Challenge of Conventional Drug Discovery
in Combating Complex Diseases
6
disease
association
binding
affinity
animal
response
Clinical
efficacy
& safety
Bender & Cortes-Ciriano, Drug Discov Today (2021)
Disease gene may
not be right drug
target
Genome-wide off-target
bindings and systems
effects are ignored
animal-human
extrapolation
remains challenging
Each step in the drug discovery process is optimized for a
proxy objective but not a clinical endpoint
Power of Artificial Intelligence (Deep Learning) has not
been fully utilized
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
Environmental
factors
(e.g., microbiome)
7
Discovery Process: Multi-Scale Modeling Drug Actions
in the Human Body
Pleiotropic systems
effect of drugs
Xie & Bourne (2012) Annu Rev
Pharmacol Toxicol,
Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
8
9
Bioinformatics Methods for Drug Design and Discovery
❏ Utilize structural genomics
data
❏ Explore evolutionary and
functional linkage (ligand
binding site similarity)
across fold space
❏ Successfully applied to drug
repurposing, side effect
prediction, and
polypharmacology
Bioinformatics Methods for Drug Design and Discovery
Early-Stage Drug Discovery Patterns:
Categories Principles
Ligand-based Chemical Similarity; Pharmacophore Similarity
Structure-based Binding site (Pocket) Similarity
Protein-ligand-based Pharmacophore Similarity; Interaction Fingerprint Similarity
Bender, et al; Drug Discov. Today; 2021
Bioinformatics Methods for Protein-ligand interaction -
based Drug Design
11
IFPs: Protein-Ligand Interaction Fingerprints
Given a ligand-binding complex
Encoding IFPs
Every residue – ligand
interaction encoded into a
7-bit substring
One 1D string
representing the protein-
ligand interactions
Using pre-defined geometric rules
Interactions Rules*
H-bond
Distance (Donor-Accepter) ≤ 3.5 Å && Tolerance
Angle (Donor-H-Acceptor ∈[-π/3, π/3])
Ionic Distance (Anion-Cation) ≤ 4.0 Å
Aromatic (face to
face)
Distance (two aromatic ring centers) ≤ 4.0 Å &&
Tolerance angle (face to edge) ∈[-π/6, π/6])
Aromatic (face to
edge)
Distance (two aromatic ring centers) ≤ 4.0 Å &&
Tolerance angle (face to edge) ∈[π/6, 5π/6])
Hydrophobic Distance (Donor-Accepter) ≤ 4.5 Å
Deng et al. J. Med. Chem (2004)
Marcou et al. J. Chem. Inf. Model. (2007)
Fs-IFP: Structural Proteome-scale Protein-Ligand
Interaction Fingerprint Method
12
Zhao et al. Drug Discov. Today (2022)
Zhao et al. J. Chem. Inf. Model. (2023)
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
Binding-sites
Alignment
Encoding
Fs-IPFs
Structura
l
Proteome
Binding-site-Aligned
Proteome Database
Encoding each complex
into one IFP string
Covalent Inhibitors,
Allosteric Inhibitors,
Macrocyclic Inhibitors,
Drug (Resistance)
Mechanisms, etc.
Apply to
Fs-IFP-encoded
Structural Proteome
Fs-IFPs: Application to Allosteric Kinase Inhibitors
13
MEK-ligand
Complexes from
PDB and MD
simulations
as Structural
Dataset
Top 15 potential
kinase targets
for allosteric
inhibitor design
Predicting
Zhao et al. PLoS One 12 (6), e0179936, 2017
Fs-IFP
Encoding
Extracting MEK-
inhibitor Interaction
Characteristics
Binding sites
aligned Dataset
Alignment
Advantage: Allosteric Kinase Inhibitor is attractive due to high, unique selectivity.
Background: 76 Kinase Drugs approved; 4 are allosteric type inhibitors targeting MEK.
Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
14
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
Georg et al. Trends Pharmacol. Sci (2022)
Background: 76 Kinase Drugs approved; Challenge: Drug Resistance
Trending: MKIs attracting much attention;
Progress: 2 Approved and 7 in clinical trials (Below)
8 out of 9 are designed by
rational drug design strategies
from the acyclic starting points
■ How to design MKI?
15
Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
Acyclic Inhibitors
MKIs
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
16
Fs-IFPs: Apply to Macrocyclic Kinase Inhibitors (MKIs)
+
Comparing Binding modes of MKIs and their acyclic counterparts
Scaffolds with the
same binding
modes before and
after cyclization
Rigid
Scaffold
MD trajectories analysis using Fs-IFPs
for MKIs and their corresponding acyclic
counterparts
■ What kind of rigid scaffolds are promising?
17
Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
MKI database
Rigid Scaffolds with 3
aromatic rings directly
connected to one another
Analyzing
https://zhengzhster.github.io/MKIs/ Zhao et al. ACS Pharmacol. Transl. Sci (2023)
Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
18
Discovery Process: A Systems Pharmacology Paradigm
19
Chemical perturbed
Omics profile
Clinical
Biomarkers
Patient Omics profile
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
Sun et al. (2016) Adv. Genetics
Challenges in Systems Pharmacology:
Multi-omics data integration and knowledge discovery
❏ Noisy, high-dimensional, and sparse
❏ Technical (batch effect) and
biological confounding factors (age,
gender etc.)
❏ Complexity of molecular interplays
across biological layers
❏ Common and unique features across
species
❏ Correlation vs causality
Challenges in Systems Pharmacology:
Extensive Dark Chemical and Functional Genomics Space
Machine Learning/Deep Learning is an Indispensable
Tool
■ To overcome the extensive dark genome/proteome:
■ It is unethical and infeasible to screen compounds in humans.
■ Complexity and high dimensionality of omics data are beyond
human comprehension.
■ Understudied biology beyond the reach of experimental
techniques.
■ Hence: only a computational approach can overcome this data gap
22
Filling the Data Gap: Deep Learning Advantages
■ Data-driven feature extraction
■ Integration of incoherent diverse data (both labeled and unlabeled)
■ Explicitly model multi-level organization of biological systems
23
adapted from https://levity.ai/blog/difference-machine-learning-
deep-learning
Filling the Data Gap: Self-Supervised Learning
■ Utilizing a large amount of unlabeled data
■ Facilitating the exploration of dark chemical and biological space
24
protein
chemical
gene expression
P H A R M
(NC2=O)C3=C(C=CC(=C3)S(=O)(=O)N4CCN(CC4)C)OCC)
9 13 20 2 3
English sentence
Filling the Data Gap: Transfer Learning & Domain
Translation
■ Translating data modality (e.g., source = cell line omics; target = patient image biomarker)
■ Removing biological and technological confounding factors across conditions and species
25
Yang et al. (2021) Nature Comm.
Filling the Data Gap: End-to-end Differentiable Biology
■ Protein sequence (unknown structure)->3D protein-ligand binding pose
Final Objective Immutable
Intermediate
objective Final
Objective
26
Final
Objective
Mutable
Classic
ML
Deep
Learning
fine-tuning
pre-training
AlQuraisi et al., Nat. Methods (2021)
End-to-end Differentiable Systems Pharmacology for
Precision Drug Discovery
27
What Can Deep Learning Do for Systems
Pharmacology-Oriented Drug Discovery ?
28
Key Questions
● What are the genome-wide target(s) and off-target(s)?
● How does drug modulate cellular state characterized by omics?
● How to translate drug activities in a disease model to clinical
human response?
Predicting Genome-Wide Protein-Chemical Interactions
for Dark Proteins
Protein similarity Chemical similarity
❏ PortalCG: An end-to-end sequence-structure
function neural network
Zhang et al. IJCAI, 2020; Liu et al. Front Bioinfor, 2021;
Cai et al. JCIM, 2021; Cai et al. Bioinformatics, 2022
Cai et al. PLoS CB, 2023; Zhang et al. Sci. Rep., 2023
Wu et al. under review, 2023
29
meta &
semi-
supervised
learning
Cai et al. (2023) PLoS CB
PortalCG: Application to Polypharmacology
30
■ Selective dual-DRD1/3 antagonist of Opioid Use Disorder (OUD)
Cai et al. (2023) PLoS CB
D1R pharmacophore D3R pharmacophore
Zhuang et al. Cell (2021)
Omics-Driven Phenotype Compound Screening
31
Chemical
structure
Gene expression
after treatment
Dose-response
cell viability
Protein expression
after treatment
Dosage
Clinical
Efficacy &
Toxicity
Cell type-specific
omics profile of patient
DNA-RNA-Protein
Unlabled-
labled
Qiu et al. Bioinformatics, 2020; Pham et al. Nat Mach Intell, 2021; He et al. Bioinformatics, 2021;
Pham et al. Patterns, 2022; He et al. Nat Mach Intell, 2022; Wu et al. PLoS CB, 2022;
Wu et al. Cell Rep Methods, 2023
Omics-Driven Phenotype Compound Screening
control
control
Resistant cancer cell Sensitive cancer cell
Transform agent Cidal agent
cold hot
32
- Drug combination to induce immunotherapy response
Pham et al. (2022) Patterns
Translating Cell Line Screens to Patient Clinical
Responses for Personalized Medicine
He et al. (2022) Nature MI
33
in
vitro
cell
line
Clinical
patient
tissue
?
Context-aware Deconfounding Autoencoder (CODE-AE)
Translating Cell Line Screens to Patient Clinical
Responses for Personalized Medicine
He et al. (2022) Nature MI 34
Put Together: Multi-Scale Modeling of Drug Actions
for AD Drug Repurposing
Drug
IRAK1
Genome-wide
CPIs
TLR/MYD88/
NF-kB pathway
TNF pathway
Lipopolysaccharide
Synthesis pathway
Herpes simplex
virus infection
Pathway
Oliveros et al. (2022) Brain 35
Reduced Plaque Burden (in red)
Reduced Microgliosis
Reduced Tangle Burden (in red)
TGNT TGTR
Improve Learning Memory
Asthma drug
(PDE inhibitor)
Clinical outcome
Will AI be Sufficient?
36
Limitations of ML: Interpretability
37
Oviedo et al. Acc. Mater. Res. (2022)
Limitations of ML: Out-of-distribution Generalization
chemical
phenotype readout
chemical
phenotype readout
38
adapted from https://medium.com/geekculture/out-of-distribution-detection-in-medical-ai-
b638b385c2a3
Integrating Data-driven and Mechanism-based
Modeling
39
■ Predictive modeling of protein-ligand binding kinetics
Causal Learning
Biophysics &
Mathematical
modeling
Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
40
Summary
41
■ Computational approaches to drug discovery have advanced on 3 axes:
1. The scale, complexity and amount of digital data available
2. The type of computation employed:
a) From the application of measurable physical features
b) To the discovery of unknown features
c) To the prediction of outcomes
3. How we think about the problem
a) One drug-one target
b) Multiple drugs-multiple targets- populations
c) Multiple drugs-multiple targets-one human

More Related Content

Similar to Data Science Meets Drug Discovery

Presentation on rational drug design converted
Presentation on rational drug design convertedPresentation on rational drug design converted
Presentation on rational drug design converted
B V V S Hanagal Shri Kumareshwar College of Pharmacy, Bagalkote
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
Chris Southan
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
Jaclyn Williams
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
Elia Brodsky
 
Cadd
CaddCadd
Cadd
HIMELROUF
 
Applications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessApplications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessProf. Dr. Basavaraj Nanjwade
 
Drug Designing
Drug DesigningDrug Designing
Drug Designing
ZainabSaif
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
Dr NEETHU ASOKAN
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
open_phacts
 
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
jodischneider
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing Ayesha Aftab
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services IndustryBarry Smith
 
Next Generation Data and Opportunities for Clinical Pharmacologists
Next Generation Data and Opportunities for Clinical PharmacologistsNext Generation Data and Opportunities for Clinical Pharmacologists
Next Generation Data and Opportunities for Clinical Pharmacologists
Philip Bourne
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Maakasaikumar
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance
 
computational chemistry
computational chemistrycomputational chemistry
computational chemistry
SAMUELAKANDE3
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
BigData_Europe
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
philmaweb
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
santosh Kumbhar
 

Similar to Data Science Meets Drug Discovery (20)

Presentation on rational drug design converted
Presentation on rational drug design convertedPresentation on rational drug design converted
Presentation on rational drug design converted
 
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
IUPHAR/BPS Guide to Pharmacology: concise mapping of chemistry, data, and tar...
 
T-bioinfo overview
T-bioinfo overviewT-bioinfo overview
T-bioinfo overview
 
T-BioInfo Methods and Approaches
T-BioInfo Methods and ApproachesT-BioInfo Methods and Approaches
T-BioInfo Methods and Approaches
 
Cadd
CaddCadd
Cadd
 
Applications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And ProcessApplications Of Bioinformatics In Drug Discovery And Process
Applications Of Bioinformatics In Drug Discovery And Process
 
Drug Designing
Drug DesigningDrug Designing
Drug Designing
 
Role of bioinformatics of drug designing
Role of bioinformatics of drug designingRole of bioinformatics of drug designing
Role of bioinformatics of drug designing
 
2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe2011-10-11 Open PHACTS at BioIT World Europe
2011-10-11 Open PHACTS at BioIT World Europe
 
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
Acquiring and representing drug-drug interaction knowledge and evidence, TRIA...
 
Computer aided drug designing
Computer aided drug designing Computer aided drug designing
Computer aided drug designing
 
Ontology for the Financial Services Industry
Ontology for the Financial Services IndustryOntology for the Financial Services Industry
Ontology for the Financial Services Industry
 
Next Generation Data and Opportunities for Clinical Pharmacologists
Next Generation Data and Opportunities for Clinical PharmacologistsNext Generation Data and Opportunities for Clinical Pharmacologists
Next Generation Data and Opportunities for Clinical Pharmacologists
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
computational chemistry
computational chemistrycomputational chemistry
computational chemistry
 
Computer Aided Drug Design
Computer Aided Drug DesignComputer Aided Drug Design
Computer Aided Drug Design
 
Big Data Analytics in the Health Domain
Big Data Analytics in the Health DomainBig Data Analytics in the Health Domain
Big Data Analytics in the Health Domain
 
Introduction to bioinformatics
Introduction to bioinformaticsIntroduction to bioinformatics
Introduction to bioinformatics
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
 

More from Philip Bourne

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
Philip Bourne
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
Philip Bourne
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
Philip Bourne
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
Philip Bourne
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
Philip Bourne
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
Philip Bourne
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
Philip Bourne
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
Philip Bourne
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
Philip Bourne
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
Philip Bourne
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
Philip Bourne
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
Philip Bourne
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
Philip Bourne
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
Philip Bourne
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
Philip Bourne
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
Philip Bourne
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
Philip Bourne
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
Philip Bourne
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
Philip Bourne
 

More from Philip Bourne (20)

Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
AI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a ConversationAI in Medical Education A Meta View to Start a Conversation
AI in Medical Education A Meta View to Start a Conversation
 
AI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We GoingAI+ Now and Then How Did We Get Here And Where Are We Going
AI+ Now and Then How Did We Get Here And Where Are We Going
 
Thoughts on Biological Data Sustainability
Thoughts on Biological Data SustainabilityThoughts on Biological Data Sustainability
Thoughts on Biological Data Sustainability
 
What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?What is FAIR Data and Who Needs It?
What is FAIR Data and Who Needs It?
 
Data Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything ChangeData Science Meets Biomedicine, Does Anything Change
Data Science Meets Biomedicine, Does Anything Change
 
Biomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not AloneBiomedical Data Science: We Are Not Alone
Biomedical Data Science: We Are Not Alone
 
BIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in ResearchBIMS7100-2023. Social Responsibility in Research
BIMS7100-2023. Social Responsibility in Research
 
AI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data ScienceAI from the Perspective of a School of Data Science
AI from the Perspective of a School of Data Science
 
What Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's ViewWhat Data Science Will Mean to You - One Person's View
What Data Science Will Mean to You - One Person's View
 
Novo Nordisk 080522.pptx
Novo Nordisk 080522.pptxNovo Nordisk 080522.pptx
Novo Nordisk 080522.pptx
 
Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)Towards a US Open research Commons (ORC)
Towards a US Open research Commons (ORC)
 
COVID and Precision Education
COVID and Precision EducationCOVID and Precision Education
COVID and Precision Education
 
One View of Data Science
One View of Data ScienceOne View of Data Science
One View of Data Science
 
Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?Cancer Research Meets Data Science — What Can We Do Together?
Cancer Research Meets Data Science — What Can We Do Together?
 
Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?Data Science Meets Open Scholarship – What Comes Next?
Data Science Meets Open Scholarship – What Comes Next?
 
Data to Advance Sustainability
Data to Advance SustainabilityData to Advance Sustainability
Data to Advance Sustainability
 
Frontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular ScalesFrontiers of Computing at the Cellular and Molecular Scales
Frontiers of Computing at the Cellular and Molecular Scales
 
Social Responsibility in Research
Social Responsibility in ResearchSocial Responsibility in Research
Social Responsibility in Research
 

Recently uploaded

Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
EverAndrsGuerraGuerr
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
EugeneSaldivar
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
MIRIAMSALINAS13
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
Tamralipta Mahavidyalaya
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
beazzy04
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
Sandy Millin
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
Celine George
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
Mohd Adib Abd Muin, Senior Lecturer at Universiti Utara Malaysia
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
Anna Sz.
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
Atul Kumar Singh
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
GeoBlogs
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
rosedainty
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
Balvir Singh
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
GeoBlogs
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
Fundacja Rozwoju Społeczeństwa Przedsiębiorczego
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
bennyroshan06
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
Vikramjit Singh
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
Delapenabediema
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
Excellence Foundation for South Sudan
 

Recently uploaded (20)

Thesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.pptThesis Statement for students diagnonsed withADHD.ppt
Thesis Statement for students diagnonsed withADHD.ppt
 
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...TESDA TM1 REVIEWER  FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
TESDA TM1 REVIEWER FOR NATIONAL ASSESSMENT WRITTEN AND ORAL QUESTIONS WITH A...
 
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXXPhrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
Phrasal Verbs.XXXXXXXXXXXXXXXXXXXXXXXXXX
 
Home assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdfHome assignment II on Spectroscopy 2024 Answers.pdf
Home assignment II on Spectroscopy 2024 Answers.pdf
 
Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345Sha'Carri Richardson Presentation 202345
Sha'Carri Richardson Presentation 202345
 
2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...2024.06.01 Introducing a competency framework for languag learning materials ...
2024.06.01 Introducing a competency framework for languag learning materials ...
 
How to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERPHow to Create Map Views in the Odoo 17 ERP
How to Create Map Views in the Odoo 17 ERP
 
Chapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptxChapter 3 - Islamic Banking Products and Services.pptx
Chapter 3 - Islamic Banking Products and Services.pptx
 
Polish students' mobility in the Czech Republic
Polish students' mobility in the Czech RepublicPolish students' mobility in the Czech Republic
Polish students' mobility in the Czech Republic
 
Language Across the Curriculm LAC B.Ed.
Language Across the  Curriculm LAC B.Ed.Language Across the  Curriculm LAC B.Ed.
Language Across the Curriculm LAC B.Ed.
 
Fish and Chips - have they had their chips
Fish and Chips - have they had their chipsFish and Chips - have they had their chips
Fish and Chips - have they had their chips
 
Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)Template Jadual Bertugas Kelas (Boleh Edit)
Template Jadual Bertugas Kelas (Boleh Edit)
 
Operation Blue Star - Saka Neela Tara
Operation Blue Star   -  Saka Neela TaraOperation Blue Star   -  Saka Neela Tara
Operation Blue Star - Saka Neela Tara
 
The geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideasThe geography of Taylor Swift - some ideas
The geography of Taylor Swift - some ideas
 
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdfESC Beyond Borders _From EU to You_ InfoPack general.pdf
ESC Beyond Borders _From EU to You_ InfoPack general.pdf
 
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptxMARUTI SUZUKI- A Successful Joint Venture in India.pptx
MARUTI SUZUKI- A Successful Joint Venture in India.pptx
 
Digital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and ResearchDigital Tools and AI for Teaching Learning and Research
Digital Tools and AI for Teaching Learning and Research
 
How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...How libraries can support authors with open access requirements for UKRI fund...
How libraries can support authors with open access requirements for UKRI fund...
 
The Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official PublicationThe Challenger.pdf DNHS Official Publication
The Challenger.pdf DNHS Official Publication
 
Introduction to Quality Improvement Essentials
Introduction to Quality Improvement EssentialsIntroduction to Quality Improvement Essentials
Introduction to Quality Improvement Essentials
 

Data Science Meets Drug Discovery

  • 1. Data Science Meets Drug Discovery Philip Bourne, University of Virginia Zheng Zhao, University of Virginia Lei Xie, The City University of New York 0
  • 2. Outline ■ One definition of data science ■ A brief history of in silico early-stage drug discovery ■ Bioinformatics methods for structure-based drug design ■ Machine Learning/Deep Learning solutions to systems pharmacology ■ Summary 1
  • 3. One Definition of Data Science – The 4+1 Model ■ Value – ensuring societal benefit ■ Design - Communication of the value of data ■ Systems – the means to communicate and convey benefit ■ Analytics – models and methods ■ Practice – where everything happens – for today that is early-stage drug discovery [Adapted from Raf Alvarado] Bourne PE (2021) Is “bioinformatics” dead? PLoS Biol 19(3): e3001165.
  • 4. A Snapshot of In Silico Drug Discovery Today’s Discussion Methods Simple Complex Data Small Large Discovery Process One Drug – One Target One Drug – Multiple Targets Multiple Drugs – Multiple Targets Multiple Drugs – Multiple Targets – One Person Populations/Environment Molecules Omics Model Organisms Organs Cells Docking MD Bioinformatics ML DL
  • 5. A Snapshot of In Silico Drug Discovery Today’s Discussion Method Simple Complex Data Small Large Discovery Process One Drug – One Target One Drug – Multiple Targets Multiple Drugs – Multiple Targets Multiple Drugs – Multiple Targets – One Person Populations Molecules Omics Model Organisms Organs Cells Docking MD Bioinformatics ML DL
  • 6. One-drug-one-target Small Molecule Drug Discovery Process 5 >90% Failure Rate Druggable target identification Lead compound identification and optimization Pre-clinical testing Clinical trial Omics data analysis Kiriiri et al. Future J Pharm Sci. (2020) Adapted from http://www.biomech.ulg.ac.be/project/multi- omics/
  • 7. Fundamental Challenge of Conventional Drug Discovery in Combating Complex Diseases 6 disease association binding affinity animal response Clinical efficacy & safety Bender & Cortes-Ciriano, Drug Discov Today (2021) Disease gene may not be right drug target Genome-wide off-target bindings and systems effects are ignored animal-human extrapolation remains challenging Each step in the drug discovery process is optimized for a proxy objective but not a clinical endpoint Power of Artificial Intelligence (Deep Learning) has not been fully utilized Adapted from http://www.biomech.ulg.ac.be/project/multi- omics/
  • 8. Environmental factors (e.g., microbiome) 7 Discovery Process: Multi-Scale Modeling Drug Actions in the Human Body Pleiotropic systems effect of drugs Xie & Bourne (2012) Annu Rev Pharmacol Toxicol,
  • 9. Outline ■ One definition of data science ■ A brief history of in silico early-stage drug discovery ■ Bioinformatics methods for structure-based drug design ■ Machine Learning/Deep Learning solutions to systems pharmacology ■ Summary 8
  • 10. 9 Bioinformatics Methods for Drug Design and Discovery ❏ Utilize structural genomics data ❏ Explore evolutionary and functional linkage (ligand binding site similarity) across fold space ❏ Successfully applied to drug repurposing, side effect prediction, and polypharmacology
  • 11. Bioinformatics Methods for Drug Design and Discovery Early-Stage Drug Discovery Patterns: Categories Principles Ligand-based Chemical Similarity; Pharmacophore Similarity Structure-based Binding site (Pocket) Similarity Protein-ligand-based Pharmacophore Similarity; Interaction Fingerprint Similarity Bender, et al; Drug Discov. Today; 2021
  • 12. Bioinformatics Methods for Protein-ligand interaction - based Drug Design 11 IFPs: Protein-Ligand Interaction Fingerprints Given a ligand-binding complex Encoding IFPs Every residue – ligand interaction encoded into a 7-bit substring One 1D string representing the protein- ligand interactions Using pre-defined geometric rules Interactions Rules* H-bond Distance (Donor-Accepter) ≤ 3.5 Å && Tolerance Angle (Donor-H-Acceptor ∈[-π/3, π/3]) Ionic Distance (Anion-Cation) ≤ 4.0 Å Aromatic (face to face) Distance (two aromatic ring centers) ≤ 4.0 Å && Tolerance angle (face to edge) ∈[-π/6, π/6]) Aromatic (face to edge) Distance (two aromatic ring centers) ≤ 4.0 Å && Tolerance angle (face to edge) ∈[π/6, 5π/6]) Hydrophobic Distance (Donor-Accepter) ≤ 4.5 Å Deng et al. J. Med. Chem (2004) Marcou et al. J. Chem. Inf. Model. (2007)
  • 13. Fs-IFP: Structural Proteome-scale Protein-Ligand Interaction Fingerprint Method 12 Zhao et al. Drug Discov. Today (2022) Zhao et al. J. Chem. Inf. Model. (2023) Zhao et al. ACS Pharmacol. Transl. Sci (2023) Binding-sites Alignment Encoding Fs-IPFs Structura l Proteome Binding-site-Aligned Proteome Database Encoding each complex into one IFP string Covalent Inhibitors, Allosteric Inhibitors, Macrocyclic Inhibitors, Drug (Resistance) Mechanisms, etc. Apply to Fs-IFP-encoded Structural Proteome
  • 14. Fs-IFPs: Application to Allosteric Kinase Inhibitors 13 MEK-ligand Complexes from PDB and MD simulations as Structural Dataset Top 15 potential kinase targets for allosteric inhibitor design Predicting Zhao et al. PLoS One 12 (6), e0179936, 2017 Fs-IFP Encoding Extracting MEK- inhibitor Interaction Characteristics Binding sites aligned Dataset Alignment Advantage: Allosteric Kinase Inhibitor is attractive due to high, unique selectivity. Background: 76 Kinase Drugs approved; 4 are allosteric type inhibitors targeting MEK.
  • 15. Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs) 14 Zhao et al. ACS Pharmacol. Transl. Sci (2023) Georg et al. Trends Pharmacol. Sci (2022) Background: 76 Kinase Drugs approved; Challenge: Drug Resistance Trending: MKIs attracting much attention; Progress: 2 Approved and 7 in clinical trials (Below) 8 out of 9 are designed by rational drug design strategies from the acyclic starting points
  • 16. ■ How to design MKI? 15 Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs) Acyclic Inhibitors MKIs Zhao et al. ACS Pharmacol. Transl. Sci (2023)
  • 17. 16 Fs-IFPs: Apply to Macrocyclic Kinase Inhibitors (MKIs) + Comparing Binding modes of MKIs and their acyclic counterparts Scaffolds with the same binding modes before and after cyclization Rigid Scaffold MD trajectories analysis using Fs-IFPs for MKIs and their corresponding acyclic counterparts
  • 18. ■ What kind of rigid scaffolds are promising? 17 Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs) MKI database Rigid Scaffolds with 3 aromatic rings directly connected to one another Analyzing https://zhengzhster.github.io/MKIs/ Zhao et al. ACS Pharmacol. Transl. Sci (2023)
  • 19. Outline ■ One definition of data science ■ A brief history of in silico early-stage drug discovery ■ Bioinformatics methods for structure-based drug design ■ Machine Learning/Deep Learning solutions to systems pharmacology ■ Summary 18
  • 20. Discovery Process: A Systems Pharmacology Paradigm 19 Chemical perturbed Omics profile Clinical Biomarkers Patient Omics profile Adapted from http://www.biomech.ulg.ac.be/project/multi- omics/ Sun et al. (2016) Adv. Genetics
  • 21. Challenges in Systems Pharmacology: Multi-omics data integration and knowledge discovery ❏ Noisy, high-dimensional, and sparse ❏ Technical (batch effect) and biological confounding factors (age, gender etc.) ❏ Complexity of molecular interplays across biological layers ❏ Common and unique features across species ❏ Correlation vs causality
  • 22. Challenges in Systems Pharmacology: Extensive Dark Chemical and Functional Genomics Space
  • 23. Machine Learning/Deep Learning is an Indispensable Tool ■ To overcome the extensive dark genome/proteome: ■ It is unethical and infeasible to screen compounds in humans. ■ Complexity and high dimensionality of omics data are beyond human comprehension. ■ Understudied biology beyond the reach of experimental techniques. ■ Hence: only a computational approach can overcome this data gap 22
  • 24. Filling the Data Gap: Deep Learning Advantages ■ Data-driven feature extraction ■ Integration of incoherent diverse data (both labeled and unlabeled) ■ Explicitly model multi-level organization of biological systems 23 adapted from https://levity.ai/blog/difference-machine-learning- deep-learning
  • 25. Filling the Data Gap: Self-Supervised Learning ■ Utilizing a large amount of unlabeled data ■ Facilitating the exploration of dark chemical and biological space 24 protein chemical gene expression P H A R M (NC2=O)C3=C(C=CC(=C3)S(=O)(=O)N4CCN(CC4)C)OCC) 9 13 20 2 3 English sentence
  • 26. Filling the Data Gap: Transfer Learning & Domain Translation ■ Translating data modality (e.g., source = cell line omics; target = patient image biomarker) ■ Removing biological and technological confounding factors across conditions and species 25 Yang et al. (2021) Nature Comm.
  • 27. Filling the Data Gap: End-to-end Differentiable Biology ■ Protein sequence (unknown structure)->3D protein-ligand binding pose Final Objective Immutable Intermediate objective Final Objective 26 Final Objective Mutable Classic ML Deep Learning fine-tuning pre-training AlQuraisi et al., Nat. Methods (2021)
  • 28. End-to-end Differentiable Systems Pharmacology for Precision Drug Discovery 27
  • 29. What Can Deep Learning Do for Systems Pharmacology-Oriented Drug Discovery ? 28 Key Questions ● What are the genome-wide target(s) and off-target(s)? ● How does drug modulate cellular state characterized by omics? ● How to translate drug activities in a disease model to clinical human response?
  • 30. Predicting Genome-Wide Protein-Chemical Interactions for Dark Proteins Protein similarity Chemical similarity ❏ PortalCG: An end-to-end sequence-structure function neural network Zhang et al. IJCAI, 2020; Liu et al. Front Bioinfor, 2021; Cai et al. JCIM, 2021; Cai et al. Bioinformatics, 2022 Cai et al. PLoS CB, 2023; Zhang et al. Sci. Rep., 2023 Wu et al. under review, 2023 29 meta & semi- supervised learning Cai et al. (2023) PLoS CB
  • 31. PortalCG: Application to Polypharmacology 30 ■ Selective dual-DRD1/3 antagonist of Opioid Use Disorder (OUD) Cai et al. (2023) PLoS CB D1R pharmacophore D3R pharmacophore Zhuang et al. Cell (2021)
  • 32. Omics-Driven Phenotype Compound Screening 31 Chemical structure Gene expression after treatment Dose-response cell viability Protein expression after treatment Dosage Clinical Efficacy & Toxicity Cell type-specific omics profile of patient DNA-RNA-Protein Unlabled- labled Qiu et al. Bioinformatics, 2020; Pham et al. Nat Mach Intell, 2021; He et al. Bioinformatics, 2021; Pham et al. Patterns, 2022; He et al. Nat Mach Intell, 2022; Wu et al. PLoS CB, 2022; Wu et al. Cell Rep Methods, 2023
  • 33. Omics-Driven Phenotype Compound Screening control control Resistant cancer cell Sensitive cancer cell Transform agent Cidal agent cold hot 32 - Drug combination to induce immunotherapy response Pham et al. (2022) Patterns
  • 34. Translating Cell Line Screens to Patient Clinical Responses for Personalized Medicine He et al. (2022) Nature MI 33 in vitro cell line Clinical patient tissue ? Context-aware Deconfounding Autoencoder (CODE-AE)
  • 35. Translating Cell Line Screens to Patient Clinical Responses for Personalized Medicine He et al. (2022) Nature MI 34
  • 36. Put Together: Multi-Scale Modeling of Drug Actions for AD Drug Repurposing Drug IRAK1 Genome-wide CPIs TLR/MYD88/ NF-kB pathway TNF pathway Lipopolysaccharide Synthesis pathway Herpes simplex virus infection Pathway Oliveros et al. (2022) Brain 35 Reduced Plaque Burden (in red) Reduced Microgliosis Reduced Tangle Burden (in red) TGNT TGTR Improve Learning Memory Asthma drug (PDE inhibitor) Clinical outcome
  • 37. Will AI be Sufficient? 36
  • 38. Limitations of ML: Interpretability 37 Oviedo et al. Acc. Mater. Res. (2022)
  • 39. Limitations of ML: Out-of-distribution Generalization chemical phenotype readout chemical phenotype readout 38 adapted from https://medium.com/geekculture/out-of-distribution-detection-in-medical-ai- b638b385c2a3
  • 40. Integrating Data-driven and Mechanism-based Modeling 39 ■ Predictive modeling of protein-ligand binding kinetics Causal Learning Biophysics & Mathematical modeling
  • 41. Outline ■ One definition of data science ■ A brief history of in silico early-stage drug discovery ■ Bioinformatics methods for structure-based drug design ■ Machine Learning/Deep Learning solutions to systems pharmacology ■ Summary 40
  • 42. Summary 41 ■ Computational approaches to drug discovery have advanced on 3 axes: 1. The scale, complexity and amount of digital data available 2. The type of computation employed: a) From the application of measurable physical features b) To the discovery of unknown features c) To the prediction of outcomes 3. How we think about the problem a) One drug-one target b) Multiple drugs-multiple targets- populations c) Multiple drugs-multiple targets-one human