Presented online as part of the NASM series in Advancing Drug Discovery see https://www.nationalacademies.org/event/40883_09-2023_advancing-drug-discovery-data-science-meets-drug-discovery
Big Data and Genomic Medicine by Corey NislowKnome_Inc
View the webinar at: http://www.knome.com/webinar-big-data-genomic-medicine. This presentation covers an overview of genomic medicine, requirements and challenges of next-generation sequencing, bottlenecks to broader healthcare adoption, and why “we want to sequence everyone.”
In spite of extensive effort by industry and academia to develop new drugs, there are still several diseases that are in need of therapeutic agents and have yet to be developed.
10 years the identification rate of disease-associated targets has been higher than the therapeutics identification rate.
Nevertheless, it is apparent that computational tools provide high hopes that many of the diseases under investigation can be brought under control.
Big Data and Genomic Medicine by Corey NislowKnome_Inc
View the webinar at: http://www.knome.com/webinar-big-data-genomic-medicine. This presentation covers an overview of genomic medicine, requirements and challenges of next-generation sequencing, bottlenecks to broader healthcare adoption, and why “we want to sequence everyone.”
In spite of extensive effort by industry and academia to develop new drugs, there are still several diseases that are in need of therapeutic agents and have yet to be developed.
10 years the identification rate of disease-associated targets has been higher than the therapeutics identification rate.
Nevertheless, it is apparent that computational tools provide high hopes that many of the diseases under investigation can be brought under control.
A full picture of -omics cellular networks of regulation brings researchers closer to a realistic and reliable understanding of complex conditions. For more information, please visit: http://tbioinfopb.pine-biotech.com/
T-Bioinfo is a comprehensive bioinformatics platform that allows the user to navigate NGS, Mass-Spec and Structural Biology data analysis pipelines using consistent interface. Analysis and integration of such data allows for better and faster discovery and optimization of personalized and precision treatment of complex diseases and understanding of medical conditions. For more information, go to pine-biotech.com
Computer Added Drug Design is one of the latest technology of medicine world. This short slide will help you to know a little about CADD.If you want to know a vast plz go throw the reference book.
Next Generation Data and Opportunities for Clinical PharmacologistsPhilip Bourne
Presentation at the Pre-meeting Workshop Next-Generation Clinical Pharmacology: Integrating Systems Pharmacology, Data-Driven Therapeutics, and Personalized Medicine. American Society for Clinical Pharmacology and Therapeutics Annual Meeting Atlanta GA March 18, 2014.
In the late Fall and Winter of 2018, the Pistoia Alliance in cooperation with Elsevier and charitable organizations Cures within Reach and Mission: Cure ran a datathon aiming to find drugs suitable for treatment of childhood chronic pancreatitis, a rare disease that causes extreme suffering. The datathon resulted in identification of four candidate compounds in a short time frame of just under three months. In this webinar our speakers discuss the technologies that made this leap possible
A full picture of -omics cellular networks of regulation brings researchers closer to a realistic and reliable understanding of complex conditions. For more information, please visit: http://tbioinfopb.pine-biotech.com/
T-Bioinfo is a comprehensive bioinformatics platform that allows the user to navigate NGS, Mass-Spec and Structural Biology data analysis pipelines using consistent interface. Analysis and integration of such data allows for better and faster discovery and optimization of personalized and precision treatment of complex diseases and understanding of medical conditions. For more information, go to pine-biotech.com
Computer Added Drug Design is one of the latest technology of medicine world. This short slide will help you to know a little about CADD.If you want to know a vast plz go throw the reference book.
Next Generation Data and Opportunities for Clinical PharmacologistsPhilip Bourne
Presentation at the Pre-meeting Workshop Next-Generation Clinical Pharmacology: Integrating Systems Pharmacology, Data-Driven Therapeutics, and Personalized Medicine. American Society for Clinical Pharmacology and Therapeutics Annual Meeting Atlanta GA March 18, 2014.
In the late Fall and Winter of 2018, the Pistoia Alliance in cooperation with Elsevier and charitable organizations Cures within Reach and Mission: Cure ran a datathon aiming to find drugs suitable for treatment of childhood chronic pancreatitis, a rare disease that causes extreme suffering. The datathon resulted in identification of four candidate compounds in a short time frame of just under three months. In this webinar our speakers discuss the technologies that made this leap possible
For a panel discussion at the Associate Research Libraries Spring meeting April 27, 2022, Montreal https://www.arl.org/schedule-for-spring-2022-association-meeting/
Frontiers of Computing at the Cellular and Molecular ScalesPhilip Bourne
3 basic points when establishing a new biomedical initiative. Presented at Frontiers of Computing in Health and Society, George Mason University, September 21, 2021.
2024.06.01 Introducing a competency framework for languag learning materials ...Sandy Millin
http://sandymillin.wordpress.com/iateflwebinar2024
Published classroom materials form the basis of syllabuses, drive teacher professional development, and have a potentially huge influence on learners, teachers and education systems. All teachers also create their own materials, whether a few sentences on a blackboard, a highly-structured fully-realised online course, or anything in between. Despite this, the knowledge and skills needed to create effective language learning materials are rarely part of teacher training, and are mostly learnt by trial and error.
Knowledge and skills frameworks, generally called competency frameworks, for ELT teachers, trainers and managers have existed for a few years now. However, until I created one for my MA dissertation, there wasn’t one drawing together what we need to know and do to be able to effectively produce language learning materials.
This webinar will introduce you to my framework, highlighting the key competencies I identified from my research. It will also show how anybody involved in language teaching (any language, not just English!), teacher training, managing schools or developing language learning materials can benefit from using the framework.
How to Create Map Views in the Odoo 17 ERPCeline George
The map views are useful for providing a geographical representation of data. They allow users to visualize and analyze the data in a more intuitive manner.
Operation “Blue Star” is the only event in the history of Independent India where the state went into war with its own people. Even after about 40 years it is not clear if it was culmination of states anger over people of the region, a political game of power or start of dictatorial chapter in the democratic setup.
The people of Punjab felt alienated from main stream due to denial of their just demands during a long democratic struggle since independence. As it happen all over the word, it led to militant struggle with great loss of lives of military, police and civilian personnel. Killing of Indira Gandhi and massacre of innocent Sikhs in Delhi and other India cities was also associated with this movement.
Read| The latest issue of The Challenger is here! We are thrilled to announce that our school paper has qualified for the NATIONAL SCHOOLS PRESS CONFERENCE (NSPC) 2024. Thank you for your unwavering support and trust. Dive into the stories that made us stand out!
This is a presentation by Dada Robert in a Your Skill Boost masterclass organised by the Excellence Foundation for South Sudan (EFSS) on Saturday, the 25th and Sunday, the 26th of May 2024.
He discussed the concept of quality improvement, emphasizing its applicability to various aspects of life, including personal, project, and program improvements. He defined quality as doing the right thing at the right time in the right way to achieve the best possible results and discussed the concept of the "gap" between what we know and what we do, and how this gap represents the areas we need to improve. He explained the scientific approach to quality improvement, which involves systematic performance analysis, testing and learning, and implementing change ideas. He also highlighted the importance of client focus and a team approach to quality improvement.
1. Data Science Meets Drug Discovery
Philip Bourne, University of Virginia
Zheng Zhao, University of Virginia
Lei Xie, The City University of New York
0
2. Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
1
3. One Definition of Data Science – The 4+1 Model
■ Value – ensuring societal
benefit
■ Design - Communication
of the value of data
■ Systems – the means to
communicate and convey
benefit
■ Analytics – models and
methods
■ Practice – where
everything happens – for
today that is early-stage
drug discovery
[Adapted from Raf Alvarado] Bourne PE (2021) Is “bioinformatics” dead? PLoS Biol 19(3): e3001165.
4. A Snapshot of In Silico Drug Discovery
Today’s Discussion
Methods
Simple Complex
Data
Small
Large
Discovery
Process
One Drug – One Target
One Drug – Multiple Targets
Multiple Drugs – Multiple Targets
Multiple Drugs – Multiple Targets – One Person
Populations/Environment
Molecules
Omics
Model
Organisms
Organs
Cells
Docking MD Bioinformatics ML DL
5. A Snapshot of In Silico Drug Discovery
Today’s Discussion
Method
Simple Complex
Data
Small
Large
Discovery
Process
One Drug – One Target
One Drug – Multiple Targets
Multiple Drugs – Multiple Targets
Multiple Drugs – Multiple Targets – One Person
Populations
Molecules
Omics
Model
Organisms
Organs
Cells
Docking MD Bioinformatics ML DL
6. One-drug-one-target Small Molecule Drug Discovery
Process
5
>90% Failure Rate
Druggable
target
identification
Lead compound
identification and
optimization
Pre-clinical
testing
Clinical trial
Omics
data
analysis
Kiriiri et al. Future J Pharm Sci. (2020)
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
7. Fundamental Challenge of Conventional Drug Discovery
in Combating Complex Diseases
6
disease
association
binding
affinity
animal
response
Clinical
efficacy
& safety
Bender & Cortes-Ciriano, Drug Discov Today (2021)
Disease gene may
not be right drug
target
Genome-wide off-target
bindings and systems
effects are ignored
animal-human
extrapolation
remains challenging
Each step in the drug discovery process is optimized for a
proxy objective but not a clinical endpoint
Power of Artificial Intelligence (Deep Learning) has not
been fully utilized
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
9. Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
8
10. 9
Bioinformatics Methods for Drug Design and Discovery
❏ Utilize structural genomics
data
❏ Explore evolutionary and
functional linkage (ligand
binding site similarity)
across fold space
❏ Successfully applied to drug
repurposing, side effect
prediction, and
polypharmacology
11. Bioinformatics Methods for Drug Design and Discovery
Early-Stage Drug Discovery Patterns:
Categories Principles
Ligand-based Chemical Similarity; Pharmacophore Similarity
Structure-based Binding site (Pocket) Similarity
Protein-ligand-based Pharmacophore Similarity; Interaction Fingerprint Similarity
Bender, et al; Drug Discov. Today; 2021
12. Bioinformatics Methods for Protein-ligand interaction -
based Drug Design
11
IFPs: Protein-Ligand Interaction Fingerprints
Given a ligand-binding complex
Encoding IFPs
Every residue – ligand
interaction encoded into a
7-bit substring
One 1D string
representing the protein-
ligand interactions
Using pre-defined geometric rules
Interactions Rules*
H-bond
Distance (Donor-Accepter) ≤ 3.5 Å && Tolerance
Angle (Donor-H-Acceptor ∈[-π/3, π/3])
Ionic Distance (Anion-Cation) ≤ 4.0 Å
Aromatic (face to
face)
Distance (two aromatic ring centers) ≤ 4.0 Å &&
Tolerance angle (face to edge) ∈[-π/6, π/6])
Aromatic (face to
edge)
Distance (two aromatic ring centers) ≤ 4.0 Å &&
Tolerance angle (face to edge) ∈[π/6, 5π/6])
Hydrophobic Distance (Donor-Accepter) ≤ 4.5 Å
Deng et al. J. Med. Chem (2004)
Marcou et al. J. Chem. Inf. Model. (2007)
13. Fs-IFP: Structural Proteome-scale Protein-Ligand
Interaction Fingerprint Method
12
Zhao et al. Drug Discov. Today (2022)
Zhao et al. J. Chem. Inf. Model. (2023)
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
Binding-sites
Alignment
Encoding
Fs-IPFs
Structura
l
Proteome
Binding-site-Aligned
Proteome Database
Encoding each complex
into one IFP string
Covalent Inhibitors,
Allosteric Inhibitors,
Macrocyclic Inhibitors,
Drug (Resistance)
Mechanisms, etc.
Apply to
Fs-IFP-encoded
Structural Proteome
14. Fs-IFPs: Application to Allosteric Kinase Inhibitors
13
MEK-ligand
Complexes from
PDB and MD
simulations
as Structural
Dataset
Top 15 potential
kinase targets
for allosteric
inhibitor design
Predicting
Zhao et al. PLoS One 12 (6), e0179936, 2017
Fs-IFP
Encoding
Extracting MEK-
inhibitor Interaction
Characteristics
Binding sites
aligned Dataset
Alignment
Advantage: Allosteric Kinase Inhibitor is attractive due to high, unique selectivity.
Background: 76 Kinase Drugs approved; 4 are allosteric type inhibitors targeting MEK.
15. Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
14
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
Georg et al. Trends Pharmacol. Sci (2022)
Background: 76 Kinase Drugs approved; Challenge: Drug Resistance
Trending: MKIs attracting much attention;
Progress: 2 Approved and 7 in clinical trials (Below)
8 out of 9 are designed by
rational drug design strategies
from the acyclic starting points
16. ■ How to design MKI?
15
Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
Acyclic Inhibitors
MKIs
Zhao et al. ACS Pharmacol. Transl. Sci (2023)
17. 16
Fs-IFPs: Apply to Macrocyclic Kinase Inhibitors (MKIs)
+
Comparing Binding modes of MKIs and their acyclic counterparts
Scaffolds with the
same binding
modes before and
after cyclization
Rigid
Scaffold
MD trajectories analysis using Fs-IFPs
for MKIs and their corresponding acyclic
counterparts
18. ■ What kind of rigid scaffolds are promising?
17
Fs-IFPs: Application to Macrocyclic Kinase Inhibitors (MKIs)
MKI database
Rigid Scaffolds with 3
aromatic rings directly
connected to one another
Analyzing
https://zhengzhster.github.io/MKIs/ Zhao et al. ACS Pharmacol. Transl. Sci (2023)
19. Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
18
20. Discovery Process: A Systems Pharmacology Paradigm
19
Chemical perturbed
Omics profile
Clinical
Biomarkers
Patient Omics profile
Adapted from http://www.biomech.ulg.ac.be/project/multi-
omics/
Sun et al. (2016) Adv. Genetics
21. Challenges in Systems Pharmacology:
Multi-omics data integration and knowledge discovery
❏ Noisy, high-dimensional, and sparse
❏ Technical (batch effect) and
biological confounding factors (age,
gender etc.)
❏ Complexity of molecular interplays
across biological layers
❏ Common and unique features across
species
❏ Correlation vs causality
22. Challenges in Systems Pharmacology:
Extensive Dark Chemical and Functional Genomics Space
23. Machine Learning/Deep Learning is an Indispensable
Tool
■ To overcome the extensive dark genome/proteome:
■ It is unethical and infeasible to screen compounds in humans.
■ Complexity and high dimensionality of omics data are beyond
human comprehension.
■ Understudied biology beyond the reach of experimental
techniques.
■ Hence: only a computational approach can overcome this data gap
22
24. Filling the Data Gap: Deep Learning Advantages
■ Data-driven feature extraction
■ Integration of incoherent diverse data (both labeled and unlabeled)
■ Explicitly model multi-level organization of biological systems
23
adapted from https://levity.ai/blog/difference-machine-learning-
deep-learning
25. Filling the Data Gap: Self-Supervised Learning
■ Utilizing a large amount of unlabeled data
■ Facilitating the exploration of dark chemical and biological space
24
protein
chemical
gene expression
P H A R M
(NC2=O)C3=C(C=CC(=C3)S(=O)(=O)N4CCN(CC4)C)OCC)
9 13 20 2 3
English sentence
26. Filling the Data Gap: Transfer Learning & Domain
Translation
■ Translating data modality (e.g., source = cell line omics; target = patient image biomarker)
■ Removing biological and technological confounding factors across conditions and species
25
Yang et al. (2021) Nature Comm.
27. Filling the Data Gap: End-to-end Differentiable Biology
■ Protein sequence (unknown structure)->3D protein-ligand binding pose
Final Objective Immutable
Intermediate
objective Final
Objective
26
Final
Objective
Mutable
Classic
ML
Deep
Learning
fine-tuning
pre-training
AlQuraisi et al., Nat. Methods (2021)
29. What Can Deep Learning Do for Systems
Pharmacology-Oriented Drug Discovery ?
28
Key Questions
● What are the genome-wide target(s) and off-target(s)?
● How does drug modulate cellular state characterized by omics?
● How to translate drug activities in a disease model to clinical
human response?
30. Predicting Genome-Wide Protein-Chemical Interactions
for Dark Proteins
Protein similarity Chemical similarity
❏ PortalCG: An end-to-end sequence-structure
function neural network
Zhang et al. IJCAI, 2020; Liu et al. Front Bioinfor, 2021;
Cai et al. JCIM, 2021; Cai et al. Bioinformatics, 2022
Cai et al. PLoS CB, 2023; Zhang et al. Sci. Rep., 2023
Wu et al. under review, 2023
29
meta &
semi-
supervised
learning
Cai et al. (2023) PLoS CB
31. PortalCG: Application to Polypharmacology
30
■ Selective dual-DRD1/3 antagonist of Opioid Use Disorder (OUD)
Cai et al. (2023) PLoS CB
D1R pharmacophore D3R pharmacophore
Zhuang et al. Cell (2021)
32. Omics-Driven Phenotype Compound Screening
31
Chemical
structure
Gene expression
after treatment
Dose-response
cell viability
Protein expression
after treatment
Dosage
Clinical
Efficacy &
Toxicity
Cell type-specific
omics profile of patient
DNA-RNA-Protein
Unlabled-
labled
Qiu et al. Bioinformatics, 2020; Pham et al. Nat Mach Intell, 2021; He et al. Bioinformatics, 2021;
Pham et al. Patterns, 2022; He et al. Nat Mach Intell, 2022; Wu et al. PLoS CB, 2022;
Wu et al. Cell Rep Methods, 2023
33. Omics-Driven Phenotype Compound Screening
control
control
Resistant cancer cell Sensitive cancer cell
Transform agent Cidal agent
cold hot
32
- Drug combination to induce immunotherapy response
Pham et al. (2022) Patterns
34. Translating Cell Line Screens to Patient Clinical
Responses for Personalized Medicine
He et al. (2022) Nature MI
33
in
vitro
cell
line
Clinical
patient
tissue
?
Context-aware Deconfounding Autoencoder (CODE-AE)
35. Translating Cell Line Screens to Patient Clinical
Responses for Personalized Medicine
He et al. (2022) Nature MI 34
36. Put Together: Multi-Scale Modeling of Drug Actions
for AD Drug Repurposing
Drug
IRAK1
Genome-wide
CPIs
TLR/MYD88/
NF-kB pathway
TNF pathway
Lipopolysaccharide
Synthesis pathway
Herpes simplex
virus infection
Pathway
Oliveros et al. (2022) Brain 35
Reduced Plaque Burden (in red)
Reduced Microgliosis
Reduced Tangle Burden (in red)
TGNT TGTR
Improve Learning Memory
Asthma drug
(PDE inhibitor)
Clinical outcome
38. Limitations of ML: Interpretability
37
Oviedo et al. Acc. Mater. Res. (2022)
39. Limitations of ML: Out-of-distribution Generalization
chemical
phenotype readout
chemical
phenotype readout
38
adapted from https://medium.com/geekculture/out-of-distribution-detection-in-medical-ai-
b638b385c2a3
40. Integrating Data-driven and Mechanism-based
Modeling
39
■ Predictive modeling of protein-ligand binding kinetics
Causal Learning
Biophysics &
Mathematical
modeling
41. Outline
■ One definition of data science
■ A brief history of in silico early-stage drug discovery
■ Bioinformatics methods for structure-based drug
design
■ Machine Learning/Deep Learning solutions to
systems pharmacology
■ Summary
40
42. Summary
41
■ Computational approaches to drug discovery have advanced on 3 axes:
1. The scale, complexity and amount of digital data available
2. The type of computation employed:
a) From the application of measurable physical features
b) To the discovery of unknown features
c) To the prediction of outcomes
3. How we think about the problem
a) One drug-one target
b) Multiple drugs-multiple targets- populations
c) Multiple drugs-multiple targets-one human