SlideShare a Scribd company logo
1 of 21
Download to read offline
Hit and Lead Discovery with Explorative RL and
Fragment-based Molecule Generation
SoojungYang, Doyeong Hwang, Seul Lee,
Seongok Ryu, Sung Ju Hwang
AITRICS and KAIST
Screening library
Hits
Chemical space
Automated small molecule drug discovery
Hits
• Searching for novel hits is a critical task in drug discovery.
2
Hits
• Molecules with high therapeutic potential
• High binding affinity to a given protein target
Small molecule Protein target
+ ➞
Protein-ligand complex
ΔG = binding free energy
Automated small molecule drug discovery
• Searching for novel hits is a critical task in drug discovery.
• RL methods can be used for automated hit search.
Screening library
Hits
RL agent
Chemical space
3
Automated small molecule drug discovery
Screening library
Hits
• Searching for novel hits is a critical task in drug discovery.
• RL methods can be used for automated hit search.
• Molecular docking simulation estimates protein-ligand binding affinity.
• We can guide RL agents with docking scores as reward.
RL agent
Docking
simulation score
reward
Chemical space
4
Drug molecules should satisfy strong structural constraints
➝ Few examples of structural alerts in
PAINS filter.
• Drug molecules should not have toxic or highly
reactive substructures.
• Widely-used medicinal chemistry filters include
several hundreds of diverse structural alerts.
Hits
Chemical space
High-quality hits
5
Drug molecules should satisfy strong structural constraints
• Previous molecular generative models guided by implicit methods cannot
entirely avoid inappropriate structures.
• e.g., multi-objective optimization where
total reward = docking score reward + structure penalty reward
➠ Explicit method to constrain the generation space within acceptable
molecules are necessary.
“Fragment-based molecular generation”
Hits
Chemical space
High-quality hits
6
Our fragment-based molecular generation method
7
Pharmacochemcially acceptable fragment library
Action 1
Next state molecule St+1
Action 2
Action 3 Augmented fragment
Possible attachment sites
Current state molecule St
Markovian embedding and policy network makes the model plausible for
hit generation and scaffold-based generation
8
• The embeddings of the molecule and fragments are autoregressively passed onto the policy network.
• State embedding network and policy network are Markov models.
• Any arbitrary molecule can be the current state.
➠ The model can be used for scaffold-based generation as well as hit generation.
Connectivity-preserving generation prevents unrealistic bonds
9
• We preserve the connectivity information of fragmented molecules as explicit attachment sites.
• In this way, our model can avoid the generation of molecules with unrealistic bonds.
Pyrrolium
Pyrrole Pyrrolium 2-methylpyrrole
New bond formation on N New bond formation on C
Explorative RL algorithm improves model performance in the
strongly constrained generation space
10
Chemical space
RL agent
Acceptable
chemical spaces
Good exploration
• Strong constraint in generation space makes search space less
smooth ➠ Solutions can be trapped in local optima.
• A practical drug discovery model should find as many diverse
optimal areas in the chemical space.
Our strategies
• Employ soft-actor-critic (SAC), an off-policy actor-critic algorithm
based on maximum entropy RL, which encourages exploration.
• Devise explorative algorithms based on prioritized experience replay
(PER) method.
11
priority pt
• Goal: Sample-efficient exploration
• Priority is a measure of how much additional information
we can learn from the transition.
• Pt = TD error of agent’s value estimate (Q or V function)
[Schaul et al.] Original PER
priority pt
Our PER algorithm encourages the agent to visit novel states
[Schaul et al.] Prioritized Experience Replay, ICLR 2016
12
priority pt
• Goal: Sample-efficient exploration
• Priority is a measure of how much additional information
we can learn from the transition.
• Pt = TD error of agent’s value estimate (Q or V function)
[Schaul et al.] Original PER
• Goal: Encourage sufficient diversity
• Priority is a measure of the novelty of the state
• PER(PE) Pt = predictive error of reward estimator
• PER(BU) Pt = Bayesian uncertainty of reward estimate
Ours: PER(PE), PER(BU)
Our PER algorithm encourages the agent to visit novel states
[Schaul et al.] Prioritized Experience Replay, ICLR 2016
reward
rt
reward predictor
rt
^
priority pt = rt
^
- rt
predictive error
FREED Model Overview and Contributions
13
• Our fragment-based generation method allows our model to leverage medicinal chemistry prior knowledge.
• Our proposed explorative RL method based on PER significantly improves the model performance.
FREED : Fragment-based generative RL with Explorative Experience replay for Drug design
Experimental results
14
fa7
protease
parp1
polymerase
5ht1b
G protein-coupled receptor
Three protein targets Evaluation metrics
Quality score (filter score)
: ratio of accepted, valid molecules to total
generated molecules
Hit ratio
: ratio of unique hit molecules to total
generated molecules, where hit is defined as
molecules who have higher docking scores
than the median of known active molecules
Top 5% score
: average docking score of top 5%-scored
generated molecules
* Higher = greater absolute value
Baseline models
MORLD
: atom-wise generative model + MolDQN
REINVENT
: SMILES-based generative model + REINFORCE
Results 1 – Explicit constraints are necessary to avoid problematic structures
15
• Our fragment-based generation model successfully avoids structural alerts.
• Baseline models trained to avoid alerts in multi-objective optimization scheme show suboptimal results.
Results 2 – Our model outperforms the unconstrained models
16
• Our model outperforms or at least show
comparable performance with the
unconstrained baseline models.
Hit ratio
Top 5% score
better
better
Result 3 – Our PER method shows the best performance
17
Hit ratio
Top 5% score
better
better
Result 3 – Our PER method shows the best performance
18
• All explorative algorithms outperform the vanilla SAC and PPO.
• Our PER(PE) and PER(BU) outperform previous algorithms such as PER(TD) and curiosity-driven algorithms.
Hit ratio
Top 5% score
better
better
Result 4 – Hit and scaffold-based generation case study
19
Hit generation Scaffold-based (lead) generation
• Initial structure: a benzene ring • Initial structure: a scaffold extracted from
known active molecules
Result 4 – Hit and scaffold-based generation case study
20
Summary
21
• Our model FREED is a novel RL framework for real-world drug design that couples a
fragment-based molecular generation strategy with a highly explorative RL algorithm.
• FREED can generate pharmacochemically acceptable molecules with high docking scores.
• When we want to avoid many structural alerts, explicitly constraining the molecular
generation space is more effective than implicit methods.
• By defining priority as the novelty of the state, PER method can encourage the model
exploration to find many optima in highly-constrained molecular generation space.

More Related Content

What's hot

Docking techniques
Docking techniquesDocking techniques
Docking techniquesDevika Rana
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS santosh Kumbhar
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesGreg Landrum
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modelingpkchoudhury
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug designThanh Truong
 

What's hot (10)

Docking techniques
Docking techniquesDocking techniques
Docking techniques
 
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS
 
docking
docking docking
docking
 
Ijcatr04051005
Ijcatr04051005Ijcatr04051005
Ijcatr04051005
 
Protein docking
Protein dockingProtein docking
Protein docking
 
Open-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databasesOpen-source tools for querying and organizing large reaction databases
Open-source tools for querying and organizing large reaction databases
 
Computer Aided Molecular Modeling
Computer Aided Molecular ModelingComputer Aided Molecular Modeling
Computer Aided Molecular Modeling
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
 
26.docking
26.docking26.docking
26.docking
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 

Similar to Fragment-Based RL Generates Drug Hits

2015 bioinformatics bio_cheminformatics_wim_vancriekinge
2015 bioinformatics bio_cheminformatics_wim_vancriekinge2015 bioinformatics bio_cheminformatics_wim_vancriekinge
2015 bioinformatics bio_cheminformatics_wim_vancriekingeProf. Wim Van Criekinge
 
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekingeProf. Wim Van Criekinge
 
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013Prof. Wim Van Criekinge
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsKedar Bandekar
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsKedar Bandekar
 
Drug development approaches
Drug development approaches Drug development approaches
Drug development approaches VIOLINA KALITA
 
Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Jannat985397
 
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014Prof. Wim Van Criekinge
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSDENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSShikha Popali
 
Where do recent small molecule clinical development candidates come from?
Where do recent small molecule clinical development candidates come from?Where do recent small molecule clinical development candidates come from?
Where do recent small molecule clinical development candidates come from?Jonas Boström
 
cadd-191129134050 (1).pptx
cadd-191129134050 (1).pptxcadd-191129134050 (1).pptx
cadd-191129134050 (1).pptxNoorelhuda2
 
Molecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryMolecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryLee Larcombe
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...Kamel Mansouri
 
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGY
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGYSlicing and dicing expert-curated protein targets in the Guide to PHARMACOLGY
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGYChris Southan
 
Explainable AI in Drug Hunting
Explainable AI in Drug HuntingExplainable AI in Drug Hunting
Explainable AI in Drug HuntingEd Griffen
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screeningDeependra Ban
 

Similar to Fragment-Based RL Generates Drug Hits (20)

2015 bioinformatics bio_cheminformatics_wim_vancriekinge
2015 bioinformatics bio_cheminformatics_wim_vancriekinge2015 bioinformatics bio_cheminformatics_wim_vancriekinge
2015 bioinformatics bio_cheminformatics_wim_vancriekinge
 
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
 
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugs
 
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugsDrug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugs
 
Drug development approaches
Drug development approaches Drug development approaches
Drug development approaches
 
Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)Drug Discovery subject (clinical research)
Drug Discovery subject (clinical research)
 
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
 
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUSDENOVO DRUG DESIGN AS PER PCI SYLLABUS
DENOVO DRUG DESIGN AS PER PCI SYLLABUS
 
Where do recent small molecule clinical development candidates come from?
Where do recent small molecule clinical development candidates come from?Where do recent small molecule clinical development candidates come from?
Where do recent small molecule clinical development candidates come from?
 
cadd-191129134050 (1).pptx
cadd-191129134050 (1).pptxcadd-191129134050 (1).pptx
cadd-191129134050 (1).pptx
 
CADD
CADDCADD
CADD
 
Denovo Drug Design
Denovo Drug DesignDenovo Drug Design
Denovo Drug Design
 
Molecular modelling for in silico drug discovery
Molecular modelling for in silico drug discoveryMolecular modelling for in silico drug discovery
Molecular modelling for in silico drug discovery
 
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
The importance of data curation on QSAR Modeling: PHYSPROP open data as a cas...
 
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGY
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGYSlicing and dicing expert-curated protein targets in the Guide to PHARMACOLGY
Slicing and dicing expert-curated protein targets in the Guide to PHARMACOLGY
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
 
Computer aided drug designing (cadd)
Computer aided drug designing (cadd)Computer aided drug designing (cadd)
Computer aided drug designing (cadd)
 
Explainable AI in Drug Hunting
Explainable AI in Drug HuntingExplainable AI in Drug Hunting
Explainable AI in Drug Hunting
 
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening1  -val_gillet_-_ligand-based_and_structure-based_virtual_screening
1 -val_gillet_-_ligand-based_and_structure-based_virtual_screening
 

More from MLAI2

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...MLAI2
 
Online Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient DistillationOnline Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient DistillationMLAI2
 
Online Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual LearningOnline Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual LearningMLAI2
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningMLAI2
 
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual LearningSequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual LearningMLAI2
 
Skill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement LearningSkill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement LearningMLAI2
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsMLAI2
 
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set EncodingMini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set EncodingMLAI2
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningMLAI2
 
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...MLAI2
 
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-LearningMeta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-LearningMLAI2
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingMLAI2
 
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...MLAI2
 
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...MLAI2
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMLAI2
 
Adversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive LearningAdversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive LearningMLAI2
 
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...MLAI2
 
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...MLAI2
 
Cost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention ProcessCost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention ProcessMLAI2
 
Adversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability SuppressionAdversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability SuppressionMLAI2
 

More from MLAI2 (20)

Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
Meta Learning Low Rank Covariance Factors for Energy-Based Deterministic Unce...
 
Online Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient DistillationOnline Hyperparameter Meta-Learning with Hypergradient Distillation
Online Hyperparameter Meta-Learning with Hypergradient Distillation
 
Online Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual LearningOnline Coreset Selection for Rehearsal-based Continual Learning
Online Coreset Selection for Rehearsal-based Continual Learning
 
Representational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual LearningRepresentational Continuity for Unsupervised Continual Learning
Representational Continuity for Unsupervised Continual Learning
 
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual LearningSequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
Sequential Reptile_Inter-Task Gradient Alignment for Multilingual Learning
 
Skill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement LearningSkill-Based Meta-Reinforcement Learning
Skill-Based Meta-Reinforcement Learning
 
Edge Representation Learning with Hypergraphs
Edge Representation Learning with HypergraphsEdge Representation Learning with Hypergraphs
Edge Representation Learning with Hypergraphs
 
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set EncodingMini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
Mini-Batch Consistent Slot Set Encoder For Scalable Set Encoding
 
Task Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive LearningTask Adaptive Neural Network Search with Meta-Contrastive Learning
Task Adaptive Neural Network Search with Meta-Contrastive Learning
 
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
Federated Semi-Supervised Learning with Inter-Client Consistency & Disjoint L...
 
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-LearningMeta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning
 
Accurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset PoolingAccurate Learning of Graph Representations with Graph Multiset Pooling
Accurate Learning of Graph Representations with Graph Multiset Pooling
 
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
Contrastive Learning with Adversarial Perturbations for Conditional Text Gene...
 
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...
Clinical Risk Prediction with Temporal Probabilistic Asymmetric Multi-Task Le...
 
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and ArchitecturesMetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures
 
Adversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive LearningAdversarial Self-Supervised Contrastive Learning
Adversarial Self-Supervised Contrastive Learning
 
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
Learning to Extrapolate Knowledge: Transductive Few-shot Out-of-Graph Link Pr...
 
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...Neural Mask Generator : Learning to Generate Adaptive WordMaskings for Langu...
Neural Mask Generator : Learning to Generate Adaptive Word Maskings for Langu...
 
Cost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention ProcessCost-effective Interactive Attention Learning with Neural Attention Process
Cost-effective Interactive Attention Learning with Neural Attention Process
 
Adversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability SuppressionAdversarial Neural Pruning with Latent Vulnerability Suppression
Adversarial Neural Pruning with Latent Vulnerability Suppression
 

Recently uploaded

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Paola De la Torre
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptxLBM Solutions
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?XfilesPro
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 

Recently uploaded (20)

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
Pigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping ElbowsPigging Solutions Piggable Sweeping Elbows
Pigging Solutions Piggable Sweeping Elbows
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 
Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101Salesforce Community Group Quito, Salesforce 101
Salesforce Community Group Quito, Salesforce 101
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Key Features Of Token Development (1).pptx
Key  Features Of Token  Development (1).pptxKey  Features Of Token  Development (1).pptx
Key Features Of Token Development (1).pptx
 
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?How to Remove Document Management Hurdles with X-Docs?
How to Remove Document Management Hurdles with X-Docs?
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024SQL Database Design For Developers at php[tek] 2024
SQL Database Design For Developers at php[tek] 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 

Fragment-Based RL Generates Drug Hits

  • 1. Hit and Lead Discovery with Explorative RL and Fragment-based Molecule Generation SoojungYang, Doyeong Hwang, Seul Lee, Seongok Ryu, Sung Ju Hwang AITRICS and KAIST
  • 2. Screening library Hits Chemical space Automated small molecule drug discovery Hits • Searching for novel hits is a critical task in drug discovery. 2 Hits • Molecules with high therapeutic potential • High binding affinity to a given protein target Small molecule Protein target + ➞ Protein-ligand complex ΔG = binding free energy
  • 3. Automated small molecule drug discovery • Searching for novel hits is a critical task in drug discovery. • RL methods can be used for automated hit search. Screening library Hits RL agent Chemical space 3
  • 4. Automated small molecule drug discovery Screening library Hits • Searching for novel hits is a critical task in drug discovery. • RL methods can be used for automated hit search. • Molecular docking simulation estimates protein-ligand binding affinity. • We can guide RL agents with docking scores as reward. RL agent Docking simulation score reward Chemical space 4
  • 5. Drug molecules should satisfy strong structural constraints ➝ Few examples of structural alerts in PAINS filter. • Drug molecules should not have toxic or highly reactive substructures. • Widely-used medicinal chemistry filters include several hundreds of diverse structural alerts. Hits Chemical space High-quality hits 5
  • 6. Drug molecules should satisfy strong structural constraints • Previous molecular generative models guided by implicit methods cannot entirely avoid inappropriate structures. • e.g., multi-objective optimization where total reward = docking score reward + structure penalty reward ➠ Explicit method to constrain the generation space within acceptable molecules are necessary. “Fragment-based molecular generation” Hits Chemical space High-quality hits 6
  • 7. Our fragment-based molecular generation method 7 Pharmacochemcially acceptable fragment library Action 1 Next state molecule St+1 Action 2 Action 3 Augmented fragment Possible attachment sites Current state molecule St
  • 8. Markovian embedding and policy network makes the model plausible for hit generation and scaffold-based generation 8 • The embeddings of the molecule and fragments are autoregressively passed onto the policy network. • State embedding network and policy network are Markov models. • Any arbitrary molecule can be the current state. ➠ The model can be used for scaffold-based generation as well as hit generation.
  • 9. Connectivity-preserving generation prevents unrealistic bonds 9 • We preserve the connectivity information of fragmented molecules as explicit attachment sites. • In this way, our model can avoid the generation of molecules with unrealistic bonds. Pyrrolium Pyrrole Pyrrolium 2-methylpyrrole New bond formation on N New bond formation on C
  • 10. Explorative RL algorithm improves model performance in the strongly constrained generation space 10 Chemical space RL agent Acceptable chemical spaces Good exploration • Strong constraint in generation space makes search space less smooth ➠ Solutions can be trapped in local optima. • A practical drug discovery model should find as many diverse optimal areas in the chemical space. Our strategies • Employ soft-actor-critic (SAC), an off-policy actor-critic algorithm based on maximum entropy RL, which encourages exploration. • Devise explorative algorithms based on prioritized experience replay (PER) method.
  • 11. 11 priority pt • Goal: Sample-efficient exploration • Priority is a measure of how much additional information we can learn from the transition. • Pt = TD error of agent’s value estimate (Q or V function) [Schaul et al.] Original PER priority pt Our PER algorithm encourages the agent to visit novel states [Schaul et al.] Prioritized Experience Replay, ICLR 2016
  • 12. 12 priority pt • Goal: Sample-efficient exploration • Priority is a measure of how much additional information we can learn from the transition. • Pt = TD error of agent’s value estimate (Q or V function) [Schaul et al.] Original PER • Goal: Encourage sufficient diversity • Priority is a measure of the novelty of the state • PER(PE) Pt = predictive error of reward estimator • PER(BU) Pt = Bayesian uncertainty of reward estimate Ours: PER(PE), PER(BU) Our PER algorithm encourages the agent to visit novel states [Schaul et al.] Prioritized Experience Replay, ICLR 2016 reward rt reward predictor rt ^ priority pt = rt ^ - rt predictive error
  • 13. FREED Model Overview and Contributions 13 • Our fragment-based generation method allows our model to leverage medicinal chemistry prior knowledge. • Our proposed explorative RL method based on PER significantly improves the model performance. FREED : Fragment-based generative RL with Explorative Experience replay for Drug design
  • 14. Experimental results 14 fa7 protease parp1 polymerase 5ht1b G protein-coupled receptor Three protein targets Evaluation metrics Quality score (filter score) : ratio of accepted, valid molecules to total generated molecules Hit ratio : ratio of unique hit molecules to total generated molecules, where hit is defined as molecules who have higher docking scores than the median of known active molecules Top 5% score : average docking score of top 5%-scored generated molecules * Higher = greater absolute value Baseline models MORLD : atom-wise generative model + MolDQN REINVENT : SMILES-based generative model + REINFORCE
  • 15. Results 1 – Explicit constraints are necessary to avoid problematic structures 15 • Our fragment-based generation model successfully avoids structural alerts. • Baseline models trained to avoid alerts in multi-objective optimization scheme show suboptimal results.
  • 16. Results 2 – Our model outperforms the unconstrained models 16 • Our model outperforms or at least show comparable performance with the unconstrained baseline models. Hit ratio Top 5% score better better
  • 17. Result 3 – Our PER method shows the best performance 17 Hit ratio Top 5% score better better
  • 18. Result 3 – Our PER method shows the best performance 18 • All explorative algorithms outperform the vanilla SAC and PPO. • Our PER(PE) and PER(BU) outperform previous algorithms such as PER(TD) and curiosity-driven algorithms. Hit ratio Top 5% score better better
  • 19. Result 4 – Hit and scaffold-based generation case study 19 Hit generation Scaffold-based (lead) generation • Initial structure: a benzene ring • Initial structure: a scaffold extracted from known active molecules
  • 20. Result 4 – Hit and scaffold-based generation case study 20
  • 21. Summary 21 • Our model FREED is a novel RL framework for real-world drug design that couples a fragment-based molecular generation strategy with a highly explorative RL algorithm. • FREED can generate pharmacochemically acceptable molecules with high docking scores. • When we want to avoid many structural alerts, explicitly constraining the molecular generation space is more effective than implicit methods. • By defining priority as the novelty of the state, PER method can encourage the model exploration to find many optima in highly-constrained molecular generation space.