Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models

•Download as PPTX, PDF•

0 likes•932 views

Salford Systems

Technology

 Traditional Drug Discovery (insert graph)
 In Silico Prediction of ADME (insert graph)
◦ Potency
◦ Absorption
◦ Lead
◦ Drug
◦ Toxicity
◦ Excretion
◦ Metabolism
◦ distribution

 Target IVY(Brute force virtual screening of
very large compound libraries) Lead
Discovery IVY(Utilize predictive models
from Biogen data for more efficient virtual
screening) Lead Optimization candidate

 (insert graph)
◦ Potency
◦ Lead
◦ Drug
◦ Toxicity
◦ Excretion
◦ Metabolism
◦ Distribution
◦ absorption

 Goal: Identify crystallographic binding mode,
Rank order ligands wrt binding with protein
 (insert graph)
 Receptor Docking
 Ligand Shape
 Generate plausible trial binding modes using
docking function then Re-rank modes with
scoring function

 (insert graph)
 341 Active
 47 Non-Active

 (insert graph)
 After filtering by Pharmacophore Feature

 (insert functions for)
◦ F_Score*
◦ D_Score
◦ G_Score
◦ PMF_Score
◦ Chem_Score
◦ ICM_Score*

 Cell Adhesion Assay (50% Serum)
◦ (insert graph)
 Biochemical Adhesion Assay
◦ (insert graph)
 Scoring Functions Are Poor More Often Than
Not

 Receptor Site View Library Design FlexX
Score Consensus Score>=3 e.g. Contact
Map, CLogP MW, HBOND Rotatable bonds
Consensus=5? if yes, substructure exists?
if yes, Pharmacophore<4.2Å? if yes, Publish
Hit Report

 Goal: Predict hit/miss class based on presence of features
(fingerprints)
 Method
◦ Given a set of N samples
◦ Given that some subset A of them are good (‘active’)
 Then we estimate for a new compound: P(good)~ A/N
◦ Given a set of binary features F
 For a given feature F:
 It appears in N samples
 It appears in A good samples
 Can we estimate: P(good l F)~A/N
 (Problem: Error gets worse as Nsmall)
◦ P’(good l F)= (A+P(good)k)/(n+k)
 P’(good l F)p(good)as N0
 P’(good l F) A/N as N large
◦ (If K=1/P(good) this is the Laplacian correction)
 Descriptors (insert)
 Advantages
◦ Can describe huge number of features (up to 4 billion; MDL 1024; Lead
scope 27,000)
◦ Contains tertiary and stereochemistry information
◦ Fast

 Classification Analysis
◦ Developing Non-Linear Scoring Functions to classify
actives and non-actives
◦ (insert graphs)
◦ Cost Function to Minimize: Gini Impurity N= 1-
ΣP^2(ω)

 Training Set Prediction Success
 (insert table)
 10-fold cross validation
 Randomly split training and test sets
 Significant Improvement in Separating Actives
from Non-Actives

 (insert graph)
 Significant Improvement in Finding Hits Using
New SF

 Optimal tree identified (insert graph)
 No random effects (insert graph)

 (insert cluster)
 Able to identify different molecular property
criteria that lead to hits

 (insert graph)
 Size= magnitude of OBA
 OBA values cover range of descriptor space

 (insert graph)
 Choose 1 & 2D Descriptors for ease of
interpretation and lower “noise”

 Build Model (insert graphs) Apply Model

 Features found in high OBA
 Features found in low OBA
 Would be nice if CART did similar view

 Improved scoring functions for separating
hits from non-hits in structure-based drug
design developed with CART and Bayesian
models
 Identified key differences in molecular
physical properties that led to hits
 Built reasonably predictive OBA model
(cannot expect method to extend to other
systems given complexity of OBA, however)

 Biogen IDEC
 Modeling
◦ Rajiah Denny
◦ Claudio Chuaqui
◦ Juswinder Singh
◦ Herman van Vlijmen
◦ Norman Wang
◦ Anuj Patel
◦ Zhan Deng
 Chemistry
◦ Kevin Guckian
◦ Dan Scott
◦ Thomas Durand-Reville
◦ Pat Conlon
◦ Charlie Hammond
◦ Chuck Jewell
 Pharmacology
◦ Tonika Bonhert

Similar to Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models

Summer 2015 InternshipTaylor Martell

SEM MODELING THROUGH PARTIAL least SQUAREMohitGupta986332

Prediction Of Bioactivity From Chemical StructureJeremy Besnard

Data mining with wekaHein Min Htike

A Validation of Object-Oriented Design Metrics as Quality Indicatorsvie_dels

Face recognition v1San Kim

RBHF_SDM_2011_JieMDO_Lab

Metabolomic Data Analysis Workshop and Tutorials (2014)Dmitry Grapov

Introduction to Chainer ChemistryPreferred Networks

Use of Definitive Screening Designs to Optimize an Analytical MethodPhilip Ramsey

P0126557 slidesNguyen Chien

Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...MLconf

Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf

MLConf 2016 SigOpt Talk by Scott ClarkSigOpt

ADMET.pptxSantu Chall

Predicting best classifier using properties of data setsAbhishek Vijayvargia

Doctoral Thesis Dissertation 2014-03-20 @PoliMiDavide Chicco

TMPA-2017: Evolutionary Algorithms in Test Generation for digital systemsIosif Itkin

Efficient aggregation for graph summarizationaftab alam

WCTFR : W RAPPING C URVELET T RANSFORM B ASED F ACE R ECOGNITIONcsandit

Similar to Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models (20)

Summer 2015 Internship

SEM MODELING THROUGH PARTIAL least SQUARE

Prediction Of Bioactivity From Chemical Structure

Data mining with weka

A Validation of Object-Oriented Design Metrics as Quality Indicators

Face recognition v1

RBHF_SDM_2011_Jie

Metabolomic Data Analysis Workshop and Tutorials (2014)

Introduction to Chainer Chemistry

Use of Definitive Screening Designs to Optimize an Analytical Method

P0126557 slides

Hanjun Dai, PhD Student, School of Computational Science and Engineering, Geo...

Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016

MLConf 2016 SigOpt Talk by Scott Clark

ADMET.pptx

Predicting best classifier using properties of data sets

Doctoral Thesis Dissertation 2014-03-20 @PoliMi

TMPA-2017: Evolutionary Algorithms in Test Generation for digital systems

Efficient aggregation for graph summarization

WCTFR : W RAPPING C URVELET T RANSFORM B ASED F ACE R ECOGNITION

Recently uploaded

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 3652toLead Limited

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Understanding the Laravel MVC ArchitecturePixlogix Infotech

"ML in Production",Oleksandr BaganFwdays

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

SQL Database Design For Developers at php[tek] 2024Scott Keck-Warren

APIForce Zurich 5 April Automation LPDGMarianaLemus7

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Install Stable Diffusion in windows machinePadma Pradeep

Story boards and shot lists for my a level piececharlottematthew16

Pigging Solutions in Pet Food ManufacturingPigging Solutions

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106

Recently uploaded (20)

Tech-Forward - Achieving Business Readiness For Copilot in Microsoft 365

Vertex AI Gemini Prompt Engineering Tips

Developer Data Modeling Mistakes: From Postgres to NoSQL

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

Advanced Test Driven-Development @ php[tek] 2024

New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

Artificial intelligence in cctv survelliance.pptx

Understanding the Laravel MVC Architecture

"ML in Production",Oleksandr Bagan

Nell’iperspazio con Rocket: il Framework Web di Rust!

SQL Database Design For Developers at php[tek] 2024

APIForce Zurich 5 April Automation LPDG

Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024

My Hashitalk Indonesia April 2024 Presentation

SIP trunking in Janus @ Kamailio World 2024

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Install Stable Diffusion in windows machine

Story boards and shot lists for my a level piece

Pigging Solutions in Pet Food Manufacturing

Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics

Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models

1. Donovan N. Chin & R. Aldrin Denny

2.  Traditional Drug Discovery (insert graph)  In Silico Prediction of ADME (insert graph) ◦ Potency ◦ Absorption ◦ Lead ◦ Drug ◦ Toxicity ◦ Excretion ◦ Metabolism ◦ distribution

3.  Target IVY(Brute force virtual screening of very large compound libraries) Lead Discovery IVY(Utilize predictive models from Biogen data for more efficient virtual screening) Lead Optimization candidate

4.  (insert graph) ◦ Potency ◦ Lead ◦ Drug ◦ Toxicity ◦ Excretion ◦ Metabolism ◦ Distribution ◦ absorption

5.  Goal: Identify crystallographic binding mode, Rank order ligands wrt binding with protein  (insert graph)  Receptor Docking  Ligand Shape  Generate plausible trial binding modes using docking function then Re-rank modes with scoring function

6.  (insert graph)  341 Active  47 Non-Active

7.  (insert graph)  After filtering by Pharmacophore Feature

8.  (insert graph)

9.  (insert functions for) ◦ F_Score* ◦ D_Score ◦ G_Score ◦ PMF_Score ◦ Chem_Score ◦ ICM_Score*

10.  Cell Adhesion Assay (50% Serum) ◦ (insert graph)  Biochemical Adhesion Assay ◦ (insert graph)  Scoring Functions Are Poor More Often Than Not

11.  Receptor Site View Library Design FlexX Score Consensus Score>=3 e.g. Contact Map, CLogP MW, HBOND Rotatable bonds Consensus=5? if yes, substructure exists? if yes, Pharmacophore<4.2Å? if yes, Publish Hit Report

12.  (insert graph)

13.  Goal: Predict hit/miss class based on presence of features (fingerprints)  Method ◦ Given a set of N samples ◦ Given that some subset A of them are good (‘active’)  Then we estimate for a new compound: P(good)~ A/N ◦ Given a set of binary features F  For a given feature F:  It appears in N samples  It appears in A good samples  Can we estimate: P(good l F)~A/N  (Problem: Error gets worse as Nsmall) ◦ P’(good l F)= (A+P(good)k)/(n+k)  P’(good l F)p(good)as N0  P’(good l F) A/N as N large ◦ (If K=1/P(good) this is the Laplacian correction)  Descriptors (insert)  Advantages ◦ Can describe huge number of features (up to 4 billion; MDL 1024; Lead scope 27,000) ◦ Contains tertiary and stereochemistry information ◦ Fast

14.  Classification Analysis ◦ Developing Non-Linear Scoring Functions to classify actives and non-actives ◦ (insert graphs) ◦ Cost Function to Minimize: Gini Impurity N= 1- ΣP^2(ω)

15.  Training Set Prediction Success  (insert table)  10-fold cross validation  Randomly split training and test sets  Significant Improvement in Separating Actives from Non-Actives

16.  (insert graph)  Significant Improvement in Finding Hits Using New SF

17.  Optimal tree identified (insert graph)  No random effects (insert graph)

18.  (insert cluster)  Able to identify different molecular property criteria that lead to hits

19.  (insert graph)

20.  (insert graph)  Size= magnitude of OBA  OBA values cover range of descriptor space

21.  (insert graph)  Choose 1 & 2D Descriptors for ease of interpretation and lower “noise”

22.  Build Model (insert graphs) Apply Model

23.  Features found in high OBA  Features found in low OBA  Would be nice if CART did similar view

24.  Improved scoring functions for separating hits from non-hits in structure-based drug design developed with CART and Bayesian models  Identified key differences in molecular physical properties that led to hits  Built reasonably predictive OBA model (cannot expect method to extend to other systems given complexity of OBA, however)

25.  Biogen IDEC  Modeling ◦ Rajiah Denny ◦ Claudio Chuaqui ◦ Juswinder Singh ◦ Herman van Vlijmen ◦ Norman Wang ◦ Anuj Patel ◦ Zhan Deng  Chemistry ◦ Kevin Guckian ◦ Dan Scott ◦ Thomas Durand-Reville ◦ Pat Conlon ◦ Charlie Hammond ◦ Chuck Jewell  Pharmacology ◦ Tonika Bonhert

Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models

Recommended

Recommended

More Related Content

Similar to Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models

Similar to Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models (20)

More from Salford Systems

More from Salford Systems (20)

Recently uploaded

Recently uploaded (20)

Improved Predictions in Structure Based Drug Design Using Cart and Bayesian Models