SlideShare a Scribd company logo
1 of 20
Download to read offline
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization
and Activity Prediction?
James T. Metz, Ph.D.
Abbott Laboratories
R4DG - Dept. of Discovery Technologies
(Cheminformatics)
Accelrys 2012 US User Group Meeting
May 9, 2012
Boston Marriott Longwharf Hotel
Boston, MA
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
2
SAR Contour Landscape Maps – Motivation?
Cpd # 2191A
pKd (DYRK1A) = 4.50
Cpd # 762
pKd (DYRK1A) = 7.20
Cpd # 41
pKd (DYRK1A) = 8.10
# 41
# 2191 # 762
A) All structures and experimentally measured kinase activities are from Metz, J.T. et al. “Navigating the Kinome,” Nature Chemical Biology, 7 (2011)
200-202, supplementary results
+ 427 other cpds with
experimentally measured
DYRK1A kinase activities
If I can generate an SAR contour landscape map, and if
the compound organization and contours are meaningful
and predictive, I can begin asking some useful questions ...
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
3
1. What does the SAR generally look like as a
2D contour map? Where are the “hot” regions
(active cpds)? What are the chemotypes in
those regions? Where are the “cold” regions
(inactive cpds)? Is the landscape smoothly
varying or rough and jagged?
2. What regions are well-explored (densely
populated)? What regions are under-explored?
3. Can the map be used to predict the activity of virtual compounds?
If so, what compound should I synthesize next? What general direction
(set of compounds) should I consider to get to where I need to go?
4. What else can I do with SARCL maps?
SAR Contour Landscape Map – Possible Questions,
Possible Utility
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
4
How to generate a SARCL map? – Pipeline Pilot
1. Begin with a set of 2D structures and activities for a target
2. Compute the complete Property Distance (1 – Tanimoto Similarity) matrix for
all compound pairs using ECFP_6 fingerprints
0.00 0.89 0.90
0.89 0.00 0.94
0.90 0.94 0.00
Cpd # 2191
Cpd # 762
Cpd # 41
# 2191
# 2191
# 41 # 762
# 41
# 762
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
5
How to generate a SARCL map? – R statistics (igraph)
0.00 0.89 0.90
0.89 0.00 0.94
0.90 0.94 0.00
# 2191
# 2191
# 41 # 762
# 41
# 762
R Statistics
igraph
package
Network X
NetworkY
Distance Matrix
41 2191
762
347
Network Coordinates
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
6
How to generate a SARCL map? – R statistics (AKIMA)
Structure Cpd
ID
“X”
Network
X
“Y”
Network
Y
“Z”
pActivity
(DYRK1A)
2191 122.2 94.8 4.50
41 -365.8 54.0 8.10
762 138.3 278.2 7.20
# 41
# 2191
# 762
R Statistics
AKIMA
package
NOTE! – AKIMA can also be used to predict Z values (pActivity) given an (X,Y)
point, if contours exist around that point e.g., from the training set.
Black filled circle – Training Set (TS)
Grey filled diamond – Prediction Set (PS)
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
7
SARCL Activity Map: DYRK1A Kinase
Error Statistics:
<pKd> TS = 5.92 “trivial model”
PS MUEA (“trivial model”) = +/- 0.65
TS (N = 335) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.43 pKd
A) Mean Unsigned Error (MUE) = +/- < | pKd(expt) – pKd(calc) | >
Tooltip Molecule Popup:
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
8
SARCL Activity Map: CDK5 Kinase
Error Statistics:
TS (N = 335) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.38 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
9
SARCL Activity Map: ROCK1 Kinase
Error Statistics:
TS (N = 335) MUE = +/- 0.05 pKd
PS (N = 84) MUE = +/- 0.59 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
10
DYRK1A SARCL Activity Maps - Contour Landscape
vs. Percentage of Available Training Set (TS) Data
100 % TS Data
TS (N = 335) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.43 pKd
90 % TS Data
TS (N = 308) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.44 pKd
80 % TS Data
TS (N = 292) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.45 pKd
70 % TS Data
TS (N = 257) MUE = +/- 0.03 pKd
PS (N = 84) MUE = +/- 0.46 pKd
60 % TS Data
TS (N = 209) MUE = +/- 0.04 pKd
PS (N = 84) MUE = +/- 0.47 pKd
50 % TS Data
TS (N = 167) MUE = +/- 0.04 pKd
PS (N = 84) MUE = +/- 0.50 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
11
PS MUE = f (Tanimoto Distance, Count) ?
Tan. Dist. = 0.10
Tan. Dist. = 0.20
Tan. Dist. = 0.30
Hmmm... In Progress!
PS molecule
TS molecule
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
12
1. Selectivity between 2 targets
Selectivity = pKd(DYRK1A) – pKd(CDK5)
2. Activity and Selectivity between 2 targets
Activity and Selectivity = pKd (DYRK1A) +
pKd(DYRK1A) – pKd(CDK5)
3. Activity and Multiple Selectivity for > 2 targets
Activity and Multiple Selectivity = pKd (DYRK1A) +
pKd(DYRK1A) – pKd(CDK5) +
pKd(DYRK1A) – pKd(ROCK1)
What else can I do with SARCL maps? – Fun with
pKd math!
activity
selectivity
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
13
SARCL Selectivity Map: pKd(DYRK1A) – pKd(CDK5)
_
=
Error Statistics:
PS (N = 84) MUE = +/- 0.35 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
14
SARCL Selectivity Map: pKd(DYRK1A) – pKd(ROCK1)
_
=
Error Statistics:
PS (N = 84) MUE = +/- 0.57 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
15
SARCL Selectivity Map: pKd(CDK5) – pKd(ROCK1)
_
=
Error Statistics:
PS (N = 84) MUE = +/- 0.52 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
16
SARCL Activity and Selectivity Map: pKd (DYRK1A) +
(pKd(DYRK1A) – pKd(CDK5))
+
=
Error Statistics:
PS (N = 84) MUE = +/- 0.65 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
17
SARCL Activity and Multiple Selectivity Map: pKd (DYRK1A) +
(pKd(DYRK1A) – pKd(CDK5)) + (pKd(DYRK1A) – pKd(ROCK1))
+ +
=
Error Statistics:
PS (N = 84) MUE = +/- 1.04 pKd
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
18
1. Why is an activity contouring method
potentially useful for predicting activity?
Patches on the surface represent a
set of spline curves that are fit locally to
the activity data.
Local fitting potentially avoids the “one equation” or “one model”
problem. Data sets may contain multiple and varied chemotypes!
Hmmm... A Parting Thought
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
19
Conclusions and Future Directions
1. SAR Contour Landscape (SARCL) maps of small molecule SAR data sets can be
constructed using Pipeline Pilot and R statistics igraph and AKIMA codes. Training
Set error for 3 kinases is ~ +/- 0.04 pKd units. Prediction Set error is ~ +/- 0.5 pKd
units. Related approaches have appeared in the literature (Peltason, L. et al. J. Chem. Inf. Model.
50 (2010) 1021, Reutlinger, M. et al. Angew. Chem. Int. Ed. 50 (2011) 1).
2. SARCL maps of activity and (multiple) selectivity are also possible via simple
mathematical manipulations of the activity data.
3. More work to do! Current challenges include handling larger data sets, in-depth
assessment of prediction error versus fingerprint types and possible usage of other
properties, compound placement and spline fitting, automatic scaling of points and
contours, deployment in Tibco Spotfire, etc. Ideas, collaborators, and help are
greatly welcomed!
Stay tuned!
SAR Contour Landscape (SARCL) Maps.
Useful for Compound Organization and
Activity Prediction?
20
Many thanks to ...
Abbott Discovery Technologies
(Cheminformatics)
Rishi Gupta
Isabella Haight
Phil Hajduk
Yvonne Martin
Steve Muchmore
Abbott Discovery IT
Robert Gregg
James Kofron
Ravi Mamidipaka
Abbott HTS
Scott Galasinki
Lemma Kifle
Phil Merta
Accelrys Support
Keith Burdick
Mike Cherry
Dana Honeycutt
Ian Kerman
Avinash Kewalramani
And many other helpful folks!

More Related Content

What's hot

What's hot (9)

Numerical calculation of high-order QED contributions to the electron anomalo...
Numerical calculation of high-order QED contributions to the electron anomalo...Numerical calculation of high-order QED contributions to the electron anomalo...
Numerical calculation of high-order QED contributions to the electron anomalo...
 
Multi-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generationMulti-core GPU – Fast parallel SAR image generation
Multi-core GPU – Fast parallel SAR image generation
 
Hydrodynamic Simulations and Time-dependent Photoionization Modeling of Starb...
Hydrodynamic Simulations and Time-dependent Photoionization Modeling of Starb...Hydrodynamic Simulations and Time-dependent Photoionization Modeling of Starb...
Hydrodynamic Simulations and Time-dependent Photoionization Modeling of Starb...
 
Neural Network Control Based on Adaptive Observer for Quadrotor Helicopter
Neural Network Control Based on Adaptive Observer for Quadrotor HelicopterNeural Network Control Based on Adaptive Observer for Quadrotor Helicopter
Neural Network Control Based on Adaptive Observer for Quadrotor Helicopter
 
Scaling out logistic regression with Spark
Scaling out logistic regression with SparkScaling out logistic regression with Spark
Scaling out logistic regression with Spark
 
2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark2014-06-20 Multinomial Logistic Regression with Apache Spark
2014-06-20 Multinomial Logistic Regression with Apache Spark
 
Tuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
Tuning of PID, SVFB and LQ Controllers Using Genetic AlgorithmsTuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
Tuning of PID, SVFB and LQ Controllers Using Genetic Algorithms
 
Large scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using sparkLarge scale logistic regression and linear support vector machines using spark
Large scale logistic regression and linear support vector machines using spark
 
口頭発表 @ 物理学会秋季大会 2012.09.13
口頭発表 @ 物理学会秋季大会 2012.09.13口頭発表 @ 物理学会秋季大会 2012.09.13
口頭発表 @ 物理学会秋季大会 2012.09.13
 

Viewers also liked (11)

Contours And Its Uses
Contours And Its UsesContours And Its Uses
Contours And Its Uses
 
Asian Architecture Case Study Report
Asian Architecture Case Study Report Asian Architecture Case Study Report
Asian Architecture Case Study Report
 
Asian Architecture Presentation Slides
Asian Architecture Presentation SlidesAsian Architecture Presentation Slides
Asian Architecture Presentation Slides
 
Final review contour
Final review  contourFinal review  contour
Final review contour
 
Case Study on Belum Rainforest Resort, Malaysia
Case Study on Belum Rainforest Resort, MalaysiaCase Study on Belum Rainforest Resort, Malaysia
Case Study on Belum Rainforest Resort, Malaysia
 
Case studies on heritage conservation
Case studies on heritage conservationCase studies on heritage conservation
Case studies on heritage conservation
 
Contouring
ContouringContouring
Contouring
 
Contouring
ContouringContouring
Contouring
 
Case study -country resort
Case study -country resortCase study -country resort
Case study -country resort
 
Case study/ Literature of a School
Case study/ Literature of a SchoolCase study/ Literature of a School
Case study/ Literature of a School
 
Case study on Resort
Case study on ResortCase study on Resort
Case study on Resort
 

Similar to Jim Metz.SAR Contour Landscapes

Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
inside-BigData.com
 
Presentation for the 21th EUROSTAR Users Conference - June 2013
Presentation for the 21th EUROSTAR Users Conference - June 2013 Presentation for the 21th EUROSTAR Users Conference - June 2013
Presentation for the 21th EUROSTAR Users Conference - June 2013
Antonios Arkas
 
Representing and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic WebRepresenting and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic Web
Kostis Kyzirakos
 
Indian satellite based navigation system description and implementation status
Indian satellite based navigation system description and implementation statusIndian satellite based navigation system description and implementation status
Indian satellite based navigation system description and implementation status
huangxj73
 
Synthetic aperture radar_advanced
Synthetic aperture radar_advancedSynthetic aperture radar_advanced
Synthetic aperture radar_advanced
Naivedya Mishra
 

Similar to Jim Metz.SAR Contour Landscapes (20)

QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architec...
QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architec...QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architec...
QR Factorizations and SVDs for Tall-and-skinny Matrices in MapReduce Architec...
 
Direct QR factorizations for tall-and-skinny matrices in MapReduce architectu...
Direct QR factorizations for tall-and-skinny matrices in MapReduce architectu...Direct QR factorizations for tall-and-skinny matrices in MapReduce architectu...
Direct QR factorizations for tall-and-skinny matrices in MapReduce architectu...
 
PCA and SVD in brief
PCA and SVD in briefPCA and SVD in brief
PCA and SVD in brief
 
Scalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduceScalable and Adaptive Graph Querying with MapReduce
Scalable and Adaptive Graph Querying with MapReduce
 
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
Extreme Computing for Extreme Adaptive Optics: The Key to Finding Life Outsid...
 
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
 
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
OPTIMAL PID CONTROLLER DESIGN FOR SPEED CONTROL OF A SEPARATELY EXCITED DC MO...
 
Optimal PID Controller Design for Speed Control of a Separately Excited DC Mo...
Optimal PID Controller Design for Speed Control of a Separately Excited DC Mo...Optimal PID Controller Design for Speed Control of a Separately Excited DC Mo...
Optimal PID Controller Design for Speed Control of a Separately Excited DC Mo...
 
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServerText Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
Text Mining Applied to SQL Queries: a Case Study for SDSS SkyServer
 
Presentation for the 21th EUROSTAR Users Conference - June 2013
Presentation for the 21th EUROSTAR Users Conference - June 2013 Presentation for the 21th EUROSTAR Users Conference - June 2013
Presentation for the 21th EUROSTAR Users Conference - June 2013
 
Representing and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic WebRepresenting and Querying Geospatial Information in the Semantic Web
Representing and Querying Geospatial Information in the Semantic Web
 
Materials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum ComputationMaterials Design in the Age of Deep Learning and Quantum Computation
Materials Design in the Age of Deep Learning and Quantum Computation
 
Indian satellite based navigation system description and implementation status
Indian satellite based navigation system description and implementation statusIndian satellite based navigation system description and implementation status
Indian satellite based navigation system description and implementation status
 
Design and Implementation of Variable Radius Sphere Decoding Algorithm
Design and Implementation of Variable Radius Sphere Decoding AlgorithmDesign and Implementation of Variable Radius Sphere Decoding Algorithm
Design and Implementation of Variable Radius Sphere Decoding Algorithm
 
T2K status in 2017
T2K status in 2017T2K status in 2017
T2K status in 2017
 
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
Ml srhwt-machine-learning-based-superlative-rapid-haar-wavelet-transformation...
 
Synthetic aperture radar_advanced
Synthetic aperture radar_advancedSynthetic aperture radar_advanced
Synthetic aperture radar_advanced
 
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...2018 IMSM: MIT Lincoln Laboratories  Problem Presentation - Light Reflections...
2018 IMSM: MIT Lincoln Laboratories Problem Presentation - Light Reflections...
 
Shape Recognition and Retrieval Based on Edit Distance and Dynamic Programming
Shape Recognition and Retrieval Based on Edit Distance and  Dynamic ProgrammingShape Recognition and Retrieval Based on Edit Distance and  Dynamic Programming
Shape Recognition and Retrieval Based on Edit Distance and Dynamic Programming
 
Data profiling in Apache Calcite
Data profiling in Apache CalciteData profiling in Apache Calcite
Data profiling in Apache Calcite
 

Jim Metz.SAR Contour Landscapes

  • 1. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? James T. Metz, Ph.D. Abbott Laboratories R4DG - Dept. of Discovery Technologies (Cheminformatics) Accelrys 2012 US User Group Meeting May 9, 2012 Boston Marriott Longwharf Hotel Boston, MA
  • 2. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 2 SAR Contour Landscape Maps – Motivation? Cpd # 2191A pKd (DYRK1A) = 4.50 Cpd # 762 pKd (DYRK1A) = 7.20 Cpd # 41 pKd (DYRK1A) = 8.10 # 41 # 2191 # 762 A) All structures and experimentally measured kinase activities are from Metz, J.T. et al. “Navigating the Kinome,” Nature Chemical Biology, 7 (2011) 200-202, supplementary results + 427 other cpds with experimentally measured DYRK1A kinase activities If I can generate an SAR contour landscape map, and if the compound organization and contours are meaningful and predictive, I can begin asking some useful questions ...
  • 3. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 3 1. What does the SAR generally look like as a 2D contour map? Where are the “hot” regions (active cpds)? What are the chemotypes in those regions? Where are the “cold” regions (inactive cpds)? Is the landscape smoothly varying or rough and jagged? 2. What regions are well-explored (densely populated)? What regions are under-explored? 3. Can the map be used to predict the activity of virtual compounds? If so, what compound should I synthesize next? What general direction (set of compounds) should I consider to get to where I need to go? 4. What else can I do with SARCL maps? SAR Contour Landscape Map – Possible Questions, Possible Utility
  • 4. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 4 How to generate a SARCL map? – Pipeline Pilot 1. Begin with a set of 2D structures and activities for a target 2. Compute the complete Property Distance (1 – Tanimoto Similarity) matrix for all compound pairs using ECFP_6 fingerprints 0.00 0.89 0.90 0.89 0.00 0.94 0.90 0.94 0.00 Cpd # 2191 Cpd # 762 Cpd # 41 # 2191 # 2191 # 41 # 762 # 41 # 762
  • 5. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 5 How to generate a SARCL map? – R statistics (igraph) 0.00 0.89 0.90 0.89 0.00 0.94 0.90 0.94 0.00 # 2191 # 2191 # 41 # 762 # 41 # 762 R Statistics igraph package Network X NetworkY Distance Matrix 41 2191 762 347 Network Coordinates
  • 6. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 6 How to generate a SARCL map? – R statistics (AKIMA) Structure Cpd ID “X” Network X “Y” Network Y “Z” pActivity (DYRK1A) 2191 122.2 94.8 4.50 41 -365.8 54.0 8.10 762 138.3 278.2 7.20 # 41 # 2191 # 762 R Statistics AKIMA package NOTE! – AKIMA can also be used to predict Z values (pActivity) given an (X,Y) point, if contours exist around that point e.g., from the training set. Black filled circle – Training Set (TS) Grey filled diamond – Prediction Set (PS)
  • 7. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 7 SARCL Activity Map: DYRK1A Kinase Error Statistics: <pKd> TS = 5.92 “trivial model” PS MUEA (“trivial model”) = +/- 0.65 TS (N = 335) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.43 pKd A) Mean Unsigned Error (MUE) = +/- < | pKd(expt) – pKd(calc) | > Tooltip Molecule Popup:
  • 8. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 8 SARCL Activity Map: CDK5 Kinase Error Statistics: TS (N = 335) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.38 pKd
  • 9. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 9 SARCL Activity Map: ROCK1 Kinase Error Statistics: TS (N = 335) MUE = +/- 0.05 pKd PS (N = 84) MUE = +/- 0.59 pKd
  • 10. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 10 DYRK1A SARCL Activity Maps - Contour Landscape vs. Percentage of Available Training Set (TS) Data 100 % TS Data TS (N = 335) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.43 pKd 90 % TS Data TS (N = 308) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.44 pKd 80 % TS Data TS (N = 292) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.45 pKd 70 % TS Data TS (N = 257) MUE = +/- 0.03 pKd PS (N = 84) MUE = +/- 0.46 pKd 60 % TS Data TS (N = 209) MUE = +/- 0.04 pKd PS (N = 84) MUE = +/- 0.47 pKd 50 % TS Data TS (N = 167) MUE = +/- 0.04 pKd PS (N = 84) MUE = +/- 0.50 pKd
  • 11. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 11 PS MUE = f (Tanimoto Distance, Count) ? Tan. Dist. = 0.10 Tan. Dist. = 0.20 Tan. Dist. = 0.30 Hmmm... In Progress! PS molecule TS molecule
  • 12. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 12 1. Selectivity between 2 targets Selectivity = pKd(DYRK1A) – pKd(CDK5) 2. Activity and Selectivity between 2 targets Activity and Selectivity = pKd (DYRK1A) + pKd(DYRK1A) – pKd(CDK5) 3. Activity and Multiple Selectivity for > 2 targets Activity and Multiple Selectivity = pKd (DYRK1A) + pKd(DYRK1A) – pKd(CDK5) + pKd(DYRK1A) – pKd(ROCK1) What else can I do with SARCL maps? – Fun with pKd math! activity selectivity
  • 13. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 13 SARCL Selectivity Map: pKd(DYRK1A) – pKd(CDK5) _ = Error Statistics: PS (N = 84) MUE = +/- 0.35 pKd
  • 14. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 14 SARCL Selectivity Map: pKd(DYRK1A) – pKd(ROCK1) _ = Error Statistics: PS (N = 84) MUE = +/- 0.57 pKd
  • 15. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 15 SARCL Selectivity Map: pKd(CDK5) – pKd(ROCK1) _ = Error Statistics: PS (N = 84) MUE = +/- 0.52 pKd
  • 16. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 16 SARCL Activity and Selectivity Map: pKd (DYRK1A) + (pKd(DYRK1A) – pKd(CDK5)) + = Error Statistics: PS (N = 84) MUE = +/- 0.65 pKd
  • 17. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 17 SARCL Activity and Multiple Selectivity Map: pKd (DYRK1A) + (pKd(DYRK1A) – pKd(CDK5)) + (pKd(DYRK1A) – pKd(ROCK1)) + + = Error Statistics: PS (N = 84) MUE = +/- 1.04 pKd
  • 18. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 18 1. Why is an activity contouring method potentially useful for predicting activity? Patches on the surface represent a set of spline curves that are fit locally to the activity data. Local fitting potentially avoids the “one equation” or “one model” problem. Data sets may contain multiple and varied chemotypes! Hmmm... A Parting Thought
  • 19. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 19 Conclusions and Future Directions 1. SAR Contour Landscape (SARCL) maps of small molecule SAR data sets can be constructed using Pipeline Pilot and R statistics igraph and AKIMA codes. Training Set error for 3 kinases is ~ +/- 0.04 pKd units. Prediction Set error is ~ +/- 0.5 pKd units. Related approaches have appeared in the literature (Peltason, L. et al. J. Chem. Inf. Model. 50 (2010) 1021, Reutlinger, M. et al. Angew. Chem. Int. Ed. 50 (2011) 1). 2. SARCL maps of activity and (multiple) selectivity are also possible via simple mathematical manipulations of the activity data. 3. More work to do! Current challenges include handling larger data sets, in-depth assessment of prediction error versus fingerprint types and possible usage of other properties, compound placement and spline fitting, automatic scaling of points and contours, deployment in Tibco Spotfire, etc. Ideas, collaborators, and help are greatly welcomed! Stay tuned!
  • 20. SAR Contour Landscape (SARCL) Maps. Useful for Compound Organization and Activity Prediction? 20 Many thanks to ... Abbott Discovery Technologies (Cheminformatics) Rishi Gupta Isabella Haight Phil Hajduk Yvonne Martin Steve Muchmore Abbott Discovery IT Robert Gregg James Kofron Ravi Mamidipaka Abbott HTS Scott Galasinki Lemma Kifle Phil Merta Accelrys Support Keith Burdick Mike Cherry Dana Honeycutt Ian Kerman Avinash Kewalramani And many other helpful folks!