Using Matched Series to decide what compound to make next

•

0 likes•1,893 views

NextMove Software

UK-QSAR meeting Spring 2014 Eli Lilly, Erl Wood

Science

For example, using pIC50 data from ChEMBL there are 982 ordered
matched series involving H, F, Cl, and Br, arranged in 24 possible ways:
What is a matched molecular series?
Summary Updating the Topliss Tree
www.nextmovesoftware.com
@nmsoftware
NextMove Software Limited
Innovation Centre (Unit 23)
Cambridge Science Park
Milton Road, Cambridge
UK CB4 0EY
Using Matched Series to decide what compound
to make next
Noel M. O’Boyle,1 Jonas Boström,2 Roger A. Sayle1,Adrian Gill2
1 NextMove Software Ltd, Cambridge, UK 2 AstraZeneca, Mölndal, Sweden
Large amounts of activity data across a broad set of targets are
available either publicly (ChEMBL) or internally within a
pharmaceutical company. We describe a method that exploits these
data to predict R-groups that will improve binding affinity [1].
Bibliography
1. O’Boyle, N. M.; Boström, J.; Sayle, R. A.; Gill, A. Using Matched Molecular Series as a
Predictive Tool To Optimize Biological Activity. J. Med. Chem. 2014, 57, 2704.
2. Wawer, M.; Bajorath, J. Local Structural Changes, Global Data Views: Graphical
Substructure−Activity Relationship Trailing. J. Med. Chem. 2011, 54, 2944.
3. Topliss, J. G. Utilization of Operational Schemes for Analog Synthesis in Drug
Design. J. Med. Chem. 1972, 15, 1006.
A matched (molecular) series describes a set of molecules with the
same scaffold but different R groups at a particular position [2]. The
related term, matched pair, corresponds to a matched series
consisting of just two molecules.
Given a query matched series, the Matsy algorithm searches an
activity database for all R-groups that have been measured along
with the query, and calculates the percentage of times each R-group
increased the activity beyond the most active R-group in the query.
The R-groups with the highest percentages are presented as the best
candidates to try next.
The Matsy algorithm: predicting R-groups likely to improve activity
In a landmark paper [3], Topliss described a decision tree that guides
a medicinal chemist to the most potent analogue of a substituted
phenyl ring. Topliss based his tree on a rational analysis of the
activity order. Using the Matsy algorithm, we have created a similar
tree that is based on observed experimental results in ChEMBL.
Matsy GUI
The origin of these preferences is assumed to be the existence of
particular binding site environments that occur again and again across
multiple targets. Using these order preferences, we can predict
whether a particular R-group is likely to increase activity given an
observed activity order.
Enriched activity orders
For an ordered matched series, there are
N! ways of arranging the R-groups. An
interesting and useful observation is that
for a particular matched series, certain
activity orders of the R-groups may be
preferred. These preferences are more
pronounced for longer series.
J. Med. Chem. 2014, 57, 2704
A > B
Query A > B > C
C > A > B
D > A > B > C
D > A > C > B
E > D > A > B
…
Activity
database
e.g.
ChEMBL
In-house
Matched
series
database
 least preferred order
 most preferred order
 most preferred order “reversed”
24.
1.
3.
A large-scale retrospective test using ChEMBL16 showed that the
longer the series the greater the predictive power [1].

What's hot

DFT Calculations and Insilico Drug Activity Predictions for The Bioactive Con...IRJET Journal

MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS santosh Kumbhar

Molecular dockingShrihith.A Ananthram

Workshop031211Philip Bourne

Molecular docking by harendra ...power point presentationHarendra Bisht

FbddPrasanthperceptron

What's hot (6)

DFT Calculations and Insilico Drug Activity Predictions for The Bioactive Con...

MOLECULAR DOCKING AND RELATED DRUG DESIGN ACHIEVEMENTS

Molecular docking

Workshop031211

Molecular docking by harendra ...power point presentation

Fbdd

Viewers also liked

Peptide Informatics - Bridging the gap between small-molecule and large-molec...NextMove Software

Chemistry and reactions from non-US patentsNextMove Software

Representation and display of non-standard peptides using semi-systematic ami...NextMove Software

InChI for Large MoleculesNextMove Software

Etl toolJinal Shah

Top 5 ETL Tools for Salesforce Data MigrationIntellipaat

Peptide line notations for biologics registration and patent filingsNextMove Software

Standardized Representations of ELN Reactions for Categorization and Duplicat...NextMove Software

Validación del Desempeño de los Dispositivos Médicos, Una Mirada desde la Ing...Rigoberto José Meléndez Cuauro

Online Marketing at Global Premier VillasGlobal Premier Villas

Manejo y seguridad del internetstephania murcia ramos

Stakeholder analysisStefan Csosz

Welwyn Hatfield Dragons Apprentice: 5 reasons to be involvedRed Potato

οδηγίες συμπλήρωσης αίτησης για το πρόγραμμα Teachers 4 europe 2014 2015Dr. Paraskevas Apostolos

Project ECHO (Extension for Community Health Outcomes)icornpresentations

ECRI-INSTITUTE - Camas de PartoRigoberto José Meléndez Cuauro

Recycle Gamemtletsclick

Ο σχολικός μας κήποςΑννα Παππα

Mini-Workshop: Responsive Web Design with Visualforce and BootstrapKeir Bowden

Social Media Report - Snack Brands - Chips (India) September - October 2016Unmetric

Viewers also liked (20)

Peptide Informatics - Bridging the gap between small-molecule and large-molec...

Chemistry and reactions from non-US patents

Representation and display of non-standard peptides using semi-systematic ami...

InChI for Large Molecules

Etl tool

Top 5 ETL Tools for Salesforce Data Migration

Peptide line notations for biologics registration and patent filings

Standardized Representations of ELN Reactions for Categorization and Duplicat...

Validación del Desempeño de los Dispositivos Médicos, Una Mirada desde la Ing...

Online Marketing at Global Premier Villas

Manejo y seguridad del internet

Stakeholder analysis

Welwyn Hatfield Dragons Apprentice: 5 reasons to be involved

οδηγίες συμπλήρωσης αίτησης για το πρόγραμμα Teachers 4 europe 2014 2015

Project ECHO (Extension for Community Health Outcomes)

ECRI-INSTITUTE - Camas de Parto

Recycle Game

Ο σχολικός μας κήπος

Mini-Workshop: Responsive Web Design with Visualforce and Bootstrap

Social Media Report - Snack Brands - Chips (India) September - October 2016

Similar to Using Matched Series to decide what compound to make next

Pep Talk San Diego 011311Philip Bourne

Drug Target Interaction (DTI) prediction (MSc. thesis) Dimitris Papadopoulos

Network Pharmacology Tri-Con 022212Philip Bourne

BioIT Drug induced liver injury talk 2011Sean Ekins

Accelrys UGM slides 2011Sean Ekins

The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updatesGuide to PHARMACOLOGY

Badapple: promiscuity patterns from noisy evidence (poster)Jeremy Yang

NMR Chemical Shift Prediction by Atomic Increment-Based AlgorithmsUS Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure

How to analyse large data setsimprovemed

Cadd and molecular modeling for M.PharmShikha Popali

Determining stable ligand orientationijaia

Metabolic engineering approaches in medicinal plantsN Poorin

Vanderwall cheminformatics Drexel Part 1Jean-Claude Bradley

Chemistry Reserach as a Social MachineJeremy Frey

Unc slides on computational toxicologySean Ekins

CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big DataDaniel Russo

Statistical modeling in pharmaceutical research and developmentPV. Viji

Ucl120810Philip Bourne

Ai in drug design webinar 26 feb 2019Pistoia Alliance

AI & ML in Drug Design: Pistoia Alliance CoEPistoia Alliance

Similar to Using Matched Series to decide what compound to make next (20)

Pep Talk San Diego 011311

Drug Target Interaction (DTI) prediction (MSc. thesis)

Network Pharmacology Tri-Con 022212

BioIT Drug induced liver injury talk 2011

Accelrys UGM slides 2011

The IUPHAR/BPS Guide to PHARAMCOLOGY in 2018: new features and updates

Badapple: promiscuity patterns from noisy evidence (poster)

NMR Chemical Shift Prediction by Atomic Increment-Based Algorithms

How to analyse large data sets

Cadd and molecular modeling for M.Pharm

Determining stable ligand orientation

Metabolic engineering approaches in medicinal plants

Vanderwall cheminformatics Drexel Part 1

Chemistry Reserach as a Social Machine

Unc slides on computational toxicology

CIIProCluster: Developing Read-Across Predictive Toxicity Models Using Big Data

Statistical modeling in pharmaceutical research and development

Ucl120810

Ai in drug design webinar 26 feb 2019

AI & ML in Drug Design: Pistoia Alliance CoE

Recently uploaded

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl

Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1

Natural Polymer Based NanomaterialsAArockiyaNisha

Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P

Nanoparticles synthesis and characterization kaibalyasahoo82800

The Philosophy of ScienceUniversity of Hertfordshire

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...anilsa9823

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani

Boyles law module in the grade 10 sciencefloriejanemacaya1

G9 Science Q4- Week 1-2 Projectile Motion.pptMAESTRELLAMesa2

Formation of low mass protostars and their circumstellar disksSérgio Sacani

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡anilsa9823

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCEPRINCE C P

Recombinant DNA technology (Immunological screening)PraveenaKalaiselvan1

Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl

Isotopic evidence of long-lived volcanism on IoSérgio Sacani

Recombination DNA Technology (Nucleic Acid Hybridization )aarthirajkumar25

Presentation Vikram Lander by Vedansh Gupta.pptxgindu3009

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...Lokesh Kothari

Recently uploaded (20)

Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |

Work, Energy and Power for class 10 ICSE Physics

Natural Polymer Based Nanomaterials

Artificial Intelligence In Microbiology by Dr. Prince C P

Nanoparticles synthesis and characterization

The Philosophy of Science

Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx

Lucknow 💋 Russian Call Girls Lucknow Finest Escorts Service 8923113531 Availa...

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...

Boyles law module in the grade 10 science

G9 Science Q4- Week 1-2 Projectile Motion.ppt

Formation of low mass protostars and their circumstellar disks

CALL ON ➥8923113531 🔝Call Girls Kesar Bagh Lucknow best Night Fun service 🪡

STERILITY TESTING OF PHARMACEUTICALS ppt by DR.C.P.PRINCE

Recombinant DNA technology (Immunological screening)

Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.

Isotopic evidence of long-lived volcanism on Io

Recombination DNA Technology (Nucleic Acid Hybridization )

Presentation Vikram Lander by Vedansh Gupta.pptx

Labelling Requirements and Label Claims for Dietary Supplements and Recommend...

Using Matched Series to decide what compound to make next

1. For example, using pIC50 data from ChEMBL there are 982 ordered matched series involving H, F, Cl, and Br, arranged in 24 possible ways: What is a matched molecular series? Summary Updating the Topliss Tree www.nextmovesoftware.com @nmsoftware NextMove Software Limited Innovation Centre (Unit 23) Cambridge Science Park Milton Road, Cambridge UK CB4 0EY Using Matched Series to decide what compound to make next Noel M. O’Boyle,1 Jonas Boström,2 Roger A. Sayle1,Adrian Gill2 1 NextMove Software Ltd, Cambridge, UK 2 AstraZeneca, Mölndal, Sweden Large amounts of activity data across a broad set of targets are available either publicly (ChEMBL) or internally within a pharmaceutical company. We describe a method that exploits these data to predict R-groups that will improve binding affinity [1]. Bibliography 1. O’Boyle, N. M.; Boström, J.; Sayle, R. A.; Gill, A. Using Matched Molecular Series as a Predictive Tool To Optimize Biological Activity. J. Med. Chem. 2014, 57, 2704. 2. Wawer, M.; Bajorath, J. Local Structural Changes, Global Data Views: Graphical Substructure−Activity Relationship Trailing. J. Med. Chem. 2011, 54, 2944. 3. Topliss, J. G. Utilization of Operational Schemes for Analog Synthesis in Drug Design. J. Med. Chem. 1972, 15, 1006. A matched (molecular) series describes a set of molecules with the same scaffold but different R groups at a particular position [2]. The related term, matched pair, corresponds to a matched series consisting of just two molecules. Given a query matched series, the Matsy algorithm searches an activity database for all R-groups that have been measured along with the query, and calculates the percentage of times each R-group increased the activity beyond the most active R-group in the query. The R-groups with the highest percentages are presented as the best candidates to try next. The Matsy algorithm: predicting R-groups likely to improve activity In a landmark paper [3], Topliss described a decision tree that guides a medicinal chemist to the most potent analogue of a substituted phenyl ring. Topliss based his tree on a rational analysis of the activity order. Using the Matsy algorithm, we have created a similar tree that is based on observed experimental results in ChEMBL. Matsy GUI The origin of these preferences is assumed to be the existence of particular binding site environments that occur again and again across multiple targets. Using these order preferences, we can predict whether a particular R-group is likely to increase activity given an observed activity order. Enriched activity orders For an ordered matched series, there are N! ways of arranging the R-groups. An interesting and useful observation is that for a particular matched series, certain activity orders of the R-groups may be preferred. These preferences are more pronounced for longer series. J. Med. Chem. 2014, 57, 2704 A > B Query A > B > C C > A > B D > A > B > C D > A > C > B E > D > A > B … Activity database e.g. ChEMBL In-house Matched series database  least preferred order  most preferred order  most preferred order “reversed” 24. 1. 3. A large-scale retrospective test using ChEMBL16 showed that the longer the series the greater the predictive power [1].

Using Matched Series to decide what compound to make next

Recommended

Recommended

More Related Content

What's hot

What's hot (6)

Viewers also liked

Viewers also liked (20)

Similar to Using Matched Series to decide what compound to make next

Similar to Using Matched Series to decide what compound to make next (20)

More from NextMove Software

More from NextMove Software (20)

Recently uploaded

Recently uploaded (20)

Using Matched Series to decide what compound to make next