SlideShare a Scribd company logo
Seed Suggestions % in
SureChEMBL
222 43
234 21
protease, 6536
phosphatase,
260
kinase, 12686
ion_channel,
4370
GPCR_7TM,
19523
Δ data A to B
MedChemica
Potency and Patents, new arenas for Matched Molecular
Pair analysis (MMPA)
Dr. Al G. Dossetter, Dr. Ed J. Griffen, Dr. Andrew G. Leach, Dr. Shane Montague
References
1Griffen, E. et al. Matched Molecular Pairs as a Medicinal Chemistry Tool. J. Med. Chem. 2011, 54(22), pp.7739-7750.
2Leach, A.G. et. al. Matched Molecular Pairs as a Guide in the Optimization of Pharmaceutical Properties; a Study of Aqueous Solubility, Plasma Protein Binding and Oral Exposure. J. Med. Chem. 2006, 49(23), pp.6672-6682.
3Papadatos, G. et al. Lead Optimization Using Matched Molecular Pairs: Inclusion of Contextual Information for Enhanced of hERG Inhibition, Solubility, and Lipophilicity. J. Chem. Inf. Model. 2010, 50(10), pp.1872-1886.
Problem
Can we understand the relationship between
patents, identify critical compounds and
automatically extract SAR?
Solution
Combine all the compounds and perform MMPA to find all the pair relationships
independent of patent membership. Use graph theory to identify critical compounds
and exploit public data to suggest further analogues and estimate their potency.
MMPA - a method of determining structure activity relationships (SAR’s) within sets of compounds. Matched Molecular Pairs
(MMP’s) are identified and differences in their measured data are used to link properties to structure.1
contact@medchemica.com
Selecting rules
Statistical analysis of data sets of SMIRKS to extract chemical
transformations that are most likely to be genuine.
3)
4) Extract rules from public potency data
Learning
• Useful potency SAR knowledge can be extracted from public data
• MMP network analysis of patents identifies pivotal compounds
• The method is validated by finding that large numbers of compounds suggested using these rules are now patented
• Extending MMP based network analysis by application of machine learning methods and exploiting MCSS structures within clusters to improve predictive accuracy
Advanced MMP’s
• Two pair finding techniques are available
• Not all pairs are found by a single method, both methods are
needed to maximize the MMP output
Molecules that differ only by a particular, well-
defined, structural transformation2
A MMP found by both methods:
1)
Fragment and Index method
Maximum Common Sub-Structure method (MCSS)
Environment Capture
• Chemical transformations are encoded as SMIRKS and recorded
along with their delta property value(s)
• The SMIRKS contain the structural change along with the chemical
environment spanning up to 4 atoms out
Essential for understanding the context of the transformation3
[c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H])[c:5]1[c:
7])([H])>>[c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H]
)[c:5]1[c:7])[F]
2)
[c:4][c:2]([H])[c:1]([c:3]([H])[c:5])([H])
>>[c:4][c:2]([H])[c:1]([c:3]([H])[c:5])[F]
[c:2][c:1]([c:3])([H])>>[c:2][c:1]([c:3])[F] [c:1]([H])>>[c:1][F]
The MMP as a transformation:
4 atom environment: 3 atom environment:
2 atom environment: 1 atom environment:
Δ data A to BΔ data A to B
Δ data A to B
FragA >> FragB
Kinase class number of rules
kinase_agc 1576
kinase_atypical 788
kinase_camk 2376
kinase_ck1 32
kinase_cmgc 1010
kinase_reg 256
kinase_ste 110
kinase_tk 4696
kinase_tkl 1842
• Clean:
• ChEMBL structures,
• convert measurements to pIC50 / pKi,
• aggregate multiple measurements on same compound by
target
• Find MMPA based rules per target
• Organize targets by protein class and sub-class
• Rules can by applied by target, sub-class or class
• The distribution of rules mirrors the distribution of data
5) Identify pivotal compound in patents
• Clean SureChEMBL structures with patent identifiers
• Generate a network map showing MMP relationships
between patents
• Network analysis identifies the key compounds within
patents
• Points are compounds colored by the patent they
were first disclosed in (green / blue), or the clinically
used compounds(red) or yellow – most highly
connected compound in each patent
• Links represent a matched molecular pair
relationships
• Distances are based on a spring force model and
are for visualization only
O
ON
O
N
N
HN Cl
F
O
O
O
O
N
N
HN
N O
O N
N
HN
Cl
F
O
O N
N
HN
2 steps to Gefitinib
3 steps to Erlotinib
Gefitinib
Erlotinib
Focus the rules used to
generate new
compounds by applying
those from the right
kinase sub class
Apply rules
to pivotal
compounds
O
O N
N
HN
N O
O N
N
HN
Cl
F
6) Estimate potency from network models
• Extending the network analysis to all the public EGF
potency data:
• MMP based clusters can be identified and
characterized by their potency
• Being a MMP neighbor in a cluster is sufficient to
estimate a compounds potency to within 1 log.
• The MMP methods used generate sets of maximum
common substructures for each cluster enabling
further direction of chemistry
• Points represent individual compounds
• Links represent a matched molecular pair relationship
pIC5
0
>8
6-8
<6
EGFR tyrosine kinase network based
potency analysis
Size of cluster Clusters Compounds
<8 compounds 133 415
>=8 compounds 59 3213
Total 192 3628
Simple regression modeling of potency based on just
cluster membership(10 fold cross validation): R2 0.44,
RMSE 0.97
Further modeling based on the maximum common
substructures within clusters in progress.

More Related Content

What's hot

Drug design based on bioinformatic tools
Drug design based on bioinformatic toolsDrug design based on bioinformatic tools
Drug design based on bioinformatic tools
SujeethKrishnan
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Shrihith.A Ananthram
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Maakasaikumar
 
Molecular docking and_virtual_screening
Molecular docking and_virtual_screeningMolecular docking and_virtual_screening
Molecular docking and_virtual_screening
Florent Barbault
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
Rahul B S
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
MadhuraDatar
 
Docking
DockingDocking
Molecular Docking
 Molecular Docking Molecular Docking
Molecular Docking
Malharraiyani
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
Bhavesh Amrute
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular Docking
Satarupa Deb
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
Thanh Truong
 
Lecture 5 pharmacophore and qsar
Lecture 5  pharmacophore and  qsarLecture 5  pharmacophore and  qsar
Lecture 5 pharmacophore and qsar
RAJAN ROLTA
 
Connecting Metabolomic Data with Context
Connecting Metabolomic Data with ContextConnecting Metabolomic Data with Context
Connecting Metabolomic Data with Context
Dmitry Grapov
 
Basics of QSAR Modeling
Basics of QSAR ModelingBasics of QSAR Modeling
Basics of QSAR Modeling
Prachi Pradeep
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
KiranmayiKnv
 
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
csandit
 
Pharmacophore mapping joon
Pharmacophore mapping joonPharmacophore mapping joon
Pharmacophore mapping joon
Joon Jyoti Sahariah
 
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERYSTRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
THILAKAR MANI
 
Docking
DockingDocking
Docking
Hema Mallika
 

What's hot (19)

Drug design based on bioinformatic tools
Drug design based on bioinformatic toolsDrug design based on bioinformatic tools
Drug design based on bioinformatic tools
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
Molecular docking and_virtual_screening
Molecular docking and_virtual_screeningMolecular docking and_virtual_screening
Molecular docking and_virtual_screening
 
Molecular docking
Molecular dockingMolecular docking
Molecular docking
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
 
Docking
DockingDocking
Docking
 
Molecular Docking
 Molecular Docking Molecular Docking
Molecular Docking
 
MOLECULAR DOCKING
MOLECULAR DOCKINGMOLECULAR DOCKING
MOLECULAR DOCKING
 
Basics Of Molecular Docking
Basics Of Molecular DockingBasics Of Molecular Docking
Basics Of Molecular Docking
 
Structure based computer aided drug design
Structure based computer aided drug designStructure based computer aided drug design
Structure based computer aided drug design
 
Lecture 5 pharmacophore and qsar
Lecture 5  pharmacophore and  qsarLecture 5  pharmacophore and  qsar
Lecture 5 pharmacophore and qsar
 
Connecting Metabolomic Data with Context
Connecting Metabolomic Data with ContextConnecting Metabolomic Data with Context
Connecting Metabolomic Data with Context
 
Basics of QSAR Modeling
Basics of QSAR ModelingBasics of QSAR Modeling
Basics of QSAR Modeling
 
Structure based drug design- kiranmayi
Structure based drug design- kiranmayiStructure based drug design- kiranmayi
Structure based drug design- kiranmayi
 
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO
 
Pharmacophore mapping joon
Pharmacophore mapping joonPharmacophore mapping joon
Pharmacophore mapping joon
 
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERYSTRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
STRUCTURE BASED DRUG DESIGN - MOLECULAR MODELLING AND DRUG DISCOVERY
 
Docking
DockingDocking
Docking
 

Similar to RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches

2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
Sirris
 
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Ellen Berg
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
Sean Ekins
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plants
N Poorin
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
Mahendra G S
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Kamel Mansouri
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
improvemed
 
Application of Mass Spectrometry In Biotechnology
Application of Mass Spectrometry In BiotechnologyApplication of Mass Spectrometry In Biotechnology
Application of Mass Spectrometry In Biotechnology
Bhanu Krishan
 
Homology Modeling of CCR3 Receptor
Homology Modeling of CCR3 ReceptorHomology Modeling of CCR3 Receptor
Homology Modeling of CCR3 Receptor
dharmakarma
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
Sunghwan Kim
 
Proteomics
ProteomicsProteomics
Proteomics
Shereen Shehata
 
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
Ed Griffen
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserNeil Swainston
 
“Proteomics” to study genes and genomes
“Proteomics” to study genes and genomes“Proteomics” to study genes and genomes
“Proteomics” to study genes and genomes
Nazish_Nehal
 
Presentation july 31_2015
Presentation july 31_2015Presentation july 31_2015
Presentation july 31_2015
gkoytiger
 
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)Marchelle Meza
 
Weber_FinalProposal
Weber_FinalProposalWeber_FinalProposal
Weber_FinalProposalAnna Weber
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
University of Allahabad
 

Similar to RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches (20)

2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai2 partners ed_kickoff_dtai
2 partners ed_kickoff_dtai
 
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...
 
JClinChem_2003
JClinChem_2003JClinChem_2003
JClinChem_2003
 
SOT short course on computational toxicology
SOT short course on computational toxicology SOT short course on computational toxicology
SOT short course on computational toxicology
 
Metabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plantsMetabolic engineering approaches in medicinal plants
Metabolic engineering approaches in medicinal plants
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
 
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
 
Experimental methods and the big data sets
Experimental methods and the big data sets Experimental methods and the big data sets
Experimental methods and the big data sets
 
Application of Mass Spectrometry In Biotechnology
Application of Mass Spectrometry In BiotechnologyApplication of Mass Spectrometry In Biotechnology
Application of Mass Spectrometry In Biotechnology
 
Homology Modeling of CCR3 Receptor
Homology Modeling of CCR3 ReceptorHomology Modeling of CCR3 Receptor
Homology Modeling of CCR3 Receptor
 
Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...Development of machine learning-based prediction models for chemical modulato...
Development of machine learning-based prediction models for chemical modulato...
 
Proteomics
ProteomicsProteomics
Proteomics
 
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
MedChemica Large scale analysis and sharing of Medicinal chemistry Knowledge ...
 
Quantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To BrowserQuantitative Proteomics: From Instrument To Browser
Quantitative Proteomics: From Instrument To Browser
 
“Proteomics” to study genes and genomes
“Proteomics” to study genes and genomes“Proteomics” to study genes and genomes
“Proteomics” to study genes and genomes
 
Presentation july 31_2015
Presentation july 31_2015Presentation july 31_2015
Presentation july 31_2015
 
dream
dreamdream
dream
 
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)
-Unprotected Docs-Student Posters 2013-Marchelle Meza (2)
 
Weber_FinalProposal
Weber_FinalProposalWeber_FinalProposal
Weber_FinalProposal
 
Applied Bioinformatics Assignment 5docx
Applied Bioinformatics Assignment  5docxApplied Bioinformatics Assignment  5docx
Applied Bioinformatics Assignment 5docx
 

More from Ed Griffen

MedChemica Levinthal Lecture at Openeye CUP XX 2020
MedChemica Levinthal Lecture at Openeye CUP XX 2020MedChemica Levinthal Lecture at Openeye CUP XX 2020
MedChemica Levinthal Lecture at Openeye CUP XX 2020
Ed Griffen
 
Explainable AI in Drug Hunting
Explainable AI in Drug HuntingExplainable AI in Drug Hunting
Explainable AI in Drug Hunting
Ed Griffen
 
SCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaSCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemica
Ed Griffen
 
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Ed Griffen
 
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Ed Griffen
 
Extracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataExtracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology data
Ed Griffen
 
Pharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisPharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair Analysis
Ed Griffen
 

More from Ed Griffen (7)

MedChemica Levinthal Lecture at Openeye CUP XX 2020
MedChemica Levinthal Lecture at Openeye CUP XX 2020MedChemica Levinthal Lecture at Openeye CUP XX 2020
MedChemica Levinthal Lecture at Openeye CUP XX 2020
 
Explainable AI in Drug Hunting
Explainable AI in Drug HuntingExplainable AI in Drug Hunting
Explainable AI in Drug Hunting
 
SCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemicaSCI What can Big Data do for Chemistry 2017 MedChemica
SCI What can Big Data do for Chemistry 2017 MedChemica
 
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
Learning Medicinal Chemistry ADMET rules UKQSAR Sept 2017
 
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
Extracting medicinal chemistry knowledge by a secured Matched Molecular Pair ...
 
Extracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology dataExtracting actionable knowledge from large scale in vitro pharmacology data
Extracting actionable knowledge from large scale in vitro pharmacology data
 
Pharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair AnalysisPharmacophore extraction from Matched Molecular Pair Analysis
Pharmacophore extraction from Matched Molecular Pair Analysis
 

Recently uploaded

Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
muralinath2
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
yusufzako14
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
aishnasrivastava
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
Richard Gill
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
Scintica Instrumentation
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
Columbia Weather Systems
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
DiyaBiswas10
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 

Recently uploaded (20)

Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptxBody fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
Body fluids_tonicity_dehydration_hypovolemia_hypervolemia.pptx
 
in vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptxin vitro propagation of plants lecture note.pptx
in vitro propagation of plants lecture note.pptx
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Structural Classification Of Protein (SCOP)
Structural Classification Of Protein  (SCOP)Structural Classification Of Protein  (SCOP)
Structural Classification Of Protein (SCOP)
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Richard's entangled aventures in wonderland
Richard's entangled aventures in wonderlandRichard's entangled aventures in wonderland
Richard's entangled aventures in wonderland
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...
 
Orion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWSOrion Air Quality Monitoring Systems - CWS
Orion Air Quality Monitoring Systems - CWS
 
extra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdfextra-chromosomal-inheritance[1].pptx.pdfpdf
extra-chromosomal-inheritance[1].pptx.pdfpdf
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 

RSC Hatfield 2018 Kinase meeting : potency patents MMPA approaches

  • 1. Seed Suggestions % in SureChEMBL 222 43 234 21 protease, 6536 phosphatase, 260 kinase, 12686 ion_channel, 4370 GPCR_7TM, 19523 Δ data A to B MedChemica Potency and Patents, new arenas for Matched Molecular Pair analysis (MMPA) Dr. Al G. Dossetter, Dr. Ed J. Griffen, Dr. Andrew G. Leach, Dr. Shane Montague References 1Griffen, E. et al. Matched Molecular Pairs as a Medicinal Chemistry Tool. J. Med. Chem. 2011, 54(22), pp.7739-7750. 2Leach, A.G. et. al. Matched Molecular Pairs as a Guide in the Optimization of Pharmaceutical Properties; a Study of Aqueous Solubility, Plasma Protein Binding and Oral Exposure. J. Med. Chem. 2006, 49(23), pp.6672-6682. 3Papadatos, G. et al. Lead Optimization Using Matched Molecular Pairs: Inclusion of Contextual Information for Enhanced of hERG Inhibition, Solubility, and Lipophilicity. J. Chem. Inf. Model. 2010, 50(10), pp.1872-1886. Problem Can we understand the relationship between patents, identify critical compounds and automatically extract SAR? Solution Combine all the compounds and perform MMPA to find all the pair relationships independent of patent membership. Use graph theory to identify critical compounds and exploit public data to suggest further analogues and estimate their potency. MMPA - a method of determining structure activity relationships (SAR’s) within sets of compounds. Matched Molecular Pairs (MMP’s) are identified and differences in their measured data are used to link properties to structure.1 contact@medchemica.com Selecting rules Statistical analysis of data sets of SMIRKS to extract chemical transformations that are most likely to be genuine. 3) 4) Extract rules from public potency data Learning • Useful potency SAR knowledge can be extracted from public data • MMP network analysis of patents identifies pivotal compounds • The method is validated by finding that large numbers of compounds suggested using these rules are now patented • Extending MMP based network analysis by application of machine learning methods and exploiting MCSS structures within clusters to improve predictive accuracy Advanced MMP’s • Two pair finding techniques are available • Not all pairs are found by a single method, both methods are needed to maximize the MMP output Molecules that differ only by a particular, well- defined, structural transformation2 A MMP found by both methods: 1) Fragment and Index method Maximum Common Sub-Structure method (MCSS) Environment Capture • Chemical transformations are encoded as SMIRKS and recorded along with their delta property value(s) • The SMIRKS contain the structural change along with the chemical environment spanning up to 4 atoms out Essential for understanding the context of the transformation3 [c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H])[c:5]1[c: 7])([H])>>[c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H] )[c:5]1[c:7])[F] 2) [c:4][c:2]([H])[c:1]([c:3]([H])[c:5])([H]) >>[c:4][c:2]([H])[c:1]([c:3]([H])[c:5])[F] [c:2][c:1]([c:3])([H])>>[c:2][c:1]([c:3])[F] [c:1]([H])>>[c:1][F] The MMP as a transformation: 4 atom environment: 3 atom environment: 2 atom environment: 1 atom environment: Δ data A to BΔ data A to B Δ data A to B FragA >> FragB Kinase class number of rules kinase_agc 1576 kinase_atypical 788 kinase_camk 2376 kinase_ck1 32 kinase_cmgc 1010 kinase_reg 256 kinase_ste 110 kinase_tk 4696 kinase_tkl 1842 • Clean: • ChEMBL structures, • convert measurements to pIC50 / pKi, • aggregate multiple measurements on same compound by target • Find MMPA based rules per target • Organize targets by protein class and sub-class • Rules can by applied by target, sub-class or class • The distribution of rules mirrors the distribution of data 5) Identify pivotal compound in patents • Clean SureChEMBL structures with patent identifiers • Generate a network map showing MMP relationships between patents • Network analysis identifies the key compounds within patents • Points are compounds colored by the patent they were first disclosed in (green / blue), or the clinically used compounds(red) or yellow – most highly connected compound in each patent • Links represent a matched molecular pair relationships • Distances are based on a spring force model and are for visualization only O ON O N N HN Cl F O O O O N N HN N O O N N HN Cl F O O N N HN 2 steps to Gefitinib 3 steps to Erlotinib Gefitinib Erlotinib Focus the rules used to generate new compounds by applying those from the right kinase sub class Apply rules to pivotal compounds O O N N HN N O O N N HN Cl F 6) Estimate potency from network models • Extending the network analysis to all the public EGF potency data: • MMP based clusters can be identified and characterized by their potency • Being a MMP neighbor in a cluster is sufficient to estimate a compounds potency to within 1 log. • The MMP methods used generate sets of maximum common substructures for each cluster enabling further direction of chemistry • Points represent individual compounds • Links represent a matched molecular pair relationship pIC5 0 >8 6-8 <6 EGFR tyrosine kinase network based potency analysis Size of cluster Clusters Compounds <8 compounds 133 415 >=8 compounds 59 3213 Total 192 3628 Simple regression modeling of potency based on just cluster membership(10 fold cross validation): R2 0.44, RMSE 0.97 Further modeling based on the maximum common substructures within clusters in progress.