Virtual Toxicity panels focussed on interpretable machine learning models that can guide medicinal chemists to identify critical substructures that are assocaited with toxicities.
Accelerating lead optimisation with active learning by exploiting MMPA based ...Ed Griffen
Presented at the 15th GCC - German Conference on Cheminformatics November 2019
We combine regression forest machine learning with our MMPA based generative methods to deliver an active learning system to accelerate lead optimisation. In the process we identify permutative MMPA as a method to leverage SAR information from small data sets.
Published by MedChemica Ltd
Emerging Challenges for Artificial Intelligence in Medicinal ChemistryEd Griffen
Presentation by Dr Ed Griffen of MedChemica Ltd, at The IBSA Conference "How Artificial Intelligence Can Change the Pharmaceutical Landscape“ - LUGANO, October 9th 2019.
1. Scoring functions are the mathematical functions used to approximately predict the binding affinity between two molecules after they have been docked.
The evaluation and ranking of predicted ligand conformations is a crucial aspect of structure-based virtual screening.
2. Scoring functions implemented in docking programs make simplifications in the evaluation of modeled complexes.
3. Affinity scoring functions are applied to the energetically best pose found for each molecule, and comparing the affinity scores for different molecules gives their relative rank-ordering.
Limitations of in silico drug discovery methodsAlichy Sowmya
In drug discovery there are various in silico approaches such as Virtual high throughput screening, Molecular docking, Homology modelling, QSAR, CoMFA, Molecular Dynamics, and Pharmacophore mapping. In this presentation various limitations of these approaches are given
Virtual Toxicity panels focussed on interpretable machine learning models that can guide medicinal chemists to identify critical substructures that are assocaited with toxicities.
Accelerating lead optimisation with active learning by exploiting MMPA based ...Ed Griffen
Presented at the 15th GCC - German Conference on Cheminformatics November 2019
We combine regression forest machine learning with our MMPA based generative methods to deliver an active learning system to accelerate lead optimisation. In the process we identify permutative MMPA as a method to leverage SAR information from small data sets.
Published by MedChemica Ltd
Emerging Challenges for Artificial Intelligence in Medicinal ChemistryEd Griffen
Presentation by Dr Ed Griffen of MedChemica Ltd, at The IBSA Conference "How Artificial Intelligence Can Change the Pharmaceutical Landscape“ - LUGANO, October 9th 2019.
1. Scoring functions are the mathematical functions used to approximately predict the binding affinity between two molecules after they have been docked.
The evaluation and ranking of predicted ligand conformations is a crucial aspect of structure-based virtual screening.
2. Scoring functions implemented in docking programs make simplifications in the evaluation of modeled complexes.
3. Affinity scoring functions are applied to the energetically best pose found for each molecule, and comparing the affinity scores for different molecules gives their relative rank-ordering.
Limitations of in silico drug discovery methodsAlichy Sowmya
In drug discovery there are various in silico approaches such as Virtual high throughput screening, Molecular docking, Homology modelling, QSAR, CoMFA, Molecular Dynamics, and Pharmacophore mapping. In this presentation various limitations of these approaches are given
A lecture on molecular docking that I give for master students at University Paris Diderot.
Warning: this presentation has numerous animations which are not included in the slideshare document.
https://florentbarbault.wordpress.com/
Chemical risk assessment is often limited by the lack of experimental toxicity data for a large number of diverse chemicals. In the absence of experimental data, potential chemical hazard is often predicted using data gap filling techniques such as quantitative structure activity relationship (QSAR) models. QSARs are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR tools are a widely utilized alternative to time-consuming clinical and animal testing methods, yet concerns over reliability and uncertainty limit application of QSAR models for regulatory chemical risk assessments. The reliability of a QSAR model depends on the quality and quantity of experimental training data and the applicability domain of the model. This talk will describe the basics concepts and best practices in QSAR modeling, principles associated with validation of QSAR models, summary of available QSAR tools, limitations and challenges in the acceptance of QSAR models, and the current status and prospects of QSAR modeling methods in the medical devices community.
Structure based drug design- kiranmayiKiranmayiKnv
This presentation helps in detail learning about the structure based drug design. It includes types of structure based drug design and detailed study of docking, de novo drug design.
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO csandit
Each and every biological function in living organism happens as a result of protein-protein interactions. The diseases are no exception to this. Identifying one or more proteins for a
particular disease and then designing a suitable chemical compound (known as drug) to destroy these proteins has been an interesting topic of research in bio-informatics. In previous methods,drugs were designed using only seven chemical components and were represented as a fixedlength
tree. But in reality, a drug contains many chemical groups collectively known as
pharmacophore. Moreover, the chemical length of the drug cannot be determined before
designing the drug.
In the present work, a Particle Swarm Optimization (PSO) based methodology has been
proposed to find out a suitable drug for a particular disease so that the drug-protein interaction
becomes stable. In the proposed algorithm, the drug is represented as a variable length tree and essential functional groups are arranged in different positions of that drug. Finally, the structure of the drug is obtained and its docking energy is minimized simultaneously. Also, the
orientation of chemical groups in the drug is tested so that it can bind to a particular active site of a target protein and the drug fits well inside the active site of target protein. Here, several inter-molecular forces have been considered for accuracy of the docking energy. Results showthat PSO performs better than the earlier methods.
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Ellen Berg
Predictive Models for Mechanism of Action Classification from Phenotypic Assay Data – Application to Phenotypic Drug Discovery
Presentation at SLAS 2014 conference in San Diego, 21 January 2014
A lecture on molecular docking that I give for master students at University Paris Diderot.
Warning: this presentation has numerous animations which are not included in the slideshare document.
https://florentbarbault.wordpress.com/
Chemical risk assessment is often limited by the lack of experimental toxicity data for a large number of diverse chemicals. In the absence of experimental data, potential chemical hazard is often predicted using data gap filling techniques such as quantitative structure activity relationship (QSAR) models. QSARs are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR tools are a widely utilized alternative to time-consuming clinical and animal testing methods, yet concerns over reliability and uncertainty limit application of QSAR models for regulatory chemical risk assessments. The reliability of a QSAR model depends on the quality and quantity of experimental training data and the applicability domain of the model. This talk will describe the basics concepts and best practices in QSAR modeling, principles associated with validation of QSAR models, summary of available QSAR tools, limitations and challenges in the acceptance of QSAR models, and the current status and prospects of QSAR modeling methods in the medical devices community.
Structure based drug design- kiranmayiKiranmayiKnv
This presentation helps in detail learning about the structure based drug design. It includes types of structure based drug design and detailed study of docking, de novo drug design.
Stable Drug Designing by Minimizing Drug Protein Interaction Energy Using PSO csandit
Each and every biological function in living organism happens as a result of protein-protein interactions. The diseases are no exception to this. Identifying one or more proteins for a
particular disease and then designing a suitable chemical compound (known as drug) to destroy these proteins has been an interesting topic of research in bio-informatics. In previous methods,drugs were designed using only seven chemical components and were represented as a fixedlength
tree. But in reality, a drug contains many chemical groups collectively known as
pharmacophore. Moreover, the chemical length of the drug cannot be determined before
designing the drug.
In the present work, a Particle Swarm Optimization (PSO) based methodology has been
proposed to find out a suitable drug for a particular disease so that the drug-protein interaction
becomes stable. In the proposed algorithm, the drug is represented as a variable length tree and essential functional groups are arranged in different positions of that drug. Finally, the structure of the drug is obtained and its docking energy is minimized simultaneously. Also, the
orientation of chemical groups in the drug is tested so that it can bind to a particular active site of a target protein and the drug fits well inside the active site of target protein. Here, several inter-molecular forces have been considered for accuracy of the docking energy. Results showthat PSO performs better than the earlier methods.
Predictive Models for Mechanism of Action Classification from Phenotypic Assa...Ellen Berg
Predictive Models for Mechanism of Action Classification from Phenotypic Assay Data – Application to Phenotypic Drug Discovery
Presentation at SLAS 2014 conference in San Diego, 21 January 2014
Humans are potentially exposed to tens of thousands of man-made chemicals in the environment. It is well known that some environmental chemicals mimic natural hormones and thus have the potential to be endocrine disruptors. Most of these environmental chemicals have never been tested for their ability to disrupt the endocrine system, in particular, their ability to interact with the estrogen receptor. EPA needs tools to prioritize thousands of chemicals, for instance in the Endocrine Disruptor Screening Program (EDSP). This project was intended to be a demonstration of the use of predictive computational models on HTS data including ToxCast and Tox21 assays to prioritize a large chemical universe of 32464 unique structures for one specific molecular target – the estrogen receptor. CERAPP combined multiple computational models for prediction of estrogen receptor activity, and used the predicted results to build a unique consensus model. Models were developed in collaboration between 17 groups in the U.S. and Europe and applied to predict the common set of chemicals. Structure-based techniques such as docking and several QSAR modeling approaches were employed, mostly using a common training set of 1677 compounds provided by U.S. EPA, to build a total of 42 classification models and 8 regression models for binding, agonist and antagonist activity. All predictions were evaluated on ToxCast data and on an external validation set collected from the literature. In order to overcome the limitations of single models, a consensus was built weighting models based on their prediction accuracy scores (including sensitivity and specificity against training and external sets). Individual model scores ranged from 0.69 to 0.85, showing high prediction reliabilities. The final consensus predicted 4001 chemicals as actives to be considered as high priority for further testing and 6742 as suspicious chemicals. This abstract does not necessarily reflect U.S. EPA policy
This report describes a homology model of CCR3, a chemokine GPCR receptor, based on the relatively new crystallographic CCR5 template.
This model was not published at the time of writing this report. The report also discusses concisely the criteria for selection of a template.
Development of machine learning-based prediction models for chemical modulato...Sunghwan Kim
Presented at the 2018 Research Festival at the National Institutes of Health (NIH) in Bethesda, MD (September 13, 2018).
==== Abstract ====
The retinoid X receptor (RXR) is a nuclear hormone receptor that functions as a transcription factor with roles in development, cell differentiation, metabolism, and cell death. Chemicals that interfere the RXR signaling pathway may cause adverse effects on human health. In this study, public-domain bioactivity data available in PubChem (https://pubchem.ncbi.nlm.nih.gov) were used to develop machine learning-based prediction models for chemical modulators of RXR-alpha, which is a subtype of RXR that plays a role in metabolic signaling pathways, dermal cysts, cardiac development, insulin sensitization, etc. The models were constructed from quantitative high-throughput screening (qHTS) data from the Tox21 project, using popular supervised machine learning methods (including support vector machine, random forest, neural network, k-nearest neighbors, decision tree, and naïve Bayes). The general applicability of the developed models was evaluated with external data sets from ChEMBL and the NCATS Chemical Genomics Center (NCGC). This study showcases how open data in the public domain can be used to develop prediction models for bioactivity of small molecules.
Presented at Artificial Intelligence and Machine Learning for Advanced Drug Discovery & Development 2019 on 28th May 2019 by Dr Ed Griffen of MedChemica Ltd
Lecture given by Ed Griffen UKQSAR meeting Sept 2017. Covers material from work in our paper http://pubs.acs.org/doi/10.1021/acs.jmedchem.7b00935 background discussed in https://www.linkedin.com/pulse/first-draft-medicinal-chemistry-admet-encyclopedia-ed-griffen/
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
A brief information about the SCOP protein database used in bioinformatics.
The Structural Classification of Proteins (SCOP) database is a comprehensive and authoritative resource for the structural and evolutionary relationships of proteins. It provides a detailed and curated classification of protein structures, grouping them into families, superfamilies, and folds based on their structural and sequence similarities.
This pdf is about the Schizophrenia.
For more details visit on YouTube; @SELF-EXPLANATORY;
https://www.youtube.com/channel/UCAiarMZDNhe1A3Rnpr_WkzA/videos
Thanks...!
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Richard's entangled aventures in wonderlandRichard Gill
Since the loophole-free Bell experiments of 2020 and the Nobel prizes in physics of 2022, critics of Bell's work have retreated to the fortress of super-determinism. Now, super-determinism is a derogatory word - it just means "determinism". Palmer, Hance and Hossenfelder argue that quantum mechanics and determinism are not incompatible, using a sophisticated mathematical construction based on a subtle thinning of allowed states and measurements in quantum mechanics, such that what is left appears to make Bell's argument fail, without altering the empirical predictions of quantum mechanics. I think however that it is a smoke screen, and the slogan "lost in math" comes to my mind. I will discuss some other recent disproofs of Bell's theorem using the language of causality based on causal graphs. Causal thinking is also central to law and justice. I will mention surprising connections to my work on serial killer nurse cases, in particular the Dutch case of Lucia de Berk and the current UK case of Lucy Letby.
Nutraceutical market, scope and growth: Herbal drug technologyLokesh Patil
As consumer awareness of health and wellness rises, the nutraceutical market—which includes goods like functional meals, drinks, and dietary supplements that provide health advantages beyond basic nutrition—is growing significantly. As healthcare expenses rise, the population ages, and people want natural and preventative health solutions more and more, this industry is increasing quickly. Further driving market expansion are product formulation innovations and the use of cutting-edge technology for customized nutrition. With its worldwide reach, the nutraceutical industry is expected to keep growing and provide significant chances for research and investment in a number of categories, including vitamins, minerals, probiotics, and herbal supplements.
Seminar of U.V. Spectroscopy by SAMIR PANDASAMIR PANDA
Spectroscopy is a branch of science dealing the study of interaction of electromagnetic radiation with matter.
Ultraviolet-visible spectroscopy refers to absorption spectroscopy or reflect spectroscopy in the UV-VIS spectral region.
Ultraviolet-visible spectroscopy is an analytical method that can measure the amount of light received by the analyte.
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
(May 29th, 2024) Advancements in Intravital Microscopy- Insights for Preclini...Scintica Instrumentation
Intravital microscopy (IVM) is a powerful tool utilized to study cellular behavior over time and space in vivo. Much of our understanding of cell biology has been accomplished using various in vitro and ex vivo methods; however, these studies do not necessarily reflect the natural dynamics of biological processes. Unlike traditional cell culture or fixed tissue imaging, IVM allows for the ultra-fast high-resolution imaging of cellular processes over time and space and were studied in its natural environment. Real-time visualization of biological processes in the context of an intact organism helps maintain physiological relevance and provide insights into the progression of disease, response to treatments or developmental processes.
In this webinar we give an overview of advanced applications of the IVM system in preclinical research. IVIM technology is a provider of all-in-one intravital microscopy systems and solutions optimized for in vivo imaging of live animal models at sub-micron resolution. The system’s unique features and user-friendly software enables researchers to probe fast dynamic biological processes such as immune cell tracking, cell-cell interaction as well as vascularization and tumor metastasis with exceptional detail. This webinar will also give an overview of IVM being utilized in drug development, offering a view into the intricate interaction between drugs/nanoparticles and tissues in vivo and allows for the evaluation of therapeutic intervention in a variety of tissues and organs. This interdisciplinary collaboration continues to drive the advancements of novel therapeutic strategies.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Slide 1: Title Slide
Extrachromosomal Inheritance
Slide 2: Introduction to Extrachromosomal Inheritance
Definition: Extrachromosomal inheritance refers to the transmission of genetic material that is not found within the nucleus.
Key Components: Involves genes located in mitochondria, chloroplasts, and plasmids.
Slide 3: Mitochondrial Inheritance
Mitochondria: Organelles responsible for energy production.
Mitochondrial DNA (mtDNA): Circular DNA molecule found in mitochondria.
Inheritance Pattern: Maternally inherited, meaning it is passed from mothers to all their offspring.
Diseases: Examples include Leber’s hereditary optic neuropathy (LHON) and mitochondrial myopathy.
Slide 4: Chloroplast Inheritance
Chloroplasts: Organelles responsible for photosynthesis in plants.
Chloroplast DNA (cpDNA): Circular DNA molecule found in chloroplasts.
Inheritance Pattern: Often maternally inherited in most plants, but can vary in some species.
Examples: Variegation in plants, where leaf color patterns are determined by chloroplast DNA.
Slide 5: Plasmid Inheritance
Plasmids: Small, circular DNA molecules found in bacteria and some eukaryotes.
Features: Can carry antibiotic resistance genes and can be transferred between cells through processes like conjugation.
Significance: Important in biotechnology for gene cloning and genetic engineering.
Slide 6: Mechanisms of Extrachromosomal Inheritance
Non-Mendelian Patterns: Do not follow Mendel’s laws of inheritance.
Cytoplasmic Segregation: During cell division, organelles like mitochondria and chloroplasts are randomly distributed to daughter cells.
Heteroplasmy: Presence of more than one type of organellar genome within a cell, leading to variation in expression.
Slide 7: Examples of Extrachromosomal Inheritance
Four O’clock Plant (Mirabilis jalapa): Shows variegated leaves due to different cpDNA in leaf cells.
Petite Mutants in Yeast: Result from mutations in mitochondrial DNA affecting respiration.
Slide 8: Importance of Extrachromosomal Inheritance
Evolution: Provides insight into the evolution of eukaryotic cells.
Medicine: Understanding mitochondrial inheritance helps in diagnosing and treating mitochondrial diseases.
Agriculture: Chloroplast inheritance can be used in plant breeding and genetic modification.
Slide 9: Recent Research and Advances
Gene Editing: Techniques like CRISPR-Cas9 are being used to edit mitochondrial and chloroplast DNA.
Therapies: Development of mitochondrial replacement therapy (MRT) for preventing mitochondrial diseases.
Slide 10: Conclusion
Summary: Extrachromosomal inheritance involves the transmission of genetic material outside the nucleus and plays a crucial role in genetics, medicine, and biotechnology.
Future Directions: Continued research and technological advancements hold promise for new treatments and applications.
Slide 11: Questions and Discussion
Invite Audience: Open the floor for any questions or further discussion on the topic.
1. Seed Suggestions % in
SureChEMBL
222 43
234 21
protease, 6536
phosphatase,
260
kinase, 12686
ion_channel,
4370
GPCR_7TM,
19523
Δ data A to B
MedChemica
Potency and Patents, new arenas for Matched Molecular
Pair analysis (MMPA)
Dr. Al G. Dossetter, Dr. Ed J. Griffen, Dr. Andrew G. Leach, Dr. Shane Montague
References
1Griffen, E. et al. Matched Molecular Pairs as a Medicinal Chemistry Tool. J. Med. Chem. 2011, 54(22), pp.7739-7750.
2Leach, A.G. et. al. Matched Molecular Pairs as a Guide in the Optimization of Pharmaceutical Properties; a Study of Aqueous Solubility, Plasma Protein Binding and Oral Exposure. J. Med. Chem. 2006, 49(23), pp.6672-6682.
3Papadatos, G. et al. Lead Optimization Using Matched Molecular Pairs: Inclusion of Contextual Information for Enhanced of hERG Inhibition, Solubility, and Lipophilicity. J. Chem. Inf. Model. 2010, 50(10), pp.1872-1886.
Problem
Can we understand the relationship between
patents, identify critical compounds and
automatically extract SAR?
Solution
Combine all the compounds and perform MMPA to find all the pair relationships
independent of patent membership. Use graph theory to identify critical compounds
and exploit public data to suggest further analogues and estimate their potency.
MMPA - a method of determining structure activity relationships (SAR’s) within sets of compounds. Matched Molecular Pairs
(MMP’s) are identified and differences in their measured data are used to link properties to structure.1
contact@medchemica.com
Selecting rules
Statistical analysis of data sets of SMIRKS to extract chemical
transformations that are most likely to be genuine.
3)
4) Extract rules from public potency data
Learning
• Useful potency SAR knowledge can be extracted from public data
• MMP network analysis of patents identifies pivotal compounds
• The method is validated by finding that large numbers of compounds suggested using these rules are now patented
• Extending MMP based network analysis by application of machine learning methods and exploiting MCSS structures within clusters to improve predictive accuracy
Advanced MMP’s
• Two pair finding techniques are available
• Not all pairs are found by a single method, both methods are
needed to maximize the MMP output
Molecules that differ only by a particular, well-
defined, structural transformation2
A MMP found by both methods:
1)
Fragment and Index method
Maximum Common Sub-Structure method (MCSS)
Environment Capture
• Chemical transformations are encoded as SMIRKS and recorded
along with their delta property value(s)
• The SMIRKS contain the structural change along with the chemical
environment spanning up to 4 atoms out
Essential for understanding the context of the transformation3
[c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H])[c:5]1[c:
7])([H])>>[c:6]1[c:4]([H])[c:2]([H])[c:1]([c:3]([H]
)[c:5]1[c:7])[F]
2)
[c:4][c:2]([H])[c:1]([c:3]([H])[c:5])([H])
>>[c:4][c:2]([H])[c:1]([c:3]([H])[c:5])[F]
[c:2][c:1]([c:3])([H])>>[c:2][c:1]([c:3])[F] [c:1]([H])>>[c:1][F]
The MMP as a transformation:
4 atom environment: 3 atom environment:
2 atom environment: 1 atom environment:
Δ data A to BΔ data A to B
Δ data A to B
FragA >> FragB
Kinase class number of rules
kinase_agc 1576
kinase_atypical 788
kinase_camk 2376
kinase_ck1 32
kinase_cmgc 1010
kinase_reg 256
kinase_ste 110
kinase_tk 4696
kinase_tkl 1842
• Clean:
• ChEMBL structures,
• convert measurements to pIC50 / pKi,
• aggregate multiple measurements on same compound by
target
• Find MMPA based rules per target
• Organize targets by protein class and sub-class
• Rules can by applied by target, sub-class or class
• The distribution of rules mirrors the distribution of data
5) Identify pivotal compound in patents
• Clean SureChEMBL structures with patent identifiers
• Generate a network map showing MMP relationships
between patents
• Network analysis identifies the key compounds within
patents
• Points are compounds colored by the patent they
were first disclosed in (green / blue), or the clinically
used compounds(red) or yellow – most highly
connected compound in each patent
• Links represent a matched molecular pair
relationships
• Distances are based on a spring force model and
are for visualization only
O
ON
O
N
N
HN Cl
F
O
O
O
O
N
N
HN
N O
O N
N
HN
Cl
F
O
O N
N
HN
2 steps to Gefitinib
3 steps to Erlotinib
Gefitinib
Erlotinib
Focus the rules used to
generate new
compounds by applying
those from the right
kinase sub class
Apply rules
to pivotal
compounds
O
O N
N
HN
N O
O N
N
HN
Cl
F
6) Estimate potency from network models
• Extending the network analysis to all the public EGF
potency data:
• MMP based clusters can be identified and
characterized by their potency
• Being a MMP neighbor in a cluster is sufficient to
estimate a compounds potency to within 1 log.
• The MMP methods used generate sets of maximum
common substructures for each cluster enabling
further direction of chemistry
• Points represent individual compounds
• Links represent a matched molecular pair relationship
pIC5
0
>8
6-8
<6
EGFR tyrosine kinase network based
potency analysis
Size of cluster Clusters Compounds
<8 compounds 133 415
>=8 compounds 59 3213
Total 192 3628
Simple regression modeling of potency based on just
cluster membership(10 fold cross validation): R2 0.44,
RMSE 0.97
Further modeling based on the maximum common
substructures within clusters in progress.