This document provides an overview of quantitative structure-activity relationship (QSAR) modeling techniques. It discusses:
1) The history and background of QSAR, dating back to the 19th century, and key contributors like Hammett who developed linear free energy relationships.
2) Common QSAR methodologies like multiple linear regression, principal component analysis, partial least squares, artificial neural networks, and genetic algorithm-based approaches.
3) Steps for validating QSAR models, including correlation coefficients, cross-validation, and assessing the applicability domain for making predictions.
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)AkshayYadav176
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Concept of pharmacophore, Pharmacophore mapping, Identification of pharmacophore features and pharmacophore modeling, Conformation search used in pharmacophore mapping, Virtual screening.
What is QSAR?, introduction to 3D QSAR, CoMFA, CoMSIA, Case Study on CoMFA contour maps analysis and CoMSIA interactive forces between ligand and receptor, various Statistical techniques involved in QSAR
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENTShikha Popali
THE PHARMACOPHORE MAPPING AND VIRTUAL SCRRENING , THESE PRESENTATION INCLUDES THE DEATIL ACCOUNT ON PHARMACOPHORE, MAPPING, ITS IDENTIFIATION FEATURES, ITS CONFORMATIONAL SEARCH, INSILICO DRUG DESIGN, VIRTUAL SCREENING, PHARMACOPHORE BASED SCREENING
In this slide I covered the detailed about hansch analysis, Free-Wilson analysis, and Mixed approach. I also gave a detailed application for each points.
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)AkshayYadav176
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Concept of pharmacophore, Pharmacophore mapping, Identification of pharmacophore features and pharmacophore modeling, Conformation search used in pharmacophore mapping, Virtual screening.
What is QSAR?, introduction to 3D QSAR, CoMFA, CoMSIA, Case Study on CoMFA contour maps analysis and CoMSIA interactive forces between ligand and receptor, various Statistical techniques involved in QSAR
PHARMACOHORE MAPPING AND VIRTUAL SCRRENING FOR RESEARCH DEPARTMENTShikha Popali
THE PHARMACOPHORE MAPPING AND VIRTUAL SCRRENING , THESE PRESENTATION INCLUDES THE DEATIL ACCOUNT ON PHARMACOPHORE, MAPPING, ITS IDENTIFIATION FEATURES, ITS CONFORMATIONAL SEARCH, INSILICO DRUG DESIGN, VIRTUAL SCREENING, PHARMACOPHORE BASED SCREENING
In this slide I covered the detailed about hansch analysis, Free-Wilson analysis, and Mixed approach. I also gave a detailed application for each points.
A quantitative structure-activity relationship
(QSAR) correlates measurable or calculable
physical or molecular properties to some
specific biological activity in terms of an
equation.
Analog design is usually defined as the modification of a drug molecule or of any bioactive compound in order to prepare a new molecule showing chemical and biological similarity with the original model compound
SAR versus QSAR, History and development of QSAR, Types of physicochemical
parameters, experimental and theoretical approaches for the determination of
physicochemical parameters such as Partition coefficient, Hammet’s substituent
constant and Taft’s steric constant. Hansch analysis, Free Wilson analysis, 3D-QSAR
approaches like COMFA and COMSIA.
CHEMISTRY OF PEPTIDES [M.PHARM, M.SC, BSC, B.PHARM]Shikha Popali
THE CHEMISTRY OF PEPTIDES THE DIFFICULT TO COLLECT DATA FOR READERS , THREFORE HERE WE HAVE COLLECTED ALL THE DATA AT A PLACE AND PROVIDED EASIER TO CHEMISTRIANS.
Chemical risk assessment is often limited by the lack of experimental toxicity data for a large number of diverse chemicals. In the absence of experimental data, potential chemical hazard is often predicted using data gap filling techniques such as quantitative structure activity relationship (QSAR) models. QSARs are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR tools are a widely utilized alternative to time-consuming clinical and animal testing methods, yet concerns over reliability and uncertainty limit application of QSAR models for regulatory chemical risk assessments. The reliability of a QSAR model depends on the quality and quantity of experimental training data and the applicability domain of the model. This talk will describe the basics concepts and best practices in QSAR modeling, principles associated with validation of QSAR models, summary of available QSAR tools, limitations and challenges in the acceptance of QSAR models, and the current status and prospects of QSAR modeling methods in the medical devices community.
A quantitative structure-activity relationship
(QSAR) correlates measurable or calculable
physical or molecular properties to some
specific biological activity in terms of an
equation.
Analog design is usually defined as the modification of a drug molecule or of any bioactive compound in order to prepare a new molecule showing chemical and biological similarity with the original model compound
SAR versus QSAR, History and development of QSAR, Types of physicochemical
parameters, experimental and theoretical approaches for the determination of
physicochemical parameters such as Partition coefficient, Hammet’s substituent
constant and Taft’s steric constant. Hansch analysis, Free Wilson analysis, 3D-QSAR
approaches like COMFA and COMSIA.
CHEMISTRY OF PEPTIDES [M.PHARM, M.SC, BSC, B.PHARM]Shikha Popali
THE CHEMISTRY OF PEPTIDES THE DIFFICULT TO COLLECT DATA FOR READERS , THREFORE HERE WE HAVE COLLECTED ALL THE DATA AT A PLACE AND PROVIDED EASIER TO CHEMISTRIANS.
Chemical risk assessment is often limited by the lack of experimental toxicity data for a large number of diverse chemicals. In the absence of experimental data, potential chemical hazard is often predicted using data gap filling techniques such as quantitative structure activity relationship (QSAR) models. QSARs are theoretical models that relate a quantitative measure of chemical structure to a physical property or a biological effect. QSAR tools are a widely utilized alternative to time-consuming clinical and animal testing methods, yet concerns over reliability and uncertainty limit application of QSAR models for regulatory chemical risk assessments. The reliability of a QSAR model depends on the quality and quantity of experimental training data and the applicability domain of the model. This talk will describe the basics concepts and best practices in QSAR modeling, principles associated with validation of QSAR models, summary of available QSAR tools, limitations and challenges in the acceptance of QSAR models, and the current status and prospects of QSAR modeling methods in the medical devices community.
A QSAR is a mathematical relationship between a biological activity of a molecular system and its geometric and chemical characteristics.
QSAR attempts to find consistent relationship between biological activity and molecular properties, so that these “rules” can be used to evaluate the activity of new compounds.
Quantitative structure - activity relationship (QSAR)
Why QSAR?
costs – 800M$ to bring a new drug to market
Patent life time is limited (generic drugs)
Synthesis / Purification of compounds is expensive and time consume-able
It is like find a needle in the haystack
QSAR helps for focusing most promising drug candidates
QSAR is a mathematical relationship between a “biological activity of a molecular system” and its “geometric and chemical characteristics”.
Such relationships holds – Equations can be drawn up- some confidence
to which should be Fit to the target
QSAR what actually do?
IDENTIFY AND QUANTIFY the Physico-chemical properties effect on Drug’s Biological activity
Aims
To relate the biological activity of a series of compounds to their physicochemical parameters in a quantitative fashion using a mathematical formula
Requirements
Quantitative measurements for biological and physicochemical properties
Physicochemical Properties
Hydrophobicity of the molecule
Hydrophobicity of substituents
Electronic properties of substituents
Steric properties of substituents
QSAR equations are only applicable to compounds in the same structural class (e.g. ethers)
However, log Po is similar for anaesthetics of different structural classes (ca. 2.3)
Structures with log P ca. 2.3 enter the CNS easily
(e.g. potent barbiturates have a log P of approximately 2.0)
Can alter log P value of drugs away from 2.0 to avoid CNS side effects
Physical properties are measured for the molecule as a whole
Properties are calculated using computer software
No experimental constants or measurements are involved
Properties are known as ‘Fields’
Steric field - defines the size and shape of the molecule
Electrostatic field - defines electron rich/poor regions of molecule
Hydrophobic properties are relatively unimportant
No reliance on experimental values
Can be applied to molecules with unusual substituents
Not restricted to molecules of the same structural class
Predictive capability
Comparative molecular field analysis (CoMFA) - Tripos
Build each molecule using modelling software
Identify the active conformation for each molecule
Identify the pharmacophore
THANKING YOU
This is a PRESENTATION just to help students to easily understand one of the method of drug designing i.e. QSAR.. this is a combination of many slides and books..this is not my personal.
Insilico methods for design of novel inhibitors of Human leukocyte elastaseJayashankar Lakshmanan
Oral contributed paper “Insilico methods for design of novel inhibitors of Human leukocyte elastase” in the International conference on Systemics, Cybernetics and Informatics-2006
The Brønsted catalysis relationship is a Linear Free Energy Relationship (LFER) that relates ionization of an acid or base which catalyzes a reaction and the rate of the reaction.
Contributed by: Quincy Davis, Jonathan Greenhalgh, Joshua Visser (Undergraduates), University of Utah, 2016
What is greenhouse gasses and how many gasses are there to affect the Earth.moosaasad1975
What are greenhouse gasses how they affect the earth and its environment what is the future of the environment and earth how the weather and the climate effects.
Professional air quality monitoring systems provide immediate, on-site data for analysis, compliance, and decision-making.
Monitor common gases, weather parameters, particulates.
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...University of Maribor
Slides from talk:
Aleš Zamuda: Remote Sensing and Computational, Evolutionary, Supercomputing, and Intelligent Systems.
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Inter-Society Networking Panel GRSS/MTT-S/CIS Panel Session: Promoting Connection and Cooperation
https://www.etran.rs/2024/en/home-english/
This presentation explores a brief idea about the structural and functional attributes of nucleotides, the structure and function of genetic materials along with the impact of UV rays and pH upon them.
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...University of Maribor
Slides from:
11th International Conference on Electrical, Electronics and Computer Engineering (IcETRAN), Niš, 3-6 June 2024
Track: Artificial Intelligence
https://www.etran.rs/2024/en/home-english/
Phenomics assisted breeding in crop improvementIshaGoswami9
As the population is increasing and will reach about 9 billion upto 2050. Also due to climate change, it is difficult to meet the food requirement of such a large population. Facing the challenges presented by resource shortages, climate
change, and increasing global population, crop yield and quality need to be improved in a sustainable way over the coming decades. Genetic improvement by breeding is the best way to increase crop productivity. With the rapid progression of functional
genomics, an increasing number of crop genomes have been sequenced and dozens of genes influencing key agronomic traits have been identified. However, current genome sequence information has not been adequately exploited for understanding
the complex characteristics of multiple gene, owing to a lack of crop phenotypic data. Efficient, automatic, and accurate technologies and platforms that can capture phenotypic data that can
be linked to genomics information for crop improvement at all growth stages have become as important as genotyping. Thus,
high-throughput phenotyping has become the major bottleneck restricting crop breeding. Plant phenomics has been defined as the high-throughput, accurate acquisition and analysis of multi-dimensional phenotypes
during crop growing stages at the organism level, including the cell, tissue, organ, individual plant, plot, and field levels. With the rapid development of novel sensors, imaging technology,
and analysis methods, numerous infrastructure platforms have been developed for phenotyping.
Toxic effects of heavy metals : Lead and Arsenicsanjana502982
Heavy metals are naturally occuring metallic chemical elements that have relatively high density, and are toxic at even low concentrations. All toxic metals are termed as heavy metals irrespective of their atomic mass and density, eg. arsenic, lead, mercury, cadmium, thallium, chromium, etc.
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Ana Luísa Pinho
Functional Magnetic Resonance Imaging (fMRI) provides means to characterize brain activations in response to behavior. However, cognitive neuroscience has been limited to group-level effects referring to the performance of specific tasks. To obtain the functional profile of elementary cognitive mechanisms, the combination of brain responses to many tasks is required. Yet, to date, both structural atlases and parcellation-based activations do not fully account for cognitive function and still present several limitations. Further, they do not adapt overall to individual characteristics. In this talk, I will give an account of deep-behavioral phenotyping strategies, namely data-driven methods in large task-fMRI datasets, to optimize functional brain-data collection and improve inference of effects-of-interest related to mental processes. Key to this approach is the employment of fast multi-functional paradigms rich on features that can be well parametrized and, consequently, facilitate the creation of psycho-physiological constructs to be modelled with imaging data. Particular emphasis will be given to music stimuli when studying high-order cognitive mechanisms, due to their ecological nature and quality to enable complex behavior compounded by discrete entities. I will also discuss how deep-behavioral phenotyping and individualized models applied to neuroimaging data can better account for the subject-specific organization of domain-general cognitive systems in the human brain. Finally, the accumulation of functional brain signatures brings the possibility to clarify relationships among tasks and create a univocal link between brain systems and mental functions through: (1) the development of ontologies proposing an organization of cognitive processes; and (2) brain-network taxonomies describing functional specialization. To this end, tools to improve commensurability in cognitive science are necessary, such as public repositories, ontology-based platforms and automated meta-analysis tools. I will thus discuss some brain-atlasing resources currently under development, and their applicability in cognitive as well as clinical neuroscience.
2. • Developing New QSAR methodologies
•CoRIA and its Variants
•HomoSAR
•LISA
•eCoRIA and eQSAR
•CoOAN
•Solving Protein Structures (using NMR)
•Computational Prediction of Resistance and QMAR
•Lead optimization strategies for Anti-TB, Dengue, AD etc
• Studies on reaction pathways and transition states using ab initio and
Quantum Mechanics.
• Molecular dynamics of Drug-Cyclodextrin complexes
Research in Prof. Coutinho’s Lab
3. Molecular Modeling in Drug Design
Receptor
Unknown-
Ligand
Unknown
Receptor
Known- Ligand
Unknown
Receptor
Known- Ligand
Known
Receptor
Unknown –
Ligand Known
4. What is QSAR?
Compounds + biological
activity
New compounds with
improved biological
activity
QSAR
5. The number of compounds required for synthesis in
order to place 10 different groups in 4 positions of
benzene ring is 104
Solution: synthesize a small number of compounds
and from their data derive rules to predict the
biological activity of other compounds.
Why QSAR?
6. QSAR date back to the 19th century
A.F.A. Cros (University of Strasbourg; 1863)
Increased toxicity of alcohols with decrease in water solubility
H. H. Meyer (University of Marburg; 1890’s) and Charles Ernest
Overton (University of Zurich; 1890’s) [working independently]
Toxicity of organic compounds depended on their lipophilicity
Crum-Brown and Fraser
the physiological action of a substance was a function of its chemical
composition and constitution
Richet
inverse relationship between the cytotoxicities of a diverse set of simple
organic molecules with water solubilities
7. Hammett,
"sigma-rho” culture; to understand the effect of substituents on
organic reactions
Taft
devised a way to separate polar, steric, and resonance effects and
introduced the first steric parameter, Es
Hansch and Fujita
The contributions of Hammett and Taft together laid the
mechanistic basis for the development of the QSAR paradigm
8. Hammett Equation
Linear Free Energy Relationships
Louis Hammett (1894-1987), correlated electronic properties of organic
acids and bases with their equilibrium constants and reactivity
Measures the electron withdrawing or electron donating effects
in comparison to benzoic acid & how affected its ionization)
Consider the dissociation of benzoic acid:
9. Hammett Equation
› m-NO2 increases dissociation constant (nitro
group is EWG stabilizing the negative charge)
› p-NO2 exhibits greater electron withdrawing effect
› p-C2H5 group on benzoic acid
10. Hammett observed similar substituent effects on the organic
acids and bases dissociation like phenyl acetic acid.
Hammett Equation
11. A linear free-energy relationship is said to exist if ‘the same series of changes
in conditions affects the rate or equilibrium of a second reaction in exactly the
same way as the first’
The free energy is proportional to the logarithm of the equilibrium
constant
Graph for a linear free energy
relationship
12. › The following equation was derived as the relationship is linear;
where r is the slope of the line and the abscissa values are always
those for benzoic acid and are given the symbol, s (substituent
constant); equation simplified as:
› r (reaction constant) relates the effect of substituents on that
equilibrium to the effect of those substituents on the benzoic acid
equilibrium
› The reaction constant depends on the nature of the chemical
reaction as well as the reaction conditions (solvent, temperature,
etc.)
› The sign and magnitude of the reaction constant are indicative
of the extent of charge build up during the reaction progress
13. › Reactions with ρ > 0 are favored by electron withdrawing
groups (i.e., the stabilization of negative charge)
› Reactions with ρ < 0 are favored by electron donating groups
(i.e., the stabilization of positive charge)
› For benzoic acid r is equal to 1.00 in pure water at 25oC
› s is a descriptor of the substituents;
› The magnitude of s gives the relative strength of the electron-
withdrawing or -donating properties of the substituents
› s is positive if the substituent is electron-withdrawing and;
› s is negative if substituent is electron-donating
› The relationships as developed by Hammett are termed linear
free energy relationships
14. › By definition, s for hydrogen is ZERO
› Positive s for the NO2 group indicate electron-withdrawing effect
m-NO2 (inductive effect); while p-NO2 (inductive + resonance
effect)
› Electronegative chlorine produce an inductive electron-withdrawing
effect
The magnitude of the effect in the p-Cl position being less than in
the m-Cl, and only the inductive effect is possible with chlorine
› CH3O- group can be electron-donating or -withdrawing, depending
on the position of substitution
m-CH3O an inductive electron-withdrawing effect is seen
p-CH3O only a small inductive effect is expected; an electron-
donating resonance effect occurs for p-CH3O, giving an overall
electron-donating effect
Hammett Constant
15. Applications of the Hammett Equation
› The prediction of the pKa of ionization equilibria
› Therefore,
› For benzoic acid the equation is
› Consider for substituted benzoic acid
› Given smeta=0.71 for NO2 and spara=-0.13 for CH3 groups,
calculated pKa=2.91, compared to the experimental value of
2.97
16. Applications of the Hammett Equation
› The applicability of Hammett's electronic descriptors in a QSAR
relating the inhibition of bacterial growth by a series of
sulfonamides
› where X represents various substituents
› A QSAR was developed based on the s values of the substituents
› where C is the minimum concentration of compound that inhibited
growth of E. coli
› The electron-withdrawing substituents favor inhibition of growth
17. Log P is a measure of the drug’s hydrophobicity, which was
selected as a measure of its ability to pass through cell
membranes.
The log P (or log Po/w) value reflects the relative solubility of the
drug in octanol (representing the lipid bilayer of a cell
membrane) and water (the fluid within the cell and in blood).
Log P values may be measured experimentally or, more
commonly, calculated.
Hansch’s Approach
19. › The Hammett substituent constant (s) reflects the drug
molecule’s intrinsic reactivity, related to electronic
factors caused by aryl substituents.
› In chemical reactions, aromatic ring substituents can
alter the rate of reaction by up to 6 orders of magnitude!
› For example, the rate of the reaction below is ~105 times
slower when X = NO2 than when X = CH3
CH3OH
C Cl
H
X
C OCH3 + HCl
H
X
20. › Log 1/C = S ai + m
where C=predicted activity,
ai= contribution per group, and
m=activity of reference
Free-Wilson Analysis
Log 1/C = -0.30 [m-F] + 0.21 [m-Cl] + 0.43 [m-Br]
+ 0.58 [m-I] + 0.45 [m-Me] + 0.34 [p-F] + 0.77 [p-Cl]
+ 1.02 [p-Br] + 1.43 [p-I] + 1.26 [p-Me] + 7.82
N
Br
X
Y HCl
21. 8. Topliss Scheme
Used to decide which substituents to use if optimising compounds
one by one (where synthesis is complex and slow)
Example: Aromatic substituents
L E M
ML EL E M
L E M
L E M
See Central
Branch
L E M
H
4-Cl
4-CH34-OMe 3,4-Cl2
4-But
3-CF3-4-Cl
3-Cl 3-Cl 4-CF3
2,4-Cl2
4-NO2
3-NMe2
3-CF3-4-NO2
3-CH3
2-Cl
4-NO2
3-CF3
3,5-Cl2
3-NO2
4-F
4-NMe2
3-Me-4-NMe2
4-NH2
22. Rationale
Replace H with
para-Cl (+p and +s)
+p and/or +s
advantageous
favourable p
unfavourable s
+p and/or +s
disadvantageous
Act. Little
change
Act.
add second Cl to
increase p and s
further
replace with OMe
(-p and -s)
replace with Me
(+p and -s)
Further changes suggested based on arguments of p, s and
steric strain
8. Topliss Scheme
24. Contents
I. Basics of regression analysis - linear and multiple
linear regression,
II. Introduction to PCA & PCR, PLS, ANN and GFA.
III. Validation of QSAR models
A. Correlation coefficients (r2 and r2 pred), F-test,
standard error,
B. cross-validation by calculation of q2, boot-strap
analysis and randomization.
IV.Applicability domain for predictions using a QSAR model.
V. Design of training and test sets using factorial design
25. Linear and multiple linear Regression
(Image Coutesy: CAMO Software AS)
Linear Data
Non-Linear Data
27. b0
b1
y=b0+b1x+e
x
y
Least squares (LS) used
for estimation of regression coefficients
Simple linear regression
])(][)([
))((
22
yyxx
yyxx
b
Error
29. Linear and Multiple linear Regression
• When to use
• When no. of observations more than no. of variables
• Not used in current QSAR formalisms
• Limitations
• Inaccurate when inter-correlated variable are present
• Cannot be applied when no. of variables are more than
observations
42. Selection of training and test set using factorial
designs
1. In factorial designs the investigated factors are varied
at fixed levels.
2. Each factor (chemical feature or descriptors) is
investigated at levels based on type of factorial
experiment.
3. Full factorial design for K chemical
features/descriptors at two levels gives nK compounds.
44. Applicability Domain in QSAR
• OECD Definition: Applicability domain (AD) of a QSAR model is
the physico-chemical, structural or biological space, knowledge or
information on which the training set of the model has been
developed, and for which it is applicable to make predictions for new
compounds.
• A new European legislation on chemicals – REACH (Registration,
Evaluation, Authorization and restriction of Chemicals) came into
force in 2007.
• Purpose
• Reliably application of (Q)SAR
• Intrapolation is better Extrapolation
45. What are the key aspects in defining the AD of
QSAR models ?
• Identification of the subspace of chemical structures.
• Defined AD determines the degree of generalization of a given
predictive model.
• A well defined AD indicates if the endpoint for the chemical
structures under evaluation can be reliably predicted.
• Characterization of the interpolation space is very significant
to define the AD for a given QSAR model
46. How can the AD of a model be defined ?
• Range Based methods
• Bounding Box or convex hull
• PCA Bounding Box
• Distance based methods
• Geometric Methods
• Probability Density Distribution based methods
Empty Region
Dense region
Bounding Box or convex hull
48. Is it correct to say :
• “prediction result is always
reliable for a point within
the application region” ?
• “prediction is always
unreliable if the point is
outside the application
region” ?
Concluding remark