SlideShare a Scribd company logo
1 of 18
Statistical Method Used in QSAR
SUBMITTED BY:
UPASANA SHARMA
(M.PHARMA :Pharmaceutical Chemistry)
SUBMITTED TO:
Dr. SALAHUDDIN
(PROFESSOR)
NOIDA INSTITUTE OF ENGINEERING AND TECHNOLOGY (PHARMACY INSTITUTE)
GREATER NOIDA 1
UPASANA SHARMA (16/05/2023)
Collection of ligand
Generation of descriptors
Features selection
Construction of model
Validation of model
QSAR Work Flow:
2
UPASANA SHARMA (16/05/2023)
Build models and calculate minimum energy
conformation
Calculate descriptors
Short list descriptors
 Based on correlation coefficient
 Based on cross correlation coefficient
 Based on dissimilarity distance
 Based on cluster analysis
 Based on genetic function approach
Develop regression relationship and estimate
statistics (R2,R2adj,R2pre,F test, Residuals)
Test model with external data set
QSAR Model
3
UPASANA SHARMA (16/05/2023)
Model construction +Features selection= Statistical analysis
( for large no. of descriptors or few no. of descriptors)
Method
Regression Based approach
Classification Based approach
Machine Learning Technology
Simple Linear Regression method
Multiple linear regression method
Partial least square method
Cluster analysis
Principal component analysis
Logistic regression
Artificial neural network
Support vector machine
Gene expressing programming
Linear properties Linear regression
Partial regression
Semi supervised learning
algorithm
Unsupervised learning
algorithm
Non-linear properties
Supervised learning algorithm
Model
Artificial neural network
Model can be construct using
4
UPASANA SHARMA (16/05/2023)
Validation:
Regression based QSAR
model
Validation metrics for internal validation
Least square fitting
Chi-Squared x2 and
root mean squared
error (RMSE)
Cross
validation
Leave one out cross
validation LOO
Leave some out cross
validation LSO
True Q2 and rm2
metrics
Validation metrics for external
validation
Predictive R2 (Q2F1)
Q2 F2 and Q3 F3
Golbraikh and tropsha's
criteria
Metrics include
(RMSEP) root mean
square for prediction
Validation metrics for
classification based
method
Wilks lamda statistics
Lower value
Canonical index (Rc)
Chi-square x2
Squared mahalanobis
distance 5
UPASANA SHARMA (16/05/2023)
 Simple Linear regression method :
1 descriptor
Y= b+b1x1+e
 Multiple Linear Regression method:
Y=b+b1x1+b2x2+bnxn+e
 Non-linear regression method :
1parameter is not linear
Y=n(x ,B)+e (B= unknown parameter)
Observational data are molded by a function which is non-linear combination of the model
parameter and depend on one or more independent variables.
Regression based method :
6
UPASANA SHARMA (16/05/2023)
UPASANA SHARMA (16/05/2023) 7
 Partial least square method:
 The principal component regression.
 predict or analyze a set of dependent variables from a set of independent variables by
multivariant statistical method done from multiple regression analysis.
 Applied in 3D-QSAR technique, Comparative Molecular Field Analysis (CoMFA)
 It used combination with GPLS genetic, FAPLS (factor analysis), OSCPLS (orthogonal
signal correction)
 COMPACT( computer optimized molecular parametric analysis of chemical toxicity)
predict carcinogenicity and other forms of toxicity.
 Software:
SIMCA-P
UNSCRAMBLER
SPM
Classification based approach :
 Cluster analysis:
 Clustering involves placing similar data into a group
in a way that maximizes similarity within groups and
dissimilarity between groups.
 Multivariate technique which analysis the
group based on distance (proximity).
 In hierarchical clustering, in a agglomerative and
divisive form
 The k-means clustering is a partition
based clustering
 Data reduction and hypotheses generation use in
data mining ,statistical data analysis
,pattern recognition.
8
UPASANA SHARMA (16/05/2023)
UPASANA SHARMA (16/05/2023) 9
 PRINCIPAL COMPONENT ANALYSIS:
 It transforms a number of possibly correlated variables
into a smaller number of uncorrelated variables called
principal components.
 PCA reduces attribute space from a larger number of
variables to a small number of factors (non-dependent
variable).
 PCA is a dimensionality reduction or data
compression method and there is no guarantee that the
dimensions are interpretable.
 Objective : To select a subset of variables from a larger
set, based on which original variables have the highest
correlations with the principal component.
Machine learning techniques :
 Artificial neural network
 Mimics the behavior of biological neurons.
 It has input layer
hidden layer
output layer
 Types:
Propagation neural networks, probabilistic
neural networks, Kohonen self-organizing
maps and Bayesian regularized neural
networks.
 Support vector machine
 Uses a linear classifier to classify data
into two categories.
 It used in combination with other
methods like MLR, PLS and so forth for
building more powerful and accurate
QSAR models
 Gene expressing programming
 The genetic algorithm and genetic
programming.
 It used for calculate the dermal
penetration, EC50, Binding Affinity
,Improved gene expressing
programming
10
UPASANA SHARMA (16/05/2023)
Validation :
 It avoid chance correlation of numerous
descriptors used in the model and also over-
fitting of data. It assign the accuracy and
prediction of the model. (training set and test
set).
 The Organization for Economic Cooperation
and Development (OECD) give 5 principles to
test the model.
1) a defined endpoint
2) an unambiguous algorithm
3) a defined domain of applicability
4) appropriate measures of goodness-of-fit,
robustness and prediction accuracy
5) a mechanistic interpretation
11
UPASANA SHARMA (16/05/2023)
Regression based QSAR model :
Validation metrics for internal validation
The use of molecules from training set to test the predictability of the model
 Least square fitting
It is the measure of square correlation coefficient (R2) between the predicted and experimental
value of activity.
The difference between R2 and R2 adj < 0.3 , QSAR validation is good.
 The χ2 Chi-squared and RMSE:
The values are used to assess the predictive quality of a model.
χ2 value shows the difference between experimentally determined bioactivity values and the
values predicted by the model
RMSE value of for large R2 value (that is >=7), values of χ2 and RMSE should < than 0.5 and
0.3 respectively.
12
UPASANA SHARMA (16/05/2023)
Cross validation :
It is internal validation include Leave-Group-Out (LGO), which involves leaving of a molecule
or a group of molecules while creating model and evaluating the predictability of the model
using the molecules left.
 In LOO cross validation (leave-one-out)
 one compound is left out and the QSAR model is constructed using remaining
compounds.
 The eliminated compound is used as a test for the predicted model
 The predictability of the model is assessed by PRESS (Predicted Residual Sum of
Squares) and cross-validated R2 (Q2 ) when SDEP (Standard Deviation of Error of
Prediction) is obtained from PRESS
13
UPASANA SHARMA (16/05/2023)
 True Q2 value:
 True Q2 is used for small data sets
 Q2 should not be treated as an ultimate proof for good predictability of models.
 Value of Q2 = < 0.5
 LSO (Leave-Some-Out) or LMO (Leave-Many-Out)
 It is set of data compounds are eliminated and models are created with rest of the
compounds.
 The left out compounds are then used to check the predictability of the model.
14
UPASANA SHARMA (16/05/2023)
Low RMSE value and High R2 value
Validation metrics for external validation
 Predictive R2 or Q2 (F1) = correlation of observed and predicted data. Model is good
if predictive power has value of Q2 (F1) =< 0.5
 Q2 (F2) and Q2 (F3)= using the mean of test data set and training data . For validation of
QSAR model, threshold value of 0.5 is defined for both metrics
 Golbraikh and Tropsha’s criteria= forth condition for selection of training and test data
sets.
 For having a good predictive power, QSAR model should satisfy following condition
i. Q 2 training > 0.5
ii. R 2 test > 0.6
iii. (r2 - r 2 0)/ r 2 < 0.1 or (r2 – r’2 0)/ r 2 < 0.1, where r2 0 is R2 of predicted vs. observed
activities and r’2 0 is R2 of observed vs. predicted activities.
iv. 0.85 <= k <= 1.15 or 0.85 <= k’<= 1.15, where k and k’ are the slopes of regression lines
 Other metrics includes RMSEP (Root Mean Square Error of Prediction) to calculate
prediction error of QSAR model
15
UPASANA SHARMA (16/05/2023)
Validation metrics for classification based methods
-cluster analysis and PCA (principal component analysis):
The validation matrix employed in classification-based methods is
 Wilks lambda (λ) statistics: It is sum of squares to total dispersion. The value ranges
between 0< λ <1
lower value corresponding = higher level of discrimination.
 Canonical index (Rc): It is used to estimate the strength of relationship between various
dependent and independent variables
 Chi-square (χ2): to check the quality of the classification based model
 Squared Mahalanobis: distance is a measure calculated using random data points
16
UPASANA SHARMA (16/05/2023)
 REFRENCES:
Pirhadi S, Shiri F, Ghasemi JB. Multivariate statistical analysis methods in QSAR.
Rsc Advances. 2015;5(127):104635-65.
Damme SV, Bultinck AR. Journal of Computational Chemistry. 2007
Aug;28(11):1924-8.
De Oliveira DB, Gaudio AC. BuildQSAR: a new computer program for QSAR
analysis. Quantitative Structure‐Activity Relationships: An International Journal
Devoted to Fundamental and Practical Aspects of Electroanalysis. 2000
Dec;19(6):599-601.
Verma J, Khedkar VM, Coutinho EC. 3D-QSAR in drug design-a review. Current
topics in medicinal chemistry. 2010 Jan 1;10(1):95-115.
UPASANA SHARMA (16/05/2023) 17
UPASANA SHARMA (16/05/2023) 18
THANKYOU

More Related Content

What's hot

3 d qsar approaches structure
3 d qsar approaches structure3 d qsar approaches structure
3 d qsar approaches structureROHIT PAL
 
Hansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsHansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsAkshay Kank
 
(Kartik Tiwari) Denovo Drug Design.pptx
(Kartik Tiwari) Denovo Drug Design.pptx(Kartik Tiwari) Denovo Drug Design.pptx
(Kartik Tiwari) Denovo Drug Design.pptxKartik Tiwari
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new moleculesMadhuraDatar
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)Pinky Vincent
 
Free wilson analysis
Free wilson analysisFree wilson analysis
Free wilson analysisASHOK GAUTAM
 
in silico drug design and virtual screening technique
 in silico drug design and virtual screening technique in silico drug design and virtual screening technique
in silico drug design and virtual screening techniqueMO.SHAHANAWAZ
 
Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Theabhi.in
 
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDQSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDGagangowda58
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping GamitKinjal
 
De novo drug design
De novo drug designDe novo drug design
De novo drug designmojdeh y
 
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)AkshayYadav176
 
Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screeningJoon Jyoti Sahariah
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsarRahul B S
 

What's hot (20)

3 d qsar approaches structure
3 d qsar approaches structure3 d qsar approaches structure
3 d qsar approaches structure
 
Hansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsHansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR Models
 
(Kartik Tiwari) Denovo Drug Design.pptx
(Kartik Tiwari) Denovo Drug Design.pptx(Kartik Tiwari) Denovo Drug Design.pptx
(Kartik Tiwari) Denovo Drug Design.pptx
 
In Silico methods for ADMET prediction of new molecules
 In Silico methods for ADMET prediction of new molecules In Silico methods for ADMET prediction of new molecules
In Silico methods for ADMET prediction of new molecules
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)
 
Rationale of prodrug design and practical consideration of
Rationale of prodrug design and practical consideration ofRationale of prodrug design and practical consideration of
Rationale of prodrug design and practical consideration of
 
Free wilson analysis
Free wilson analysisFree wilson analysis
Free wilson analysis
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
 
QSAR.pptx
QSAR.pptxQSAR.pptx
QSAR.pptx
 
in silico drug design and virtual screening technique
 in silico drug design and virtual screening technique in silico drug design and virtual screening technique
in silico drug design and virtual screening technique
 
3d qsar
3d qsar3d qsar
3d qsar
 
Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)
 
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDQSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping
 
Denovo
DenovoDenovo
Denovo
 
De novo drug design
De novo drug designDe novo drug design
De novo drug design
 
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
Pharmacophore Mapping and Virtual Screening (Computer aided Drug design)
 
Pharmacophore mapping joon
Pharmacophore mapping joonPharmacophore mapping joon
Pharmacophore mapping joon
 
Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screening
 
Free wilson analysis qsar
Free wilson analysis qsarFree wilson analysis qsar
Free wilson analysis qsar
 

Similar to Statistical Methods for QSAR Modeling

Small Molecules and siRNA: Methods to Explore Bioactivity Data
Small Molecules and siRNA: Methods to Explore Bioactivity DataSmall Molecules and siRNA: Methods to Explore Bioactivity Data
Small Molecules and siRNA: Methods to Explore Bioactivity DataRajarshi Guha
 
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...ijcsa
 
Quantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipQuantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipRaniBhagat1
 
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...ijcisjournal
 
Treatment by alternative methods of regression gas chromatographic retention ...
Treatment by alternative methods of regression gas chromatographic retention ...Treatment by alternative methods of regression gas chromatographic retention ...
Treatment by alternative methods of regression gas chromatographic retention ...ijics
 
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...ijsc
 
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...ijsc
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...Simone Brogi
 
Regression kriging
Regression krigingRegression kriging
Regression krigingFAO
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESIRJET Journal
 
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...IJCI JOURNAL
 
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUES
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUESHEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUES
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUESIRJET Journal
 
Threshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesThreshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesjournalBEEI
 
IJBB-51-3-188-200
IJBB-51-3-188-200IJBB-51-3-188-200
IJBB-51-3-188-200sankar basu
 
journal.pone.0161879.PDF
journal.pone.0161879.PDFjournal.pone.0161879.PDF
journal.pone.0161879.PDFsankar basu
 
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...SSA KPI
 
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...ijaia
 
A Comparative study of locality Preserving Projection & Principle Component A...
A Comparative study of locality Preserving Projection & Principle Component A...A Comparative study of locality Preserving Projection & Principle Component A...
A Comparative study of locality Preserving Projection & Principle Component A...RAHUL WAGAJ
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET Journal
 

Similar to Statistical Methods for QSAR Modeling (20)

Small Molecules and siRNA: Methods to Explore Bioactivity Data
Small Molecules and siRNA: Methods to Explore Bioactivity DataSmall Molecules and siRNA: Methods to Explore Bioactivity Data
Small Molecules and siRNA: Methods to Explore Bioactivity Data
 
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
A ROBUST MISSING VALUE IMPUTATION METHOD MIFOIMPUTE FOR INCOMPLETE MOLECULAR ...
 
Quantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipQuantitative Structure Activity Relationship
Quantitative Structure Activity Relationship
 
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
 
Treatment by alternative methods of regression gas chromatographic retention ...
Treatment by alternative methods of regression gas chromatographic retention ...Treatment by alternative methods of regression gas chromatographic retention ...
Treatment by alternative methods of regression gas chromatographic retention ...
 
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...
Treatment by Alternative Methods of Regression Gas Chromatographic Retention ...
 
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
TREATMENT BY ALTERNATIVE METHODS OF REGRESSION GAS CHROMATOGRAPHIC RETENTION ...
 
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
PROGRAM PHASE IN LIGAND-BASED PHARMACOPHORE MODEL GENERATION AND 3D DATABASE ...
 
Regression kriging
Regression krigingRegression kriging
Regression kriging
 
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGESCASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
CASE STUDY: ADMISSION PREDICTION IN ENGINEERING AND TECHNOLOGY COLLEGES
 
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...
POSTERIOR RESOLUTION AND STRUCTURAL MODIFICATION FOR PARAMETER DETERMINATION ...
 
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUES
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUESHEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUES
HEART DISEASE PREDICTION USING MACHINE LEARNING TECHNIQUES
 
Threshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniquesThreshold benchmarking for feature ranking techniques
Threshold benchmarking for feature ranking techniques
 
IJBB-51-3-188-200
IJBB-51-3-188-200IJBB-51-3-188-200
IJBB-51-3-188-200
 
journal.pone.0161879.PDF
journal.pone.0161879.PDFjournal.pone.0161879.PDF
journal.pone.0161879.PDF
 
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...Forecasting Default Probabilities  in Emerging Markets and   Dynamical Regula...
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
 
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
 
Ijetr042111
Ijetr042111Ijetr042111
Ijetr042111
 
A Comparative study of locality Preserving Projection & Principle Component A...
A Comparative study of locality Preserving Projection & Principle Component A...A Comparative study of locality Preserving Projection & Principle Component A...
A Comparative study of locality Preserving Projection & Principle Component A...
 
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...IRJET-  	  Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
IRJET- Comparative Study of PCA, KPCA, KFA and LDA Algorithms for Face Re...
 

Recently uploaded

Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...hotbabesbook
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiSuhani Kapoor
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...Taniya Sharma
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomdiscovermytutordmt
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...Neha Kaur
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...astropune
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 

Recently uploaded (20)

Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
Night 7k to 12k Chennai City Center Call Girls 👉👉 7427069034⭐⭐ 100% Genuine E...
 
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service KochiLow Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
Low Rate Call Girls Kochi Anika 8250192130 Independent Escort Service Kochi
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
💎VVIP Kolkata Call Girls Parganas🩱7001035870🩱Independent Girl ( Ac Rooms Avai...
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Coimbatore Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Jabalpur Just Call 9907093804 Top Class Call Girl Service Available
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Kochi Just Call 9907093804 Top Class Call Girl Service Available
 
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel roomLucknow Call girls - 8800925952 - 24x7 service with hotel room
Lucknow Call girls - 8800925952 - 24x7 service with hotel room
 
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
VIP Russian Call Girls in Varanasi Samaira 8250192130 Independent Escort Serv...
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
♛VVIP Hyderabad Call Girls Chintalkunta🖕7001035870🖕Riya Kappor Top Call Girl ...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 

Statistical Methods for QSAR Modeling

  • 1. Statistical Method Used in QSAR SUBMITTED BY: UPASANA SHARMA (M.PHARMA :Pharmaceutical Chemistry) SUBMITTED TO: Dr. SALAHUDDIN (PROFESSOR) NOIDA INSTITUTE OF ENGINEERING AND TECHNOLOGY (PHARMACY INSTITUTE) GREATER NOIDA 1 UPASANA SHARMA (16/05/2023)
  • 2. Collection of ligand Generation of descriptors Features selection Construction of model Validation of model QSAR Work Flow: 2 UPASANA SHARMA (16/05/2023)
  • 3. Build models and calculate minimum energy conformation Calculate descriptors Short list descriptors  Based on correlation coefficient  Based on cross correlation coefficient  Based on dissimilarity distance  Based on cluster analysis  Based on genetic function approach Develop regression relationship and estimate statistics (R2,R2adj,R2pre,F test, Residuals) Test model with external data set QSAR Model 3 UPASANA SHARMA (16/05/2023)
  • 4. Model construction +Features selection= Statistical analysis ( for large no. of descriptors or few no. of descriptors) Method Regression Based approach Classification Based approach Machine Learning Technology Simple Linear Regression method Multiple linear regression method Partial least square method Cluster analysis Principal component analysis Logistic regression Artificial neural network Support vector machine Gene expressing programming Linear properties Linear regression Partial regression Semi supervised learning algorithm Unsupervised learning algorithm Non-linear properties Supervised learning algorithm Model Artificial neural network Model can be construct using 4 UPASANA SHARMA (16/05/2023)
  • 5. Validation: Regression based QSAR model Validation metrics for internal validation Least square fitting Chi-Squared x2 and root mean squared error (RMSE) Cross validation Leave one out cross validation LOO Leave some out cross validation LSO True Q2 and rm2 metrics Validation metrics for external validation Predictive R2 (Q2F1) Q2 F2 and Q3 F3 Golbraikh and tropsha's criteria Metrics include (RMSEP) root mean square for prediction Validation metrics for classification based method Wilks lamda statistics Lower value Canonical index (Rc) Chi-square x2 Squared mahalanobis distance 5 UPASANA SHARMA (16/05/2023)
  • 6.  Simple Linear regression method : 1 descriptor Y= b+b1x1+e  Multiple Linear Regression method: Y=b+b1x1+b2x2+bnxn+e  Non-linear regression method : 1parameter is not linear Y=n(x ,B)+e (B= unknown parameter) Observational data are molded by a function which is non-linear combination of the model parameter and depend on one or more independent variables. Regression based method : 6 UPASANA SHARMA (16/05/2023)
  • 7. UPASANA SHARMA (16/05/2023) 7  Partial least square method:  The principal component regression.  predict or analyze a set of dependent variables from a set of independent variables by multivariant statistical method done from multiple regression analysis.  Applied in 3D-QSAR technique, Comparative Molecular Field Analysis (CoMFA)  It used combination with GPLS genetic, FAPLS (factor analysis), OSCPLS (orthogonal signal correction)  COMPACT( computer optimized molecular parametric analysis of chemical toxicity) predict carcinogenicity and other forms of toxicity.  Software: SIMCA-P UNSCRAMBLER SPM
  • 8. Classification based approach :  Cluster analysis:  Clustering involves placing similar data into a group in a way that maximizes similarity within groups and dissimilarity between groups.  Multivariate technique which analysis the group based on distance (proximity).  In hierarchical clustering, in a agglomerative and divisive form  The k-means clustering is a partition based clustering  Data reduction and hypotheses generation use in data mining ,statistical data analysis ,pattern recognition. 8 UPASANA SHARMA (16/05/2023)
  • 9. UPASANA SHARMA (16/05/2023) 9  PRINCIPAL COMPONENT ANALYSIS:  It transforms a number of possibly correlated variables into a smaller number of uncorrelated variables called principal components.  PCA reduces attribute space from a larger number of variables to a small number of factors (non-dependent variable).  PCA is a dimensionality reduction or data compression method and there is no guarantee that the dimensions are interpretable.  Objective : To select a subset of variables from a larger set, based on which original variables have the highest correlations with the principal component.
  • 10. Machine learning techniques :  Artificial neural network  Mimics the behavior of biological neurons.  It has input layer hidden layer output layer  Types: Propagation neural networks, probabilistic neural networks, Kohonen self-organizing maps and Bayesian regularized neural networks.  Support vector machine  Uses a linear classifier to classify data into two categories.  It used in combination with other methods like MLR, PLS and so forth for building more powerful and accurate QSAR models  Gene expressing programming  The genetic algorithm and genetic programming.  It used for calculate the dermal penetration, EC50, Binding Affinity ,Improved gene expressing programming 10 UPASANA SHARMA (16/05/2023)
  • 11. Validation :  It avoid chance correlation of numerous descriptors used in the model and also over- fitting of data. It assign the accuracy and prediction of the model. (training set and test set).  The Organization for Economic Cooperation and Development (OECD) give 5 principles to test the model. 1) a defined endpoint 2) an unambiguous algorithm 3) a defined domain of applicability 4) appropriate measures of goodness-of-fit, robustness and prediction accuracy 5) a mechanistic interpretation 11 UPASANA SHARMA (16/05/2023)
  • 12. Regression based QSAR model : Validation metrics for internal validation The use of molecules from training set to test the predictability of the model  Least square fitting It is the measure of square correlation coefficient (R2) between the predicted and experimental value of activity. The difference between R2 and R2 adj < 0.3 , QSAR validation is good.  The χ2 Chi-squared and RMSE: The values are used to assess the predictive quality of a model. χ2 value shows the difference between experimentally determined bioactivity values and the values predicted by the model RMSE value of for large R2 value (that is >=7), values of χ2 and RMSE should < than 0.5 and 0.3 respectively. 12 UPASANA SHARMA (16/05/2023)
  • 13. Cross validation : It is internal validation include Leave-Group-Out (LGO), which involves leaving of a molecule or a group of molecules while creating model and evaluating the predictability of the model using the molecules left.  In LOO cross validation (leave-one-out)  one compound is left out and the QSAR model is constructed using remaining compounds.  The eliminated compound is used as a test for the predicted model  The predictability of the model is assessed by PRESS (Predicted Residual Sum of Squares) and cross-validated R2 (Q2 ) when SDEP (Standard Deviation of Error of Prediction) is obtained from PRESS 13 UPASANA SHARMA (16/05/2023)
  • 14.  True Q2 value:  True Q2 is used for small data sets  Q2 should not be treated as an ultimate proof for good predictability of models.  Value of Q2 = < 0.5  LSO (Leave-Some-Out) or LMO (Leave-Many-Out)  It is set of data compounds are eliminated and models are created with rest of the compounds.  The left out compounds are then used to check the predictability of the model. 14 UPASANA SHARMA (16/05/2023) Low RMSE value and High R2 value
  • 15. Validation metrics for external validation  Predictive R2 or Q2 (F1) = correlation of observed and predicted data. Model is good if predictive power has value of Q2 (F1) =< 0.5  Q2 (F2) and Q2 (F3)= using the mean of test data set and training data . For validation of QSAR model, threshold value of 0.5 is defined for both metrics  Golbraikh and Tropsha’s criteria= forth condition for selection of training and test data sets.  For having a good predictive power, QSAR model should satisfy following condition i. Q 2 training > 0.5 ii. R 2 test > 0.6 iii. (r2 - r 2 0)/ r 2 < 0.1 or (r2 – r’2 0)/ r 2 < 0.1, where r2 0 is R2 of predicted vs. observed activities and r’2 0 is R2 of observed vs. predicted activities. iv. 0.85 <= k <= 1.15 or 0.85 <= k’<= 1.15, where k and k’ are the slopes of regression lines  Other metrics includes RMSEP (Root Mean Square Error of Prediction) to calculate prediction error of QSAR model 15 UPASANA SHARMA (16/05/2023)
  • 16. Validation metrics for classification based methods -cluster analysis and PCA (principal component analysis): The validation matrix employed in classification-based methods is  Wilks lambda (λ) statistics: It is sum of squares to total dispersion. The value ranges between 0< λ <1 lower value corresponding = higher level of discrimination.  Canonical index (Rc): It is used to estimate the strength of relationship between various dependent and independent variables  Chi-square (χ2): to check the quality of the classification based model  Squared Mahalanobis: distance is a measure calculated using random data points 16 UPASANA SHARMA (16/05/2023)
  • 17.  REFRENCES: Pirhadi S, Shiri F, Ghasemi JB. Multivariate statistical analysis methods in QSAR. Rsc Advances. 2015;5(127):104635-65. Damme SV, Bultinck AR. Journal of Computational Chemistry. 2007 Aug;28(11):1924-8. De Oliveira DB, Gaudio AC. BuildQSAR: a new computer program for QSAR analysis. Quantitative Structure‐Activity Relationships: An International Journal Devoted to Fundamental and Practical Aspects of Electroanalysis. 2000 Dec;19(6):599-601. Verma J, Khedkar VM, Coutinho EC. 3D-QSAR in drug design-a review. Current topics in medicinal chemistry. 2010 Jan 1;10(1):95-115. UPASANA SHARMA (16/05/2023) 17