SlideShare a Scribd company logo
1 of 25
QSAR STATISTICAL
METHODS
PRESENTED BY-GAYATRI SATI
CLASS-M.PHARMA-2nd sem
(PHARMACOLOGY)
TABLE OF CONTENTS
1. INTRODUCTION OF QSAR
2. QSAR STATISTICAL METHODS
3. REGRESSION ANALYSIS
4. APPLICATION OF REGRESSION ANALYSIS
5. PARTIAL LEAST SQUARE ANALYSIS
6. APLICATION OF PLS
7. OTHER METHODS
8. REFERENCES
INTRODUCTION OF QSAR
• Quantitative structure activity relationship (QSAR) is a strategy of the essential
importance for chemistry and pharmacy, based on the idea that when we change a
structure of a molecule then also the activity or property of the substance will be
modified.
• QSAR are mathematical relationships between the physicochemical properties and
pharmacological/biological activity in a quantitative manner for a series of compound.
Biological activity=f (physicochemical properties and /or structure properties)
• Statistics is a branch of mathematics dealing with data collection, organization, analysis,
interpretation and presentation
QSAR
STATISTICAL
METHODS
REGRESSION
ANALYSIS
Simple
regression
analysis
Multiple
regression
PARTIAL
LEAST
SQUARE(PLS)
OTHER
METHODS
Cluster
analysis
Principal
component
analysis
Regression
based analysis
Ordinary least square
regression
Generalized linear
models
INDTRODUCTION REGRESSION ANALYSIS
• In statistical modeling, regression analysis is a set of statistical processes for
estimating the relationships among variables.
• Regression analysis correlates independent X variables with dependent Y variables.
• If two variables are involved, the variable that is basis of estimation is called the
independent variable and the variable whose value is to be estimated is called as
dependent variable.
• For any given values of X, the Y values are independent and follow a normal
distribution curve.
DEFINITION OF REGRESSION ANALYSIS
Regression analysis is a technique of studying the dependence of one variable (called
dependent Y variable e.g. biological data) on one or more variables (called independent X
variable e.g. physicochemical parameters) with a view to estimate or predict the average
value of dependent variable in terms of known or fixed values of the independent variable.
The dependent variable is also called as-
»Explained »Response »Endogenous
The independent variable is also called as-
»Explanatory »Regressor »Exogenous
REGRESSION MODELS
• Regression models involve the following parameters and variables. The unknown
parameter known as β, which may be a scalar or vector
• A regression model relates Y to a function of X and β
Y ≈ f (X, β)
where;
f = function
β = unknown parameter
X=independent variable
Y=dependent variable
Assume now that the vector of unknown parameters β is of length K, In order to
perform a regression analysis the user must provide information about the dependent
variable Y
 If N data points of the form (Y, X) are observed, where N < K, most classical
approaches to regression analysis cannot be performed.
If N = K data points are observed, and the function f is linear, the equations Y ≈ f (X,
β) can be solved exactly rather than approximately.
If N > K data points are observed, there is enough information in the data to estimate
the unique value for β.
REGRESSION
ANALYSIS
SIMPLE REGRESSION
ANALYSIS
LINEAR
NON-LINEAR
MULTIPLE REGRESSION
ANALYSIS
LINEAR
NON-LINEAR
SIMPLE LINEAR REGRESSION MODEL
• In simple linear regression there is only single explanatory variable
• Simple linear regression is applied when you to want to predict the value of one
variable, given values of other variables.
0
10
20
30
40
50
60
0 10 20 30 40 50 60
WEIGHT
HEIGHT
Linear regression fit plot
SIMPLE LINEAR REGRESSION
Simple linear regression for a derivation of these formulas
Yᵢ = β̥+ β1 Xᵢ + εᵢ
Where,
Yᵢ=Dependent variable
β̥=Population Y intercept
β1= Population slope coefficient linear component
Xᵢ=Independent variable
εᵢ=Random error term Random error component
MULTIPLE LINEAR REGRESSION
• Multiple linear regression is the same idea as simple linear regression, except how you
have several independent variables predicting the dependent variables
• It is used when we want to predict the value of a variable based on the value of two or
more other variables
Y=β ̥+ β1X1 + β2 X2 +……….+ βn Xn + ε
Where,
N=number of variable
β̥=intercept term
βn=Coefficients for independent variable
β= unknown parameter
USES OF REGRESSION ANALYSIS
1. Regression analysis helps in establishing the relationship between two or more
variables
2. Regression analysis predicts the value of dependent variables from the values of
independent variables
3. Coefficient of correlation and coefficient of determination can be calculated with the
help of regression analysis
4. Regression analysis is widely used as statistical tool in QSAR.
PARTIAL LEAST SQUARE ANALYSIS(PLS)
• Partial least square analysis (PLS) is a method for constructing predictive models when
the factors are many and collinear
• It is a recent technique that generalizes and combines features from principal
component analysis and multiple regression
• Goal-predict set of dependent variables Y from a set of independent variables X
describe their common structure
• Used to Find the fundamental relations between the two variables/matrices (X and Y)
• COMPACT (computer optimized molecular parametric analysis of chemical toxicity),
a PLS approach, is described to predict carcinogenicity and other forms of toxicity.
SOFTWARES USED IN PLS
Its application depends on the availability of software
• SIMCA-P
• UNSCRAMBLER
• SPM
• SAS PROC PLS
APPLICATIONS OF PLS
• PLS is used to find the fundamental relations between two matrices (X and Y)
• PLS model will try to find the maximum multidirectional direction in the X space
and the maximum multidimensional direction in the Y space
• PLS regression is widely used in chemo metrics especially in the case where the
number of independent variables is significantly larger than the number of data
points and related areas
• It is also used in bioinformatics, sensometrics, neuroscience and anthropology.
OTHER MULTIVARIABLE STATISTICAL MODELS
1. Cluster analysis
2. Principal component analysis
3. Regression based analysis methods
a) Ordinary least square regression
b) Generalized linear models
1. CLUSTER ANALYSIS
• Cluster analysis is a group of multivariate techniques whose primary purpose is to group
objects based on the characteristics they possess.
• In cluster analysis, the grouping is based on the distance (proximity)
• It is the main task of exploratory data mining, statistical data analysis, pattern
recognition, image analysis, bioinformatics, data compression and computer graphics
ROLE &APPLICATIONS OF CLUSTER ANALYSIS
ROLES-
1. Data reduction
2. Hypotheses generation
APPLICATIONS-
1. Medicine
2. Analysis of antimicrobial activity
3. Biology & bioinformatics
4. Field of psychiatry
5. Climate
6. Sequence analysis
7. Crime analysis & transcriptomic
2. PRINCIPAL COMPONENT ANALYSIS
• It is a exploratory technique used to reduce the dimensionality of data set to 2D or 3D
• PCA is a procedure that transforms a number of possibly correlated variables into a
smaller number of uncorrelated variables called principal components
• Objective of PCA:-
PCA is a dimensionality reduction or data compression method
• Goal of PCA:-
To select a subset of variables from a larger set, based on which original variables have
the highest correlations with the principal component
APPLICATIONS OF PCA
1. Neuroscience: A variant of PCA is used in neuroscience to identify the specific
properties of a stimulus that increase a neuron’s probability of generating an action
potential. This technique is known as spike triggered covariance analysis. In
neuroscience, PCA is also used to discern the identify of a neuron from the shape of
its action potential.
2. Quantitative finance: PCA can be directly applied to the risk management of interest
rate derivatives portfolios.
3. REGRESSION BASED ANALYSIS
a) Ordinary least squares:-
• In statistics, ordinary least squares (OLS) is a type of linear least squares
method for estimating the unknown parameters in a linear regression model.
• OLS is used in fields as diverse as economics (econometrics), data science,
political science, psychology and engineering (control theory and signal
processing)
b) Generalized linear model:-
• In statistics, the generalized linear model (GLM) is a flexible generalization
of ordinary linear regression that allows for response variables that have error
distribution models other than a normal distribution.
REFERENCES
1. https://en.wikipedia.org/wiki/Statistics
2. www.statstutor.ac.uk/resources/uploaded/1introduction3.pdf
3. http://home.iitk.ac.in/~kundu/Statistical
4. Methods.pdfhttps://en.wikipedia.org/wiki/Regression_analysis#Linear_regression
”NEVER TRUST A STATISTICS YOU DIDN’T FORGE YOURSELF ”
-WINSTON CHURCHILL

More Related Content

What's hot

Rational drug design method
Rational drug design methodRational drug design method
Rational drug design methodRangnathChikane
 
Hansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsHansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsAkshay Kank
 
Quantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipQuantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipRaniBhagat1
 
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDQSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDGagangowda58
 
De novo drug design
De novo drug designDe novo drug design
De novo drug designmojdeh y
 
Traditional and Rational Drug Designing
Traditional and Rational Drug DesigningTraditional and Rational Drug Designing
Traditional and Rational Drug DesigningManish Kumar
 
Rationale of prodrug design and practical considertions of prodrug design
Rationale of prodrug design and practical considertions of prodrug designRationale of prodrug design and practical considertions of prodrug design
Rationale of prodrug design and practical considertions of prodrug designKeshari Sriwastawa
 
Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Theabhi.in
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)Pinky Vincent
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping GamitKinjal
 
Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screeningJoon Jyoti Sahariah
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryAbhik Seal
 
Docking based screening of drugs.
Docking based screening of drugs.Docking based screening of drugs.
Docking based screening of drugs.Himanshu Yadav
 
Role of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery processRole of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery processmohamed abusalih
 
Combinatorial chemistry and high throughput screening
Combinatorial chemistry and high throughput screeningCombinatorial chemistry and high throughput screening
Combinatorial chemistry and high throughput screeningAnji Reddy
 

What's hot (20)

Rational drug design method
Rational drug design methodRational drug design method
Rational drug design method
 
Hansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR ModelsHansch and Free-Wilson QSAR Models
Hansch and Free-Wilson QSAR Models
 
Quantitative Structure Activity Relationship
Quantitative Structure Activity RelationshipQuantitative Structure Activity Relationship
Quantitative Structure Activity Relationship
 
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADDQSAR applications: Hansch analysis and Free Wilson analysis, CADD
QSAR applications: Hansch analysis and Free Wilson analysis, CADD
 
De novo drug design
De novo drug designDe novo drug design
De novo drug design
 
3D QSAR
3D QSAR3D QSAR
3D QSAR
 
Virtual sreening
Virtual sreeningVirtual sreening
Virtual sreening
 
Traditional and Rational Drug Designing
Traditional and Rational Drug DesigningTraditional and Rational Drug Designing
Traditional and Rational Drug Designing
 
Rationale of prodrug design and practical considertions of prodrug design
Rationale of prodrug design and practical considertions of prodrug designRationale of prodrug design and practical considertions of prodrug design
Rationale of prodrug design and practical considertions of prodrug design
 
Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)Quantitative Structure Activity Relationship (QSAR)
Quantitative Structure Activity Relationship (QSAR)
 
CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)CoMFA CoMFA Comparative Molecular Field Analysis)
CoMFA CoMFA Comparative Molecular Field Analysis)
 
Pharmacophore mapping
Pharmacophore mapping Pharmacophore mapping
Pharmacophore mapping
 
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...Presentation on concept of pharmacophore mapping and pharmacophore based scre...
Presentation on concept of pharmacophore mapping and pharmacophore based scre...
 
Structure based in silico virtual screening
Structure based in silico virtual screeningStructure based in silico virtual screening
Structure based in silico virtual screening
 
QSAR.pptx
QSAR.pptxQSAR.pptx
QSAR.pptx
 
Denovo Drug Design
Denovo Drug DesignDenovo Drug Design
Denovo Drug Design
 
Virtual Screening in Drug Discovery
Virtual Screening in Drug DiscoveryVirtual Screening in Drug Discovery
Virtual Screening in Drug Discovery
 
Docking based screening of drugs.
Docking based screening of drugs.Docking based screening of drugs.
Docking based screening of drugs.
 
Role of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery processRole of nuclicacid microarray &protein micro array for drug discovery process
Role of nuclicacid microarray &protein micro array for drug discovery process
 
Combinatorial chemistry and high throughput screening
Combinatorial chemistry and high throughput screeningCombinatorial chemistry and high throughput screening
Combinatorial chemistry and high throughput screening
 

Similar to QSAR statistical methods for drug discovery(pharmacology m.pharm2nd sem)

[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)mohamedchaouche
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfDr. Radhey Shyam
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
Factor analysis
Factor analysis Factor analysis
Factor analysis Nima
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8ParulSharma130721
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfJermaeDizon2
 
Singular Value Decomposition (SVD).pptx
Singular Value Decomposition (SVD).pptxSingular Value Decomposition (SVD).pptx
Singular Value Decomposition (SVD).pptxrajalakshmi5921
 
EDAB Module 5 Singular Value Decomposition (SVD).pptx
EDAB Module 5 Singular Value Decomposition (SVD).pptxEDAB Module 5 Singular Value Decomposition (SVD).pptx
EDAB Module 5 Singular Value Decomposition (SVD).pptxrajalakshmi5921
 
Analysis of data (pratik)
Analysis of data (pratik)Analysis of data (pratik)
Analysis of data (pratik)Patel Parth
 
Factor analysis ppt
Factor analysis pptFactor analysis ppt
Factor analysis pptMukesh Bisht
 
An Introduction to Factor analysis ppt
An Introduction to Factor analysis pptAn Introduction to Factor analysis ppt
An Introduction to Factor analysis pptMukesh Bisht
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningQuantUniversity
 

Similar to QSAR statistical methods for drug discovery(pharmacology m.pharm2nd sem) (20)

[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
[Xin yan, xiao_gang_su]_linear_regression_analysis(book_fi.org)
 
KIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdfKIT-601 Lecture Notes-UNIT-2.pdf
KIT-601 Lecture Notes-UNIT-2.pdf
 
Discriminant analysis.pptx
Discriminant analysis.pptxDiscriminant analysis.pptx
Discriminant analysis.pptx
 
Unit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdfUnit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdf
 
Unit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdfUnit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdf
 
Unit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdfUnit-3 Data Analytics.pdf
Unit-3 Data Analytics.pdf
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
Priya
PriyaPriya
Priya
 
Factor analysis
Factor analysis Factor analysis
Factor analysis
 
Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8Biostatistics and Research Methodology Semester 8
Biostatistics and Research Methodology Semester 8
 
Linear regression
Linear regressionLinear regression
Linear regression
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 
cannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdfcannonicalpresentation-110505114327-phpapp01.pdf
cannonicalpresentation-110505114327-phpapp01.pdf
 
Singular Value Decomposition (SVD).pptx
Singular Value Decomposition (SVD).pptxSingular Value Decomposition (SVD).pptx
Singular Value Decomposition (SVD).pptx
 
EDAB Module 5 Singular Value Decomposition (SVD).pptx
EDAB Module 5 Singular Value Decomposition (SVD).pptxEDAB Module 5 Singular Value Decomposition (SVD).pptx
EDAB Module 5 Singular Value Decomposition (SVD).pptx
 
Analysis of data (pratik)
Analysis of data (pratik)Analysis of data (pratik)
Analysis of data (pratik)
 
Factor analysis ppt
Factor analysis pptFactor analysis ppt
Factor analysis ppt
 
An Introduction to Factor analysis ppt
An Introduction to Factor analysis pptAn Introduction to Factor analysis ppt
An Introduction to Factor analysis ppt
 
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep LearningAnomaly detection: Core Techniques and Advances in Big Data and Deep Learning
Anomaly detection: Core Techniques and Advances in Big Data and Deep Learning
 
Machine learning meetup
Machine learning meetupMachine learning meetup
Machine learning meetup
 

Recently uploaded

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...Marc Dusseiller Dusjagr
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTiammrhaywood
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupJonathanParaisoCruz
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Celine George
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...JhezDiaz1
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for BeginnersSabitha Banu
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...jaredbarbolino94
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxAvyJaneVismanos
 

Recently uploaded (20)

Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
“Oh GOSH! Reflecting on Hackteria's Collaborative Practices in a Global Do-It...
 
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPTECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
ECONOMIC CONTEXT - LONG FORM TV DRAMA - PPT
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
MARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized GroupMARGINALIZATION (Different learners in Marginalized Group
MARGINALIZATION (Different learners in Marginalized Group
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17Difference Between Search & Browse Methods in Odoo 17
Difference Between Search & Browse Methods in Odoo 17
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)ESSENTIAL of (CS/IT/IS) class 06 (database)
ESSENTIAL of (CS/IT/IS) class 06 (database)
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
ENGLISH 7_Q4_LESSON 2_ Employing a Variety of Strategies for Effective Interp...
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
Full Stack Web Development Course for Beginners
Full Stack Web Development Course  for BeginnersFull Stack Web Development Course  for Beginners
Full Stack Web Development Course for Beginners
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...Historical philosophical, theoretical, and legal foundations of special and i...
Historical philosophical, theoretical, and legal foundations of special and i...
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Final demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptxFinal demo Grade 9 for demo Plan dessert.pptx
Final demo Grade 9 for demo Plan dessert.pptx
 

QSAR statistical methods for drug discovery(pharmacology m.pharm2nd sem)

  • 1. QSAR STATISTICAL METHODS PRESENTED BY-GAYATRI SATI CLASS-M.PHARMA-2nd sem (PHARMACOLOGY)
  • 2. TABLE OF CONTENTS 1. INTRODUCTION OF QSAR 2. QSAR STATISTICAL METHODS 3. REGRESSION ANALYSIS 4. APPLICATION OF REGRESSION ANALYSIS 5. PARTIAL LEAST SQUARE ANALYSIS 6. APLICATION OF PLS 7. OTHER METHODS 8. REFERENCES
  • 3. INTRODUCTION OF QSAR • Quantitative structure activity relationship (QSAR) is a strategy of the essential importance for chemistry and pharmacy, based on the idea that when we change a structure of a molecule then also the activity or property of the substance will be modified. • QSAR are mathematical relationships between the physicochemical properties and pharmacological/biological activity in a quantitative manner for a series of compound. Biological activity=f (physicochemical properties and /or structure properties) • Statistics is a branch of mathematics dealing with data collection, organization, analysis, interpretation and presentation
  • 4.
  • 6. INDTRODUCTION REGRESSION ANALYSIS • In statistical modeling, regression analysis is a set of statistical processes for estimating the relationships among variables. • Regression analysis correlates independent X variables with dependent Y variables. • If two variables are involved, the variable that is basis of estimation is called the independent variable and the variable whose value is to be estimated is called as dependent variable. • For any given values of X, the Y values are independent and follow a normal distribution curve.
  • 7. DEFINITION OF REGRESSION ANALYSIS Regression analysis is a technique of studying the dependence of one variable (called dependent Y variable e.g. biological data) on one or more variables (called independent X variable e.g. physicochemical parameters) with a view to estimate or predict the average value of dependent variable in terms of known or fixed values of the independent variable. The dependent variable is also called as- »Explained »Response »Endogenous The independent variable is also called as- »Explanatory »Regressor »Exogenous
  • 8. REGRESSION MODELS • Regression models involve the following parameters and variables. The unknown parameter known as β, which may be a scalar or vector • A regression model relates Y to a function of X and β Y ≈ f (X, β) where; f = function β = unknown parameter X=independent variable Y=dependent variable
  • 9. Assume now that the vector of unknown parameters β is of length K, In order to perform a regression analysis the user must provide information about the dependent variable Y  If N data points of the form (Y, X) are observed, where N < K, most classical approaches to regression analysis cannot be performed. If N = K data points are observed, and the function f is linear, the equations Y ≈ f (X, β) can be solved exactly rather than approximately. If N > K data points are observed, there is enough information in the data to estimate the unique value for β.
  • 11. SIMPLE LINEAR REGRESSION MODEL • In simple linear regression there is only single explanatory variable • Simple linear regression is applied when you to want to predict the value of one variable, given values of other variables. 0 10 20 30 40 50 60 0 10 20 30 40 50 60 WEIGHT HEIGHT Linear regression fit plot
  • 12. SIMPLE LINEAR REGRESSION Simple linear regression for a derivation of these formulas Yᵢ = β̥+ β1 Xᵢ + εᵢ Where, Yᵢ=Dependent variable β̥=Population Y intercept β1= Population slope coefficient linear component Xᵢ=Independent variable εᵢ=Random error term Random error component
  • 13. MULTIPLE LINEAR REGRESSION • Multiple linear regression is the same idea as simple linear regression, except how you have several independent variables predicting the dependent variables • It is used when we want to predict the value of a variable based on the value of two or more other variables Y=β ̥+ β1X1 + β2 X2 +……….+ βn Xn + ε Where, N=number of variable β̥=intercept term βn=Coefficients for independent variable β= unknown parameter
  • 14. USES OF REGRESSION ANALYSIS 1. Regression analysis helps in establishing the relationship between two or more variables 2. Regression analysis predicts the value of dependent variables from the values of independent variables 3. Coefficient of correlation and coefficient of determination can be calculated with the help of regression analysis 4. Regression analysis is widely used as statistical tool in QSAR.
  • 15. PARTIAL LEAST SQUARE ANALYSIS(PLS) • Partial least square analysis (PLS) is a method for constructing predictive models when the factors are many and collinear • It is a recent technique that generalizes and combines features from principal component analysis and multiple regression • Goal-predict set of dependent variables Y from a set of independent variables X describe their common structure • Used to Find the fundamental relations between the two variables/matrices (X and Y) • COMPACT (computer optimized molecular parametric analysis of chemical toxicity), a PLS approach, is described to predict carcinogenicity and other forms of toxicity.
  • 16. SOFTWARES USED IN PLS Its application depends on the availability of software • SIMCA-P • UNSCRAMBLER • SPM • SAS PROC PLS
  • 17. APPLICATIONS OF PLS • PLS is used to find the fundamental relations between two matrices (X and Y) • PLS model will try to find the maximum multidirectional direction in the X space and the maximum multidimensional direction in the Y space • PLS regression is widely used in chemo metrics especially in the case where the number of independent variables is significantly larger than the number of data points and related areas • It is also used in bioinformatics, sensometrics, neuroscience and anthropology.
  • 18. OTHER MULTIVARIABLE STATISTICAL MODELS 1. Cluster analysis 2. Principal component analysis 3. Regression based analysis methods a) Ordinary least square regression b) Generalized linear models
  • 19. 1. CLUSTER ANALYSIS • Cluster analysis is a group of multivariate techniques whose primary purpose is to group objects based on the characteristics they possess. • In cluster analysis, the grouping is based on the distance (proximity) • It is the main task of exploratory data mining, statistical data analysis, pattern recognition, image analysis, bioinformatics, data compression and computer graphics
  • 20. ROLE &APPLICATIONS OF CLUSTER ANALYSIS ROLES- 1. Data reduction 2. Hypotheses generation APPLICATIONS- 1. Medicine 2. Analysis of antimicrobial activity 3. Biology & bioinformatics 4. Field of psychiatry 5. Climate 6. Sequence analysis 7. Crime analysis & transcriptomic
  • 21. 2. PRINCIPAL COMPONENT ANALYSIS • It is a exploratory technique used to reduce the dimensionality of data set to 2D or 3D • PCA is a procedure that transforms a number of possibly correlated variables into a smaller number of uncorrelated variables called principal components • Objective of PCA:- PCA is a dimensionality reduction or data compression method • Goal of PCA:- To select a subset of variables from a larger set, based on which original variables have the highest correlations with the principal component
  • 22. APPLICATIONS OF PCA 1. Neuroscience: A variant of PCA is used in neuroscience to identify the specific properties of a stimulus that increase a neuron’s probability of generating an action potential. This technique is known as spike triggered covariance analysis. In neuroscience, PCA is also used to discern the identify of a neuron from the shape of its action potential. 2. Quantitative finance: PCA can be directly applied to the risk management of interest rate derivatives portfolios.
  • 23. 3. REGRESSION BASED ANALYSIS a) Ordinary least squares:- • In statistics, ordinary least squares (OLS) is a type of linear least squares method for estimating the unknown parameters in a linear regression model. • OLS is used in fields as diverse as economics (econometrics), data science, political science, psychology and engineering (control theory and signal processing) b) Generalized linear model:- • In statistics, the generalized linear model (GLM) is a flexible generalization of ordinary linear regression that allows for response variables that have error distribution models other than a normal distribution.
  • 24. REFERENCES 1. https://en.wikipedia.org/wiki/Statistics 2. www.statstutor.ac.uk/resources/uploaded/1introduction3.pdf 3. http://home.iitk.ac.in/~kundu/Statistical 4. Methods.pdfhttps://en.wikipedia.org/wiki/Regression_analysis#Linear_regression
  • 25. ”NEVER TRUST A STATISTICS YOU DIDN’T FORGE YOURSELF ” -WINSTON CHURCHILL