SlideShare a Scribd company logo
1 of 58
CORRELATION &REGRESSION
ANALYSIS Using SPSS
Dr Parag Shah | M.Sc., M.Phil., Ph.D. ( Statistics)
www.paragstatistics.wordpress.com
Correlation
Correlation analysis is used to study the strength of
relationship between two or more quantitative
variables. Correlation shows the degree of linear
dependence between the two variables.
Correlation doesn’t imply causation.
If variables are not related by cause and effect
relationship but show correlation then such
correlation is called Spurious or Non-sense
correlation.
Correlation
Correlation can be positive, negative or zero
depending on the change between two variables.
If the change in two variables is in the same
direction it is positive correlation.
If the change in two variables is in the opposite
direction it is negative correlation.
If the change in one variable does not affect the
change in the other variable it is zero correlation.
Correlation
Coefficient
Correlation coefficient (r) is the measure of extent
of correlation between two variables.
There are several types of correlation coefficient
but the most popular is Karl Pearson’s correlation
coefficient.
Testing
Correlation
Coefficient
Null Hypothesis H0: 𝜌 = 0
[There is no significant linear correlation between two variables]
Alternative Hypothesis H1: πœŒβ‰  0
[There is significant linear correlation between two variables]
Test statistics: 𝐭 =
π‘Ÿ π‘›βˆ’2
1βˆ’π‘Ÿ2
The test statistics t follows Student’s t distribution with 𝒏 βˆ’ 𝟐
degrees of freedom.
Case Study
The body temperature (in 0
𝐹) for 100 adults were measured along with
their gender, age, and heart rate. Data: body_temp.xlsx .
Obtain correlation coefficient between body temperature and heart rate.
Also check its significance.
Null & Alternative
Hypothesis
Null Hypothesis H0: 𝜌 = 0
[There is no significant linear correlation between body
temperature and heart rate]
Alternative Hypothesis H1: πœŒβ‰  0
[There is significant linear correlation between body temperature
and heart rate]
Test Statistics t
and p value
Test Statistics t
and p value
Correlation coefficient (r) between two variables heart rate
and temperature is 0.448.
Here p value = 0.000 < 0.05, so null hypothesis is rejected.
Thus, there is significant linear correlation between Heart rate
and Temperature
Regression
Regression analysis is a set of statistical processes
for estimating the relationships between
a dependent variable (often called the 'outcome' or
'response' variable) and one or more independent
variables (often called 'predictors', 'covariates',
'explanatory variables' or 'features’).
Regression
Analysis
Regression analysis helps you understand how the
dependent variable changes when one of the
independent variables varies and allows to
mathematically determine which of those
variables really has an impact.
Regression analysis includes several variations,
such as linear, multiple linear, and nonlinear. The
most common models are simple linear and
multiple linear.
Types of Regression
Dependent variable Independent variable Type of
Regression
Relationship
between variables
One
(Scale )
One
(Scale)
Simple Linear Linear
One
(Scale)
Two or more
(Continuous / Categorical)
Multiple Linear Linear
One
( Categorical – binary)
Two or more
(Continuous / Categorical)
Logistic Need not be linear
One
( Categorical )
Two or more
(Continuous / Categorical)
Multinomial
Logistic
Need not be linear
Simple
Regression
The simple linear regression model is used to predict one
response (dependent) variable based on one predictor
(independent) variable.
The linear regression model can be stated as follows
𝑦𝑖 = 𝛽0 + 𝛽1π‘₯𝑖 + 𝑒𝑖 , 𝑖 = 1, 2, Β· Β· Β· , n.
where
β€’ 𝑦𝑖 is value of the response variable,
β€’ π‘₯𝑖 is the value of the predictor variable,
β€’ 𝛽0 , 𝛽1are the parameters (regression coefficients),
β€’ 𝑒𝑖 is random error term with E(𝑒𝑖 ) = 0 and V (𝑒𝑖 ) = 𝜎2.
Random Error
for this Xi value
Y
X
Observed Value
of Y for Xi
Predicted Value
of Y for Xi
i
i
1
0
i Ξ΅
x
Ξ²
Ξ²
y 

ο€½
Xi
Slope = Ξ²1
Intercept = Ξ²0
Ξ΅i
Graphical representation
Assumptions of
Simple
Regression
The four important assumptions for a simple linear
regression model are :
β€’ The regression model is Linear in parameter.
β€’ The errors are Independently distributed.
β€’ The errors are Normally distributed.
β€’ The errors have Equal variances. i.e. V (𝑒𝑖 ) = 𝜎2
.
( Homoscedasticity)
Method
The best line of fit can be obtained by the method of
least squares. It calculates the best line of fit for the
observed data by minimizing the sum of squares of the
vertical deviations from each data point to the line,
i.e., (𝑦𝑖 βˆ’ 𝑦𝑖)2
Total variation is made up of two parts:
SSE
SSR
SST 
ο€½
Total Sum of
Squares
Regression Sum
of Squares
Error Sum of
Squares
οƒ₯ ο€­
ο€½ 2
i )
Y
Y
(
SST οƒ₯ ο€­
ο€½ 2
i
i )
YΜ‚
Y
(
SSE
οƒ₯ ο€­
ο€½ 2
i )
Y
YΜ‚
(
SSR
where: = Mean value of the dependent variable
Yi = Observed value of the dependent variable
= Predicted value of Y for the given Xi value
i
Y
Λ†
Y
β€’ SST = total sum of squares (Total Variation)
β€’ Measures the variation of the Yi values around their mean π‘Œ
β€’ SSR = regression sum of squares (Explained Variation)
β€’ Variation attributable to the relationship between X and Y
β€’ SSE = error sum of squares (Unexplained Variation)
β€’ Variation in Y attributable to factors other than X
Measures of Variations
Xi
Y
X
Yi
SST = οƒ₯(Yi - Y)2
SSE = οƒ₯(Yi - Yi )2

SSR = οƒ₯(Yi - Y)2

_
_
_
Y

Y
Y
_
Y

Measures of Variations
The Coefficient of determination is the portion of the total variation in the
dependent variable that is explained by variation in the independent variable.
The coefficient of determination is denoted as R2
1
R
0 2
ο‚£
ο‚£
Note:
ο€½
ο€½
SST
SSR
R2
Coefficient of Determination
π‘…π‘’π‘”π‘Ÿπ‘’π‘ π‘ π‘–π‘œπ‘› π‘ π‘’π‘š π‘œπ‘“ π‘ π‘žπ‘’π‘Žπ‘Ÿπ‘’π‘ 
π‘‡π‘œπ‘‘π‘Žπ‘™ π‘†π‘’π‘š π‘œπ‘“ π‘ π‘žπ‘’π‘Žπ‘Ÿπ‘’π‘ 
The Adjusted R-squared is a modified
version of R-squared that adjusts for
predictors that are not significant in a
regression model.
Adjusted R Square
R-squared increases every time you add an
independent variable to the model. Adjusted R-
squared value increases only when the new term
improves the model fit more than expected by
chance alone. The adjusted R-squared value
actually decreases when the term doesn’t
improve the model fit by a sufficient amount.
Multiple
Regression
The multiple linear regression model is used to predict a
response (independent) variable based on two or more
predictor variable (dependent) variable.
The multiple linear regression model can be stated as follows
𝑦𝑖 = 𝛽0 + 𝛽1π‘₯𝑖1 + 𝛽2π‘₯𝑖2 + β‹― … … + 𝛽𝑝π‘₯𝑖𝑝 + 𝑒𝑖 , 𝑖 = 1,2, Β· Β· , n.
where
β€’ 𝑦𝑖 is π‘–π‘‘β„Žvalue of the response variable,
β€’ π‘₯𝑖𝑗 is the π‘–π‘‘β„Ž
observation of π‘—π‘‘β„Ž
predictor variable,
β€’ 𝛽0, 𝛽1, 𝛽2 …. 𝛽𝑝 are the parameters (regression coefficients),
β€’ 𝑒𝑖 is random error term with E(𝑒𝑖 ) = 0 and V (𝑒𝑖 ) = 𝜎2
.
Case Study 1
The body temperature (in 0
𝐹) for 100 adults were measured along with
their gender, age, and heart rate. The data is stored in body_temp.xlsx file.
Built a linear regression model for body temperature using heart rate as a
predictor.
Regression
Regression
Multiple R = Correlation Coefficient = 0.45
R Square = Coefficient of Determination = 0.20
R Square = 0.20 shows that 20% of variations in temperature due to Heart Rate.
Model Summary
p value = 0 < 0.05.
So, there is enough evidence that fitted regression model is significant.
The regression model predicts the dependent variable – Temperature,
significantly well.
ANOVA
H0: 𝛽1=0 [Regression coefficient for Heart Rate is
not significant]
H1: 𝛽1β‰  0 [Regression coefficient for Heart Rate is
significant]
p value of regression coefficient of Heart Rate = 0
< 0.05, H0 is rejected.
So , regression coefficient of Heart Rate is
significant.
Regression Coefficients
Regression Model:
Temperature = 92.391 + 0.081 Heart Rate
Checking
Assumptions
β€’ The regression model is Linear in parameter.
β€’ The errors are Independently distributed.
β€’ The errors are Normally distributed.
β€’ The errors have Equal variances. That is V (𝑒𝑖 ) = 𝜎2
.
( Homoscedasticity)
Linearity Assumption
Linearity Assumption
Assumption - Errors are Independently distributed
Assumption - Errors are Independently distributed
Value of Durbin-Watson is
1.804,which is close to 2.
So, the assumption that errors
are independently distributed is
met
Normality & Homoscedasticity Assumptions
Normality Assumptions
Points are very close to the
diagonal line, so the variable -
temperature is normally distributed
Homoscedastic Assumptions
The data does not have an obvious
pattern, there are points equally
distributed above and below zero on the
X axis, and to the left and right of zero
on the Y axis.
So homoscedasticity assumption is met.
Case Study 2
The data were collected on a simple random sample of 20
patients with hypertension. The dataset is in arterialBp.csv.
The variables are
Y = mean arterial blood pressure (mm Hg)
X1 = age (years), X2 = weight (kgs)
X3 = body surface area (sq. m)
X4 = duration of hypertension (years)
X5 = basal pulse (beats /min), X6 = measure of stress
Fit an appropriate regression equation.
Case Study 2
Regression
Regression
Multiple R = Correlation Coefficient = 0.997
R Square = Coefficient of Determination = 0.995
R Square = 0.995 shows that 99.5% of variations in blood pressure is due to age,
weight, bsa, hypertension, pulse and stress.
Model Summary
p value = 0 < 0.05.
So, there is enough evidence that fitted regression model is significant.
The regression model predicts the dependent variable – blood pressure,
significantly well.
ANOVA
Regression Coefficients
Running the regression again after removing the insignificant variables:
hyper, pulse and stress
Multiple R = Correlation Coefficient = 0.997
R Square = Coefficient of Determination = 0.993
R Square = 0.993 shows that 99.3% of variations in blood pressure is due to age,
weight, bsa.
Model Summary
p value = 0 < 0.05.
So, there is enough evidence that fitted regression model is significant.
The regression model predicts the dependent variable – blood pressure,
significantly well.
ANOVA
Regression Coefficients
Regression Model:
Bp = -13.401 + 0.718 * Age + 0.896 * weight + 4.553 * bsa
Checking
Assumptions
β€’ The regression model is Linear in parameter.
β€’ The errors are Independently distributed.
β€’ The errors are Normally distributed.
β€’ The errors have Equal variances. That is V (𝑒𝑖 ) = 𝜎2
.
( Homoscedasticity)
β€’ There is no Multicollinearity
(No significant correlation between independent variables)
Linearity Assumptions
Linearity Assumptions
Linearity Assumptions
Normality & Homoscedasticity Assumptions
Normality Assumptions
Points are very close to the
diagonal line, so the variable - Bp is
normally distributed
Homoscedastic Assumptions
The data does not have an obvious
pattern, there are points equally
distributed above and below zero on the
X axis, and to the left and right of zero
on the Y axis.
So homoscedasticity assumption is met.
Assumption - Errors are Independently distributed
Assumption - Errors are Independently distributed
Value of Durbin-Watson is
1.537,which is close to 2.
So, the assumption that errors
are independently distributed
is met
Multicollinearity Assumptions
Multicollinearity Assumptions
Variance Inflation Factor(VIF) for all variables lie between 1 & 10, so there is no
multicollinearity. i.e. independent variables are do not have significant correlation between
them.
THANK YOU
Dr Parag Shah | M.Sc., M.Phil., Ph.D. ( Statistics)
www.paragstatistics.wordpress.com

More Related Content

What's hot

Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regressiondessybudiyanti
Β 
Regression
Regression Regression
Regression Ali Raza
Β 
Chapter8
Chapter8Chapter8
Chapter8Vu Vo
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisASAD ALI
Β 
Simple Linear Regression
Simple Linear RegressionSimple Linear Regression
Simple Linear RegressionYesica Adicondro
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression AnalysisSalim Azad
Β 
Linear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec domsLinear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec domsBabasab Patil
Β 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.sonia gupta
Β 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regressionalok tiwari
Β 
Linear regression theory
Linear regression theoryLinear regression theory
Linear regression theorySaurav Mukherjee
Β 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysisRabin BK
Β 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionpankaj8108
Β 
Regression analysis
Regression analysisRegression analysis
Regression analysisbijuhari
Β 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionMohit Asija
Β 

What's hot (20)

Regression analysis
Regression analysisRegression analysis
Regression analysis
Β 
Simple Linier Regression
Simple Linier RegressionSimple Linier Regression
Simple Linier Regression
Β 
Regression
Regression Regression
Regression
Β 
Chapter8
Chapter8Chapter8
Chapter8
Β 
Regression analysis on SPSS
Regression analysis on SPSSRegression analysis on SPSS
Regression analysis on SPSS
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
Β 
Simple Linear Regression
Simple Linear RegressionSimple Linear Regression
Simple Linear Regression
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 
Linear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec domsLinear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec doms
Β 
Correlation Analysis
Correlation AnalysisCorrelation Analysis
Correlation Analysis
Β 
Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.
Β 
Presentation On Regression
Presentation On RegressionPresentation On Regression
Presentation On Regression
Β 
Linear regression theory
Linear regression theoryLinear regression theory
Linear regression theory
Β 
Statistics-Regression analysis
Statistics-Regression analysisStatistics-Regression analysis
Statistics-Regression analysis
Β 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
Β 
Regression analysis
Regression analysisRegression analysis
Regression analysis
Β 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
Β 
Hypothesis testing
Hypothesis testingHypothesis testing
Hypothesis testing
Β 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
Β 

Similar to Correlation & Regression Analysis using SPSS

Regression &amp; correlation coefficient
Regression &amp; correlation coefficientRegression &amp; correlation coefficient
Regression &amp; correlation coefficientMuhamamdZiaSamad
Β 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSAneesa K Ayoob
Β 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsDerek Kane
Β 
Simple Regression.pptx
Simple Regression.pptxSimple Regression.pptx
Simple Regression.pptxVictoria Bozhenko
Β 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
Β 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...Smarten Augmented Analytics
Β 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramMohamed Farouk, CFA, CFTe I
Β 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06Kishor Ade
Β 
Biostatistics Lecture on Correlation.pptx
Biostatistics Lecture on Correlation.pptxBiostatistics Lecture on Correlation.pptx
Biostatistics Lecture on Correlation.pptxFantahun Dugassa
Β 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression Dr. Tushar J Bhatt
Β 
Chapter13
Chapter13Chapter13
Chapter13rwmiller
Β 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxAnusuya123
Β 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfRavinandan A P
Β 
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdf
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdfMSC III_Research Methodology and Statistics_Inferrential ststistics.pdf
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdfSuchita Rawat
Β 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regressionKhulna University
Β 
Regression
RegressionRegression
RegressionSauravurp
Β 

Similar to Correlation & Regression Analysis using SPSS (20)

Regression &amp; correlation coefficient
Regression &amp; correlation coefficientRegression &amp; correlation coefficient
Regression &amp; correlation coefficient
Β 
Ders 2 ols .ppt
Ders 2 ols .pptDers 2 ols .ppt
Ders 2 ols .ppt
Β 
STATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELSSTATISTICAL REGRESSION MODELS
STATISTICAL REGRESSION MODELS
Β 
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic NetsData Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Data Science - Part XII - Ridge Regression, LASSO, and Elastic Nets
Β 
Simple Regression.pptx
Simple Regression.pptxSimple Regression.pptx
Simple Regression.pptx
Β 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
Β 
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...What is Simple Linear Regression and How Can an Enterprise Use this Technique...
What is Simple Linear Regression and How Can an Enterprise Use this Technique...
Β 
Regression ppt
Regression pptRegression ppt
Regression ppt
Β 
Quantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA ProgramQuantitative Methods - Level II - CFA Program
Quantitative Methods - Level II - CFA Program
Β 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06
Β 
Biostatistics Lecture on Correlation.pptx
Biostatistics Lecture on Correlation.pptxBiostatistics Lecture on Correlation.pptx
Biostatistics Lecture on Correlation.pptx
Β 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
Β 
Chapter13
Chapter13Chapter13
Chapter13
Β 
Unit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptxUnit-III Correlation and Regression.pptx
Unit-III Correlation and Regression.pptx
Β 
Regression -Linear.pptx
Regression -Linear.pptxRegression -Linear.pptx
Regression -Linear.pptx
Β 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdf
Β 
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdf
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdfMSC III_Research Methodology and Statistics_Inferrential ststistics.pdf
MSC III_Research Methodology and Statistics_Inferrential ststistics.pdf
Β 
Stat 1163 -correlation and regression
Stat 1163 -correlation and regressionStat 1163 -correlation and regression
Stat 1163 -correlation and regression
Β 
Correlations
CorrelationsCorrelations
Correlations
Β 
Regression
RegressionRegression
Regression
Β 

More from Parag Shah

Basic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxBasic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxParag Shah
Β 
Non- Parametric Tests
Non- Parametric TestsNon- Parametric Tests
Non- Parametric TestsParag Shah
Β 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi squareParag Shah
Β 
Chi square tests using spss
Chi square tests using spssChi square tests using spss
Chi square tests using spssParag Shah
Β 
Chi square tests using SPSS
Chi square tests using SPSSChi square tests using SPSS
Chi square tests using SPSSParag Shah
Β 
t test using spss
t test using spsst test using spss
t test using spssParag Shah
Β 
Basics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for PharmacyBasics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for PharmacyParag Shah
Β 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesParag Shah
Β 
Probability
Probability    Probability
Probability Parag Shah
Β 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excelParag Shah
Β 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: EstimationParag Shah
Β 
Small sample test
Small sample testSmall sample test
Small sample testParag Shah
Β 
F test and ANOVA
F test and ANOVAF test and ANOVA
F test and ANOVAParag Shah
Β 
Testing of hypothesis - Chi-Square test
Testing of hypothesis - Chi-Square testTesting of hypothesis - Chi-Square test
Testing of hypothesis - Chi-Square testParag Shah
Β 
Testing of hypothesis - large sample test
Testing of hypothesis - large sample testTesting of hypothesis - large sample test
Testing of hypothesis - large sample testParag Shah
Β 
Statistics for Physical Education
Statistics for Physical EducationStatistics for Physical Education
Statistics for Physical EducationParag Shah
Β 
Career option for stats
Career option for statsCareer option for stats
Career option for statsParag Shah
Β 

More from Parag Shah (17)

Basic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptxBasic Statistics in 1 hour.pptx
Basic Statistics in 1 hour.pptx
Β 
Non- Parametric Tests
Non- Parametric TestsNon- Parametric Tests
Non- Parametric Tests
Β 
Proportion test using Chi square
Proportion test using Chi squareProportion test using Chi square
Proportion test using Chi square
Β 
Chi square tests using spss
Chi square tests using spssChi square tests using spss
Chi square tests using spss
Β 
Chi square tests using SPSS
Chi square tests using SPSSChi square tests using SPSS
Chi square tests using SPSS
Β 
t test using spss
t test using spsst test using spss
t test using spss
Β 
Basics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for PharmacyBasics of Hypothesis testing for Pharmacy
Basics of Hypothesis testing for Pharmacy
Β 
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical SciencesExploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Exploratory Data Analysis for Biotechnology and Pharmaceutical Sciences
Β 
Probability
Probability    Probability
Probability
Β 
Basic stat analysis using excel
Basic stat analysis using excelBasic stat analysis using excel
Basic stat analysis using excel
Β 
Statistical inference: Estimation
Statistical inference: EstimationStatistical inference: Estimation
Statistical inference: Estimation
Β 
Small sample test
Small sample testSmall sample test
Small sample test
Β 
F test and ANOVA
F test and ANOVAF test and ANOVA
F test and ANOVA
Β 
Testing of hypothesis - Chi-Square test
Testing of hypothesis - Chi-Square testTesting of hypothesis - Chi-Square test
Testing of hypothesis - Chi-Square test
Β 
Testing of hypothesis - large sample test
Testing of hypothesis - large sample testTesting of hypothesis - large sample test
Testing of hypothesis - large sample test
Β 
Statistics for Physical Education
Statistics for Physical EducationStatistics for Physical Education
Statistics for Physical Education
Β 
Career option for stats
Career option for statsCareer option for stats
Career option for stats
Β 

Recently uploaded

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% SecurePooja Nehwal
Β 
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Callshivangimorya083
Β 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxolyaivanovalion
Β 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...Pooja Nehwal
Β 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionfulawalesam
Β 
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Online
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service OnlineCALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Online
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Onlineanilsa9823
Β 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightDelhi Call girls
Β 
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...amitlee9823
Β 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxfirstjob4
Β 
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girl
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girlCall Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girl
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girlkumarajju5765
Β 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxolyaivanovalion
Β 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra
Β 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Delhi Call girls
Β 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxolyaivanovalion
Β 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71
Β 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
Β 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
Β 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfadriantubila
Β 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion
Β 

Recently uploaded (20)

Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% SecureCall me @ 9892124323  Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Call me @ 9892124323 Cheap Rate Call Girls in Vashi with Real Photo 100% Secure
Β 
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip CallDelhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Delhi Call Girls CP 9711199171 β˜Žβœ”πŸ‘Œβœ” Whatsapp Hard And Sexy Vip Call
Β 
Edukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFxEdukaciniai dropshipping via API with DroFx
Edukaciniai dropshipping via API with DroFx
Β 
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...{Pooja:  9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
{Pooja: 9892124323 } Call Girl in Mumbai | Jas Kaur Rate 4500 Free Hotel Del...
Β 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Β 
Week-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interactionWeek-01-2.ppt BBB human Computer interaction
Week-01-2.ppt BBB human Computer interaction
Β 
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Online
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service OnlineCALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Online
CALL ON βž₯8923113531 πŸ”Call Girls Chinhat Lucknow best sexual service Online
Β 
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 nightCheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Cheap Rate Call girls Sarita Vihar Delhi 9205541914 shot 1500 night
Β 
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Junnasandra Call Girls: πŸ“ 7737669865 πŸ“ High Profile Model Escorts | Bangalore...
Β 
Introduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptxIntroduction-to-Machine-Learning (1).pptx
Introduction-to-Machine-Learning (1).pptx
Β 
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girl
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girlCall Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girl
Call Girls 🫀 Dwarka ➑️ 9711199171 ➑️ Delhi 🫦 Two shot with one girl
Β 
Smarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptxSmarteg dropshipping via API with DroFx.pptx
Smarteg dropshipping via API with DroFx.pptx
Β 
Sampling (random) method and Non random.ppt
Sampling (random) method and Non random.pptSampling (random) method and Non random.ppt
Sampling (random) method and Non random.ppt
Β 
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Best VIP Call Girls Noida Sector 22 Call Me: 8448380779
Β 
Carero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptxCarero dropshipping via API with DroFx.pptx
Carero dropshipping via API with DroFx.pptx
Β 
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAl Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha
Β 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
Β 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
Β 
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdfAccredited-Transport-Cooperatives-Jan-2021-Web.pdf
Accredited-Transport-Cooperatives-Jan-2021-Web.pdf
Β 
CebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptxCebaBaby dropshipping via API with DroFX.pptx
CebaBaby dropshipping via API with DroFX.pptx
Β 

Correlation & Regression Analysis using SPSS

  • 1. CORRELATION &REGRESSION ANALYSIS Using SPSS Dr Parag Shah | M.Sc., M.Phil., Ph.D. ( Statistics) www.paragstatistics.wordpress.com
  • 2. Correlation Correlation analysis is used to study the strength of relationship between two or more quantitative variables. Correlation shows the degree of linear dependence between the two variables. Correlation doesn’t imply causation. If variables are not related by cause and effect relationship but show correlation then such correlation is called Spurious or Non-sense correlation.
  • 3. Correlation Correlation can be positive, negative or zero depending on the change between two variables. If the change in two variables is in the same direction it is positive correlation. If the change in two variables is in the opposite direction it is negative correlation. If the change in one variable does not affect the change in the other variable it is zero correlation.
  • 4. Correlation Coefficient Correlation coefficient (r) is the measure of extent of correlation between two variables. There are several types of correlation coefficient but the most popular is Karl Pearson’s correlation coefficient.
  • 5. Testing Correlation Coefficient Null Hypothesis H0: 𝜌 = 0 [There is no significant linear correlation between two variables] Alternative Hypothesis H1: πœŒβ‰  0 [There is significant linear correlation between two variables] Test statistics: 𝐭 = π‘Ÿ π‘›βˆ’2 1βˆ’π‘Ÿ2 The test statistics t follows Student’s t distribution with 𝒏 βˆ’ 𝟐 degrees of freedom.
  • 6. Case Study The body temperature (in 0 𝐹) for 100 adults were measured along with their gender, age, and heart rate. Data: body_temp.xlsx . Obtain correlation coefficient between body temperature and heart rate. Also check its significance.
  • 7. Null & Alternative Hypothesis Null Hypothesis H0: 𝜌 = 0 [There is no significant linear correlation between body temperature and heart rate] Alternative Hypothesis H1: πœŒβ‰  0 [There is significant linear correlation between body temperature and heart rate]
  • 9.
  • 10. Test Statistics t and p value Correlation coefficient (r) between two variables heart rate and temperature is 0.448. Here p value = 0.000 < 0.05, so null hypothesis is rejected. Thus, there is significant linear correlation between Heart rate and Temperature
  • 11. Regression Regression analysis is a set of statistical processes for estimating the relationships between a dependent variable (often called the 'outcome' or 'response' variable) and one or more independent variables (often called 'predictors', 'covariates', 'explanatory variables' or 'features’).
  • 12. Regression Analysis Regression analysis helps you understand how the dependent variable changes when one of the independent variables varies and allows to mathematically determine which of those variables really has an impact. Regression analysis includes several variations, such as linear, multiple linear, and nonlinear. The most common models are simple linear and multiple linear.
  • 13. Types of Regression Dependent variable Independent variable Type of Regression Relationship between variables One (Scale ) One (Scale) Simple Linear Linear One (Scale) Two or more (Continuous / Categorical) Multiple Linear Linear One ( Categorical – binary) Two or more (Continuous / Categorical) Logistic Need not be linear One ( Categorical ) Two or more (Continuous / Categorical) Multinomial Logistic Need not be linear
  • 14. Simple Regression The simple linear regression model is used to predict one response (dependent) variable based on one predictor (independent) variable. The linear regression model can be stated as follows 𝑦𝑖 = 𝛽0 + 𝛽1π‘₯𝑖 + 𝑒𝑖 , 𝑖 = 1, 2, Β· Β· Β· , n. where β€’ 𝑦𝑖 is value of the response variable, β€’ π‘₯𝑖 is the value of the predictor variable, β€’ 𝛽0 , 𝛽1are the parameters (regression coefficients), β€’ 𝑒𝑖 is random error term with E(𝑒𝑖 ) = 0 and V (𝑒𝑖 ) = 𝜎2.
  • 15. Random Error for this Xi value Y X Observed Value of Y for Xi Predicted Value of Y for Xi i i 1 0 i Ξ΅ x Ξ² Ξ² y   ο€½ Xi Slope = Ξ²1 Intercept = Ξ²0 Ξ΅i Graphical representation
  • 16. Assumptions of Simple Regression The four important assumptions for a simple linear regression model are : β€’ The regression model is Linear in parameter. β€’ The errors are Independently distributed. β€’ The errors are Normally distributed. β€’ The errors have Equal variances. i.e. V (𝑒𝑖 ) = 𝜎2 . ( Homoscedasticity)
  • 17. Method The best line of fit can be obtained by the method of least squares. It calculates the best line of fit for the observed data by minimizing the sum of squares of the vertical deviations from each data point to the line, i.e., (𝑦𝑖 βˆ’ 𝑦𝑖)2
  • 18. Total variation is made up of two parts: SSE SSR SST  ο€½ Total Sum of Squares Regression Sum of Squares Error Sum of Squares οƒ₯ ο€­ ο€½ 2 i ) Y Y ( SST οƒ₯ ο€­ ο€½ 2 i i ) YΜ‚ Y ( SSE οƒ₯ ο€­ ο€½ 2 i ) Y YΜ‚ ( SSR where: = Mean value of the dependent variable Yi = Observed value of the dependent variable = Predicted value of Y for the given Xi value i Y Λ† Y β€’ SST = total sum of squares (Total Variation) β€’ Measures the variation of the Yi values around their mean π‘Œ β€’ SSR = regression sum of squares (Explained Variation) β€’ Variation attributable to the relationship between X and Y β€’ SSE = error sum of squares (Unexplained Variation) β€’ Variation in Y attributable to factors other than X Measures of Variations
  • 19. Xi Y X Yi SST = οƒ₯(Yi - Y)2 SSE = οƒ₯(Yi - Yi )2  SSR = οƒ₯(Yi - Y)2  _ _ _ Y  Y Y _ Y  Measures of Variations
  • 20. The Coefficient of determination is the portion of the total variation in the dependent variable that is explained by variation in the independent variable. The coefficient of determination is denoted as R2 1 R 0 2 ο‚£ ο‚£ Note: ο€½ ο€½ SST SSR R2 Coefficient of Determination π‘…π‘’π‘”π‘Ÿπ‘’π‘ π‘ π‘–π‘œπ‘› π‘ π‘’π‘š π‘œπ‘“ π‘ π‘žπ‘’π‘Žπ‘Ÿπ‘’π‘  π‘‡π‘œπ‘‘π‘Žπ‘™ π‘†π‘’π‘š π‘œπ‘“ π‘ π‘žπ‘’π‘Žπ‘Ÿπ‘’π‘ 
  • 21. The Adjusted R-squared is a modified version of R-squared that adjusts for predictors that are not significant in a regression model. Adjusted R Square R-squared increases every time you add an independent variable to the model. Adjusted R- squared value increases only when the new term improves the model fit more than expected by chance alone. The adjusted R-squared value actually decreases when the term doesn’t improve the model fit by a sufficient amount.
  • 22. Multiple Regression The multiple linear regression model is used to predict a response (independent) variable based on two or more predictor variable (dependent) variable. The multiple linear regression model can be stated as follows 𝑦𝑖 = 𝛽0 + 𝛽1π‘₯𝑖1 + 𝛽2π‘₯𝑖2 + β‹― … … + 𝛽𝑝π‘₯𝑖𝑝 + 𝑒𝑖 , 𝑖 = 1,2, Β· Β· , n. where β€’ 𝑦𝑖 is π‘–π‘‘β„Žvalue of the response variable, β€’ π‘₯𝑖𝑗 is the π‘–π‘‘β„Ž observation of π‘—π‘‘β„Ž predictor variable, β€’ 𝛽0, 𝛽1, 𝛽2 …. 𝛽𝑝 are the parameters (regression coefficients), β€’ 𝑒𝑖 is random error term with E(𝑒𝑖 ) = 0 and V (𝑒𝑖 ) = 𝜎2 .
  • 23. Case Study 1 The body temperature (in 0 𝐹) for 100 adults were measured along with their gender, age, and heart rate. The data is stored in body_temp.xlsx file. Built a linear regression model for body temperature using heart rate as a predictor.
  • 26. Multiple R = Correlation Coefficient = 0.45 R Square = Coefficient of Determination = 0.20 R Square = 0.20 shows that 20% of variations in temperature due to Heart Rate. Model Summary
  • 27. p value = 0 < 0.05. So, there is enough evidence that fitted regression model is significant. The regression model predicts the dependent variable – Temperature, significantly well. ANOVA
  • 28. H0: 𝛽1=0 [Regression coefficient for Heart Rate is not significant] H1: 𝛽1β‰  0 [Regression coefficient for Heart Rate is significant] p value of regression coefficient of Heart Rate = 0 < 0.05, H0 is rejected. So , regression coefficient of Heart Rate is significant. Regression Coefficients Regression Model: Temperature = 92.391 + 0.081 Heart Rate
  • 29. Checking Assumptions β€’ The regression model is Linear in parameter. β€’ The errors are Independently distributed. β€’ The errors are Normally distributed. β€’ The errors have Equal variances. That is V (𝑒𝑖 ) = 𝜎2 . ( Homoscedasticity)
  • 32. Assumption - Errors are Independently distributed
  • 33. Assumption - Errors are Independently distributed Value of Durbin-Watson is 1.804,which is close to 2. So, the assumption that errors are independently distributed is met
  • 35. Normality Assumptions Points are very close to the diagonal line, so the variable - temperature is normally distributed
  • 36. Homoscedastic Assumptions The data does not have an obvious pattern, there are points equally distributed above and below zero on the X axis, and to the left and right of zero on the Y axis. So homoscedasticity assumption is met.
  • 37. Case Study 2 The data were collected on a simple random sample of 20 patients with hypertension. The dataset is in arterialBp.csv. The variables are Y = mean arterial blood pressure (mm Hg) X1 = age (years), X2 = weight (kgs) X3 = body surface area (sq. m) X4 = duration of hypertension (years) X5 = basal pulse (beats /min), X6 = measure of stress Fit an appropriate regression equation.
  • 41. Multiple R = Correlation Coefficient = 0.997 R Square = Coefficient of Determination = 0.995 R Square = 0.995 shows that 99.5% of variations in blood pressure is due to age, weight, bsa, hypertension, pulse and stress. Model Summary
  • 42. p value = 0 < 0.05. So, there is enough evidence that fitted regression model is significant. The regression model predicts the dependent variable – blood pressure, significantly well. ANOVA
  • 43. Regression Coefficients Running the regression again after removing the insignificant variables: hyper, pulse and stress
  • 44. Multiple R = Correlation Coefficient = 0.997 R Square = Coefficient of Determination = 0.993 R Square = 0.993 shows that 99.3% of variations in blood pressure is due to age, weight, bsa. Model Summary
  • 45. p value = 0 < 0.05. So, there is enough evidence that fitted regression model is significant. The regression model predicts the dependent variable – blood pressure, significantly well. ANOVA
  • 46. Regression Coefficients Regression Model: Bp = -13.401 + 0.718 * Age + 0.896 * weight + 4.553 * bsa
  • 47. Checking Assumptions β€’ The regression model is Linear in parameter. β€’ The errors are Independently distributed. β€’ The errors are Normally distributed. β€’ The errors have Equal variances. That is V (𝑒𝑖 ) = 𝜎2 . ( Homoscedasticity) β€’ There is no Multicollinearity (No significant correlation between independent variables)
  • 52. Normality Assumptions Points are very close to the diagonal line, so the variable - Bp is normally distributed
  • 53. Homoscedastic Assumptions The data does not have an obvious pattern, there are points equally distributed above and below zero on the X axis, and to the left and right of zero on the Y axis. So homoscedasticity assumption is met.
  • 54. Assumption - Errors are Independently distributed
  • 55. Assumption - Errors are Independently distributed Value of Durbin-Watson is 1.537,which is close to 2. So, the assumption that errors are independently distributed is met
  • 57. Multicollinearity Assumptions Variance Inflation Factor(VIF) for all variables lie between 1 & 10, so there is no multicollinearity. i.e. independent variables are do not have significant correlation between them.
  • 58. THANK YOU Dr Parag Shah | M.Sc., M.Phil., Ph.D. ( Statistics) www.paragstatistics.wordpress.com