SlideShare a Scribd company logo
1 of 37
Multiple Regression Analysis
Chapter 14
Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
McGraw-Hill/Irwin
LEARNING OBJECTIVES
LO 14-1 Describe the relationship between several independent variables
and a dependent variable using multiple regression analysis.
LO 14-2 Develop and interpret an ANOVA table.
LO 14-3 Compute and interpret measures of association in multiple
regression.
LO 14-4 Conduct a hypothesis test to determine whether a set of
regression coefficients differ from zero. – F-Test (Textbook Table E5 p.
777-780)
LO 14-5 Conduct a hypothesis test of each regression coefficient.
LO 14-6 Use residual analysis to evaluate the assumptions of multiple
regression analysis.
14-2
Multiple Regression Analysis (Many IV)
• X1 … Xk are the independent variables.
• a is the Y-intercept
• b1 is the net change in Y for each unit change in X1 holding X2 … Xk constant.
• The least squares criterion is used to develop this equation.
• Determining b1, b2, tedious, etc. use Excel
LO 14-1 multiple regression analysis.
14-3
Multiple Linear Regression—Example
Salsberry Realty sells homes along the east coast of
the United States. One of the questions
most frequently asked by prospective buyers
is: If we purchase this home, how much can
we expect to pay to heat it during the
winter?
Three variables are thought to relate to the heating
costs: (1) the mean daily outside
temperature, (2) the number of inches of
insulation in the attic, and (3) the age in
years of the furnace.
To investigate, Salsberry’s research department
selected a random sample of 20 recently
sold homes. It determined the cost to heat
each home last January, as well
^
Y X1 X2
X3
LO 14-1
14-4
Multiple Linear Regression — ANOVA Table
LO 14-2 Develop and interpret ANOVA table
14-5
Multiple Linear Regression — ANOVA Table
a
b3
b1
b2
LO 14-2
14-6
The Multiple Regression Equation—Interpreting the Measures
of Association and Applying the Model for Estimation
Interpreting the Regression Coefficients
Mean Outside Temp, X1 −4.583
Attic Insulation, X2 −14.83
Age of Furnace, X3 +6.10
What is the estimated heating cost for a home if the
mean outside temperature is 30 degrees, there are 5
inches of insulation in the attic, and the furnace is 10
years old?
LO 14-2
14-7
Multiple Standard Error of Estimate
The multiple standard
error of estimate is a
measure of the
effectiveness of the
regression equation.
 It is measured in the
same units as the
dependent variable.
 It is difficult to
determine what is a
large value and what
is a small value of the
standard error.
LO 14-2
14-8
Multiple Regression and
Correlation Assumptions
 The independent variables and the dependent variable
have a linear relationship.
 The dependent variable must be continuous and at least
interval-scale.
 The residual must be the same for all values of Y.
 The residuals should follow the normal distributed with
mean 0.
 Successive values of the dependent variable must be
uncorrelated.
LO 14-2
14-9
Coefficient of Multiple Determination (r2)
Coefficient of Multiple Determination:
1. Symbolized by r2.
2. Ranges from 0 to 1.
3. Cannot assume negative values.
4. Easy to interpret.
The Adjusted r2:
1. The number of independent
variables in a multiple regression
equation makes the coefficient
of determination larger.
2. Adjusted r2 is used instead to
balance the effect that the
number of independent
variables has on the coefficient
of multiple determination.
LO 14-3 Compute and interpret measures of
association in multiple regression.
14-10
Global Test: Testing the Multiple Regression Model
The global test is used to investigate whether any of the
independent variables have significant coefficients.
The hypotheses are:
0
equal
s
all
Not
:
0
...
:
1
2
1
0




H
H k 



Decision Rule: Reject H0 if
Computed F > Critical F (,k,n − k − 1)
LO 14-4 Conduct a hypothesis test to determine whether a set
of regression coefficients differ from zero.
14-11
Finding the Computed and Critical F
F (,k,n−k−1)
F (.05,3,16)
LO 14-4
Excel function: =finv(.05,3,16)
14-12
Comparing Means of Two or More
Populations – Example
LO 14-4
F distribution
d.f. of
Denominator
14-13
Interpretation
 The computed value of F is
21.90, which is in the rejection
region.
 The null hypothesis that all the
multiple regression coefficients
are zero is therefore rejected.
 Interpretation: Some of the
independent variables (amount
of insulation, etc.) do have the
ability to explain the variation in
the dependent variable (heating
cost).
 Logical question – which ones?
21.9
LO 14-4
14-14
 This test is used to determine which independent variables
have nonzero regression coefficients.
 The variables that have zero regression coefficients are usually
dropped from the analysis.
 The test statistic is the t distribution with n−(k+1) degrees of
freedom.
 The hypothesis test is as follows:
H0: βi = 0
H1: βi ≠ 0
Reject H0 if computed t > critical t or
computed t < −critical t
Evaluating Individual Regression Coefficients (βi = 0)
LO 14-5 Conduct a hypothesis test of each
regression coefficient.
14-15
Critical t for the Coefficients
-2.120 2.120
Reject H0 if computed t > critical t or computed t < −critical t
LO 14-5
14-16
Computed t for the Coefficients
LO 14-5
14-17
Conclusion on Significance of the Coefficients
−2.120 2.120
–5.93
(Temp)
–3.119
(Insulation)
1.521
(Age)
LO 14-5
14-18
New Regression Model without Variable “Age” – Minitab
LO 14-5
14-19
New Regression Model without Variable “Age” –
Minitab
LO 14-5
14-20
Testing the New Model for Significance
d.f. (2,17)
3.59
Computed F = 29.42
LO 14-5
14-21
Critical t-stat for the New Coefficients
−2.110 2.110
Reject H0 if computed t > critical t or computed t < −critical t
LO 14-5
14-22
-2.110 2.110
−7.34
(Temp)
−2.98
Insulation
Conclusion on Significance of New Coefficients
LO 14-5
14-23
Evaluating the Assumptions of Multiple Regression
***SELF- STUDY FROM THIS SLIDE ONWARDS
1. There is a linear relationship.
2. The variation in the residuals is the same for both
large and small values of the estimated Y.
3. The residuals follow the normal probability
distribution.
4. The independent variables should not be correlated.
5. The residuals are independent.
A residual is the difference between the actual value
of Y and the predicted value of Y.
LO 14-6 Use residual analysis to evaluate the
assumptions of multiple regression analysis.
14-24
Scatter and Residual Plots
A plot of the residuals and their corresponding Y values is used for showing
that there are no trends or patterns in the residuals.
LO 14-6
Scatter plot of Cost vs. Temp Scatter plot of Cost vs. Insul
14-25
Distribution of Residuals
Both Minitab and Excel offer another graph that helps to evaluate the
assumption of normally distributed residuals. It is a called a normal
probability plot and is shown to the right of the histogram.
LO 14-6
14-26
Multicollinearity
 Multicollinearity—when independent variables (X’s) are correlated.
 Effects of multicollinearity on the model:
1. An independent variable known to be an important predictor ends
up having a regression coefficient that is not significant.
2. A regression coefficient that should have a positive sign turns out
to be negative, or vice versa.
3. When an independent variable is added or removed, there is a
drastic change in the values of the remaining regression coefficients.
 However, correlated independent variables do not affect a multiple
regression equation’s ability to predict the dependent variable (Y).
LO 14-7 Evaluate the effects of
correlated independent variables.
14-27
Variance Inflation Factor
 General rule: correlation between two independent variables
is between −0.70 and 0.70; not a problem using both of the
independent variables.
 A more precise test is to use the variance inflation factor (VIF).
 A VIF > 10 is unsatisfactory. Remove that independent variable
from the analysis.
 The value of VIF is found as follows:
2
1
1
j
R
VIF


LO7
LO 14-7
14-28
Correlation Matrix of the Variables
Multicollinearity – Example
Refer to the data in the table,
which relates the heating
cost to the independent
variables outside
temperature, amount of
insulation, and age of
furnace.
Develop a correlation matrix
for all the independent
variables.
Does it appear there is a
problem with
multicollinearity?
Find and interpret the variance
inflation factor for each
of the independent
variables.
LO 14-7
14-29
VIF – Minitab Example
Coefficient of
Determination
The VIF value of 1.32 is less than
the upper limit of 10.
Interpretation: Temperature is not
strongly correlated with the other
independent variables.
LO 14-7
14-30
Independence Assumption
 An assumption about regression and
correlation analysis is that successive
residuals should be independent.
 When successive residuals are correlated, we
refer to this condition as autocorrelation.
Autocorrelation frequently occurs when the
data are collected over a period of time.
LO 14-7
14-31
Residual Plot vs. Fitted Values:
Testing the Independence Assumption
 When successive residuals are
correlated, we refer to this condition
as autocorrelation, which frequently
occurs when the data are collected
over a period of time.
 Note the run of residuals above the
mean of the residuals, followed by a
run below the mean. A scatter
plot such as this would indicate
possible autocorrelation.
LO 14-7
14-32
Qualitative Variable – Example
 Nominal-scale variables—such as
gender, ethic background—can be
included in model. These are called
qualitative variables.
 Use dummy variables in which one of
the two possible conditions is coded
0 and the other 1.
EXAMPLE
Suppose in the Salsberry Realty example that
the independent variable “garage” is
added. For those homes without an
attached garage, 0 is used; for homes with
an attached garage, a 1 is used. We will
refer to the “garage” variable as the data
from Table 14–2 are entered into the
Minitab system.
LO 14-8 Evaluate and use qualitative
independent variables.
14-33
Qualitative Variable – Minitab
Garage as
“dummy”
variable
LO 14-8
14-34
Using the Model for Estimation
What is the effect of the garage variable? Suppose we have two houses exactly alike next
to each other in Buffalo, New York; one has an attached garage, and the other does
not. Both homes have 3 inches of insulation, and the mean January temperature in
Buffalo is 20 degrees.
For the house without an attached garage, a 0 is substituted for in the regression
equation. The estimated heating cost is $280.90, found by:
For the house with an attached garage, a 1 is substituted for in the regression equation.
The estimated heating cost is $358.30, found by:
Without garage
With garage
LO 14-8
14-35
 This test is used to determine which independent variables have non-zero
regression coefficients.
 The variables that have zero regression coefficients are usually dropped from
the analysis.
 The test statistic is the t distribution with n − (k + 1) or n − k − 1 degrees of
freedom.
 The hypothesis test is as follows:
H0: βi = 0
H1: βi ≠ 0
Reject H0 if computed t > critical t, or
computed t < −critical t
Evaluating Individual Regression Coefficients
(βi = 0)
LO 14-8
14-36
Testing Variable “Garage” for Significance
-2.120 2.120
Conclusion: The regression coefficient is not zero. The independent
variable garage should be included in the analysis.`
Reject H0 if computed t > critical t or computed t < −critical t
(0.05 significance level, d.f. = 16)
LO 14-8
14-37

More Related Content

Similar to Chap014 BStat SemA201.ppt

Group 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdfGroup 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdffahlevet40
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
IBM401 Lecture 5
IBM401 Lecture 5IBM401 Lecture 5
IBM401 Lecture 5saark
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptvigia41
 
linear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annovalinear Regression, multiple Regression and Annova
linear Regression, multiple Regression and AnnovaMansi Rastogi
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regressionmejikpg
 
Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help HelpWithAssignment.com
 
Applied statistics lecture_6
Applied statistics lecture_6Applied statistics lecture_6
Applied statistics lecture_6Daria Bogdanova
 
Overview of Multivariate Statistical Methods
Overview of Multivariate Statistical MethodsOverview of Multivariate Statistical Methods
Overview of Multivariate Statistical MethodsThomasUttaro1
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics testsrodrick koome
 
10.Analysis of Variance.ppt
10.Analysis of Variance.ppt10.Analysis of Variance.ppt
10.Analysis of Variance.pptAbdulhaqAli
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regressionjasondroesch
 

Similar to Chap014 BStat SemA201.ppt (20)

Group 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdfGroup 5 - Regression Analysis.pdf
Group 5 - Regression Analysis.pdf
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
IBM401 Lecture 5
IBM401 Lecture 5IBM401 Lecture 5
IBM401 Lecture 5
 
A presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.pptA presentation for Multiple linear regression.ppt
A presentation for Multiple linear regression.ppt
 
linear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annovalinear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annova
 
Chap013.ppt
Chap013.pptChap013.ppt
Chap013.ppt
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Regression
RegressionRegression
Regression
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help
 
Applied statistics lecture_6
Applied statistics lecture_6Applied statistics lecture_6
Applied statistics lecture_6
 
Overview of Multivariate Statistical Methods
Overview of Multivariate Statistical MethodsOverview of Multivariate Statistical Methods
Overview of Multivariate Statistical Methods
 
Regression
RegressionRegression
Regression
 
Non parametrics tests
Non parametrics testsNon parametrics tests
Non parametrics tests
 
Chap013.ppt
Chap013.pptChap013.ppt
Chap013.ppt
 
Miich4 ftests
Miich4 ftestsMiich4 ftests
Miich4 ftests
 
Math Module 4 Review
Math Module 4 ReviewMath Module 4 Review
Math Module 4 Review
 
10.Analysis of Variance.ppt
10.Analysis of Variance.ppt10.Analysis of Variance.ppt
10.Analysis of Variance.ppt
 
Correlation and Regression
Correlation and RegressionCorrelation and Regression
Correlation and Regression
 

More from najwalyaa

Bateman12eChapter15ADAkp.pptx
Bateman12eChapter15ADAkp.pptxBateman12eChapter15ADAkp.pptx
Bateman12eChapter15ADAkp.pptxnajwalyaa
 
Chapter 1 Introduction to Strategic Management.ppt
Chapter 1 Introduction to Strategic Management.pptChapter 1 Introduction to Strategic Management.ppt
Chapter 1 Introduction to Strategic Management.pptnajwalyaa
 
Chap1 BStat SemA201 Intro.ppt
Chap1 BStat SemA201 Intro.pptChap1 BStat SemA201 Intro.ppt
Chap1 BStat SemA201 Intro.pptnajwalyaa
 

More from najwalyaa (9)

Bateman12eChapter15ADAkp.pptx
Bateman12eChapter15ADAkp.pptxBateman12eChapter15ADAkp.pptx
Bateman12eChapter15ADAkp.pptx
 
Chapter 1 Introduction to Strategic Management.ppt
Chapter 1 Introduction to Strategic Management.pptChapter 1 Introduction to Strategic Management.ppt
Chapter 1 Introduction to Strategic Management.ppt
 
Chap1 BStat SemA201 Intro.ppt
Chap1 BStat SemA201 Intro.pptChap1 BStat SemA201 Intro.ppt
Chap1 BStat SemA201 Intro.ppt
 
Chap002.ppt
Chap002.pptChap002.ppt
Chap002.ppt
 
Chap003.ppt
Chap003.pptChap003.ppt
Chap003.ppt
 
Chap008.ppt
Chap008.pptChap008.ppt
Chap008.ppt
 
Chap006.ppt
Chap006.pptChap006.ppt
Chap006.ppt
 
Chap007.ppt
Chap007.pptChap007.ppt
Chap007.ppt
 
Chap004.ppt
Chap004.pptChap004.ppt
Chap004.ppt
 

Recently uploaded

FULL ENJOY - 9953040155 Call Girls in Chhatarpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Chhatarpur | DelhiFULL ENJOY - 9953040155 Call Girls in Chhatarpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Chhatarpur | DelhiMalviyaNagarCallGirl
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...lizamodels9
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfPaul Menig
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in managementchhavia330
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneVIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneCall girls in Ahmedabad High profile
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfmuskan1121w
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfpollardmorgan
 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creationsnakalysalcedo61
 
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCRsoniya singh
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Roomdivyansh0kumar0
 

Recently uploaded (20)

FULL ENJOY - 9953040155 Call Girls in Chhatarpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Chhatarpur | DelhiFULL ENJOY - 9953040155 Call Girls in Chhatarpur | Delhi
FULL ENJOY - 9953040155 Call Girls in Chhatarpur | Delhi
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
Call Girls In Connaught Place Delhi ❤️88604**77959_Russian 100% Genuine Escor...
 
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Mehrauli Delhi 💯Call Us 🔝8264348440🔝
 
Grateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdfGrateful 7 speech thanking everyone that has helped.pdf
Grateful 7 speech thanking everyone that has helped.pdf
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
GD Birla and his contribution in management
GD Birla and his contribution in managementGD Birla and his contribution in management
GD Birla and his contribution in management
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service PuneVIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
VIP Call Girls Pune Kirti 8617697112 Independent Escort Service Pune
 
rishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdfrishikeshgirls.in- Rishikesh call girl.pdf
rishikeshgirls.in- Rishikesh call girl.pdf
 
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdfIntro to BCG's Carbon Emissions Benchmark_vF.pdf
Intro to BCG's Carbon Emissions Benchmark_vF.pdf
 
Marketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet CreationsMarketing Management Business Plan_My Sweet Creations
Marketing Management Business Plan_My Sweet Creations
 
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
(8264348440) 🔝 Call Girls In Mahipalpur 🔝 Delhi NCR
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.Eni 2024 1Q Results - 24.04.24 business.
Eni 2024 1Q Results - 24.04.24 business.
 
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130  Available With RoomVIP Kolkata Call Girl Howrah 👉 8250192130  Available With Room
VIP Kolkata Call Girl Howrah 👉 8250192130 Available With Room
 

Chap014 BStat SemA201.ppt

  • 1. Multiple Regression Analysis Chapter 14 Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved. McGraw-Hill/Irwin
  • 2. LEARNING OBJECTIVES LO 14-1 Describe the relationship between several independent variables and a dependent variable using multiple regression analysis. LO 14-2 Develop and interpret an ANOVA table. LO 14-3 Compute and interpret measures of association in multiple regression. LO 14-4 Conduct a hypothesis test to determine whether a set of regression coefficients differ from zero. – F-Test (Textbook Table E5 p. 777-780) LO 14-5 Conduct a hypothesis test of each regression coefficient. LO 14-6 Use residual analysis to evaluate the assumptions of multiple regression analysis. 14-2
  • 3. Multiple Regression Analysis (Many IV) • X1 … Xk are the independent variables. • a is the Y-intercept • b1 is the net change in Y for each unit change in X1 holding X2 … Xk constant. • The least squares criterion is used to develop this equation. • Determining b1, b2, tedious, etc. use Excel LO 14-1 multiple regression analysis. 14-3
  • 4. Multiple Linear Regression—Example Salsberry Realty sells homes along the east coast of the United States. One of the questions most frequently asked by prospective buyers is: If we purchase this home, how much can we expect to pay to heat it during the winter? Three variables are thought to relate to the heating costs: (1) the mean daily outside temperature, (2) the number of inches of insulation in the attic, and (3) the age in years of the furnace. To investigate, Salsberry’s research department selected a random sample of 20 recently sold homes. It determined the cost to heat each home last January, as well ^ Y X1 X2 X3 LO 14-1 14-4
  • 5. Multiple Linear Regression — ANOVA Table LO 14-2 Develop and interpret ANOVA table 14-5
  • 6. Multiple Linear Regression — ANOVA Table a b3 b1 b2 LO 14-2 14-6
  • 7. The Multiple Regression Equation—Interpreting the Measures of Association and Applying the Model for Estimation Interpreting the Regression Coefficients Mean Outside Temp, X1 −4.583 Attic Insulation, X2 −14.83 Age of Furnace, X3 +6.10 What is the estimated heating cost for a home if the mean outside temperature is 30 degrees, there are 5 inches of insulation in the attic, and the furnace is 10 years old? LO 14-2 14-7
  • 8. Multiple Standard Error of Estimate The multiple standard error of estimate is a measure of the effectiveness of the regression equation.  It is measured in the same units as the dependent variable.  It is difficult to determine what is a large value and what is a small value of the standard error. LO 14-2 14-8
  • 9. Multiple Regression and Correlation Assumptions  The independent variables and the dependent variable have a linear relationship.  The dependent variable must be continuous and at least interval-scale.  The residual must be the same for all values of Y.  The residuals should follow the normal distributed with mean 0.  Successive values of the dependent variable must be uncorrelated. LO 14-2 14-9
  • 10. Coefficient of Multiple Determination (r2) Coefficient of Multiple Determination: 1. Symbolized by r2. 2. Ranges from 0 to 1. 3. Cannot assume negative values. 4. Easy to interpret. The Adjusted r2: 1. The number of independent variables in a multiple regression equation makes the coefficient of determination larger. 2. Adjusted r2 is used instead to balance the effect that the number of independent variables has on the coefficient of multiple determination. LO 14-3 Compute and interpret measures of association in multiple regression. 14-10
  • 11. Global Test: Testing the Multiple Regression Model The global test is used to investigate whether any of the independent variables have significant coefficients. The hypotheses are: 0 equal s all Not : 0 ... : 1 2 1 0     H H k     Decision Rule: Reject H0 if Computed F > Critical F (,k,n − k − 1) LO 14-4 Conduct a hypothesis test to determine whether a set of regression coefficients differ from zero. 14-11
  • 12. Finding the Computed and Critical F F (,k,n−k−1) F (.05,3,16) LO 14-4 Excel function: =finv(.05,3,16) 14-12
  • 13. Comparing Means of Two or More Populations – Example LO 14-4 F distribution d.f. of Denominator 14-13
  • 14. Interpretation  The computed value of F is 21.90, which is in the rejection region.  The null hypothesis that all the multiple regression coefficients are zero is therefore rejected.  Interpretation: Some of the independent variables (amount of insulation, etc.) do have the ability to explain the variation in the dependent variable (heating cost).  Logical question – which ones? 21.9 LO 14-4 14-14
  • 15.  This test is used to determine which independent variables have nonzero regression coefficients.  The variables that have zero regression coefficients are usually dropped from the analysis.  The test statistic is the t distribution with n−(k+1) degrees of freedom.  The hypothesis test is as follows: H0: βi = 0 H1: βi ≠ 0 Reject H0 if computed t > critical t or computed t < −critical t Evaluating Individual Regression Coefficients (βi = 0) LO 14-5 Conduct a hypothesis test of each regression coefficient. 14-15
  • 16. Critical t for the Coefficients -2.120 2.120 Reject H0 if computed t > critical t or computed t < −critical t LO 14-5 14-16
  • 17. Computed t for the Coefficients LO 14-5 14-17
  • 18. Conclusion on Significance of the Coefficients −2.120 2.120 –5.93 (Temp) –3.119 (Insulation) 1.521 (Age) LO 14-5 14-18
  • 19. New Regression Model without Variable “Age” – Minitab LO 14-5 14-19
  • 20. New Regression Model without Variable “Age” – Minitab LO 14-5 14-20
  • 21. Testing the New Model for Significance d.f. (2,17) 3.59 Computed F = 29.42 LO 14-5 14-21
  • 22. Critical t-stat for the New Coefficients −2.110 2.110 Reject H0 if computed t > critical t or computed t < −critical t LO 14-5 14-22
  • 23. -2.110 2.110 −7.34 (Temp) −2.98 Insulation Conclusion on Significance of New Coefficients LO 14-5 14-23
  • 24. Evaluating the Assumptions of Multiple Regression ***SELF- STUDY FROM THIS SLIDE ONWARDS 1. There is a linear relationship. 2. The variation in the residuals is the same for both large and small values of the estimated Y. 3. The residuals follow the normal probability distribution. 4. The independent variables should not be correlated. 5. The residuals are independent. A residual is the difference between the actual value of Y and the predicted value of Y. LO 14-6 Use residual analysis to evaluate the assumptions of multiple regression analysis. 14-24
  • 25. Scatter and Residual Plots A plot of the residuals and their corresponding Y values is used for showing that there are no trends or patterns in the residuals. LO 14-6 Scatter plot of Cost vs. Temp Scatter plot of Cost vs. Insul 14-25
  • 26. Distribution of Residuals Both Minitab and Excel offer another graph that helps to evaluate the assumption of normally distributed residuals. It is a called a normal probability plot and is shown to the right of the histogram. LO 14-6 14-26
  • 27. Multicollinearity  Multicollinearity—when independent variables (X’s) are correlated.  Effects of multicollinearity on the model: 1. An independent variable known to be an important predictor ends up having a regression coefficient that is not significant. 2. A regression coefficient that should have a positive sign turns out to be negative, or vice versa. 3. When an independent variable is added or removed, there is a drastic change in the values of the remaining regression coefficients.  However, correlated independent variables do not affect a multiple regression equation’s ability to predict the dependent variable (Y). LO 14-7 Evaluate the effects of correlated independent variables. 14-27
  • 28. Variance Inflation Factor  General rule: correlation between two independent variables is between −0.70 and 0.70; not a problem using both of the independent variables.  A more precise test is to use the variance inflation factor (VIF).  A VIF > 10 is unsatisfactory. Remove that independent variable from the analysis.  The value of VIF is found as follows: 2 1 1 j R VIF   LO7 LO 14-7 14-28
  • 29. Correlation Matrix of the Variables Multicollinearity – Example Refer to the data in the table, which relates the heating cost to the independent variables outside temperature, amount of insulation, and age of furnace. Develop a correlation matrix for all the independent variables. Does it appear there is a problem with multicollinearity? Find and interpret the variance inflation factor for each of the independent variables. LO 14-7 14-29
  • 30. VIF – Minitab Example Coefficient of Determination The VIF value of 1.32 is less than the upper limit of 10. Interpretation: Temperature is not strongly correlated with the other independent variables. LO 14-7 14-30
  • 31. Independence Assumption  An assumption about regression and correlation analysis is that successive residuals should be independent.  When successive residuals are correlated, we refer to this condition as autocorrelation. Autocorrelation frequently occurs when the data are collected over a period of time. LO 14-7 14-31
  • 32. Residual Plot vs. Fitted Values: Testing the Independence Assumption  When successive residuals are correlated, we refer to this condition as autocorrelation, which frequently occurs when the data are collected over a period of time.  Note the run of residuals above the mean of the residuals, followed by a run below the mean. A scatter plot such as this would indicate possible autocorrelation. LO 14-7 14-32
  • 33. Qualitative Variable – Example  Nominal-scale variables—such as gender, ethic background—can be included in model. These are called qualitative variables.  Use dummy variables in which one of the two possible conditions is coded 0 and the other 1. EXAMPLE Suppose in the Salsberry Realty example that the independent variable “garage” is added. For those homes without an attached garage, 0 is used; for homes with an attached garage, a 1 is used. We will refer to the “garage” variable as the data from Table 14–2 are entered into the Minitab system. LO 14-8 Evaluate and use qualitative independent variables. 14-33
  • 34. Qualitative Variable – Minitab Garage as “dummy” variable LO 14-8 14-34
  • 35. Using the Model for Estimation What is the effect of the garage variable? Suppose we have two houses exactly alike next to each other in Buffalo, New York; one has an attached garage, and the other does not. Both homes have 3 inches of insulation, and the mean January temperature in Buffalo is 20 degrees. For the house without an attached garage, a 0 is substituted for in the regression equation. The estimated heating cost is $280.90, found by: For the house with an attached garage, a 1 is substituted for in the regression equation. The estimated heating cost is $358.30, found by: Without garage With garage LO 14-8 14-35
  • 36.  This test is used to determine which independent variables have non-zero regression coefficients.  The variables that have zero regression coefficients are usually dropped from the analysis.  The test statistic is the t distribution with n − (k + 1) or n − k − 1 degrees of freedom.  The hypothesis test is as follows: H0: βi = 0 H1: βi ≠ 0 Reject H0 if computed t > critical t, or computed t < −critical t Evaluating Individual Regression Coefficients (βi = 0) LO 14-8 14-36
  • 37. Testing Variable “Garage” for Significance -2.120 2.120 Conclusion: The regression coefficient is not zero. The independent variable garage should be included in the analysis.` Reject H0 if computed t > critical t or computed t < −critical t (0.05 significance level, d.f. = 16) LO 14-8 14-37