SlideShare a Scribd company logo
1 of 71
Copyright © 2012 Pearson Education 4-1
Chapter 4
To accompany
Quantitative Analysis for Management, Eleventh Edition, Global Edition
by Render, Stair, and Hanna
Power Point slides created by Brian Peterson
Regression Models
4-2
Learning Objectives
1. Identify variables and use them in a regression
model.
2. Develop simple linear regression equations.
from sample data and interpret the slope and
intercept.
3. Compute the coefficient of determination and
the coefficient of correlation and interpret their
meanings.
4. Interpret the F-test in a linear regression model.
5. List the assumptions used in regression and
use residual plots to identify problems.
After completing this chapter, students will be able to:After completing this chapter, students will be able to:
4-3
Learning Objectives
6. Develop a multiple regression model and use it
for prediction purposes.
7. Use dummy variables to model categorical
data.
8. Determine which variables should be included
in a multiple regression model.
9. Transform a nonlinear function into a linear
one for use in regression.
10. Understand and avoid common mistakes made
in the use of regression analysis.
After completing this chapter, students will be able to:After completing this chapter, students will be able to:
4-4
Chapter Outline
4.1 Introduction
4.2 Scatter Diagrams
4.3 Simple Linear Regression
4.4 Measuring the Fit of the Regression
Model
4.5 Using Computer Software for Regression
4.6 Assumptions of the Regression Model
4-5
Chapter Outline
4.7 Testing the Model for Significance
4.8 Multiple Regression Analysis
4.9 Binary or Dummy Variables
4.10 Model Building
4.11 Nonlinear Regression
4.12 Cautions and Pitfalls in Regression
Analysis
4-6
Introduction
 Regression analysisRegression analysis is a very valuable
tool for a manager.
 Regression can be used to:
 Understand the relationship between
variables.
 Predict the value of one variable based on
another variable.
 Simple linear regression models have
only two variables.
 Multiple regression models have more
variables.
4-7
Introduction
 The variable to be predicted is called
the dependent variabledependent variable.
 This is sometimes called the responseresponse
variable.variable.
 The value of this variable depends on
the value of the independent variable.independent variable.
 This is sometimes called the explanatoryexplanatory
or predictor variable.predictor variable.
Independent
variable
Dependent
variable
Independent
variable
= +
4-8
Scatter Diagram
 A scatter diagramscatter diagram or scatter plotscatter plot is
often used to investigate the
relationship between variables.
 The independent variable is normally
plotted on the X axis.
 The dependent variable is normally
plotted on the Y axis.
4-9
Triple A Construction
 Triple A Construction renovates old homes.
 Managers have found that the dollar volume of
renovation work is dependent on the area
payroll.
TRIPLE A’S SALES
($100,000s)
LOCAL PAYROLL
($100,000,000s)
6 3
8 4
9 6
5 4
4.5 2
9.5 5
Table 4.1
4-10
Triple A Construction
Figure 4.1
Scatter Diagram of Triple A Construction Company Data
4-11
Simple Linear Regression
where
Y = dependent variable (response)
X = independent variable (predictor or explanatory)
β0 = intercept (value of Y when X = 0)
β1 = slope of the regression line
ε = random error
 Regression models are used to test if there is a
relationship between variables.
 There is some random error that cannot be
predicted.
εββ ++= XY 10
4-12
Simple Linear Regression
 True values for the slope and intercept are not
known so they are estimated using sample data.
XbbY 10 +=ˆ
where
Y = predicted value of Y
b0 = estimate of β0, based on sample results
b1 = estimate of β1, based on sample results
^
4-13
Triple A Construction
Triple A Construction is trying to predict sales
based on area payroll.
Y = Sales
X = Area payroll
The line chosen in Figure 4.1 is the one that
minimizes the errors.
Error = (Actual value) – (Predicted value)
YYe ˆ−=
4-14
Triple A Construction
For the simple linear regression model, the values
of the intercept and slope can be calculated using
the formulas below.
XbbY 10 +=ˆ
valuesof(mean)average X
n
X
X ==
∑
valuesof(mean)average Y
n
Y
Y ==
∑
∑
∑
−
−−
= 21
)(
))((
XX
YYXX
b
XbYb 10 −=
4-15
Triple A Construction
Y X (X – X)2
(X – X)(Y – Y)
6 3 (3 – 4)2
= 1 (3 – 4)(6 – 7) = 1
8 4 (4 – 4)2
= 0 (4 – 4)(8 – 7) = 0
9 6 (6 – 4)2
= 4 (6 – 4)(9 – 7) = 4
5 4 (4 – 4)2
= 0 (4 – 4)(5 – 7) = 0
4.5 2 (2 – 4)2
= 4 (2 – 4)(4.5 – 7) = 5
9.5 5 (5 – 4)2
= 1 (5 – 4)(9.5 – 7) = 2.5
ΣY = 42
Y = 42/6 = 7
ΣX = 24
X = 24/6 = 4
Σ(X – X)2
= 10 Σ(X – X)(Y – Y) = 12.5
Table 4.2
Regression calculations for Triple A Construction
4-16
Triple A Construction
4
6
24
6
===
∑ X
X
7
6
42
6
===
∑Y
Y
251
10
512
21 .
.
)(
))((
==
−
−−
=
∑
∑
XX
YYXX
b
24251710 =−=−= ))(.(XbYb
Regression calculations
XY 2512 .ˆ +=Therefore
4-17
Triple A Construction
4
6
24
6
===
∑ X
X
7
6
42
6
===
∑Y
Y
251
10
512
21 .
.
)(
))((
==
−
−−
=
∑
∑
XX
YYXX
b
24251710 =−=−= ))(.(XbYb
Regression calculations
XY 2512 .ˆ +=Therefore
sales = 2 + 1.25(payroll)
If the payroll next
year is $600 million
000950$or5962512 ,.)(.ˆ =+=Y
4-18
Measuring the Fit
of the Regression Model
 Regression models can be developed
for any variables X and Y.
 How do we know the model is actually
helpful in predicting Y based on X?
 We could just take the average error, but
the positive and negative errors would
cancel each other out.
 Three measures of variability are:
 SST – Total variability about the mean.
 SSE – Variability about the regression line.
 SSR – Total variability that is explained by
the model.
4-19
Measuring the Fit
of the Regression Model
 Sum of the squares total:
2
)(∑ −= YYSST
 Sum of the squared error:
∑ ∑ −== 22
)ˆ( YYeSSE
 Sum of squares due to regression:
∑ −= 2
)ˆ( YYSSR
 An important relationship:
SSESSRSST +=
4-20
Measuring the Fit
of the Regression Model
Y X (Y – Y)2
Y (Y – Y)2
(Y – Y)2
6 3 (6 – 7)2
= 1 2 + 1.25(3) = 5.75 0.0625 1.563
8 4 (8 – 7)2
= 1 2 + 1.25(4) = 7.00 1 0
9 6 (9 – 7)2
= 4 2 + 1.25(6) = 9.50 0.25 6.25
5 4 (5 – 7)2
= 4 2 + 1.25(4) = 7.00 4 0
4.5 2 (4.5 – 7)2
= 6.25 2 + 1.25(2) = 4.50 0 6.25
9.5 5 (9.5 – 7)2
= 6.25 2 + 1.25(5) = 8.25 1.5625 1.563
∑(Y – Y)2
= 22.5 ∑(Y – Y)2
= 6.875
∑(Y – Y)2
=
15.625
Y = 7 SST = 22.5 SSE = 6.875 SSR = 15.625
^
^^
^^
Table 4.3
Sum of Squares for Triple A Construction
4-21
 Sum of the squares total
2
)(∑ −= YYSST
 Sum of the squared error
∑ ∑ −== 22
)ˆ( YYeSSE
 Sum of squares due to regression
∑ −= 2
)ˆ( YYSSR
 An important relationship
SSESSRSST +=
Measuring the Fit
of the Regression Model
For Triple A Construction
SST = 22.5
SSE = 6.875
SSR = 15.625
4-22
Measuring the Fit
of the Regression Model
Figure 4.2
Deviations from the Regression Line and from the Mean
4-23
Coefficient of Determination
 The proportion of the variability in Y explained by
the regression equation is called the coefficientcoefficient
of determination.of determination.
 The coefficient of determination is r2
.
SST
SSE
SST
SSR
r −== 12
 For Triple A Construction:
69440
522
625152
.
.
.
==r
 About 69% of the variability in Y is explained by
the equation based on payroll (X).
4-24
Correlation Coefficient
 The correlation coefficientcorrelation coefficient is an expression of the
strength of the linear relationship.
 It will always be between +1 and –1.
 The correlation coefficient is r.
2
rr ±=
 For Triple A Construction:
8333069440 .. ==r
4-25
Four Values of the Correlation
Coefficient
*
*
*
*
(a) Perfect Positive
Correlation:
r = +1
X
Y
*
* *
*
(c) No Correlation:
r = 0
X
Y
* *
*
*
* *
* **
*
(d) Perfect Negative
Correlation:
r = –1
X
Y
*
*
*
*
* *
*
*
*
(b) Positive
Correlation:
0 < r < 1
X
Y
*
*
*
*
*
*
*
Figure 4.3
4-26
Using Computer Software
for Regression
Program 4.1A
Accessing the Regression Option in Excel 2010
4-27
Using Computer Software
for Regression
Program 4.1B
Data Input for Regression in Excel
4-28
Using Computer Software
for Regression
Program 4.1C
Excel Output for the Triple A Construction Example
4-29
Assumptions of the Regression Model
1. Errors are independent.
2. Errors are normally distributed.
3. Errors have a mean of zero.
4. Errors have a constant variance.
 If we make certain assumptions about the errors
in a regression model, we can perform statistical
tests to determine if the model is useful.
 A plot of the residuals (errors) will often highlight
any glaring violations of the assumption.
4-30
Residual Plots
Pattern of Errors Indicating Randomness
Figure 4.4A
Error
X
4-31
Residual Plots
Nonconstant error variance
Figure 4.4B
Error
X
4-32
Residual Plots
Errors Indicate Relationship is not Linear
Figure 4.4C
Error
X
4-33
Estimating the Variance
 Errors are assumed to have a constant
variance (σ 2
), but we usually don’t know
this.
 It can be estimated using the meanmean
squared errorsquared error (MSEMSE), s2.
1
2
−−
==
kn
SSE
MSEs
where
n = number of observations in the sample
k = number of independent variables
4-34
Estimating the Variance
 For Triple A Construction:
71881
4
87506
116
87506
1
2
.
..
==
−−
=
−−
==
kn
SSE
MSEs
 We can estimate the standard deviation, s.
 This is also called the standard error of thestandard error of the
estimateestimate or the standard deviation of thestandard deviation of the
regression.regression.
31171881 .. === MSEs
4-35
Testing the Model for Significance
 When the sample size is too small, you
can get good values for MSE and r2
even if
there is no relationship between the
variables.
 Testing the model for significance helps
determine if the values are meaningful.
 We do this by performing a statistical
hypothesis test.
4-36
Testing the Model for Significance
 We start with the general linear model
εββ ++= XY 10
 If β1 = 0, the null hypothesis is that there is
nono relationship between X and Y.
 The alternate hypothesis is that there isis a
linear relationship (β1 ≠ 0).
 If the null hypothesis can be rejected, we
have proven there is a relationship.
 We use the F statistic for this test.
4-37
Testing the Model for Significance
 The F statistic is based on the MSE and MSR:
k
SSR
MSR =
where
k = number of independent variables in the
model
 The F statistic is:
MSE
MSR
F =
 This describes an F distribution with:
degrees of freedom for the numerator = df1 = k
degrees of freedom for the denominator = df2 = n – k – 1
4-38
Testing the Model for Significance
 If there is very little error, the MSE would
be small and the F-statistic would be large
indicating the model is useful.
 If the F-statistic is large, the significance
level (p-value) will be low, indicating it is
unlikely this would have occurred by
chance.
 So when the F-value is large, we can reject
the null hypothesis and accept that there is
a linear relationship between X and Y and
the values of the MSE and r2
are
meaningful.
4-39
Steps in a Hypothesis Test
1. Specify null and alternative hypotheses:
010 =β:H
011 ≠β:H
2. Select the level of significance (α). Common
values are 0.01 and 0.05.
3. Calculate the value of the test statistic using the
formula:
MSE
MSR
F =
4-40
Steps in a Hypothesis Test
4. Make a decision using one of the following
methods:
a) Reject the null hypothesis if the test statistic is
greater than the F-value from the table in Appendix D.
Otherwise, do not reject the null hypothesis:
21
ifReject dfdfcalculated FF ,,α>
kdf =1
12 −−= kndf
b) Reject the null hypothesis if the observed significance
level, or p-value, is less than the level of significance
(α). Otherwise, do not reject the null hypothesis:
)( statistictestcalculatedvalue- >= FPp
α<value-ifReject p
4-41
Triple A Construction
Step 1.Step 1.
H0: β1 = 0 (no linear
relationship between X and Y)
H1: β1 ≠ 0 (linear relationship
exists between X and Y)
Step 2.Step 2.
Select α = 0.05
625015
1
625015
.
.
===
k
SSR
MSR
099
71881
625015
.
.
.
===
MSE
MSR
F
Step 3.Step 3.
Calculate the value of the test
statistic.
4-42
Triple A Construction
Step 4.Step 4.
Reject the null hypothesis if the test statistic
is greater than the F-value in Appendix D.
df1 = k = 1
df2 = n – k – 1 = 6 – 1 – 1 = 4
The value of F associated with a 5% level of
significance and with degrees of freedom 1
and 4 is found in Appendix D.
F0.05,1,4 = 7.71
Fcalculated = 9.09
Reject H0 because 9.09 > 7.71
4-43
F = 7.71
0.05
9.09
Triple A Construction
Figure 4.5
 We can conclude there is a
statistically significant
relationship between X and
Y.
 The r2
value of 0.69 means
about 69% of the variability
in sales (Y) is explained by
local payroll (X).
4-44
Analysis of Variance (ANOVA) Table
 When software is used to develop a regression
model, an ANOVA table is typically created that
shows the observed significance level (p-value)
for the calculated F value.
 This can be compared to the level of significance
(α) to make a decision.
DF SS MS F SIGNIFICANCE
Regression k SSR MSR = SSR/k MSR/MSE P(F >
MSR/MSE)
Residual n - k - 1 SSE MSE =
SSE/(n - k - 1)
Total n - 1 SST
Table 4.4
4-45
ANOVA for Triple A Construction
Because this probability is less than 0.05, we reject
the null hypothesis of no linear relationship and
conclude there is a linear relationship between X
and Y.
Program 4.1C
(partial)
P(F > 9.0909) = 0.0394
4-46
Multiple Regression Analysis
 Multiple regression modelsMultiple regression models are
extensions to the simple linear model
and allow the creation of models with
more than one independent variable.
Y = β0 + β1X1 + β2X2 + … + βkXk + ε
where
Y = dependent variable (response variable)
Xi = ith
independent variable (predictor or
explanatory variable)
β0 = intercept (value of Y when all Xi = 0)
βi = coefficient of the ith
independent variable
k = number of independent variables
ε = random error
4-47
Multiple Regression Analysis
To estimate these values, a sample is taken the
following equation developed
kk XbXbXbbY ++++= ...ˆ 22110
where
= predicted value of Y
b0 = sample intercept (and is an estimate of
β0)
bi = sample coefficient of the ith variable
(and is an estimate of βi)
Yˆ
4-48
Jenny Wilson Realty
Jenny Wilson wants to develop a model to determine
the suggested listing price for houses based on the
size and age of the house.
22110
ˆ XbXbbY ++=
where
= predicted value of dependent
variable (selling price)
b0 = Y intercept
X1 and X2 = value of the two independent
variables (square footage and age)
respectively
b1 and b2 = slopes for X1 and X2
respectively
Yˆ
She selects a sample of houses that have sold
recently and records the data shown in Table 4.5
4-49
Jenny Wilson Real Estate Data
SELLING
PRICE ($)
SQUARE
FOOTAGE
AGE CONDITION
95,000 1,926 30 Good
119,000 2,069 40 Excellent
124,800 1,720 30 Excellent
135,000 1,396 15 Good
142,000 1,706 32 Mint
145,000 1,847 38 Mint
159,000 1,950 27 Mint
165,000 2,323 30 Excellent
182,000 2,285 26 Mint
183,000 3,752 35 Good
200,000 2,300 18 Good
211,000 2,525 17 Good
215,000 3,800 40 Excellent
219,000 1,740 12 MintTable 4.5
4-50
Jenny Wilson Realty
Program 4.2A
Input Screen for the Jenny Wilson Realty Multiple
Regression Example
4-51
Jenny Wilson Realty
Program 4.2B
Output for the Jenny Wilson Realty Multiple
Regression Example
4-52
Evaluating Multiple Regression Models
 Evaluation is similar to simple linear
regression models.
 The p-value for the F-test and r2
are
interpreted the same.
 The hypothesis is different because there
is more than one independent variable.
 The F-test is investigating whether all
the coefficients are equal to 0 at the
same time.
4-53
Evaluating Multiple Regression Models
 To determine which independent
variables are significant, tests are
performed for each variable.
010 =β:H
011 ≠β:H
 The test statistic is calculated and if the
p-value is lower than the level of
significance (α), the null hypothesis is
rejected.
4-54
Jenny Wilson Realty
 The model is statistically significant
 The p-value for the F-test is 0.002.
 r2
= 0.6719 so the model explains about 67% of
the variation in selling price (Y).
 But the F-test is for the entire model and we can’t
tell if one or both of the independent variables are
significant.
 By calculating the p-value of each variable, we can
assess the significance of the individual variables.
 Since the p-value for X1 (square footage) and X2
(age) are both less than the significance level of
0.05, both null hypotheses can be rejected.
4-55
Binary or Dummy Variables
 BinaryBinary (or dummydummy or indicatorindicator) variables
are special variables created for
qualitative data.
 A dummy variable is assigned a value of
1 if a particular condition is met and a
value of 0 otherwise.
 The number of dummy variables must
equal one less than the number of
categories of the qualitative variable.
4-56
Jenny Wilson Realty
 Jenny believes a better model can be developed if
she includes information about the condition of
the property.
X3 = 1 if house is in excellent condition
= 0 otherwise
X4 = 1 if house is in mint condition
= 0 otherwise
 Two dummy variables are used to describe the
three categories of condition.
 No variable is needed for “good” condition since
if both X3 and X4 = 0, the house must be in good
condition.
4-57
Jenny Wilson Realty
Program 4.3A
Input Screen for the Jenny Wilson Realty Example
with Dummy Variables
4-58
Jenny Wilson Realty
Program 4.3B
Output for the Jenny Wilson Realty Example with
Dummy Variables
4-59
Model Building
 The best model is a statistically significant
model with a high r2
and few variables.
 As more variables are added to the model,
the r2
-value usually increases.
 For this reason, the adjustedadjusted rr22
value is
often used to determine the usefulness of
an additional variable.
 The adjusted r2
takes into account the
number of independent variables in the
model.
4-60
Model Building
SST
SSE
SST
SSR
−== 12
r
 The formula for r2
 The formula for adjusted r2
)/(SST
)/(SSE
1
1
1Adjusted 2
−
−−
−=
n
kn
r
 As the number of variables increases, the
adjusted r2
gets smaller unless the increase due to
the new variable is large enough to offset the
change in k.
4-61
Model Building
 In general, if a new variable increases the
adjusted r2
, it should probably be included in the
model.
 In some cases, variables contain duplicate
information.
 When two independent variables are correlated,
they are said to be collinear.collinear.
 When more than two independent variables are
correlated, multicollinearitymulticollinearity exists.
 When multicollinearity is present, hypothesis
tests for the individual coefficients are not valid
but the model may still be useful.
4-62
Nonlinear Regression
 In some situations, variables are not linear.
 Transformations may be used to turn a
nonlinear model into a linear model.
*
* **
** *
* *
Linear relationship Nonlinear relationship
* *
** **
*
*
**
*
4-63
Colonel Motors
 Engineers at Colonel Motors want to use
regression analysis to improve fuel efficiency.
 They have been asked to study the impact of
weight on miles per gallon (MPG).
MPG
WEIGHT (1,000
LBS.) MPG
WEIGHT (1,000
LBS.)
12 4.58 20 3.18
13 4.66 23 2.68
15 4.02 24 2.65
18 2.53 33 1.70
19 3.09 36 1.95
19 3.11 42 1.92
Table 4.6
4-64
Colonel Motors
Figure 4.6A
Linear Model for MPG Data
4-65
Colonel Motors
Program 4.4 This is a useful model with a
small F-test for significance
and a good r2
value.
Excel Output for Linear Regression Model with
MPG Data
4-66
Colonel Motors
Figure 4.6B
Nonlinear Model for MPG Data
4-67
Colonel Motors
 The nonlinear model is a quadratic model.
 The easiest way to work with this model is to
develop a new variable.
2
2 weight)(=X
 This gives us a model that can be solved with
linear regression software:
22110 XbXbbY ++=ˆ
4-68
Colonel Motors
Program 4.5
A better model with a
smaller F-test for
significance and a larger
adjusted r2
value
21 43230879 XXY ...ˆ +−=
4-69
Cautions and Pitfalls
 If the assumptions are not met, the
statistical test may not be valid.
 Correlation does not necessarily mean
causation.
 Multicollinearity makes interpreting
coefficients problematic, but the model
may still be good.
 Using a regression model beyond the
range of X is questionable, as the
relationship may not hold outside the
sample data.
4-70
Cautions and Pitfalls
 A t-test for the intercept (b0) may be ignored
as this point is often outside the range of
the model.
 A linear relationship may not be the best
relationship, even if the F-test returns an
acceptable value.
 A nonlinear relationship can exist even if a
linear relationship does not.
 Even though a relationship is statistically
significant it may not have any practical
value.
4-71
Copyright
All rights reserved. No part of this publication may be
reproduced, stored in a retrieval system, or transmitted, in
any form or by any means, electronic, mechanical,
photocopying, recording, or otherwise, without the prior
written permission of the publisher. Printed in the United
States of America.

More Related Content

What's hot

Business Statistics Chapter 1
Business Statistics Chapter 1Business Statistics Chapter 1
Business Statistics Chapter 1Lux PP
 
Strategic Management: Concepts & Cases Chapter 05
Strategic Management: Concepts & Cases Chapter 05Strategic Management: Concepts & Cases Chapter 05
Strategic Management: Concepts & Cases Chapter 05KaleemSarwar2
 
Section 3.3 quadratic functions and their properties
Section 3.3 quadratic functions and their properties Section 3.3 quadratic functions and their properties
Section 3.3 quadratic functions and their properties Wong Hsiung
 
Simple regression and correlation
Simple regression and correlationSimple regression and correlation
Simple regression and correlationMary Grace
 
Bba 3274 qm week 6 part 1 regression models
Bba 3274 qm week 6 part 1 regression modelsBba 3274 qm week 6 part 1 regression models
Bba 3274 qm week 6 part 1 regression modelsStephen Ong
 
Related rates ppt
Related rates pptRelated rates ppt
Related rates pptRon Eick
 
Daniels ib15inppt 03
Daniels ib15inppt 03Daniels ib15inppt 03
Daniels ib15inppt 03Moises Cielak
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysisNimrita Koul
 
01 ch ken black solution
01 ch ken black solution01 ch ken black solution
01 ch ken black solutionKrunal Shah
 
26 alternating series and conditional convergence x
26 alternating series and conditional convergence x26 alternating series and conditional convergence x
26 alternating series and conditional convergence xmath266
 
Ch11 establishing rewards &amp; pay plans
Ch11 establishing rewards &amp; pay plansCh11 establishing rewards &amp; pay plans
Ch11 establishing rewards &amp; pay plansSelf employed
 

What's hot (20)

Business Statistics Chapter 1
Business Statistics Chapter 1Business Statistics Chapter 1
Business Statistics Chapter 1
 
Strategic Management: Concepts & Cases Chapter 05
Strategic Management: Concepts & Cases Chapter 05Strategic Management: Concepts & Cases Chapter 05
Strategic Management: Concepts & Cases Chapter 05
 
David sm13 ppt_11
David sm13 ppt_11David sm13 ppt_11
David sm13 ppt_11
 
Lecture chapter 5
Lecture chapter 5Lecture chapter 5
Lecture chapter 5
 
Statistics for management assignment
Statistics for management assignmentStatistics for management assignment
Statistics for management assignment
 
Section 3.3 quadratic functions and their properties
Section 3.3 quadratic functions and their properties Section 3.3 quadratic functions and their properties
Section 3.3 quadratic functions and their properties
 
Ch08
Ch08Ch08
Ch08
 
Correlation and regression
Correlation and regressionCorrelation and regression
Correlation and regression
 
Simple regression and correlation
Simple regression and correlationSimple regression and correlation
Simple regression and correlation
 
Regression
RegressionRegression
Regression
 
Chap07 interval estimation
Chap07 interval estimationChap07 interval estimation
Chap07 interval estimation
 
Bba 3274 qm week 6 part 1 regression models
Bba 3274 qm week 6 part 1 regression modelsBba 3274 qm week 6 part 1 regression models
Bba 3274 qm week 6 part 1 regression models
 
Related rates ppt
Related rates pptRelated rates ppt
Related rates ppt
 
Daniels ib15inppt 03
Daniels ib15inppt 03Daniels ib15inppt 03
Daniels ib15inppt 03
 
Linear regression analysis
Linear regression analysisLinear regression analysis
Linear regression analysis
 
01 ch ken black solution
01 ch ken black solution01 ch ken black solution
01 ch ken black solution
 
PSPP software application
PSPP software applicationPSPP software application
PSPP software application
 
26 alternating series and conditional convergence x
26 alternating series and conditional convergence x26 alternating series and conditional convergence x
26 alternating series and conditional convergence x
 
Chapter 13
Chapter 13Chapter 13
Chapter 13
 
Ch11 establishing rewards &amp; pay plans
Ch11 establishing rewards &amp; pay plansCh11 establishing rewards &amp; pay plans
Ch11 establishing rewards &amp; pay plans
 

Similar to REGRESSION TITLE

Tbs910 regression models
Tbs910 regression modelsTbs910 regression models
Tbs910 regression modelsStephen Ong
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answersBookStoreLib
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answersFiras Husseini
 
Regression analysis
Regression analysisRegression analysis
Regression analysisSrikant001p
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
Chapter 4 power point presentation Regression models
Chapter 4 power point presentation Regression modelsChapter 4 power point presentation Regression models
Chapter 4 power point presentation Regression modelsJustinXerri
 
Lecture - 8 MLR.pptx
Lecture - 8 MLR.pptxLecture - 8 MLR.pptx
Lecture - 8 MLR.pptxiris765749
 
An econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsAn econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsIRJET Journal
 
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docxLynellBull52
 
Nonparametric approach to multiple regression
Nonparametric approach to multiple regressionNonparametric approach to multiple regression
Nonparametric approach to multiple regressionAlexander Decker
 
SupportVectorRegression
SupportVectorRegressionSupportVectorRegression
SupportVectorRegressionDaniel K
 
Exploring Support Vector Regression - Signals and Systems Project
Exploring Support Vector Regression - Signals and Systems ProjectExploring Support Vector Regression - Signals and Systems Project
Exploring Support Vector Regression - Signals and Systems ProjectSurya Chandra
 
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docx
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docxPage 1 of 18Part A Multiple Choice (1–11)______1. Using.docx
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docxalfred4lewis58146
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regressionMaria Theresa
 
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...Dr.Summiya Parveen
 
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...Data Approximation in Mathematical Modelling Regression Analysis and curve fi...
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...Dr.Summiya Parveen
 
Quantitative Analysis for Management new ppt
Quantitative Analysis for Management new pptQuantitative Analysis for Management new ppt
Quantitative Analysis for Management new pptkrrish242
 

Similar to REGRESSION TITLE (20)

2.1 regression
2.1 regression2.1 regression
2.1 regression
 
Tbs910 regression models
Tbs910 regression modelsTbs910 regression models
Tbs910 regression models
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answers
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answers
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
Chapter 4 power point presentation Regression models
Chapter 4 power point presentation Regression modelsChapter 4 power point presentation Regression models
Chapter 4 power point presentation Regression models
 
Lecture - 8 MLR.pptx
Lecture - 8 MLR.pptxLecture - 8 MLR.pptx
Lecture - 8 MLR.pptx
 
An econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsAn econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using Statistics
 
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx
© 2008 Prentice-Hall, Inc.Regression ModelsChapter 4.docx
 
Regression
RegressionRegression
Regression
 
Nonparametric approach to multiple regression
Nonparametric approach to multiple regressionNonparametric approach to multiple regression
Nonparametric approach to multiple regression
 
SupportVectorRegression
SupportVectorRegressionSupportVectorRegression
SupportVectorRegression
 
Exploring Support Vector Regression - Signals and Systems Project
Exploring Support Vector Regression - Signals and Systems ProjectExploring Support Vector Regression - Signals and Systems Project
Exploring Support Vector Regression - Signals and Systems Project
 
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docx
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docxPage 1 of 18Part A Multiple Choice (1–11)______1. Using.docx
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docx
 
Simple linear regression
Simple linear regressionSimple linear regression
Simple linear regression
 
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...
Data Approximation in Mathematical Modelling Regression Analysis and Curve Fi...
 
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...Data Approximation in Mathematical Modelling Regression Analysis and curve fi...
Data Approximation in Mathematical Modelling Regression Analysis and curve fi...
 
Quantitative Analysis for Management new ppt
Quantitative Analysis for Management new pptQuantitative Analysis for Management new ppt
Quantitative Analysis for Management new ppt
 
Course pack unit 5
Course pack unit 5Course pack unit 5
Course pack unit 5
 

More from Firas Husseini

More from Firas Husseini (20)

Ali M Fadel CV
Ali M Fadel  CVAli M Fadel  CV
Ali M Fadel CV
 
Transportation problems1
Transportation problems1Transportation problems1
Transportation problems1
 
Slides for ch08
Slides for ch08Slides for ch08
Slides for ch08
 
Slides for ch07
Slides for ch07Slides for ch07
Slides for ch07
 
Slides for ch06
Slides for ch06Slides for ch06
Slides for ch06
 
Rsh qam11 ch10 ge
Rsh qam11 ch10 geRsh qam11 ch10 ge
Rsh qam11 ch10 ge
 
Rsh qam11 ch09 ge
Rsh qam11 ch09 geRsh qam11 ch09 ge
Rsh qam11 ch09 ge
 
Rsh qam11 ch08 ge
Rsh qam11 ch08 geRsh qam11 ch08 ge
Rsh qam11 ch08 ge
 
Rsh qam11 ch07 ge
Rsh qam11 ch07 geRsh qam11 ch07 ge
Rsh qam11 ch07 ge
 
Rsh qam11 ch06 ge
Rsh qam11 ch06 geRsh qam11 ch06 ge
Rsh qam11 ch06 ge
 
Rsh qam11 ch05 ge
Rsh qam11 ch05 geRsh qam11 ch05 ge
Rsh qam11 ch05 ge
 
Rsh qam11 ch03
Rsh qam11 ch03Rsh qam11 ch03
Rsh qam11 ch03
 
Rsh qam11 ch03 ge
Rsh qam11 ch03 geRsh qam11 ch03 ge
Rsh qam11 ch03 ge
 
Rsh qam11 ch02
Rsh qam11 ch02Rsh qam11 ch02
Rsh qam11 ch02
 
Rsh qam11 ch01
Rsh qam11 ch01Rsh qam11 ch01
Rsh qam11 ch01
 
Render03 140622012601-phpapp02
Render03 140622012601-phpapp02Render03 140622012601-phpapp02
Render03 140622012601-phpapp02
 
Render03 140622012601-phpapp02 (1)
Render03 140622012601-phpapp02 (1)Render03 140622012601-phpapp02 (1)
Render03 140622012601-phpapp02 (1)
 
Render01edited 121120194704-phpapp02
Render01edited 121120194704-phpapp02Render01edited 121120194704-phpapp02
Render01edited 121120194704-phpapp02
 
Render 09
Render 09Render 09
Render 09
 
Quantitativeanalysisfordecisionmaking 13427543542352-phpapp02-120719222252-ph...
Quantitativeanalysisfordecisionmaking 13427543542352-phpapp02-120719222252-ph...Quantitativeanalysisfordecisionmaking 13427543542352-phpapp02-120719222252-ph...
Quantitativeanalysisfordecisionmaking 13427543542352-phpapp02-120719222252-ph...
 

Recently uploaded

Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesKeppelCorporation
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation SlidesKeppelCorporation
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMintel Group
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailAriel592675
 
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607(Best) ENJOY Call Girls in Faridabad Ex | 8377087607
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607dollysharma2066
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadAyesha Khan
 
Call Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any TimeCall Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any Timedelhimodelshub1
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...lizamodels9
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Pereraictsugar
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfrichard876048
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Seta Wicaksana
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCRashishs7044
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024christinemoorman
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCRashishs7044
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckHajeJanKamps
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfRbc Rbcua
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...ssuserf63bd7
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...ictsugar
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaoncallgirls2057
 

Recently uploaded (20)

Annual General Meeting Presentation Slides
Annual General Meeting Presentation SlidesAnnual General Meeting Presentation Slides
Annual General Meeting Presentation Slides
 
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
Keppel Ltd. 1Q 2024 Business Update  Presentation SlidesKeppel Ltd. 1Q 2024 Business Update  Presentation Slides
Keppel Ltd. 1Q 2024 Business Update Presentation Slides
 
Market Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 EditionMarket Sizes Sample Report - 2024 Edition
Market Sizes Sample Report - 2024 Edition
 
Case study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detailCase study on tata clothing brand zudio in detail
Case study on tata clothing brand zudio in detail
 
Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)Japan IT Week 2024 Brochure by 47Billion (English)
Japan IT Week 2024 Brochure by 47Billion (English)
 
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607(Best) ENJOY Call Girls in Faridabad Ex | 8377087607
(Best) ENJOY Call Girls in Faridabad Ex | 8377087607
 
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in IslamabadIslamabad Escorts | Call 03070433345 | Escort Service in Islamabad
Islamabad Escorts | Call 03070433345 | Escort Service in Islamabad
 
Call Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any TimeCall Girls Miyapur 7001305949 all area service COD available Any Time
Call Girls Miyapur 7001305949 all area service COD available Any Time
 
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
Call Girls In Sikandarpur Gurgaon ❤️8860477959_Russian 100% Genuine Escorts I...
 
Kenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith PereraKenya Coconut Production Presentation by Dr. Lalith Perera
Kenya Coconut Production Presentation by Dr. Lalith Perera
 
Innovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdfInnovation Conference 5th March 2024.pdf
Innovation Conference 5th March 2024.pdf
 
Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...Ten Organizational Design Models to align structure and operations to busines...
Ten Organizational Design Models to align structure and operations to busines...
 
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
8447779800, Low rate Call girls in Shivaji Enclave Delhi NCR
 
The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024The CMO Survey - Highlights and Insights Report - Spring 2024
The CMO Survey - Highlights and Insights Report - Spring 2024
 
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR8447779800, Low rate Call girls in Tughlakabad Delhi NCR
8447779800, Low rate Call girls in Tughlakabad Delhi NCR
 
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deckPitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
Pitch Deck Teardown: Geodesic.Life's $500k Pre-seed deck
 
APRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdfAPRIL2024_UKRAINE_xml_0000000000000 .pdf
APRIL2024_UKRAINE_xml_0000000000000 .pdf
 
International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...International Business Environments and Operations 16th Global Edition test b...
International Business Environments and Operations 16th Global Edition test b...
 
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...Global Scenario On Sustainable  and Resilient Coconut Industry by Dr. Jelfina...
Global Scenario On Sustainable and Resilient Coconut Industry by Dr. Jelfina...
 
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City GurgaonCall Us 📲8800102216📞 Call Girls In DLF City Gurgaon
Call Us 📲8800102216📞 Call Girls In DLF City Gurgaon
 

REGRESSION TITLE

  • 1. Copyright © 2012 Pearson Education 4-1 Chapter 4 To accompany Quantitative Analysis for Management, Eleventh Edition, Global Edition by Render, Stair, and Hanna Power Point slides created by Brian Peterson Regression Models
  • 2. 4-2 Learning Objectives 1. Identify variables and use them in a regression model. 2. Develop simple linear regression equations. from sample data and interpret the slope and intercept. 3. Compute the coefficient of determination and the coefficient of correlation and interpret their meanings. 4. Interpret the F-test in a linear regression model. 5. List the assumptions used in regression and use residual plots to identify problems. After completing this chapter, students will be able to:After completing this chapter, students will be able to:
  • 3. 4-3 Learning Objectives 6. Develop a multiple regression model and use it for prediction purposes. 7. Use dummy variables to model categorical data. 8. Determine which variables should be included in a multiple regression model. 9. Transform a nonlinear function into a linear one for use in regression. 10. Understand and avoid common mistakes made in the use of regression analysis. After completing this chapter, students will be able to:After completing this chapter, students will be able to:
  • 4. 4-4 Chapter Outline 4.1 Introduction 4.2 Scatter Diagrams 4.3 Simple Linear Regression 4.4 Measuring the Fit of the Regression Model 4.5 Using Computer Software for Regression 4.6 Assumptions of the Regression Model
  • 5. 4-5 Chapter Outline 4.7 Testing the Model for Significance 4.8 Multiple Regression Analysis 4.9 Binary or Dummy Variables 4.10 Model Building 4.11 Nonlinear Regression 4.12 Cautions and Pitfalls in Regression Analysis
  • 6. 4-6 Introduction  Regression analysisRegression analysis is a very valuable tool for a manager.  Regression can be used to:  Understand the relationship between variables.  Predict the value of one variable based on another variable.  Simple linear regression models have only two variables.  Multiple regression models have more variables.
  • 7. 4-7 Introduction  The variable to be predicted is called the dependent variabledependent variable.  This is sometimes called the responseresponse variable.variable.  The value of this variable depends on the value of the independent variable.independent variable.  This is sometimes called the explanatoryexplanatory or predictor variable.predictor variable. Independent variable Dependent variable Independent variable = +
  • 8. 4-8 Scatter Diagram  A scatter diagramscatter diagram or scatter plotscatter plot is often used to investigate the relationship between variables.  The independent variable is normally plotted on the X axis.  The dependent variable is normally plotted on the Y axis.
  • 9. 4-9 Triple A Construction  Triple A Construction renovates old homes.  Managers have found that the dollar volume of renovation work is dependent on the area payroll. TRIPLE A’S SALES ($100,000s) LOCAL PAYROLL ($100,000,000s) 6 3 8 4 9 6 5 4 4.5 2 9.5 5 Table 4.1
  • 10. 4-10 Triple A Construction Figure 4.1 Scatter Diagram of Triple A Construction Company Data
  • 11. 4-11 Simple Linear Regression where Y = dependent variable (response) X = independent variable (predictor or explanatory) β0 = intercept (value of Y when X = 0) β1 = slope of the regression line ε = random error  Regression models are used to test if there is a relationship between variables.  There is some random error that cannot be predicted. εββ ++= XY 10
  • 12. 4-12 Simple Linear Regression  True values for the slope and intercept are not known so they are estimated using sample data. XbbY 10 +=ˆ where Y = predicted value of Y b0 = estimate of β0, based on sample results b1 = estimate of β1, based on sample results ^
  • 13. 4-13 Triple A Construction Triple A Construction is trying to predict sales based on area payroll. Y = Sales X = Area payroll The line chosen in Figure 4.1 is the one that minimizes the errors. Error = (Actual value) – (Predicted value) YYe ˆ−=
  • 14. 4-14 Triple A Construction For the simple linear regression model, the values of the intercept and slope can be calculated using the formulas below. XbbY 10 +=ˆ valuesof(mean)average X n X X == ∑ valuesof(mean)average Y n Y Y == ∑ ∑ ∑ − −− = 21 )( ))(( XX YYXX b XbYb 10 −=
  • 15. 4-15 Triple A Construction Y X (X – X)2 (X – X)(Y – Y) 6 3 (3 – 4)2 = 1 (3 – 4)(6 – 7) = 1 8 4 (4 – 4)2 = 0 (4 – 4)(8 – 7) = 0 9 6 (6 – 4)2 = 4 (6 – 4)(9 – 7) = 4 5 4 (4 – 4)2 = 0 (4 – 4)(5 – 7) = 0 4.5 2 (2 – 4)2 = 4 (2 – 4)(4.5 – 7) = 5 9.5 5 (5 – 4)2 = 1 (5 – 4)(9.5 – 7) = 2.5 ΣY = 42 Y = 42/6 = 7 ΣX = 24 X = 24/6 = 4 Σ(X – X)2 = 10 Σ(X – X)(Y – Y) = 12.5 Table 4.2 Regression calculations for Triple A Construction
  • 16. 4-16 Triple A Construction 4 6 24 6 === ∑ X X 7 6 42 6 === ∑Y Y 251 10 512 21 . . )( ))(( == − −− = ∑ ∑ XX YYXX b 24251710 =−=−= ))(.(XbYb Regression calculations XY 2512 .ˆ +=Therefore
  • 17. 4-17 Triple A Construction 4 6 24 6 === ∑ X X 7 6 42 6 === ∑Y Y 251 10 512 21 . . )( ))(( == − −− = ∑ ∑ XX YYXX b 24251710 =−=−= ))(.(XbYb Regression calculations XY 2512 .ˆ +=Therefore sales = 2 + 1.25(payroll) If the payroll next year is $600 million 000950$or5962512 ,.)(.ˆ =+=Y
  • 18. 4-18 Measuring the Fit of the Regression Model  Regression models can be developed for any variables X and Y.  How do we know the model is actually helpful in predicting Y based on X?  We could just take the average error, but the positive and negative errors would cancel each other out.  Three measures of variability are:  SST – Total variability about the mean.  SSE – Variability about the regression line.  SSR – Total variability that is explained by the model.
  • 19. 4-19 Measuring the Fit of the Regression Model  Sum of the squares total: 2 )(∑ −= YYSST  Sum of the squared error: ∑ ∑ −== 22 )ˆ( YYeSSE  Sum of squares due to regression: ∑ −= 2 )ˆ( YYSSR  An important relationship: SSESSRSST +=
  • 20. 4-20 Measuring the Fit of the Regression Model Y X (Y – Y)2 Y (Y – Y)2 (Y – Y)2 6 3 (6 – 7)2 = 1 2 + 1.25(3) = 5.75 0.0625 1.563 8 4 (8 – 7)2 = 1 2 + 1.25(4) = 7.00 1 0 9 6 (9 – 7)2 = 4 2 + 1.25(6) = 9.50 0.25 6.25 5 4 (5 – 7)2 = 4 2 + 1.25(4) = 7.00 4 0 4.5 2 (4.5 – 7)2 = 6.25 2 + 1.25(2) = 4.50 0 6.25 9.5 5 (9.5 – 7)2 = 6.25 2 + 1.25(5) = 8.25 1.5625 1.563 ∑(Y – Y)2 = 22.5 ∑(Y – Y)2 = 6.875 ∑(Y – Y)2 = 15.625 Y = 7 SST = 22.5 SSE = 6.875 SSR = 15.625 ^ ^^ ^^ Table 4.3 Sum of Squares for Triple A Construction
  • 21. 4-21  Sum of the squares total 2 )(∑ −= YYSST  Sum of the squared error ∑ ∑ −== 22 )ˆ( YYeSSE  Sum of squares due to regression ∑ −= 2 )ˆ( YYSSR  An important relationship SSESSRSST += Measuring the Fit of the Regression Model For Triple A Construction SST = 22.5 SSE = 6.875 SSR = 15.625
  • 22. 4-22 Measuring the Fit of the Regression Model Figure 4.2 Deviations from the Regression Line and from the Mean
  • 23. 4-23 Coefficient of Determination  The proportion of the variability in Y explained by the regression equation is called the coefficientcoefficient of determination.of determination.  The coefficient of determination is r2 . SST SSE SST SSR r −== 12  For Triple A Construction: 69440 522 625152 . . . ==r  About 69% of the variability in Y is explained by the equation based on payroll (X).
  • 24. 4-24 Correlation Coefficient  The correlation coefficientcorrelation coefficient is an expression of the strength of the linear relationship.  It will always be between +1 and –1.  The correlation coefficient is r. 2 rr ±=  For Triple A Construction: 8333069440 .. ==r
  • 25. 4-25 Four Values of the Correlation Coefficient * * * * (a) Perfect Positive Correlation: r = +1 X Y * * * * (c) No Correlation: r = 0 X Y * * * * * * * ** * (d) Perfect Negative Correlation: r = –1 X Y * * * * * * * * * (b) Positive Correlation: 0 < r < 1 X Y * * * * * * * Figure 4.3
  • 26. 4-26 Using Computer Software for Regression Program 4.1A Accessing the Regression Option in Excel 2010
  • 27. 4-27 Using Computer Software for Regression Program 4.1B Data Input for Regression in Excel
  • 28. 4-28 Using Computer Software for Regression Program 4.1C Excel Output for the Triple A Construction Example
  • 29. 4-29 Assumptions of the Regression Model 1. Errors are independent. 2. Errors are normally distributed. 3. Errors have a mean of zero. 4. Errors have a constant variance.  If we make certain assumptions about the errors in a regression model, we can perform statistical tests to determine if the model is useful.  A plot of the residuals (errors) will often highlight any glaring violations of the assumption.
  • 30. 4-30 Residual Plots Pattern of Errors Indicating Randomness Figure 4.4A Error X
  • 31. 4-31 Residual Plots Nonconstant error variance Figure 4.4B Error X
  • 32. 4-32 Residual Plots Errors Indicate Relationship is not Linear Figure 4.4C Error X
  • 33. 4-33 Estimating the Variance  Errors are assumed to have a constant variance (σ 2 ), but we usually don’t know this.  It can be estimated using the meanmean squared errorsquared error (MSEMSE), s2. 1 2 −− == kn SSE MSEs where n = number of observations in the sample k = number of independent variables
  • 34. 4-34 Estimating the Variance  For Triple A Construction: 71881 4 87506 116 87506 1 2 . .. == −− = −− == kn SSE MSEs  We can estimate the standard deviation, s.  This is also called the standard error of thestandard error of the estimateestimate or the standard deviation of thestandard deviation of the regression.regression. 31171881 .. === MSEs
  • 35. 4-35 Testing the Model for Significance  When the sample size is too small, you can get good values for MSE and r2 even if there is no relationship between the variables.  Testing the model for significance helps determine if the values are meaningful.  We do this by performing a statistical hypothesis test.
  • 36. 4-36 Testing the Model for Significance  We start with the general linear model εββ ++= XY 10  If β1 = 0, the null hypothesis is that there is nono relationship between X and Y.  The alternate hypothesis is that there isis a linear relationship (β1 ≠ 0).  If the null hypothesis can be rejected, we have proven there is a relationship.  We use the F statistic for this test.
  • 37. 4-37 Testing the Model for Significance  The F statistic is based on the MSE and MSR: k SSR MSR = where k = number of independent variables in the model  The F statistic is: MSE MSR F =  This describes an F distribution with: degrees of freedom for the numerator = df1 = k degrees of freedom for the denominator = df2 = n – k – 1
  • 38. 4-38 Testing the Model for Significance  If there is very little error, the MSE would be small and the F-statistic would be large indicating the model is useful.  If the F-statistic is large, the significance level (p-value) will be low, indicating it is unlikely this would have occurred by chance.  So when the F-value is large, we can reject the null hypothesis and accept that there is a linear relationship between X and Y and the values of the MSE and r2 are meaningful.
  • 39. 4-39 Steps in a Hypothesis Test 1. Specify null and alternative hypotheses: 010 =β:H 011 ≠β:H 2. Select the level of significance (α). Common values are 0.01 and 0.05. 3. Calculate the value of the test statistic using the formula: MSE MSR F =
  • 40. 4-40 Steps in a Hypothesis Test 4. Make a decision using one of the following methods: a) Reject the null hypothesis if the test statistic is greater than the F-value from the table in Appendix D. Otherwise, do not reject the null hypothesis: 21 ifReject dfdfcalculated FF ,,α> kdf =1 12 −−= kndf b) Reject the null hypothesis if the observed significance level, or p-value, is less than the level of significance (α). Otherwise, do not reject the null hypothesis: )( statistictestcalculatedvalue- >= FPp α<value-ifReject p
  • 41. 4-41 Triple A Construction Step 1.Step 1. H0: β1 = 0 (no linear relationship between X and Y) H1: β1 ≠ 0 (linear relationship exists between X and Y) Step 2.Step 2. Select α = 0.05 625015 1 625015 . . === k SSR MSR 099 71881 625015 . . . === MSE MSR F Step 3.Step 3. Calculate the value of the test statistic.
  • 42. 4-42 Triple A Construction Step 4.Step 4. Reject the null hypothesis if the test statistic is greater than the F-value in Appendix D. df1 = k = 1 df2 = n – k – 1 = 6 – 1 – 1 = 4 The value of F associated with a 5% level of significance and with degrees of freedom 1 and 4 is found in Appendix D. F0.05,1,4 = 7.71 Fcalculated = 9.09 Reject H0 because 9.09 > 7.71
  • 43. 4-43 F = 7.71 0.05 9.09 Triple A Construction Figure 4.5  We can conclude there is a statistically significant relationship between X and Y.  The r2 value of 0.69 means about 69% of the variability in sales (Y) is explained by local payroll (X).
  • 44. 4-44 Analysis of Variance (ANOVA) Table  When software is used to develop a regression model, an ANOVA table is typically created that shows the observed significance level (p-value) for the calculated F value.  This can be compared to the level of significance (α) to make a decision. DF SS MS F SIGNIFICANCE Regression k SSR MSR = SSR/k MSR/MSE P(F > MSR/MSE) Residual n - k - 1 SSE MSE = SSE/(n - k - 1) Total n - 1 SST Table 4.4
  • 45. 4-45 ANOVA for Triple A Construction Because this probability is less than 0.05, we reject the null hypothesis of no linear relationship and conclude there is a linear relationship between X and Y. Program 4.1C (partial) P(F > 9.0909) = 0.0394
  • 46. 4-46 Multiple Regression Analysis  Multiple regression modelsMultiple regression models are extensions to the simple linear model and allow the creation of models with more than one independent variable. Y = β0 + β1X1 + β2X2 + … + βkXk + ε where Y = dependent variable (response variable) Xi = ith independent variable (predictor or explanatory variable) β0 = intercept (value of Y when all Xi = 0) βi = coefficient of the ith independent variable k = number of independent variables ε = random error
  • 47. 4-47 Multiple Regression Analysis To estimate these values, a sample is taken the following equation developed kk XbXbXbbY ++++= ...ˆ 22110 where = predicted value of Y b0 = sample intercept (and is an estimate of β0) bi = sample coefficient of the ith variable (and is an estimate of βi) Yˆ
  • 48. 4-48 Jenny Wilson Realty Jenny Wilson wants to develop a model to determine the suggested listing price for houses based on the size and age of the house. 22110 ˆ XbXbbY ++= where = predicted value of dependent variable (selling price) b0 = Y intercept X1 and X2 = value of the two independent variables (square footage and age) respectively b1 and b2 = slopes for X1 and X2 respectively Yˆ She selects a sample of houses that have sold recently and records the data shown in Table 4.5
  • 49. 4-49 Jenny Wilson Real Estate Data SELLING PRICE ($) SQUARE FOOTAGE AGE CONDITION 95,000 1,926 30 Good 119,000 2,069 40 Excellent 124,800 1,720 30 Excellent 135,000 1,396 15 Good 142,000 1,706 32 Mint 145,000 1,847 38 Mint 159,000 1,950 27 Mint 165,000 2,323 30 Excellent 182,000 2,285 26 Mint 183,000 3,752 35 Good 200,000 2,300 18 Good 211,000 2,525 17 Good 215,000 3,800 40 Excellent 219,000 1,740 12 MintTable 4.5
  • 50. 4-50 Jenny Wilson Realty Program 4.2A Input Screen for the Jenny Wilson Realty Multiple Regression Example
  • 51. 4-51 Jenny Wilson Realty Program 4.2B Output for the Jenny Wilson Realty Multiple Regression Example
  • 52. 4-52 Evaluating Multiple Regression Models  Evaluation is similar to simple linear regression models.  The p-value for the F-test and r2 are interpreted the same.  The hypothesis is different because there is more than one independent variable.  The F-test is investigating whether all the coefficients are equal to 0 at the same time.
  • 53. 4-53 Evaluating Multiple Regression Models  To determine which independent variables are significant, tests are performed for each variable. 010 =β:H 011 ≠β:H  The test statistic is calculated and if the p-value is lower than the level of significance (α), the null hypothesis is rejected.
  • 54. 4-54 Jenny Wilson Realty  The model is statistically significant  The p-value for the F-test is 0.002.  r2 = 0.6719 so the model explains about 67% of the variation in selling price (Y).  But the F-test is for the entire model and we can’t tell if one or both of the independent variables are significant.  By calculating the p-value of each variable, we can assess the significance of the individual variables.  Since the p-value for X1 (square footage) and X2 (age) are both less than the significance level of 0.05, both null hypotheses can be rejected.
  • 55. 4-55 Binary or Dummy Variables  BinaryBinary (or dummydummy or indicatorindicator) variables are special variables created for qualitative data.  A dummy variable is assigned a value of 1 if a particular condition is met and a value of 0 otherwise.  The number of dummy variables must equal one less than the number of categories of the qualitative variable.
  • 56. 4-56 Jenny Wilson Realty  Jenny believes a better model can be developed if she includes information about the condition of the property. X3 = 1 if house is in excellent condition = 0 otherwise X4 = 1 if house is in mint condition = 0 otherwise  Two dummy variables are used to describe the three categories of condition.  No variable is needed for “good” condition since if both X3 and X4 = 0, the house must be in good condition.
  • 57. 4-57 Jenny Wilson Realty Program 4.3A Input Screen for the Jenny Wilson Realty Example with Dummy Variables
  • 58. 4-58 Jenny Wilson Realty Program 4.3B Output for the Jenny Wilson Realty Example with Dummy Variables
  • 59. 4-59 Model Building  The best model is a statistically significant model with a high r2 and few variables.  As more variables are added to the model, the r2 -value usually increases.  For this reason, the adjustedadjusted rr22 value is often used to determine the usefulness of an additional variable.  The adjusted r2 takes into account the number of independent variables in the model.
  • 60. 4-60 Model Building SST SSE SST SSR −== 12 r  The formula for r2  The formula for adjusted r2 )/(SST )/(SSE 1 1 1Adjusted 2 − −− −= n kn r  As the number of variables increases, the adjusted r2 gets smaller unless the increase due to the new variable is large enough to offset the change in k.
  • 61. 4-61 Model Building  In general, if a new variable increases the adjusted r2 , it should probably be included in the model.  In some cases, variables contain duplicate information.  When two independent variables are correlated, they are said to be collinear.collinear.  When more than two independent variables are correlated, multicollinearitymulticollinearity exists.  When multicollinearity is present, hypothesis tests for the individual coefficients are not valid but the model may still be useful.
  • 62. 4-62 Nonlinear Regression  In some situations, variables are not linear.  Transformations may be used to turn a nonlinear model into a linear model. * * ** ** * * * Linear relationship Nonlinear relationship * * ** ** * * ** *
  • 63. 4-63 Colonel Motors  Engineers at Colonel Motors want to use regression analysis to improve fuel efficiency.  They have been asked to study the impact of weight on miles per gallon (MPG). MPG WEIGHT (1,000 LBS.) MPG WEIGHT (1,000 LBS.) 12 4.58 20 3.18 13 4.66 23 2.68 15 4.02 24 2.65 18 2.53 33 1.70 19 3.09 36 1.95 19 3.11 42 1.92 Table 4.6
  • 65. 4-65 Colonel Motors Program 4.4 This is a useful model with a small F-test for significance and a good r2 value. Excel Output for Linear Regression Model with MPG Data
  • 67. 4-67 Colonel Motors  The nonlinear model is a quadratic model.  The easiest way to work with this model is to develop a new variable. 2 2 weight)(=X  This gives us a model that can be solved with linear regression software: 22110 XbXbbY ++=ˆ
  • 68. 4-68 Colonel Motors Program 4.5 A better model with a smaller F-test for significance and a larger adjusted r2 value 21 43230879 XXY ...ˆ +−=
  • 69. 4-69 Cautions and Pitfalls  If the assumptions are not met, the statistical test may not be valid.  Correlation does not necessarily mean causation.  Multicollinearity makes interpreting coefficients problematic, but the model may still be good.  Using a regression model beyond the range of X is questionable, as the relationship may not hold outside the sample data.
  • 70. 4-70 Cautions and Pitfalls  A t-test for the intercept (b0) may be ignored as this point is often outside the range of the model.  A linear relationship may not be the best relationship, even if the F-test returns an acceptable value.  A nonlinear relationship can exist even if a linear relationship does not.  Even though a relationship is statistically significant it may not have any practical value.
  • 71. 4-71 Copyright All rights reserved. No part of this publication may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic, mechanical, photocopying, recording, or otherwise, without the prior written permission of the publisher. Printed in the United States of America.