SlideShare a Scribd company logo
1 of 99
11-1Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11-2Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Statistics for Business and
Economics
Chapter 11
Simple Linear Regression
11-3Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Contents
1. Probabilistic Models
2. Fitting the Model: The Least Squares
Approach
3. Model Assumptions
4. Assessing the Utility of the Model:
Making Inferences about the Slope β1
11-4Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Contents
5. The Coefficients of Correlation and
Determination
6. Using the Model for Estimation and
Prediction
7. A Complete Example
11-5Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Learning Objectives
• Introduce the straight-line (simple linear
regression) model as a means of
relating one quantitative variable to
another quantitative variable
• Assess how well the simple linear
regression model fits the sample data
11-6Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Learning Objectives
• Introduce the correlation coefficient as
a means of relating one quantitative
variable to another quantitative variable
• Employ the simple linear regression
model for predicting the value of one
variable from a specified value of
another variable
11-7Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.1
Probabilistic Models
11-8Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Models
• Representation of some phenomenon
• Mathematical model is a mathematical
expression of some phenomenon
• Often describe relationships between
variables
• Types
– Deterministic models
– Probabilistic models
11-9Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Deterministic Models
• Hypothesize exact relationships
• Suitable when prediction error is
negligible
• Example: force is exactly mass times
acceleration
– F = m·a
© 1984-1994 T/Maker Co.
11-10Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Probabilistic Models
• Hypothesize two components
– Deterministic
– Random error
• Example: sales volume (y) is 10 times
advertising spending (x) + random error
– y = 10x + ε
– Random error may be due to factors
other than advertising
11-11Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
General Form of Probabilistic
Models
y = Deterministic component + Random error
where y is the variable of interest. We always
assume that the mean value of the random
error equals 0. This is equivalent to assuming
that the mean value of y, E(y), equals the
deterministic component of the model; that is,
E(y) = Deterministic component
11-12Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A First-Order (Straight Line)
Probabilistic Model
y = β0 + β1x +ε
where
y = Dependent or response variable
(variable to be modeled)
x = Independent or predictor variable
(variable used as a predictor of y)
E(y) = β0 + β1x = Deterministic component
ε (epsilon) = Random error component
11-13Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A First-Order (Straight Line)
Probabilistic Model
y = β0 + β1x +ε
β0 (beta zero) = y-intercept of the line, that is, the
point at which the line intercepts
or cuts through the y-axis
β1 (beta one) = slope of the line, that is, the
change (amount of increase or
decrease) in the deterministic
component of y for every 1-unit
increase in x
11-14Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
[Note: A positive slope implies that E(y)
increases by the amount β1 for each unit
increase in x. A negative slope implies that
E(y) decreases by the amount β1.]
A First-Order (Straight Line)
Probabilistic Model
11-15Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Five-Step Procedure
Step 1: Hypothesize the deterministic component
of the model that relates the mean, E(y),
to the independent variable x.
Step 2: Use the sample data to estimate unknown
parameters in the model.
Step 3: Specify the probability distribution of the
random error term and estimate the
standard deviation of this distribution.
Step 4: Statistically evaluate the usefulness of the
model.
Step 5: When satisfied that the model is useful,
use it for prediction, estimation, and other
purposes.
11-16Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.2
Fitting the Model:
The Least Squares Approach
11-17Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Scatterplot
1. Plot of all (xi, yi) pairs
2. Suggests how well model will fit
0
20
40
60
0 20 40 60
x
y
11-18Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Thinking Challenge
0
20
40
60
0 20 40 60
x
y
• How would you draw a line through the
points?
• How do you determine which line ‘fits best’?
11-19Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
The least squares line is one
that has the following two properties:
1. The sum of the errors equals 0,
i.e., mean error = 0.
2. The sum of squared errors (SSE) is
smaller than for any other straight-line
model, i.e., the error variance is
minimum.
Least Squares Line
0 1
ˆ ˆˆ β β= +y x
11-20Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Formula for the Least
Squares Estimates
0 1
ˆ ˆ: β β− = −y intercept y x
1
ˆ: β = xy
xx
SS
Slope
SS
where SSxy
= xi
− x( )∑ yi
− y( )
SSxx
= xi
− x( )
2
∑
n = Sample size
11-21Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
y-intercept: represents the predicted value
of y when x = 0 (Caution: This value
will not be meaningful if the value x =
0 is nonsensical or outside the range
of the sample data.)
slope: represents the increase (or
decrease) in y for every 1-unit
increase in x (Caution: This
interpretation is valid only for x-values
within the range of the sample data.)
Interpreting the Estimates of β0 and
β1 in Simple Liner Regression
1
ˆβ
0
ˆβ
11-22Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Least Squares Graphically
ε2
y
x
ε1 ε3
ε4
^^
^
^
2 0 1 2 2
ˆ ˆ ˆy xβ β ε= + +
0 1
ˆ ˆˆi iy xβ β= +
2 2 2 2 2
1 2 3 4
1
ˆ ˆ ˆ ˆ ˆLS minimizes
n
i
i
ε ε ε ε ε
=
= + + +∑
11-23Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Least Squares Example
You’re a marketing analyst for Hasbro
Toys. You gather the following data:
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Find the least squares line relating
sales and advertising.
11-24Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Scatterplot
Sales vs. Advertising
0
1
2
3
4
0 1 2 3 4 5
Sales
Advertising
11-25Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Parameter Estimation
Solution
15
3
5 5
= = =∑x
x
10
2
5 5
= = =∑y
y
( )( )
( )( )3 2 7
= − −
= − − =
∑
∑
xySS x x y y
x y
( )
( )
2
2
3 10
= −
= − =
∑
∑
xxSS x x
x
11-26Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Parameter Estimation
Solution
ˆ .1 .7y x= − +
( ) ( )0 1
ˆ ˆ 2 .70 3 .10y xβ β= − = − = −
1
7ˆ .7
10
= = =xy
xx
SS
B
SS
The slope of the least squares line is:
11-27Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Parameter Estimation
Computer Output
Parameter Estimates
Parameter Standard T for H0:
Variable DF Estimate Error Param=0 Prob>|T|
INTERCEP 1 -0.1000 0.6350 -0.157 0.8849
ADVERT 1 0.7000 0.1914 3.656 0.0354
β0
^
β1
^
ˆ .1 .7y x= − +
11-28Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient Interpretation
Solution
1. Slope (β1)
• Sales Volume (y) is expected to increase by
$700 for each $100 increase in advertising
(x), over the sampled range of advertising
expenditures from $100 to $500
^
2. y-Intercept (β0)
• Since 0 is outside of the range of the
sampled values of x, the y-intercept has no
meaningful interpretation
^^
11-29Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.3
Model Assumptions
11-30Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Basic Assumptions of the
Probability Distribution
Assumption 1:
The mean of the probability distribution of ε is
0 – that is, the average of the values of ε over
an infinitely long series of experiments is 0 for
each setting of the independent variable x.
This assumption implies that the mean value
of y, E(y), for a given value of x is
E(y) = β0 + β1x.
11-31Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Basic Assumptions of the
Probability Distribution
Assumption 2:
The variance of the probability distribution of
ε is constant for all settings of the
independent variable x. For our straight-line
model, this assumption means that the
variance of ε is equal to a constant, say σ 2
,
for all values of x.
11-32Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Basic Assumptions of the
Probability Distribution
Assumption 3:
The probability distribution of ε is normal.
Assumption 4:
The values of ε associated with any two
observed values of y are independent–that is,
the value of ε associated with one value of y
has no effect on the values of ε associated
with other y values.
11-33Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Basic Assumptions of the
Probability Distribution
.
11-34Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
To estimate the standard deviation σ of ε,
we calculate
We will refer to s as the estimated
standard error of the regression model.
Estimation of σ 2
for a (First-
Order) Straight-Line Model
2 SSE SSE
Degrees of freedom for error 2
= =
−
s
n
( )
( )
2
1
2
ˆˆwhere SSE β= − = −
= −
∑
∑
i i yy xy
yy i
y y SS SS
SS y y
s = s2
=
SSE
n − 2
11-35Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Calculating SSE, s2
, s
Example
You’re a marketing analyst for Hasbro
Toys. You gather the following data:
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Find SSE, s2
, and s.
11-36Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Calculating s2
and s Solution
2 1.1
.36667
2 5 2
SSE
s
n
= = =
− −
.36667 .6055s = =
11-37Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.4
Assessing the Utility of the
Model: Making Inferences
about the Slope β1
11-38Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
If we make the four assumptions about ε,
the sampling distribution of the least
squares estimator of the slope will be
normal with mean β1 (the true slope) and
standard deviation
Sampling Distribution of
1
ˆ
SSβ
σ
σ =
xx
1
ˆβ
1
ˆβ
11-39Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
We estimate by and refer to
this quantity as the estimated standard
error of the least squares slope .
Sampling Distribution of
1
ˆ
SSβ
=
xx
s
s
1
ˆβ
σ
1
ˆβ
1
ˆβ
11-40Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A Test of Model Utility: Simple
Linear Regression
11-41Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Interpreting p-Values for β
Coefficients in Regression
Almost all statistical computer software
packages report a two-tailed p-value for each
of the β parameters in the regression model.
For example, in simple linear regression, the
p-value for the two-tailed test H0: β1 = 0
versus Ha: β1 ≠ 0 is given on the printout. If
you want to conduct a one-tailed test of
hypothesis, you will need to adjust the p-
value reported on the printout as follows:
11-42Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Interpreting p-Values for β
Coefficients in Regression
where p is the p-value reported on the printout and
t is the value of the test statistic.
11-43Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11-44Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Test of Slope Coefficient
Example
You’re a marketing analyst for Hasbro Toys.
You find β0 = –.1, β1 = .7 and s = .6055.
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Is the relationship significant
at the .05 level of significance?
^^
11-45Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Test of Slope Coefficient
Solution
• H0:
• Ha:
• α =
• df =
• Critical Value(s):
t0 3.182-3.182
.025
Reject H0 Reject H0
.025
β1 = 0
β1 ≠ 0
.05
5 – 2 = 3
11-46Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Test Statistic
Solution
söβ1
=
s
SSxx
=
.6055
55−
15( )
2
5
= .1914
t =
öβ1
Söβ1
=
.70
.1914
= 3.657
11-47Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Test of Slope Coefficient
Solution
• H0:
• Ha:
• α =
• df =
• Critical Value(s):
t0 3.182-3.182
.025
Reject H0 Reject H0
.025
β1 = 0
β1 ≠ 0
.05
5 – 2 = 3
Test Statistic:
Decision:
Conclusion:
t = 3.657
Reject at α = .05
There is evidence of a
relationship
11-48Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Test of Slope Coefficient
Computer Output
Parameter Estimates
Parameter Standard T for H0:
Variable DF Estimate Error Param=0 Prob>|T|
INTERCEP 1 -0.1000 0.6350 -0.157 0.8849
ADVERT 1 0.7000 0.1914 3.656 0.0354
t = β1 / Sβ
P-Value
Sββ1
1 1
^^^^
11-49Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.5
The Coefficients of Correlation
and Determination
11-50Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Correlation Models
• Answers ‘How strong is the linear
relationship between two variables?’
• Coefficient of correlation
– Sample correlation coefficient denoted r
– Values range from –1 to +1
– Measures degree of association
– Does not indicate cause–effect
relationship
11-51Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
xy
xx yy
SS
r
SS SS
=
SSxy
= x − x( ) y − y( )∑
SSxx
= x − x( )
2
∑
SSyy
= y − y( )
2
∑
where
11-52Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
11-53Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
11-54Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
11-55Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
Example
You’re a marketing analyst for Hasbro Toys.
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Calculate the coefficient of
correlation.
11-56Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
Solution
SSxy
= x − x( )∑ y − y( )= 7
SSyy
= y − y( )∑
2
= 6
SSxx
= x − x( )
2
∑ = 10
7
.904
10 6
xy
xx yy
SS
r
SS SS
= = =
×
11-57Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A Test for Linear Correlation
11-58Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Condition Required for a Valid
Test of Correlation
• The sample of (x, y) values is randomly
selected from a normal population.
11-59Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
Thinking Challenge
You’re an economist for the county
cooperative. You gather the following data:
Fertilizer (lb.) Yield (lb.)
4 3.0
6 5.5
10 6.5
12 9.0
Find the coefficient of correlation.
© 1984-1994 T/Maker Co.
11-60Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Correlation
Solution
SSxy
= x − x( )∑ y − y( )= 26
SSyy
= y − y( )∑
2
= 18.5
SSxx
= x − x( )
2
∑ = 40
26
.956
40 18.5
xy
xx yy
SS
r
SS SS
= = =
×
11-61Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of Determination
It represents the proportion of the total sample
variability around y that is explained by the
linear relationship between y and x.
r2
=
Explained Variation
Total Variation
=
SSyy
− SSE
SSyy
= 1−
SSE
SSyy
0 ≤ r2
≤ 1
r2
= (coefficient of correlation)2
11-62Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of
Determination Example
You’re a marketing analyst for Hasbro
Toys. You know r = .904.
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Calculate and interpret the
coefficient of determination.
11-63Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Coefficient of
Determination Solution
r2
= (coefficient of correlation)2
r2
= (.904)2
r2
= .817
Interpretation: About 81.7% of the sample
variation in Sales (y) can be explained by using
Ad $ (x) to predict Sales (y) in the linear model.
11-64Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
r2
Computer Output
Root MSE 0.60553 R-square 0.8167
Dep Mean 2.00000 Adj R-sq 0.7556
C.V. 30.27650
r2
adjusted for number of
explanatory variables &
sample size
r2
11-65Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.6
Using the Model for Estimation
and Determination
11-66Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Probabilistic Model
• Used to make inferences
– Estimate the mean value of y, E(y) for a
specific x
 Estimate the mean sales for all months during
which $400 (x = 4) is expended on advertising
– Predict a new individual y value for given x
 If we expend $400 in advertising next month, we
want to predict the sales revenue for that month
11-67Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11-68Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A 100(1 – α)% Confidence
Interval for the Mean Value of
y at x = xp
( )
2
/2
1
ˆ
SS
α
−
± +
p
xx
x x
y t s
n
df = n – 2
ˆ(Estimated standard error of )/2
ˆ α± yy t
11-69Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A 100(1 – α)% Prediction
Interval for an Individual New
Value of y at x = xp
( )
2
/2
1
ˆ 1
SS
α
−
± + +
p
xx
x x
y t s
n
df = n – 2
(Estimated standard error of prediction)/2
ˆ α±y t
11-70Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Error of estimating the mean
value of y for a given value of x
11-71Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Error of predicting a future
value of y for a given value of x
11-72Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence Interval
Example
You’re a marketing analyst for Hasbro Toys.
You find β0 = –.1, β 1 = .7 and s = .6055.
Ad Expenditure (100$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Find a 95% confidence interval for
the mean sales when advertising is $4.
^^
11-73Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence Interval Solution
( )
( ) ( )
( ) ( )
( )
2
/2
2
1
ˆ
SS
ˆ .1 .7 4 2.7
4 31
2.7 3.182 .6055
5 10
1.645 ( ) 3.755
α
−
± +
= − + =
−
± +
≤ ≤
p
xx
x x
y t s
n
y
E Y
x to be predicted
11-74Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
A 100(1 – α)% Prediction
Interval for an Individual New
Value of y at x = xp
( )
2
/2
1
ˆ 1
SS
α
−
± + +
p
xx
x x
y t s
n
Note!
df = n – 2
11-75Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Why the Extra ‘S’?
Expected
(Mean) y
y
y we're trying to
predict
Prediction, y^
x
xp
ε
E(y) = β0
+ β1x
^
^
^
yi
= β 0
+ β 1
xi
11-76Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Prediction Interval
Example
You’re a marketing analyst for Hasbro Toys.
You find β0 = –.1, β 1 = .7 and s = .6055.
Ad Expenditure (1000$) Sales (Units)
1 1
2 1
3 2
4 2
5 4
Predict the sales when advertising
is $400. Use a 95% prediction interval.
^^
11-77Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Prediction Interval Solution
( )
( ) ( )
( ) ( )
( )
2
/2
2
4
1
ˆ 1
SS
ˆ .1 .7 4 2.7
4 31
2.7 3.182 .6055 1
5 10
.503 4.897
α
−
± + +
= − + =
−
± + +
≤ ≤
p
xx
x x
y t s
n
y
y
x to be predicted
11-78Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Interval Estimate
Computer Output
Dep Var Pred Std Err Low95% Upp95% Low95% Upp95%
Obs SALES Value Predict Mean Mean Predict Predict
1 1.000 0.600 0.469 -0.892 2.092 -1.837 3.037
2 1.000 1.300 0.332 0.244 2.355 -0.897 3.497
3 2.000 2.000 0.271 1.138 2.861 -0.111 4.111
4 2.000 2.700 0.332 1.644 3.755 0.502 4.897
5 4.000 3.400 0.469 1.907 4.892 0.962 5.837
Predicted y
when x = 4
Confidence
Interval
SY^
Prediction
Interval
11-79Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Confidence intervals for mean
values and prediction intervals
for new values
11-80Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
11.7
A Complete Example
11-81Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Suppose a fire insurance company wants to
relate the amount of fire damage in major
residential fires to the distance between the
burning house and the nearest fire station.
The study is to be conducted in a large
suburb of a major city; a sample of 15 recent
fires in this suburb is selected. The amount
of damage, y, and the distance between the
fire and the nearest fire station, x, are
recorded for each fire.
11-82Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
11-83Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 1: First, we hypothesize a model to
relate fire damage, y, to the distance from
the nearest fire station, x. We hypothesize a
straight-line probabilistic model:
y = β0 + β1x + ε
11-84Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 2: Use a statistical software package to
estimate the unknown parameters in the
deterministic component of the hypothesized
model. The Excel printout for the simple
linear regression analysis is shown on the
next slide. The least squares estimates of
the slope β1 and intercept β0, highlighted on
the printout, are
1
0
ˆ 4.919331
ˆ 10.277929
β
β
=
=
11-85Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
ˆLeast Squares Equation: 10.278 4.919= +y x
11-86Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
This prediction equation is graphed in the
Minitab scatterplot.
11-87Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
The least squares estimate of the slope,
implies that the estimated mean
damage increases by $4,919 for each
additional mile from the fire station. This
interpretation is valid over the range of x, or
from .7 to 6.1 miles from the station. The
estimated y-intercept, , has the
interpretation that a fire 0 miles from the fire
station has an estimated mean damage of
$10,278.
1
ˆ 4.919β =
0
ˆ 10.278β =
11-88Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 3: Specify the probability distribution of
the random error component ε. The estimate
of the standard deviation σ of ε, highlighted
on the Excel printout is
s = 2.31635
This implies that most of the observed fire
damage (y) values will fall within
approximately 2σ = 4.64 thousand dollars of
their respective predicted values when using
the least squares line.
11-89Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 4: First, test the null hypothesis that the
slope β1 is 0 –that is, that there is no linear
relationship between fire damage and the
distance from the nearest fire station,
against the alternative hypothesis that fire
damage increases as the distance
increases. We test
H0: β1 = 0
Ha: β1 > 0
The two-tailed observed significance level
for testing is approximately 0.
11-90Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
The 95% confidence interval yields (4.070,
5.768).
We estimate (with 95% confidence) that the
interval from $4,070 to $5,768 encloses the
mean increase (β1) in fire damage per
additional mile distance from the fire station.
The coefficient of determination, is r2
= .9235,
which implies that about 92% of the sample
variation in fire damage (y) is explained by the
distance (x) between the fire and the fire
station.
11-91Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
The coefficient of correlation, r, that measures
the strength of the linear relationship between
y and x is not shown on the Excel printout and
must be
calculated. We find
The high correlation confirms our conclusion
that β1 is greater than 0; it appears that fire
damage and distance from the fire station are
positively correlated. All signs point to a
strong linear relationship between y and x.
r = + r2
= .9235 = .96
11-92Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 5: We are now prepared to use the least
squares model. Suppose the insurance
company wants to predict the fire damage if a
major residential fire were to occur 3.5 miles
from the nearest fire station. A 95%
confidence interval for E(y) and prediction
interval for y when x = 3.5 are shown on the
Minitab printout on the next slide.
11-93Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
Step 5: We are now prepared to use the least
11-94Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Example
The predicted value (highlighted on the
printout) is , while the 95% prediction
interval (also highlighted) is (22.3239,
32.6672). Therefore, with 95% confidence we
predict fire damage in a major residential fire
3.5 miles from the nearest station to be
between $22,324 and $32,667.
ˆ 27.496=y
11-95Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Key Ideas
Simple Linear Regression Variables
y = Dependent variable (quantitative)
x = Independent variable (quantitative)
Method of Least Squares Properties
1. average error of prediction = 0
2. sum of squared errors is minimum
11-96Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Key Ideas
Practical Interpretation of y-intercept
predicted y value when x = 0
(no practical interpretation if x = 0 is either
nonsensical or outside range of sample data)
Practical Interpretation of Slope
Increase or decrease in y for every 1-unit increase
in x
11-97Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Key Ideas
First-Order (Straight Line) Model
E(y) = β0 + β1x
where E(y) = mean of y
β0 = y-intercept of line (point where line
intercepts the y-axis)
β1 = slope of line (change in y for every 1-
unit change in x)
11-98Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Key Ideas
Coefficient of Correlation, r
1. Ranges between –1 and 1
2. Measures strength of linear relationship
between y and x
Coefficient of Determination, r2
1. Ranges between 0 and 1
2. Measures proportion of sample variation in y
explained by the model
11-99Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
Key Ideas
Practical Interpretation of Model
Standard Deviation, s
Ninety-five percent of y-values fall within 2s
of their respected predicted values
Width of confidence interval for E(y) will
always be narrower than width of
prediction interval for y

More Related Content

What's hot

Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfacewolf
 
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationA New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationAIRCC Publishing Corporation
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep diveabulyomon
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answersFiras Husseini
 
Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Alamin Milton
 
06 cv mil_learning_and_inference
06 cv mil_learning_and_inference06 cv mil_learning_and_inference
06 cv mil_learning_and_inferencezukun
 
Minghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkMinghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkWei Wang
 
Fisherfaces Face Recognition Algorithm
Fisherfaces Face Recognition AlgorithmFisherfaces Face Recognition Algorithm
Fisherfaces Face Recognition AlgorithmVishnu K N
 
Dummy variable
Dummy variableDummy variable
Dummy variableAkram Ali
 
Diagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelDiagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelMehdi Shayegani
 
Polynomial regression
Polynomial regressionPolynomial regression
Polynomial regressionnaveedaliabad
 
Simple linear regression analysis
Simple linear  regression analysisSimple linear  regression analysis
Simple linear regression analysisNorma Mingo
 
Spurious correlation (updated)
Spurious correlation (updated)Spurious correlation (updated)
Spurious correlation (updated)jemille6
 

What's hot (20)

Chap03 probability
Chap03 probabilityChap03 probability
Chap03 probability
 
Moshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenfaceMoshe Guttmann's slides on eigenface
Moshe Guttmann's slides on eigenface
 
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its ApplicationA New Approach for Ranking Shadowed Fuzzy Numbers and its Application
A New Approach for Ranking Shadowed Fuzzy Numbers and its Application
 
Regression Analysis
Regression AnalysisRegression Analysis
Regression Analysis
 
Regression: A skin-deep dive
Regression: A skin-deep diveRegression: A skin-deep dive
Regression: A skin-deep dive
 
Distribusi Teoritis
Distribusi TeoritisDistribusi Teoritis
Distribusi Teoritis
 
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
MUMS: Bayesian, Fiducial, and Frequentist Conference - Including Factors in B...
 
161783709 chapter-04-answers
161783709 chapter-04-answers161783709 chapter-04-answers
161783709 chapter-04-answers
 
Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-
 
06 cv mil_learning_and_inference
06 cv mil_learning_and_inference06 cv mil_learning_and_inference
06 cv mil_learning_and_inference
 
Seattle.Slides.7
Seattle.Slides.7Seattle.Slides.7
Seattle.Slides.7
 
Minghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation TalkMinghui Conference Cross-Validation Talk
Minghui Conference Cross-Validation Talk
 
Fisherfaces Face Recognition Algorithm
Fisherfaces Face Recognition AlgorithmFisherfaces Face Recognition Algorithm
Fisherfaces Face Recognition Algorithm
 
Dummy variable
Dummy variableDummy variable
Dummy variable
 
Diagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelDiagnostic methods for Building the regression model
Diagnostic methods for Building the regression model
 
Polynomial regression
Polynomial regressionPolynomial regression
Polynomial regression
 
Simple linear regression analysis
Simple linear  regression analysisSimple linear  regression analysis
Simple linear regression analysis
 
MUMS Opening Workshop - Model Uncertainty and Uncertain Quantification - Merl...
MUMS Opening Workshop - Model Uncertainty and Uncertain Quantification - Merl...MUMS Opening Workshop - Model Uncertainty and Uncertain Quantification - Merl...
MUMS Opening Workshop - Model Uncertainty and Uncertain Quantification - Merl...
 
Spurious correlation (updated)
Spurious correlation (updated)Spurious correlation (updated)
Spurious correlation (updated)
 
A new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbersA new approach for ranking of intuitionistic fuzzy numbers
A new approach for ranking of intuitionistic fuzzy numbers
 

Viewers also liked

BSE TASIS SHARIAH 50 INDEX
BSE TASIS SHARIAH 50 INDEXBSE TASIS SHARIAH 50 INDEX
BSE TASIS SHARIAH 50 INDEXMisab P.T
 
Stock market indices
Stock market indicesStock market indices
Stock market indicesSakshi Ji
 
Bombaby stock exchange
Bombaby  stock exchangeBombaby  stock exchange
Bombaby stock exchangeManohar Gupta
 
Bse sme exchange business ppt
Bse sme exchange   business pptBse sme exchange   business ppt
Bse sme exchange business pptpriyanshu4jan
 
7. stock market indices
7. stock market indices7. stock market indices
7. stock market indicesAkash Bakshi
 
Bombay stock exchange Advanced Study Guide
Bombay stock exchange Advanced Study GuideBombay stock exchange Advanced Study Guide
Bombay stock exchange Advanced Study GuideJobi Mathai
 
Calculation of Sensex & Nifty
Calculation of Sensex & NiftyCalculation of Sensex & Nifty
Calculation of Sensex & NiftyDinesh VNS
 
bombay stock exchange by Zabihlucky
bombay stock exchange by Zabihlucky bombay stock exchange by Zabihlucky
bombay stock exchange by Zabihlucky Zabih Lucky
 
BSE- Bombay Stock Exchange
BSE- Bombay Stock ExchangeBSE- Bombay Stock Exchange
BSE- Bombay Stock ExchangeAnkur Pandey
 
The Indian Stock Market Journey 1996 to 2016
The Indian Stock Market Journey 1996 to 2016The Indian Stock Market Journey 1996 to 2016
The Indian Stock Market Journey 1996 to 2016Swati Shrivastava
 
BSE and NSE Stock Exchange
BSE and NSE Stock ExchangeBSE and NSE Stock Exchange
BSE and NSE Stock ExchangeDipak Senapati
 
Bombay stock exchange
Bombay stock exchangeBombay stock exchange
Bombay stock exchangeOmkar More
 
STOCK MARKET INDICES
STOCK MARKET INDICESSTOCK MARKET INDICES
STOCK MARKET INDICESRiyas Pk
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsLinkedIn
 

Viewers also liked (18)

BSE TASIS SHARIAH 50 INDEX
BSE TASIS SHARIAH 50 INDEXBSE TASIS SHARIAH 50 INDEX
BSE TASIS SHARIAH 50 INDEX
 
BSE & NSE
BSE & NSEBSE & NSE
BSE & NSE
 
Stock market indices
Stock market indicesStock market indices
Stock market indices
 
Bombaby stock exchange
Bombaby  stock exchangeBombaby  stock exchange
Bombaby stock exchange
 
Bse sme exchange business ppt
Bse sme exchange   business pptBse sme exchange   business ppt
Bse sme exchange business ppt
 
7. stock market indices
7. stock market indices7. stock market indices
7. stock market indices
 
Bombay stock exchange Advanced Study Guide
Bombay stock exchange Advanced Study GuideBombay stock exchange Advanced Study Guide
Bombay stock exchange Advanced Study Guide
 
Calculation of Sensex & Nifty
Calculation of Sensex & NiftyCalculation of Sensex & Nifty
Calculation of Sensex & Nifty
 
bombay stock exchange by Zabihlucky
bombay stock exchange by Zabihlucky bombay stock exchange by Zabihlucky
bombay stock exchange by Zabihlucky
 
Bse ppt[2]
Bse ppt[2]Bse ppt[2]
Bse ppt[2]
 
BSE- Bombay Stock Exchange
BSE- Bombay Stock ExchangeBSE- Bombay Stock Exchange
BSE- Bombay Stock Exchange
 
The Indian Stock Market Journey 1996 to 2016
The Indian Stock Market Journey 1996 to 2016The Indian Stock Market Journey 1996 to 2016
The Indian Stock Market Journey 1996 to 2016
 
BSE and NSE Stock Exchange
BSE and NSE Stock ExchangeBSE and NSE Stock Exchange
BSE and NSE Stock Exchange
 
NSE & NIFTY -calculations
NSE & NIFTY -calculationsNSE & NIFTY -calculations
NSE & NIFTY -calculations
 
Nse & Bse
Nse & Bse Nse & Bse
Nse & Bse
 
Bombay stock exchange
Bombay stock exchangeBombay stock exchange
Bombay stock exchange
 
STOCK MARKET INDICES
STOCK MARKET INDICESSTOCK MARKET INDICES
STOCK MARKET INDICES
 
Study: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving CarsStudy: The Future of VR, AR and Self-Driving Cars
Study: The Future of VR, AR and Self-Driving Cars
 

Similar to Simple Linear Regression Model Assumptions and Utility Assessment

Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docx
Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docxAirlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docx
Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docxdaniahendric
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.pptcfisicaster
 
Newbold_chap12.ppt
Newbold_chap12.pptNewbold_chap12.ppt
Newbold_chap12.pptcfisicaster
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.pptTanyaWadhwani4
 
end behavior.....pptx
end behavior.....pptxend behavior.....pptx
end behavior.....pptxLunaLedezma3
 
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.ppt
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.pptch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.ppt
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.pptTushar Chaudhari
 
simple regression-1.pdf
simple regression-1.pdfsimple regression-1.pdf
simple regression-1.pdfBabulShaikh1
 
11.2 graphing linear equations in two variables
11.2 graphing linear equations in two variables11.2 graphing linear equations in two variables
11.2 graphing linear equations in two variablesGlenSchlee
 
multiple regression model building
 multiple regression model building multiple regression model building
multiple regression model buildingYesica Adicondro
 
2. polynomial interpolation
2. polynomial interpolation2. polynomial interpolation
2. polynomial interpolationEasyStudy3
 
Chapter 10 Section 3.ppt
Chapter 10 Section 3.pptChapter 10 Section 3.ppt
Chapter 10 Section 3.pptManoloTaquire
 
Chapter 1 linear equations and straight lines
Chapter 1   linear equations and straight linesChapter 1   linear equations and straight lines
Chapter 1 linear equations and straight linessarkissk
 
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022S
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022SEcon 103 Homework 2Manu NavjeevanAugust 15, 2022S
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022SEvonCanales257
 
Two-sample Hypothesis Tests
Two-sample Hypothesis Tests Two-sample Hypothesis Tests
Two-sample Hypothesis Tests mgbardossy
 
Data Analysison Regression
Data Analysison RegressionData Analysison Regression
Data Analysison Regressionjamuga gitulho
 

Similar to Simple Linear Regression Model Assumptions and Utility Assessment (20)

Chap11 simple regression
Chap11 simple regressionChap11 simple regression
Chap11 simple regression
 
2.1 regression
2.1 regression2.1 regression
2.1 regression
 
Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docx
Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docxAirlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docx
Airlines.xlsxDATADestinationSouthwest Fare ($)US Airways Fare .docx
 
Unit 1.1
Unit 1.1Unit 1.1
Unit 1.1
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.ppt
 
Newbold_chap12.ppt
Newbold_chap12.pptNewbold_chap12.ppt
Newbold_chap12.ppt
 
Multiple Regression.ppt
Multiple Regression.pptMultiple Regression.ppt
Multiple Regression.ppt
 
end behavior.....pptx
end behavior.....pptxend behavior.....pptx
end behavior.....pptx
 
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.ppt
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.pptch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.ppt
ch04sdsdsdsdsdsdsdsdsdsdswewrerertrtr.ppt
 
simple regression-1.pdf
simple regression-1.pdfsimple regression-1.pdf
simple regression-1.pdf
 
11.2 graphing linear equations in two variables
11.2 graphing linear equations in two variables11.2 graphing linear equations in two variables
11.2 graphing linear equations in two variables
 
multiple regression model building
 multiple regression model building multiple regression model building
multiple regression model building
 
2. polynomial interpolation
2. polynomial interpolation2. polynomial interpolation
2. polynomial interpolation
 
Chapter 10 Section 3.ppt
Chapter 10 Section 3.pptChapter 10 Section 3.ppt
Chapter 10 Section 3.ppt
 
Simple Regression
Simple RegressionSimple Regression
Simple Regression
 
Chapter 1 linear equations and straight lines
Chapter 1   linear equations and straight linesChapter 1   linear equations and straight lines
Chapter 1 linear equations and straight lines
 
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022S
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022SEcon 103 Homework 2Manu NavjeevanAugust 15, 2022S
Econ 103 Homework 2Manu NavjeevanAugust 15, 2022S
 
Lecture 1.pdf
Lecture 1.pdfLecture 1.pdf
Lecture 1.pdf
 
Two-sample Hypothesis Tests
Two-sample Hypothesis Tests Two-sample Hypothesis Tests
Two-sample Hypothesis Tests
 
Data Analysison Regression
Data Analysison RegressionData Analysison Regression
Data Analysison Regression
 

More from Subas Nandy

Les7e ppt ada_0103
Les7e ppt ada_0103Les7e ppt ada_0103
Les7e ppt ada_0103Subas Nandy
 
Les7e ppt ada_0102
Les7e ppt ada_0102Les7e ppt ada_0102
Les7e ppt ada_0102Subas Nandy
 
Les7e ppt ada_0101
Les7e ppt ada_0101Les7e ppt ada_0101
Les7e ppt ada_0101Subas Nandy
 
Week1 newtons laws_of_motion
Week1 newtons laws_of_motionWeek1 newtons laws_of_motion
Week1 newtons laws_of_motionSubas Nandy
 
052516 week11 quantum computers
052516 week11 quantum computers052516 week11 quantum computers
052516 week11 quantum computersSubas Nandy
 
052516 week11 quantum3
052516 week11 quantum3052516 week11 quantum3
052516 week11 quantum3Subas Nandy
 
052516 week11 quantum2
052516 week11 quantum2052516 week11 quantum2
052516 week11 quantum2Subas Nandy
 
052516 week11 quantum1
052516 week11 quantum1052516 week11 quantum1
052516 week11 quantum1Subas Nandy
 
0511 week10 resonance
0511 week10 resonance0511 week10 resonance
0511 week10 resonanceSubas Nandy
 
0511 week10 second_reflection
0511 week10 second_reflection0511 week10 second_reflection
0511 week10 second_reflectionSubas Nandy
 
0511 week10 reflection and refraction
0511 week10  reflection and refraction0511 week10  reflection and refraction
0511 week10 reflection and refractionSubas Nandy
 
Sound&resonance
Sound&resonanceSound&resonance
Sound&resonanceSubas Nandy
 
051116 week9 waves
051116 week9 waves051116 week9 waves
051116 week9 wavesSubas Nandy
 
050316 week8 c12-electromagnetic_waves
050316 week8 c12-electromagnetic_waves050316 week8 c12-electromagnetic_waves
050316 week8 c12-electromagnetic_wavesSubas Nandy
 
050316 week 8_electromagnetism (1)
050316 week 8_electromagnetism (1)050316 week 8_electromagnetism (1)
050316 week 8_electromagnetism (1)Subas Nandy
 
0503 week8 electromagnetism
0503 week8 electromagnetism0503 week8 electromagnetism
0503 week8 electromagnetismSubas Nandy
 

More from Subas Nandy (20)

Les7e ppt ada_0103
Les7e ppt ada_0103Les7e ppt ada_0103
Les7e ppt ada_0103
 
Les7e ppt ada_0102
Les7e ppt ada_0102Les7e ppt ada_0102
Les7e ppt ada_0102
 
Les7e ppt ada_0101
Les7e ppt ada_0101Les7e ppt ada_0101
Les7e ppt ada_0101
 
Ge273.u10.pp1
Ge273.u10.pp1Ge273.u10.pp1
Ge273.u10.pp1
 
Week1 newtons laws_of_motion
Week1 newtons laws_of_motionWeek1 newtons laws_of_motion
Week1 newtons laws_of_motion
 
Les5e ppt 05
Les5e ppt 05Les5e ppt 05
Les5e ppt 05
 
052516 week11 quantum computers
052516 week11 quantum computers052516 week11 quantum computers
052516 week11 quantum computers
 
052516 week11 quantum3
052516 week11 quantum3052516 week11 quantum3
052516 week11 quantum3
 
052516 week11 quantum2
052516 week11 quantum2052516 week11 quantum2
052516 week11 quantum2
 
052516 week11 quantum1
052516 week11 quantum1052516 week11 quantum1
052516 week11 quantum1
 
Ge273.u9.pp1
Ge273.u9.pp1Ge273.u9.pp1
Ge273.u9.pp1
 
0511 week10 resonance
0511 week10 resonance0511 week10 resonance
0511 week10 resonance
 
0511 week10 second_reflection
0511 week10 second_reflection0511 week10 second_reflection
0511 week10 second_reflection
 
0511 week10 reflection and refraction
0511 week10  reflection and refraction0511 week10  reflection and refraction
0511 week10 reflection and refraction
 
Ge273.u8.pp1
Ge273.u8.pp1Ge273.u8.pp1
Ge273.u8.pp1
 
Sound&resonance
Sound&resonanceSound&resonance
Sound&resonance
 
051116 week9 waves
051116 week9 waves051116 week9 waves
051116 week9 waves
 
050316 week8 c12-electromagnetic_waves
050316 week8 c12-electromagnetic_waves050316 week8 c12-electromagnetic_waves
050316 week8 c12-electromagnetic_waves
 
050316 week 8_electromagnetism (1)
050316 week 8_electromagnetism (1)050316 week 8_electromagnetism (1)
050316 week 8_electromagnetism (1)
 
0503 week8 electromagnetism
0503 week8 electromagnetism0503 week8 electromagnetism
0503 week8 electromagnetism
 

Recently uploaded

Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxpboyjonauth
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Celine George
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxAnaBeatriceAblay2
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfadityarao40181
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 

Recently uploaded (20)

Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Introduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptxIntroduction to AI in Higher Education_draft.pptx
Introduction to AI in Higher Education_draft.pptx
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
Incoming and Outgoing Shipments in 1 STEP Using Odoo 17
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptxENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
ENGLISH5 QUARTER4 MODULE1 WEEK1-3 How Visual and Multimedia Elements.pptx
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Biting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdfBiting mechanism of poisonous snakes.pdf
Biting mechanism of poisonous snakes.pdf
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 

Simple Linear Regression Model Assumptions and Utility Assessment

  • 1. 11-1Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
  • 2. 11-2Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Statistics for Business and Economics Chapter 11 Simple Linear Regression
  • 3. 11-3Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Contents 1. Probabilistic Models 2. Fitting the Model: The Least Squares Approach 3. Model Assumptions 4. Assessing the Utility of the Model: Making Inferences about the Slope β1
  • 4. 11-4Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Contents 5. The Coefficients of Correlation and Determination 6. Using the Model for Estimation and Prediction 7. A Complete Example
  • 5. 11-5Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Learning Objectives • Introduce the straight-line (simple linear regression) model as a means of relating one quantitative variable to another quantitative variable • Assess how well the simple linear regression model fits the sample data
  • 6. 11-6Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Learning Objectives • Introduce the correlation coefficient as a means of relating one quantitative variable to another quantitative variable • Employ the simple linear regression model for predicting the value of one variable from a specified value of another variable
  • 7. 11-7Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.1 Probabilistic Models
  • 8. 11-8Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Models • Representation of some phenomenon • Mathematical model is a mathematical expression of some phenomenon • Often describe relationships between variables • Types – Deterministic models – Probabilistic models
  • 9. 11-9Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Deterministic Models • Hypothesize exact relationships • Suitable when prediction error is negligible • Example: force is exactly mass times acceleration – F = m·a © 1984-1994 T/Maker Co.
  • 10. 11-10Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Probabilistic Models • Hypothesize two components – Deterministic – Random error • Example: sales volume (y) is 10 times advertising spending (x) + random error – y = 10x + ε – Random error may be due to factors other than advertising
  • 11. 11-11Copyright © 2014, 2011, and 2008 Pearson Education, Inc. General Form of Probabilistic Models y = Deterministic component + Random error where y is the variable of interest. We always assume that the mean value of the random error equals 0. This is equivalent to assuming that the mean value of y, E(y), equals the deterministic component of the model; that is, E(y) = Deterministic component
  • 12. 11-12Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A First-Order (Straight Line) Probabilistic Model y = β0 + β1x +ε where y = Dependent or response variable (variable to be modeled) x = Independent or predictor variable (variable used as a predictor of y) E(y) = β0 + β1x = Deterministic component ε (epsilon) = Random error component
  • 13. 11-13Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A First-Order (Straight Line) Probabilistic Model y = β0 + β1x +ε β0 (beta zero) = y-intercept of the line, that is, the point at which the line intercepts or cuts through the y-axis β1 (beta one) = slope of the line, that is, the change (amount of increase or decrease) in the deterministic component of y for every 1-unit increase in x
  • 14. 11-14Copyright © 2014, 2011, and 2008 Pearson Education, Inc. [Note: A positive slope implies that E(y) increases by the amount β1 for each unit increase in x. A negative slope implies that E(y) decreases by the amount β1.] A First-Order (Straight Line) Probabilistic Model
  • 15. 11-15Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Five-Step Procedure Step 1: Hypothesize the deterministic component of the model that relates the mean, E(y), to the independent variable x. Step 2: Use the sample data to estimate unknown parameters in the model. Step 3: Specify the probability distribution of the random error term and estimate the standard deviation of this distribution. Step 4: Statistically evaluate the usefulness of the model. Step 5: When satisfied that the model is useful, use it for prediction, estimation, and other purposes.
  • 16. 11-16Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.2 Fitting the Model: The Least Squares Approach
  • 17. 11-17Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Scatterplot 1. Plot of all (xi, yi) pairs 2. Suggests how well model will fit 0 20 40 60 0 20 40 60 x y
  • 18. 11-18Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Thinking Challenge 0 20 40 60 0 20 40 60 x y • How would you draw a line through the points? • How do you determine which line ‘fits best’?
  • 19. 11-19Copyright © 2014, 2011, and 2008 Pearson Education, Inc. The least squares line is one that has the following two properties: 1. The sum of the errors equals 0, i.e., mean error = 0. 2. The sum of squared errors (SSE) is smaller than for any other straight-line model, i.e., the error variance is minimum. Least Squares Line 0 1 ˆ ˆˆ β β= +y x
  • 20. 11-20Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Formula for the Least Squares Estimates 0 1 ˆ ˆ: β β− = −y intercept y x 1 ˆ: β = xy xx SS Slope SS where SSxy = xi − x( )∑ yi − y( ) SSxx = xi − x( ) 2 ∑ n = Sample size
  • 21. 11-21Copyright © 2014, 2011, and 2008 Pearson Education, Inc. y-intercept: represents the predicted value of y when x = 0 (Caution: This value will not be meaningful if the value x = 0 is nonsensical or outside the range of the sample data.) slope: represents the increase (or decrease) in y for every 1-unit increase in x (Caution: This interpretation is valid only for x-values within the range of the sample data.) Interpreting the Estimates of β0 and β1 in Simple Liner Regression 1 ˆβ 0 ˆβ
  • 22. 11-22Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Least Squares Graphically ε2 y x ε1 ε3 ε4 ^^ ^ ^ 2 0 1 2 2 ˆ ˆ ˆy xβ β ε= + + 0 1 ˆ ˆˆi iy xβ β= + 2 2 2 2 2 1 2 3 4 1 ˆ ˆ ˆ ˆ ˆLS minimizes n i i ε ε ε ε ε = = + + +∑
  • 23. 11-23Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Least Squares Example You’re a marketing analyst for Hasbro Toys. You gather the following data: Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Find the least squares line relating sales and advertising.
  • 24. 11-24Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Scatterplot Sales vs. Advertising 0 1 2 3 4 0 1 2 3 4 5 Sales Advertising
  • 25. 11-25Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Parameter Estimation Solution 15 3 5 5 = = =∑x x 10 2 5 5 = = =∑y y ( )( ) ( )( )3 2 7 = − − = − − = ∑ ∑ xySS x x y y x y ( ) ( ) 2 2 3 10 = − = − = ∑ ∑ xxSS x x x
  • 26. 11-26Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Parameter Estimation Solution ˆ .1 .7y x= − + ( ) ( )0 1 ˆ ˆ 2 .70 3 .10y xβ β= − = − = − 1 7ˆ .7 10 = = =xy xx SS B SS The slope of the least squares line is:
  • 27. 11-27Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Parameter Estimation Computer Output Parameter Estimates Parameter Standard T for H0: Variable DF Estimate Error Param=0 Prob>|T| INTERCEP 1 -0.1000 0.6350 -0.157 0.8849 ADVERT 1 0.7000 0.1914 3.656 0.0354 β0 ^ β1 ^ ˆ .1 .7y x= − +
  • 28. 11-28Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient Interpretation Solution 1. Slope (β1) • Sales Volume (y) is expected to increase by $700 for each $100 increase in advertising (x), over the sampled range of advertising expenditures from $100 to $500 ^ 2. y-Intercept (β0) • Since 0 is outside of the range of the sampled values of x, the y-intercept has no meaningful interpretation ^^
  • 29. 11-29Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.3 Model Assumptions
  • 30. 11-30Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Basic Assumptions of the Probability Distribution Assumption 1: The mean of the probability distribution of ε is 0 – that is, the average of the values of ε over an infinitely long series of experiments is 0 for each setting of the independent variable x. This assumption implies that the mean value of y, E(y), for a given value of x is E(y) = β0 + β1x.
  • 31. 11-31Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Basic Assumptions of the Probability Distribution Assumption 2: The variance of the probability distribution of ε is constant for all settings of the independent variable x. For our straight-line model, this assumption means that the variance of ε is equal to a constant, say σ 2 , for all values of x.
  • 32. 11-32Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Basic Assumptions of the Probability Distribution Assumption 3: The probability distribution of ε is normal. Assumption 4: The values of ε associated with any two observed values of y are independent–that is, the value of ε associated with one value of y has no effect on the values of ε associated with other y values.
  • 33. 11-33Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Basic Assumptions of the Probability Distribution .
  • 34. 11-34Copyright © 2014, 2011, and 2008 Pearson Education, Inc. To estimate the standard deviation σ of ε, we calculate We will refer to s as the estimated standard error of the regression model. Estimation of σ 2 for a (First- Order) Straight-Line Model 2 SSE SSE Degrees of freedom for error 2 = = − s n ( ) ( ) 2 1 2 ˆˆwhere SSE β= − = − = − ∑ ∑ i i yy xy yy i y y SS SS SS y y s = s2 = SSE n − 2
  • 35. 11-35Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Calculating SSE, s2 , s Example You’re a marketing analyst for Hasbro Toys. You gather the following data: Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Find SSE, s2 , and s.
  • 36. 11-36Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Calculating s2 and s Solution 2 1.1 .36667 2 5 2 SSE s n = = = − − .36667 .6055s = =
  • 37. 11-37Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.4 Assessing the Utility of the Model: Making Inferences about the Slope β1
  • 38. 11-38Copyright © 2014, 2011, and 2008 Pearson Education, Inc. If we make the four assumptions about ε, the sampling distribution of the least squares estimator of the slope will be normal with mean β1 (the true slope) and standard deviation Sampling Distribution of 1 ˆ SSβ σ σ = xx 1 ˆβ 1 ˆβ
  • 39. 11-39Copyright © 2014, 2011, and 2008 Pearson Education, Inc. We estimate by and refer to this quantity as the estimated standard error of the least squares slope . Sampling Distribution of 1 ˆ SSβ = xx s s 1 ˆβ σ 1 ˆβ 1 ˆβ
  • 40. 11-40Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A Test of Model Utility: Simple Linear Regression
  • 41. 11-41Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Interpreting p-Values for β Coefficients in Regression Almost all statistical computer software packages report a two-tailed p-value for each of the β parameters in the regression model. For example, in simple linear regression, the p-value for the two-tailed test H0: β1 = 0 versus Ha: β1 ≠ 0 is given on the printout. If you want to conduct a one-tailed test of hypothesis, you will need to adjust the p- value reported on the printout as follows:
  • 42. 11-42Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Interpreting p-Values for β Coefficients in Regression where p is the p-value reported on the printout and t is the value of the test statistic.
  • 43. 11-43Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
  • 44. 11-44Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Test of Slope Coefficient Example You’re a marketing analyst for Hasbro Toys. You find β0 = –.1, β1 = .7 and s = .6055. Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Is the relationship significant at the .05 level of significance? ^^
  • 45. 11-45Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Test of Slope Coefficient Solution • H0: • Ha: • α = • df = • Critical Value(s): t0 3.182-3.182 .025 Reject H0 Reject H0 .025 β1 = 0 β1 ≠ 0 .05 5 – 2 = 3
  • 46. 11-46Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Test Statistic Solution söβ1 = s SSxx = .6055 55− 15( ) 2 5 = .1914 t = öβ1 Söβ1 = .70 .1914 = 3.657
  • 47. 11-47Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Test of Slope Coefficient Solution • H0: • Ha: • α = • df = • Critical Value(s): t0 3.182-3.182 .025 Reject H0 Reject H0 .025 β1 = 0 β1 ≠ 0 .05 5 – 2 = 3 Test Statistic: Decision: Conclusion: t = 3.657 Reject at α = .05 There is evidence of a relationship
  • 48. 11-48Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Test of Slope Coefficient Computer Output Parameter Estimates Parameter Standard T for H0: Variable DF Estimate Error Param=0 Prob>|T| INTERCEP 1 -0.1000 0.6350 -0.157 0.8849 ADVERT 1 0.7000 0.1914 3.656 0.0354 t = β1 / Sβ P-Value Sββ1 1 1 ^^^^
  • 49. 11-49Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.5 The Coefficients of Correlation and Determination
  • 50. 11-50Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Correlation Models • Answers ‘How strong is the linear relationship between two variables?’ • Coefficient of correlation – Sample correlation coefficient denoted r – Values range from –1 to +1 – Measures degree of association – Does not indicate cause–effect relationship
  • 51. 11-51Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation xy xx yy SS r SS SS = SSxy = x − x( ) y − y( )∑ SSxx = x − x( ) 2 ∑ SSyy = y − y( ) 2 ∑ where
  • 52. 11-52Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation
  • 53. 11-53Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation
  • 54. 11-54Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation
  • 55. 11-55Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation Example You’re a marketing analyst for Hasbro Toys. Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Calculate the coefficient of correlation.
  • 56. 11-56Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation Solution SSxy = x − x( )∑ y − y( )= 7 SSyy = y − y( )∑ 2 = 6 SSxx = x − x( ) 2 ∑ = 10 7 .904 10 6 xy xx yy SS r SS SS = = = ×
  • 57. 11-57Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A Test for Linear Correlation
  • 58. 11-58Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Condition Required for a Valid Test of Correlation • The sample of (x, y) values is randomly selected from a normal population.
  • 59. 11-59Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation Thinking Challenge You’re an economist for the county cooperative. You gather the following data: Fertilizer (lb.) Yield (lb.) 4 3.0 6 5.5 10 6.5 12 9.0 Find the coefficient of correlation. © 1984-1994 T/Maker Co.
  • 60. 11-60Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Correlation Solution SSxy = x − x( )∑ y − y( )= 26 SSyy = y − y( )∑ 2 = 18.5 SSxx = x − x( ) 2 ∑ = 40 26 .956 40 18.5 xy xx yy SS r SS SS = = = ×
  • 61. 11-61Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Determination It represents the proportion of the total sample variability around y that is explained by the linear relationship between y and x. r2 = Explained Variation Total Variation = SSyy − SSE SSyy = 1− SSE SSyy 0 ≤ r2 ≤ 1 r2 = (coefficient of correlation)2
  • 62. 11-62Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Determination Example You’re a marketing analyst for Hasbro Toys. You know r = .904. Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Calculate and interpret the coefficient of determination.
  • 63. 11-63Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Coefficient of Determination Solution r2 = (coefficient of correlation)2 r2 = (.904)2 r2 = .817 Interpretation: About 81.7% of the sample variation in Sales (y) can be explained by using Ad $ (x) to predict Sales (y) in the linear model.
  • 64. 11-64Copyright © 2014, 2011, and 2008 Pearson Education, Inc. r2 Computer Output Root MSE 0.60553 R-square 0.8167 Dep Mean 2.00000 Adj R-sq 0.7556 C.V. 30.27650 r2 adjusted for number of explanatory variables & sample size r2
  • 65. 11-65Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.6 Using the Model for Estimation and Determination
  • 66. 11-66Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Probabilistic Model • Used to make inferences – Estimate the mean value of y, E(y) for a specific x  Estimate the mean sales for all months during which $400 (x = 4) is expended on advertising – Predict a new individual y value for given x  If we expend $400 in advertising next month, we want to predict the sales revenue for that month
  • 67. 11-67Copyright © 2014, 2011, and 2008 Pearson Education, Inc.
  • 68. 11-68Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A 100(1 – α)% Confidence Interval for the Mean Value of y at x = xp ( ) 2 /2 1 ˆ SS α − ± + p xx x x y t s n df = n – 2 ˆ(Estimated standard error of )/2 ˆ α± yy t
  • 69. 11-69Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A 100(1 – α)% Prediction Interval for an Individual New Value of y at x = xp ( ) 2 /2 1 ˆ 1 SS α − ± + + p xx x x y t s n df = n – 2 (Estimated standard error of prediction)/2 ˆ α±y t
  • 70. 11-70Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Error of estimating the mean value of y for a given value of x
  • 71. 11-71Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Error of predicting a future value of y for a given value of x
  • 72. 11-72Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Confidence Interval Example You’re a marketing analyst for Hasbro Toys. You find β0 = –.1, β 1 = .7 and s = .6055. Ad Expenditure (100$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Find a 95% confidence interval for the mean sales when advertising is $4. ^^
  • 73. 11-73Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Confidence Interval Solution ( ) ( ) ( ) ( ) ( ) ( ) 2 /2 2 1 ˆ SS ˆ .1 .7 4 2.7 4 31 2.7 3.182 .6055 5 10 1.645 ( ) 3.755 α − ± + = − + = − ± + ≤ ≤ p xx x x y t s n y E Y x to be predicted
  • 74. 11-74Copyright © 2014, 2011, and 2008 Pearson Education, Inc. A 100(1 – α)% Prediction Interval for an Individual New Value of y at x = xp ( ) 2 /2 1 ˆ 1 SS α − ± + + p xx x x y t s n Note! df = n – 2
  • 75. 11-75Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Why the Extra ‘S’? Expected (Mean) y y y we're trying to predict Prediction, y^ x xp ε E(y) = β0 + β1x ^ ^ ^ yi = β 0 + β 1 xi
  • 76. 11-76Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Prediction Interval Example You’re a marketing analyst for Hasbro Toys. You find β0 = –.1, β 1 = .7 and s = .6055. Ad Expenditure (1000$) Sales (Units) 1 1 2 1 3 2 4 2 5 4 Predict the sales when advertising is $400. Use a 95% prediction interval. ^^
  • 77. 11-77Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Prediction Interval Solution ( ) ( ) ( ) ( ) ( ) ( ) 2 /2 2 4 1 ˆ 1 SS ˆ .1 .7 4 2.7 4 31 2.7 3.182 .6055 1 5 10 .503 4.897 α − ± + + = − + = − ± + + ≤ ≤ p xx x x y t s n y y x to be predicted
  • 78. 11-78Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Interval Estimate Computer Output Dep Var Pred Std Err Low95% Upp95% Low95% Upp95% Obs SALES Value Predict Mean Mean Predict Predict 1 1.000 0.600 0.469 -0.892 2.092 -1.837 3.037 2 1.000 1.300 0.332 0.244 2.355 -0.897 3.497 3 2.000 2.000 0.271 1.138 2.861 -0.111 4.111 4 2.000 2.700 0.332 1.644 3.755 0.502 4.897 5 4.000 3.400 0.469 1.907 4.892 0.962 5.837 Predicted y when x = 4 Confidence Interval SY^ Prediction Interval
  • 79. 11-79Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Confidence intervals for mean values and prediction intervals for new values
  • 80. 11-80Copyright © 2014, 2011, and 2008 Pearson Education, Inc. 11.7 A Complete Example
  • 81. 11-81Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Suppose a fire insurance company wants to relate the amount of fire damage in major residential fires to the distance between the burning house and the nearest fire station. The study is to be conducted in a large suburb of a major city; a sample of 15 recent fires in this suburb is selected. The amount of damage, y, and the distance between the fire and the nearest fire station, x, are recorded for each fire.
  • 82. 11-82Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example
  • 83. 11-83Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 1: First, we hypothesize a model to relate fire damage, y, to the distance from the nearest fire station, x. We hypothesize a straight-line probabilistic model: y = β0 + β1x + ε
  • 84. 11-84Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 2: Use a statistical software package to estimate the unknown parameters in the deterministic component of the hypothesized model. The Excel printout for the simple linear regression analysis is shown on the next slide. The least squares estimates of the slope β1 and intercept β0, highlighted on the printout, are 1 0 ˆ 4.919331 ˆ 10.277929 β β = =
  • 85. 11-85Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example ˆLeast Squares Equation: 10.278 4.919= +y x
  • 86. 11-86Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example This prediction equation is graphed in the Minitab scatterplot.
  • 87. 11-87Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example The least squares estimate of the slope, implies that the estimated mean damage increases by $4,919 for each additional mile from the fire station. This interpretation is valid over the range of x, or from .7 to 6.1 miles from the station. The estimated y-intercept, , has the interpretation that a fire 0 miles from the fire station has an estimated mean damage of $10,278. 1 ˆ 4.919β = 0 ˆ 10.278β =
  • 88. 11-88Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 3: Specify the probability distribution of the random error component ε. The estimate of the standard deviation σ of ε, highlighted on the Excel printout is s = 2.31635 This implies that most of the observed fire damage (y) values will fall within approximately 2σ = 4.64 thousand dollars of their respective predicted values when using the least squares line.
  • 89. 11-89Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 4: First, test the null hypothesis that the slope β1 is 0 –that is, that there is no linear relationship between fire damage and the distance from the nearest fire station, against the alternative hypothesis that fire damage increases as the distance increases. We test H0: β1 = 0 Ha: β1 > 0 The two-tailed observed significance level for testing is approximately 0.
  • 90. 11-90Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example The 95% confidence interval yields (4.070, 5.768). We estimate (with 95% confidence) that the interval from $4,070 to $5,768 encloses the mean increase (β1) in fire damage per additional mile distance from the fire station. The coefficient of determination, is r2 = .9235, which implies that about 92% of the sample variation in fire damage (y) is explained by the distance (x) between the fire and the fire station.
  • 91. 11-91Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example The coefficient of correlation, r, that measures the strength of the linear relationship between y and x is not shown on the Excel printout and must be calculated. We find The high correlation confirms our conclusion that β1 is greater than 0; it appears that fire damage and distance from the fire station are positively correlated. All signs point to a strong linear relationship between y and x. r = + r2 = .9235 = .96
  • 92. 11-92Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 5: We are now prepared to use the least squares model. Suppose the insurance company wants to predict the fire damage if a major residential fire were to occur 3.5 miles from the nearest fire station. A 95% confidence interval for E(y) and prediction interval for y when x = 3.5 are shown on the Minitab printout on the next slide.
  • 93. 11-93Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example Step 5: We are now prepared to use the least
  • 94. 11-94Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Example The predicted value (highlighted on the printout) is , while the 95% prediction interval (also highlighted) is (22.3239, 32.6672). Therefore, with 95% confidence we predict fire damage in a major residential fire 3.5 miles from the nearest station to be between $22,324 and $32,667. ˆ 27.496=y
  • 95. 11-95Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Key Ideas Simple Linear Regression Variables y = Dependent variable (quantitative) x = Independent variable (quantitative) Method of Least Squares Properties 1. average error of prediction = 0 2. sum of squared errors is minimum
  • 96. 11-96Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Key Ideas Practical Interpretation of y-intercept predicted y value when x = 0 (no practical interpretation if x = 0 is either nonsensical or outside range of sample data) Practical Interpretation of Slope Increase or decrease in y for every 1-unit increase in x
  • 97. 11-97Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Key Ideas First-Order (Straight Line) Model E(y) = β0 + β1x where E(y) = mean of y β0 = y-intercept of line (point where line intercepts the y-axis) β1 = slope of line (change in y for every 1- unit change in x)
  • 98. 11-98Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Key Ideas Coefficient of Correlation, r 1. Ranges between –1 and 1 2. Measures strength of linear relationship between y and x Coefficient of Determination, r2 1. Ranges between 0 and 1 2. Measures proportion of sample variation in y explained by the model
  • 99. 11-99Copyright © 2014, 2011, and 2008 Pearson Education, Inc. Key Ideas Practical Interpretation of Model Standard Deviation, s Ninety-five percent of y-values fall within 2s of their respected predicted values Width of confidence interval for E(y) will always be narrower than width of prediction interval for y

Editor's Notes

  1. As a result of this class, you will be able to...
  2. As a result of this class, you will be able to...
  3. As a result of this class, you will be able to...
  4. As a result of this class, you will be able to...
  5. :1, 1, 3
  6. .
  7. :1, 1, 3
  8. :1, 1, 3
  9. :1, 1, 3
  10. ‘Standard Error’ is the estimated standard deviation of the sampling distribution, sbP.
  11. :1, 1, 3
  12. :1, 1, 3
  13. Note the 1 under the radical in the standard error formula. The effect of the extra Syx is to increase the width of the interval. This will be seen in the interval bands.
  14. The error in predicting some future value of Y is the sum of 2 errors: 1. the error of estimating the mean Y, E(Y|X) 2. the random error that is a component of the value of Y to be predicted. Even if we knew the population regression line exactly, we would still make  error.
  15. :1, 1, 3
  16. As a result of this class, you will be able to...
  17. As a result of this class, you will be able to...
  18. As a result of this class, you will be able to...
  19. As a result of this class, you will be able to...
  20. As a result of this class, you will be able to...