SlideShare a Scribd company logo
1 of 42
Section 10.3-1
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Lecture Slides
Elementary Statistics
Twelfth Edition
and the Triola Statistics Series
by Mario F. Triola
Section 10.3-2
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Chapter 10
Correlation and Regression
10-1 Review and Preview
10-2 Correlation
10-3 Regression
10-4 Prediction Intervals and Variation
10-5 Multiple Regression
10-6 Nonlinear Regression
Section 10.3-3
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Key Concept
In Part 1 of this section we find the equation of the straight
line that best fits the paired sample data. That equation
algebraically describes the relationship between two
variables.
The best-fitting straight line is called a regression line and
its equation is called the regression equation.
In Part 2, we discuss marginal change, influential points,
and residual plots as tools for analyzing correlation and
regression results.
Section 10.3-4
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Part 1: Basic Concepts of Regression
Section 10.3-5
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Regression
The typical equation of a straight line y = mx + b is
expressed in the form ŷ = b0 + b1x, where b0 is the y-
intercept and b1 is the slope.
The regression equation expresses a relationship
between x (called the explanatory variable, predictor
variable or independent variable), and ŷ (called the
response variable or dependent variable).
Section 10.3-6
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Definitions
Regression Equation:
Given a collection of paired sample data, the regression
line (or line of best fit, or least-squares line) is the
straight line that “best” fits the scatterplot of data.
The regression equation ŷ = b0 + b1x algebraically
describes the regression line.
Section 10.3-7
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Notation for
Regression Equation
Population
Parameter
Sample
Statistic
y-Intercept of
regression equation
β0 b0
Slope of regression
equation
β1 b1
Equation of the
regression line
y = β0 + β1x ŷ = b0 + b1x
Section 10.3-8
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Requirements
1. The sample of paired (x, y) data is a random sample
of quantitative data.
2. Visual examination of the scatterplot shows that the
points approximate a straight-line pattern.
3. Any outliers must be removed if they are known to be
errors. Consider the effects of any outliers that are
not known errors.
Section 10.3-9
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Formulas for b1 and b0
Technology will compute these values.
1
Slope: y
x
s
b r
s

0 1
y-intercept: b y b x
 
Section 10.3-10
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
Let us return to the example from Section 10.2. We
would like to use the explanatory variable, x, shoe print
length, to predict the response variable, y, height.
The data are listed below:
Section 10.3-11
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
Requirement Check:
1. The data are assumed to be a simple random
sample.
2. The scatterplot showed a roughly straight-line
pattern.
3. There are no outliers.
The use of technology is recommended for finding the
equation of a regression line.
Section 10.3-12
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example – Continued
Section 10.3-13
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
All these technologies show that the regression equation
can be expressed as:
Now we use the formulas to determine the regression
equation (technology is recommended).
ˆ 125 1.73
y x
 
Section 10.3-14
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
Recall from the previous section that r = 0.591269.
Technology can be used to find the values of the sample
means and sample standard deviations used below.
(These are the same coefficients found using
technology)
  
1
0 1
4.87391
0.591269 1.72745
1.66823
177.3 1.72745 30.04 125.40740
y
x
s
b r
s
b y b x
  
    
Section 10.3-15
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
Graph the regression equation on a scatterplot:
ˆ 125 1.73
y x
 
Section 10.3-16
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
1. Use the regression equation for predictions only if the
graph of the regression line on the scatterplot confirms
that the regression line fits the points reasonably well.
2. Use the regression equation for predictions only if the
linear correlation coefficient r indicates that there is a
linear correlation between the two variables (as
described in Section 10-2).
Using the Regression
Equation for Predictions
Section 10.3-17
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
3. Use the regression line for predictions only if the data do
not go much beyond the scope of the available sample
data. (Predicting too far beyond the scope of the
available sample data is called extrapolation, and it
could result in bad predictions.)
4. If the regression equation does not appear to be useful
for making predictions, the best predicted value of a
variable is its sample mean.
Using the Regression
Equation for Predictions
Section 10.3-18
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Strategy for Predicting Values of y
Section 10.3-19
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
If the regression equation is not a good model, the best
predicted value of y is simply , the mean of the y
values.
Remember, this strategy applies to linear patterns of
points in a scatterplot.
If the scatterplot shows a pattern that is not a straight-line
pattern, other methods apply, as described in Section 10-
6.
Using the Regression
Equation for Predictions
y
Section 10.3-20
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
Use the 5 pairs of shoe print lengths and heights to
predict the height of a person with a shoe print length of
29 cm.
The regression line does not fit the points well. The
correlation is r = 0.591, which suggests there is not a
linear correlation (the P-value was 0.294).
The best predicted height is simply the mean of the
sample heights:
177.3 cm
y 
Section 10.3-21
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
Use the 40 pairs of shoe print lengths from Data Set 2 in
Appendix B to predict the height of a person with a shoe
print length of 29 cm.
Now, the regression line does fit the points well, and the
correlation of r = 0.813 suggests that there is a linear
correlation (the P-value is 0.000).
Section 10.3-22
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
Using technology we obtain the regression equation and
scatterplot:
Section 10.3-23
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
The given shoe length of 29 cm is not beyond the scope
of the available data, so substitute in 29 cm into the
regression model:
A person with a shoe length of 29 cm is predicted to be
174.3 cm tall.
 
ˆ 80.9 3.22
80.9 3.22 29
174.3 cm
y x
 
 

Section 10.3-24
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Part 2: Beyond the Basics of Regression
Section 10.3-25
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Definition
In working with two variables related by a regression
equation, the marginal change in a variable is the amount
that it changes when the other variable changes by
exactly one unit.
The slope b1 in the regression equation represents the
marginal change in y that occurs when x changes by one
unit.
Section 10.3-26
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
For the 40 pairs of shoe print lengths and heights, the
regression equation was:
The slope of 3.22 tells us that if we increase shoe print
length by 1 cm, the predicted height of a person
increases by 3.22 cm.
ˆ 80.9 3.22
y x
 
Section 10.3-27
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Definition
In a scatterplot, an outlier is a point lying far away from
the other data points.
Paired sample data may include one or more influential
points, which are points that strongly affect the graph of
the regression line.
Section 10.3-28
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
For the 40 pairs of shoe prints and heights, observe what
happens if we include this additional data point:
x = 35 cm and y = 25 cm
Section 10.3-29
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
The additional point is an influential point because the
graph of the regression line because the graph of the
regression line did change considerably.
The additional point is also an outlier because it is far
from the other points.
Section 10.3-30
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
For a pair of sample x and y values, the residual is the
difference between the observed sample value of y and
the y-value that is predicted by using the regression
equation.
That is:
Definition
ˆ
residual observed predicted
y y y y
   
Section 10.3-31
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Residuals
Section 10.3-32
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
A straight line satisfies the least-squares property if the
sum of the squares of the residuals is the smallest sum
possible.
Definition
Section 10.3-33
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
A residual plot is a scatterplot of the (x, y) values after
each of the y-coordinate values has been replaced by
the residual value y – ŷ (where ŷ denotes the predicted
value of y).
That is, a residual plot is a graph of the points (x, y – ŷ).
Definition
Section 10.3-34
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Residual Plot Analysis
When analyzing a residual plot, look for a pattern in the
way the points are configured, and use these criteria:
The residual plot should not have any obvious patterns
(not even a straight line pattern). This confirms that the
scatterplot of the sample data is a straight-line pattern.
The residual plot should not become thicker (or thinner)
when viewed from left to right. This confirms the
requirement that for different fixed values of x, the
distributions of the corresponding y values all have the
same standard deviation.
Section 10.3-35
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example
The shoe print and height data are used to generate the
following residual plot:
Section 10.3-36
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
The residual plot becomes thicker, which suggests that
the requirement of equal standard deviations is violated.
Section 10.3-37
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
On the following slides are three residual plots.
Observe what is good or bad about the individual
regression models.
Section 10.3-38
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
Regression model is a good model:
Section 10.3-39
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
Distinct pattern: sample data may not follow a straight-
line pattern.
Section 10.3-40
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Example - Continued
Residual plot becoming thicker: equal standard
deviations violated.
Section 10.3-41
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Complete Regression Analysis
1. Construct a scatterplot and verify that the pattern of
the points is approximately a straight-line pattern
without outliers. (If there are outliers, consider their
effects by comparing results that include the outliers
to results that exclude the outliers.)
2. Construct a residual plot and verify that there is no
pattern (other than a straight-line pattern) and also
verify that the residual plot does not become thicker
(or thinner).
Section 10.3-42
Copyright © 2014, 2012, 2010 Pearson Education, Inc.
Complete Regression Analysis
3. Use a histogram and/or normal quantile plot to
confirm that the values of the residuals have a
distribution that is approximately normal.
4. Consider any effects of a pattern over time.

More Related Content

Similar to Chapter 10 Section 3.ppt

Chapter 8 Section 5.ppt
Chapter 8 Section 5.pptChapter 8 Section 5.ppt
Chapter 8 Section 5.pptManoloTaquire
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.pptcfisicaster
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptxAjka3
 
An econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsAn econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsIRJET Journal
 
Multiple Linear Regression.pptx
Multiple Linear Regression.pptxMultiple Linear Regression.pptx
Multiple Linear Regression.pptxBHUSHANKPATEL
 
Diagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelDiagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelMehdi Shayegani
 
Pat05 ppt 0105
Pat05 ppt 0105Pat05 ppt 0105
Pat05 ppt 0105wzuri
 
Curve_Fitting.pdf
Curve_Fitting.pdfCurve_Fitting.pdf
Curve_Fitting.pdfIrfan Khan
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionIOSRJM
 
Chap14 analysis of categorical data
Chap14 analysis of categorical dataChap14 analysis of categorical data
Chap14 analysis of categorical dataJudianto Nugroho
 
Lecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptLecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptshakirRahman10
 

Similar to Chapter 10 Section 3.ppt (20)

Chapter 8 Section 5.ppt
Chapter 8 Section 5.pptChapter 8 Section 5.ppt
Chapter 8 Section 5.ppt
 
Newbold_chap14.ppt
Newbold_chap14.pptNewbold_chap14.ppt
Newbold_chap14.ppt
 
Business statistics homework help
Business statistics homework helpBusiness statistics homework help
Business statistics homework help
 
Chap11 simple regression
Chap11 simple regressionChap11 simple regression
Chap11 simple regression
 
Regression Analysis.pptx
Regression Analysis.pptxRegression Analysis.pptx
Regression Analysis.pptx
 
An econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using StatisticsAn econometric model for Linear Regression using Statistics
An econometric model for Linear Regression using Statistics
 
Multiple Linear Regression.pptx
Multiple Linear Regression.pptxMultiple Linear Regression.pptx
Multiple Linear Regression.pptx
 
Correlation and regression in r
Correlation and regression in rCorrelation and regression in r
Correlation and regression in r
 
Diagnostic methods for Building the regression model
Diagnostic methods for Building the regression modelDiagnostic methods for Building the regression model
Diagnostic methods for Building the regression model
 
Chapter 12
Chapter 12Chapter 12
Chapter 12
 
Pat05 ppt 0105
Pat05 ppt 0105Pat05 ppt 0105
Pat05 ppt 0105
 
Ijetcas14 546
Ijetcas14 546Ijetcas14 546
Ijetcas14 546
 
Curve_Fitting.pdf
Curve_Fitting.pdfCurve_Fitting.pdf
Curve_Fitting.pdf
 
Comparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distributionComparing the methods of Estimation of Three-Parameter Weibull distribution
Comparing the methods of Estimation of Three-Parameter Weibull distribution
 
Engineering mathematics 1
Engineering mathematics 1Engineering mathematics 1
Engineering mathematics 1
 
Ch08(2)
Ch08(2)Ch08(2)
Ch08(2)
 
Ch08(1)
Ch08(1)Ch08(1)
Ch08(1)
 
Chap14 analysis of categorical data
Chap14 analysis of categorical dataChap14 analysis of categorical data
Chap14 analysis of categorical data
 
Lecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.pptLecture 13 Regression & Correlation.ppt
Lecture 13 Regression & Correlation.ppt
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 

More from ManoloTaquire

Chapter 2 Section 3.ppt
Chapter 2 Section 3.pptChapter 2 Section 3.ppt
Chapter 2 Section 3.pptManoloTaquire
 
Chapter 4 Section 3.ppt
Chapter 4 Section 3.pptChapter 4 Section 3.ppt
Chapter 4 Section 3.pptManoloTaquire
 
Chapter 3 Section 2.ppt
Chapter 3 Section 2.pptChapter 3 Section 2.ppt
Chapter 3 Section 2.pptManoloTaquire
 
Chapter 2 Section 4.ppt
Chapter 2 Section 4.pptChapter 2 Section 4.ppt
Chapter 2 Section 4.pptManoloTaquire
 
Chapter 6 Section 7.ppt
Chapter 6 Section 7.pptChapter 6 Section 7.ppt
Chapter 6 Section 7.pptManoloTaquire
 
Chapter 4 Section 2.ppt
Chapter 4 Section 2.pptChapter 4 Section 2.ppt
Chapter 4 Section 2.pptManoloTaquire
 
Chapter 1 Section 4.ppt
Chapter 1 Section 4.pptChapter 1 Section 4.ppt
Chapter 1 Section 4.pptManoloTaquire
 
Chapter 3 Section 4.ppt
Chapter 3 Section 4.pptChapter 3 Section 4.ppt
Chapter 3 Section 4.pptManoloTaquire
 
Chapter 2 Section 1.ppt
Chapter 2 Section 1.pptChapter 2 Section 1.ppt
Chapter 2 Section 1.pptManoloTaquire
 
Chapter 3 Section 1.ppt
Chapter 3 Section 1.pptChapter 3 Section 1.ppt
Chapter 3 Section 1.pptManoloTaquire
 
Chapter 3 Section 3.ppt
Chapter 3 Section 3.pptChapter 3 Section 3.ppt
Chapter 3 Section 3.pptManoloTaquire
 
Chapter 6 Section 5.ppt
Chapter 6 Section 5.pptChapter 6 Section 5.ppt
Chapter 6 Section 5.pptManoloTaquire
 
Chapter 4 Section 1.ppt
Chapter 4 Section 1.pptChapter 4 Section 1.ppt
Chapter 4 Section 1.pptManoloTaquire
 
Chapter 4 Section 5.ppt
Chapter 4 Section 5.pptChapter 4 Section 5.ppt
Chapter 4 Section 5.pptManoloTaquire
 
Chapter 1 Section 3.ppt
Chapter 1 Section 3.pptChapter 1 Section 3.ppt
Chapter 1 Section 3.pptManoloTaquire
 
Chapter 6 Section 3.ppt
Chapter 6 Section 3.pptChapter 6 Section 3.ppt
Chapter 6 Section 3.pptManoloTaquire
 
Chapter 6 Section 1.ppt
Chapter 6 Section 1.pptChapter 6 Section 1.ppt
Chapter 6 Section 1.pptManoloTaquire
 
Chapter 5 Section (5).ppt
Chapter 5 Section (5).pptChapter 5 Section (5).ppt
Chapter 5 Section (5).pptManoloTaquire
 
Chapter 5 Section (3).ppt
Chapter 5 Section (3).pptChapter 5 Section (3).ppt
Chapter 5 Section (3).pptManoloTaquire
 
Chapter 1 Section 2.ppt
Chapter 1 Section 2.pptChapter 1 Section 2.ppt
Chapter 1 Section 2.pptManoloTaquire
 

More from ManoloTaquire (20)

Chapter 2 Section 3.ppt
Chapter 2 Section 3.pptChapter 2 Section 3.ppt
Chapter 2 Section 3.ppt
 
Chapter 4 Section 3.ppt
Chapter 4 Section 3.pptChapter 4 Section 3.ppt
Chapter 4 Section 3.ppt
 
Chapter 3 Section 2.ppt
Chapter 3 Section 2.pptChapter 3 Section 2.ppt
Chapter 3 Section 2.ppt
 
Chapter 2 Section 4.ppt
Chapter 2 Section 4.pptChapter 2 Section 4.ppt
Chapter 2 Section 4.ppt
 
Chapter 6 Section 7.ppt
Chapter 6 Section 7.pptChapter 6 Section 7.ppt
Chapter 6 Section 7.ppt
 
Chapter 4 Section 2.ppt
Chapter 4 Section 2.pptChapter 4 Section 2.ppt
Chapter 4 Section 2.ppt
 
Chapter 1 Section 4.ppt
Chapter 1 Section 4.pptChapter 1 Section 4.ppt
Chapter 1 Section 4.ppt
 
Chapter 3 Section 4.ppt
Chapter 3 Section 4.pptChapter 3 Section 4.ppt
Chapter 3 Section 4.ppt
 
Chapter 2 Section 1.ppt
Chapter 2 Section 1.pptChapter 2 Section 1.ppt
Chapter 2 Section 1.ppt
 
Chapter 3 Section 1.ppt
Chapter 3 Section 1.pptChapter 3 Section 1.ppt
Chapter 3 Section 1.ppt
 
Chapter 3 Section 3.ppt
Chapter 3 Section 3.pptChapter 3 Section 3.ppt
Chapter 3 Section 3.ppt
 
Chapter 6 Section 5.ppt
Chapter 6 Section 5.pptChapter 6 Section 5.ppt
Chapter 6 Section 5.ppt
 
Chapter 4 Section 1.ppt
Chapter 4 Section 1.pptChapter 4 Section 1.ppt
Chapter 4 Section 1.ppt
 
Chapter 4 Section 5.ppt
Chapter 4 Section 5.pptChapter 4 Section 5.ppt
Chapter 4 Section 5.ppt
 
Chapter 1 Section 3.ppt
Chapter 1 Section 3.pptChapter 1 Section 3.ppt
Chapter 1 Section 3.ppt
 
Chapter 6 Section 3.ppt
Chapter 6 Section 3.pptChapter 6 Section 3.ppt
Chapter 6 Section 3.ppt
 
Chapter 6 Section 1.ppt
Chapter 6 Section 1.pptChapter 6 Section 1.ppt
Chapter 6 Section 1.ppt
 
Chapter 5 Section (5).ppt
Chapter 5 Section (5).pptChapter 5 Section (5).ppt
Chapter 5 Section (5).ppt
 
Chapter 5 Section (3).ppt
Chapter 5 Section (3).pptChapter 5 Section (3).ppt
Chapter 5 Section (3).ppt
 
Chapter 1 Section 2.ppt
Chapter 1 Section 2.pptChapter 1 Section 2.ppt
Chapter 1 Section 2.ppt
 

Recently uploaded

Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Disha Kariya
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.pptRamjanShidvankar
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxVishalSingh1417
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17Celine George
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfSanaAli374401
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.christianmathematics
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactPECB
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin ClassesCeline George
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxheathfieldcps1
 

Recently uploaded (20)

Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..Sports & Fitness Value Added Course FY..
Sports & Fitness Value Added Course FY..
 
Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Application orientated numerical on hev.ppt
Application orientated numerical on hev.pptApplication orientated numerical on hev.ppt
Application orientated numerical on hev.ppt
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
An Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdfAn Overview of Mutual Funds Bcom Project.pdf
An Overview of Mutual Funds Bcom Project.pdf
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Beyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global ImpactBeyond the EU: DORA and NIS 2 Directive's Global Impact
Beyond the EU: DORA and NIS 2 Directive's Global Impact
 
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17  How to Extend Models Using Mixin ClassesMixin Classes in Odoo 17  How to Extend Models Using Mixin Classes
Mixin Classes in Odoo 17 How to Extend Models Using Mixin Classes
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
The basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptxThe basics of sentences session 2pptx copy.pptx
The basics of sentences session 2pptx copy.pptx
 

Chapter 10 Section 3.ppt

  • 1. Section 10.3-1 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Lecture Slides Elementary Statistics Twelfth Edition and the Triola Statistics Series by Mario F. Triola
  • 2. Section 10.3-2 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Chapter 10 Correlation and Regression 10-1 Review and Preview 10-2 Correlation 10-3 Regression 10-4 Prediction Intervals and Variation 10-5 Multiple Regression 10-6 Nonlinear Regression
  • 3. Section 10.3-3 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Key Concept In Part 1 of this section we find the equation of the straight line that best fits the paired sample data. That equation algebraically describes the relationship between two variables. The best-fitting straight line is called a regression line and its equation is called the regression equation. In Part 2, we discuss marginal change, influential points, and residual plots as tools for analyzing correlation and regression results.
  • 4. Section 10.3-4 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Part 1: Basic Concepts of Regression
  • 5. Section 10.3-5 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Regression The typical equation of a straight line y = mx + b is expressed in the form ŷ = b0 + b1x, where b0 is the y- intercept and b1 is the slope. The regression equation expresses a relationship between x (called the explanatory variable, predictor variable or independent variable), and ŷ (called the response variable or dependent variable).
  • 6. Section 10.3-6 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Definitions Regression Equation: Given a collection of paired sample data, the regression line (or line of best fit, or least-squares line) is the straight line that “best” fits the scatterplot of data. The regression equation ŷ = b0 + b1x algebraically describes the regression line.
  • 7. Section 10.3-7 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Notation for Regression Equation Population Parameter Sample Statistic y-Intercept of regression equation β0 b0 Slope of regression equation β1 b1 Equation of the regression line y = β0 + β1x ŷ = b0 + b1x
  • 8. Section 10.3-8 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Requirements 1. The sample of paired (x, y) data is a random sample of quantitative data. 2. Visual examination of the scatterplot shows that the points approximate a straight-line pattern. 3. Any outliers must be removed if they are known to be errors. Consider the effects of any outliers that are not known errors.
  • 9. Section 10.3-9 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Formulas for b1 and b0 Technology will compute these values. 1 Slope: y x s b r s  0 1 y-intercept: b y b x  
  • 10. Section 10.3-10 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Let us return to the example from Section 10.2. We would like to use the explanatory variable, x, shoe print length, to predict the response variable, y, height. The data are listed below:
  • 11. Section 10.3-11 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Requirement Check: 1. The data are assumed to be a simple random sample. 2. The scatterplot showed a roughly straight-line pattern. 3. There are no outliers. The use of technology is recommended for finding the equation of a regression line.
  • 12. Section 10.3-12 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example – Continued
  • 13. Section 10.3-13 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued All these technologies show that the regression equation can be expressed as: Now we use the formulas to determine the regression equation (technology is recommended). ˆ 125 1.73 y x  
  • 14. Section 10.3-14 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Recall from the previous section that r = 0.591269. Technology can be used to find the values of the sample means and sample standard deviations used below. (These are the same coefficients found using technology)    1 0 1 4.87391 0.591269 1.72745 1.66823 177.3 1.72745 30.04 125.40740 y x s b r s b y b x        
  • 15. Section 10.3-15 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Graph the regression equation on a scatterplot: ˆ 125 1.73 y x  
  • 16. Section 10.3-16 Copyright © 2014, 2012, 2010 Pearson Education, Inc. 1. Use the regression equation for predictions only if the graph of the regression line on the scatterplot confirms that the regression line fits the points reasonably well. 2. Use the regression equation for predictions only if the linear correlation coefficient r indicates that there is a linear correlation between the two variables (as described in Section 10-2). Using the Regression Equation for Predictions
  • 17. Section 10.3-17 Copyright © 2014, 2012, 2010 Pearson Education, Inc. 3. Use the regression line for predictions only if the data do not go much beyond the scope of the available sample data. (Predicting too far beyond the scope of the available sample data is called extrapolation, and it could result in bad predictions.) 4. If the regression equation does not appear to be useful for making predictions, the best predicted value of a variable is its sample mean. Using the Regression Equation for Predictions
  • 18. Section 10.3-18 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Strategy for Predicting Values of y
  • 19. Section 10.3-19 Copyright © 2014, 2012, 2010 Pearson Education, Inc. If the regression equation is not a good model, the best predicted value of y is simply , the mean of the y values. Remember, this strategy applies to linear patterns of points in a scatterplot. If the scatterplot shows a pattern that is not a straight-line pattern, other methods apply, as described in Section 10- 6. Using the Regression Equation for Predictions y
  • 20. Section 10.3-20 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Use the 5 pairs of shoe print lengths and heights to predict the height of a person with a shoe print length of 29 cm. The regression line does not fit the points well. The correlation is r = 0.591, which suggests there is not a linear correlation (the P-value was 0.294). The best predicted height is simply the mean of the sample heights: 177.3 cm y 
  • 21. Section 10.3-21 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example Use the 40 pairs of shoe print lengths from Data Set 2 in Appendix B to predict the height of a person with a shoe print length of 29 cm. Now, the regression line does fit the points well, and the correlation of r = 0.813 suggests that there is a linear correlation (the P-value is 0.000).
  • 22. Section 10.3-22 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Using technology we obtain the regression equation and scatterplot:
  • 23. Section 10.3-23 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued The given shoe length of 29 cm is not beyond the scope of the available data, so substitute in 29 cm into the regression model: A person with a shoe length of 29 cm is predicted to be 174.3 cm tall.   ˆ 80.9 3.22 80.9 3.22 29 174.3 cm y x     
  • 24. Section 10.3-24 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Part 2: Beyond the Basics of Regression
  • 25. Section 10.3-25 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Definition In working with two variables related by a regression equation, the marginal change in a variable is the amount that it changes when the other variable changes by exactly one unit. The slope b1 in the regression equation represents the marginal change in y that occurs when x changes by one unit.
  • 26. Section 10.3-26 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example For the 40 pairs of shoe print lengths and heights, the regression equation was: The slope of 3.22 tells us that if we increase shoe print length by 1 cm, the predicted height of a person increases by 3.22 cm. ˆ 80.9 3.22 y x  
  • 27. Section 10.3-27 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Definition In a scatterplot, an outlier is a point lying far away from the other data points. Paired sample data may include one or more influential points, which are points that strongly affect the graph of the regression line.
  • 28. Section 10.3-28 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example For the 40 pairs of shoe prints and heights, observe what happens if we include this additional data point: x = 35 cm and y = 25 cm
  • 29. Section 10.3-29 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued The additional point is an influential point because the graph of the regression line because the graph of the regression line did change considerably. The additional point is also an outlier because it is far from the other points.
  • 30. Section 10.3-30 Copyright © 2014, 2012, 2010 Pearson Education, Inc. For a pair of sample x and y values, the residual is the difference between the observed sample value of y and the y-value that is predicted by using the regression equation. That is: Definition ˆ residual observed predicted y y y y    
  • 31. Section 10.3-31 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Residuals
  • 32. Section 10.3-32 Copyright © 2014, 2012, 2010 Pearson Education, Inc. A straight line satisfies the least-squares property if the sum of the squares of the residuals is the smallest sum possible. Definition
  • 33. Section 10.3-33 Copyright © 2014, 2012, 2010 Pearson Education, Inc. A residual plot is a scatterplot of the (x, y) values after each of the y-coordinate values has been replaced by the residual value y – ŷ (where ŷ denotes the predicted value of y). That is, a residual plot is a graph of the points (x, y – ŷ). Definition
  • 34. Section 10.3-34 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Residual Plot Analysis When analyzing a residual plot, look for a pattern in the way the points are configured, and use these criteria: The residual plot should not have any obvious patterns (not even a straight line pattern). This confirms that the scatterplot of the sample data is a straight-line pattern. The residual plot should not become thicker (or thinner) when viewed from left to right. This confirms the requirement that for different fixed values of x, the distributions of the corresponding y values all have the same standard deviation.
  • 35. Section 10.3-35 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example The shoe print and height data are used to generate the following residual plot:
  • 36. Section 10.3-36 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued The residual plot becomes thicker, which suggests that the requirement of equal standard deviations is violated.
  • 37. Section 10.3-37 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued On the following slides are three residual plots. Observe what is good or bad about the individual regression models.
  • 38. Section 10.3-38 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Regression model is a good model:
  • 39. Section 10.3-39 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Distinct pattern: sample data may not follow a straight- line pattern.
  • 40. Section 10.3-40 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Example - Continued Residual plot becoming thicker: equal standard deviations violated.
  • 41. Section 10.3-41 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Complete Regression Analysis 1. Construct a scatterplot and verify that the pattern of the points is approximately a straight-line pattern without outliers. (If there are outliers, consider their effects by comparing results that include the outliers to results that exclude the outliers.) 2. Construct a residual plot and verify that there is no pattern (other than a straight-line pattern) and also verify that the residual plot does not become thicker (or thinner).
  • 42. Section 10.3-42 Copyright © 2014, 2012, 2010 Pearson Education, Inc. Complete Regression Analysis 3. Use a histogram and/or normal quantile plot to confirm that the values of the residuals have a distribution that is approximately normal. 4. Consider any effects of a pattern over time.