SlideShare a Scribd company logo
1 of 75
Regression Analysis
Multiple Regression
[ Cross-Sectional Data ]
Learning Objectives
Explain the linear multiple regression
model [for cross-sectional data]
Interpret linear multiple regression
computer output
Explain multicollinearity
Describe the types of multiple regression
models
Regression Modeling Steps
Define problem or question
Specify model
Collect data
Do descriptive data analysis
Estimate unknown parameters
Evaluate model
Use model for prediction
Simple vs. Multiple
  represents the
unit change in Y
per unit change in
X .
 Does not take into
account any other
variable besides
single independent
variable.
 i represents the unit
change in Y per unit
change in Xi.
 Takes into account
the effect of other
i s.
 “Net regression
coefficient.”
Assumptions
Linearity - the Y variable is linearly related
to the value of the X variable.
Independence of Error - the error
(residual) is independent for each value of X.
Homoscedasticity - the variation around
the line of regression be constant for all values
of X.
Normality - the values of Y be normally
distributed at each value of X.
Goal
Develop a statistical model that
can predict the values of a
dependent (response) variable
based upon the values of the
independent (explanatory)
variables.
Simple Regression
A statistical model that utilizes
one quantitative independent
variable “X” to predict the
quantitative dependent
variable “Y.”
Multiple Regression
A statistical model that utilizes two
or more quantitative and
qualitative explanatory variables
(x1,..., xp) to predict a quantitative
dependent variable Y.
Caution: have at least two or more quantitative
explanatory variables (rule of thumb)
Multiple Regression Model
X2
X1
Y
e
Hypotheses
 H0: 1 = 2 = 3 = ... = P = 0
 H1: At least one regression
coefficient is not equal to zero
Hypotheses (alternate format)
H0: i = 0
H1: i  0
Types of Models
Positive linear relationship
Negative linear relationship
No relationship between X and Y
Positive curvilinear relationship
U-shaped curvilinear
Negative curvilinear relationship
Multiple Regression Models
Multiple
Regression
Models
Linear
Dummy
Variable
Linear
Non-
Linear
Inter-
action
Poly-
Nomial
Square
Root
Log Reciprocal Exponential
Multiple Regression Equations
This is too
complicated! You’ve got to
be kiddin’!
Multiple Regression Models
Multiple
Regression
Models
Linear
Dummy
Variable
Linear
Non-
Linear
Inter-
action
Poly-
Nomial
Square
Root
Log Reciprocal Exponential
Linear Model
Relationship between one dependent & two
or more independent variables is a linear
function




 




 P
P X
X
X
Y 
2
2
1
1
0
Dependent
(response)
variable
Independent
(explanatory)
variables
Population
slopes
Population
Y-intercept
Random
error
Method of Least Squares
The straight line that best fits the data.
Determine the straight line for which the
differences between the actual values (Y)
and the values that would be predicted
from the fitted line of regression (Y-hat)
are as small as possible.
Measures of Variation
Explained variation (sum of
squares due to regression)
Unexplained variation (error sum
of squares)
Total sum of squares
Coefficient of Multiple Determination
When null hypothesis
is rejected, a
relationship between Y
and the X variables
exists.
Strength measured by
R2 [ several types ]
Coefficient of Multiple
Determination
R2
y.123- - -P
The proportion of Y that is
explained by the set of
explanatory variables selected
Standard Error of the Estimate
sy.x
the measure of
variability
around the
line of
regression
Confidence interval estimates
»True mean
Y.X
»Individual
Y-hati
Interval Bands [from simple regression]
X
Y
X
Yi
= b0
+ b1
X
^
Xgiven
_
Multiple Regression Equation
Y-hat = 0 + 1x1 + 2x2 + ... + PxP + 
where:
0 = y-intercept {a constant value}
1 = slope of Y with variable x1 holding the
variables x2, x3, ..., xP effects constant
P = slope of Y with variable xP holding all
other variables’ effects constant
Who is in Charge?
Mini-Case
Predict the consumption of home
heating oil during January for
homes located around Screne Lakes.
Two explanatory variables are
selected - - average daily
atmospheric temperature (oF) and
the amount of attic insulation (“).
O il (G a l) Te m p Insula tion
275.30 40 3
363.80 27 3
164.30 40 10
40.80 73 6
94.30 64 6
230.90 34 6
366.70 9 6
300.60 8 10
237.80 23 10
121.40 63 3
31.40 65 10
203.50 41 6
441.10 21 3
323.00 38 3
52.50 58 10
Mini-Case
(0F)
Develop a model for
estimating heating oil
used for a single family
home in the month of
January based on average
temperature and amount
of insulation in inches.
Mini-Case
What preliminary conclusions can home
owners draw from the data?
What could a home owner expect heating
oil consumption (in gallons) to be if the
outside temperature is 15 oF when the
attic insulation is 10 inches thick?
Multiple Regression Equation
[mini-case]
Dependent variable: Gallons Consumed
-------------------------------------------------------------------------------------
Standard T
Parameter Estimate Error Statistic P-Value
--------------------------------------------------------------------------------------
CONSTANT 562.151 21.0931 26.6509 0.0000
Insulation -20.0123 2.34251 -8.54313 0.0000
Temperature -5.43658 0.336216 -16.1699 0.0000
--------------------------------------------------------------------------------------
R-squared = 96.561 percent
R-squared (adjusted for d.f.) = 95.9879 percent
Standard Error of Est. = 26.0138
+
Multiple Regression Equation
[mini-case]
Y-hat = 562.15 - 5.44x1 - 20.01x2
where: x1 = temperature [degrees F]
x2 = attic insulation [inches]
Multiple Regression Equation
[mini-case]
Y-hat = 562.15 - 5.44x1 - 20.01x2
thus:
 For a home with zero inches of attic
insulation and an outside temperature
of 0 oF, 562.15 gallons of heating oil
would be consumed.
[ caution .. data boundaries .. extrapolation ]
Extrapolation
Y
Interpolation
X
Extrapolation Extrapolation
Relevant Range
Multiple Regression Equation
[mini-case]
Y-hat = 562.15 - 5.44x1 - 20.01x2
 For a home with zero attic insulation and an outside
temperature of zero, 562.15 gallons of heating oil would be
consumed. [ caution .. data boundaries .. extrapolation ]
 For each incremental increase in
degree F of temperature, for a given
amount of attic insulation, heating oil
consumption drops 5.44 gallons.
+
Multiple Regression Equation
[mini-case]
Y-hat = 562.15 - 5.44x1 - 20.01x2
 For a home with zero attic insulation and an outside temperature of zero,
562 gallons of heating oil would be consumed. [ caution … ]
 For each incremental increase in degree F of temperature, for a given
amount of attic insulation, heating oil consumption drops 5.44 gallons.
For each incremental increase in inches
of attic insulation, at a given temperature,
heating oil consumption drops 20.01
gallons.
Multiple Regression Prediction
[mini-case]
Y-hat = 562.15 - 5.44x1 - 20.01x2
with x1 = 15oF and x2 = 10 inches
Y-hat = 562.15 - 5.44(15) - 20.01(10)
= 280.45 gallons consumed
Coefficient of Multiple Determination
[mini-case]
R2
y.12 = .9656
96.56 percent of the variation in
heating oil can be explained by
the variation in temperature and
insulation.
Coefficient of Multiple Determination
Proportion of variation in Y ‘explained’ by all
X variables taken together
R2
Y.12 = Explained variation = SSR
Total variation SST
Never decreases when new X variable is added
to model
– Only Y values determine SST
– Disadvantage when comparing models
Proportion of variation in Y ‘explained’ by all
X variables taken together
Reflects
– Sample size
– Number of independent variables
Smaller [more conservative] than R2
Y.12
Used to compare models
Coefficient of Multiple Determination
Adjusted
Coefficient of Multiple Determination
(adjusted)
R2
(adj) y.123- - -P
The proportion of Y that is explained by the
set of independent [explanatory] variables
selected, adjusted for the number of
independent variables and the sample size.
Coefficient of Multiple Determination
(adjusted) [Mini-Case]
R2
adj = 0.9599
95.99 percent of the variation in
heating oil consumption can be
explained by the model - adjusted
for number of independent variables
and the sample size
Coefficient of Partial Determination
Proportion of variation in Y ‘explained’ by
variable XP holding all others constant
Must estimate separate models
Denoted R2
Y1.2 in two X variables case
– Coefficient of partial determination of X1 with Y
holding X2 constant
Useful in selecting X variables
Coefficient of Partial
Determination [p. 878]
R2
y1.234 --- P
The coefficient of partial variation of
variable Y with x1 holding constant
the effects of variables x2, x3, x4, ... xP.
Coefficient of Partial Determination
[Mini-Case]
R2
y1.2 = 0.9561
For a fixed (constant) amount of
insulation, 95.61 percent of the variation
in heating oil can be explained by the
variation in average atmospheric
temperature. [p. 879]
Coefficient of Partial Determination
[Mini-Case]
R2
y2.1 = 0.8588
For a fixed (constant) temperature,
85.88 percent of the variation in
heating oil can be explained by the
variation in amount of insulation.
Testing Overall Significance
Shows if there is a linear relationship
between all X variables together & Y
Uses p-value
Hypotheses
– H0: 1 = 2 = ... = P = 0
»No linear relationship
– H1: At least one coefficient is not 0
»At least one X variable affects Y
Examines the contribution of a set of X
variables to the relationship with Y
Null hypothesis:
– Variables in set do not improve significantly
the model when all other variables are included
Must estimate separate models
Used in selecting X variables
Testing Model Portions
Diagnostic Checking
H0 retain or reject
If reject - {p-value  0.05}
R2
adj
Correlation matrix
Partial correlation matrix
Multicollinearity
High correlation between X variables
Coefficients measure combined effect
Leads to unstable coefficients depending on
X variables in model
Always exists; matter of degree
Example: Using both total number of rooms
and number of bedrooms as explanatory
variables in same model
Detecting Multicollinearity
Examine correlation matrix
– Correlations between pairs of X variables are
more than with Y variable
Few remedies
– Obtain new sample data
– Eliminate one correlated X variable
Evaluating Multiple Regression Model Steps
Examine variation measures
Do residual analysis
Test parameter significance
– Overall model
– Portions of model
– Individual coefficients
Test for multicollinearity
Multiple Regression Models
Multiple
Regression
Models
Linear
Dummy
Variable
Linear
Non-
Linear
Inter-
action
Poly-
Nomial
Square
Root
Log Reciprocal Exponential
Dummy-Variable Regression Model
Involves categorical X variable with
two levels
– e.g., female-male, employed-not employed, etc.
Dummy-Variable Regression Model
Involves categorical X variable with
two levels
– e.g., female-male, employed-not employed, etc.
Variable levels coded 0 & 1
Dummy-Variable Regression Model
Involves categorical X variable with
two levels
– e.g., female-male, employed-not employed, etc.
Variable levels coded 0 & 1
Assumes only intercept is different
– Slopes are constant across categories
Dummy-Variable Model Relationships
Y
X1
0
0
Same slopes b1
b0
b0 + b2
Females
Males
Dummy Variables
 Permits use of
qualitative data
(e.g.: seasonal, class
standing, location,
gender).
 0, 1 coding
(nominative data)
 As part of Diagnostic
Checking;
incorporate outliers
(i.e.: large residuals)
and influence
measures.
Multiple Regression Models
Multiple
Regression
Models
Linear
Dummy
Variable
Linear
Non-
Linear
Inter-
action
Poly-
Nomial
Square
Root
Log Reciprocal Exponential
Interaction Regression Model
Hypothesizes interaction between pairs of X
variables
– Response to one X variable varies at different
levels of another X variable
Contains two-way cross product terms
Y = 0 + 1x1 + 2x2 + 3x1x2 + 
Can be combined with other models
e.g. dummy variable models
Effect of Interaction
Given:
Without interaction term, effect of X1 on Y
is measured by 1
With interaction term, effect of X1 on
Y is measured by 1 + 3X2
– Effect increases as X2i increases
Y X X X X
i i i i i i
    
    
0 1 1 2 2 3 1 2
Interaction Example
X1
4
8
12
0
0 1
0.5 1.5
Y Y = 1 + 2X1 + 3X2 + 4X1X2
Interaction Example
X1
4
8
12
0
0 1
0.5 1.5
Y Y = 1 + 2X1 + 3X2 + 4X1X2
Y = 1 + 2X1 + 3(0) + 4X1(0) = 1 + 2X1
Interaction Example
Y
X1
4
8
12
0
0 1
0.5 1.5
Y = 1 + 2X1 + 3X2 + 4X1X2
Y = 1 + 2X1 + 3(1) + 4X1(1) = 4 + 6X1
Y = 1 + 2X1 + 3(0) + 4X1(0) = 1 + 2X1
Interaction Example
Effect (slope) of X1 on Y does depend on X2 value
X1
4
8
12
0
0 1
0.5 1.5
Y Y = 1 + 2X1 + 3X2 + 4X1X2
Y = 1 + 2X1 + 3(1) + 4X1(1) = 4 + 6X1
Y = 1 + 2X1 + 3(0) + 4X1(0) = 1 + 2X1
Multiple Regression Models
Multiple
Regression
Models
Linear
Dummy
Variable
Linear
Non-
Linear
Inter-
action
Poly-
Nomial
Square
Root
Log Reciprocal Exponential
Inherently Linear Models
Non-linear models that can be expressed in
linear form
– Can be estimated by least square in linear form
Require data transformation
Y
X1
Curvilinear Model Relationships
Y
X1
Y
X1
Y
X1
Logarithmic Transformation
Y
X1
1 > 0
1 < 0
Y =  + 1 lnx1 + 2 lnx2 + 
Square-Root Transformation
Y
X1
Y X X
i i i i
   
   
0 1 1 2 2
1 > 0
1 < 0
Reciprocal Transformation
Y
X1
1 > 0
1 < 0
i
i
i
i
X
X
Y 


 



2
2
1
1
0
1
1
Asymptote
Exponential Transformation
Y
X1
1 > 0
1 < 0
Y e
i
X X
i
i i
  
  

0 1 1 2 2
Overview
Explained the linear multiple regression
model
Interpreted linear multiple regression
computer output
Explained multicollinearity
Described the types of multiple regression
models
Source of Elaborate Slides
Prentice Hall, Inc
Levine, et. all, First Edition
Regression Analysis
[Multiple Regression]
*** End of Presentation ***
Questions?
multiple.ppt
multiple.ppt

More Related Content

Similar to multiple.ppt

Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
Kemal İnciroğlu
 
15 ch ken black solution
15 ch ken black solution15 ch ken black solution
15 ch ken black solution
Krunal Shah
 

Similar to multiple.ppt (20)

Multiple Regression
Multiple RegressionMultiple Regression
Multiple Regression
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
An Overview of Simple Linear Regression
An Overview of Simple Linear RegressionAn Overview of Simple Linear Regression
An Overview of Simple Linear Regression
 
Simple lin regress_inference
Simple lin regress_inferenceSimple lin regress_inference
Simple lin regress_inference
 
Simple & Multiple Regression Analysis
Simple & Multiple Regression AnalysisSimple & Multiple Regression Analysis
Simple & Multiple Regression Analysis
 
15 ch ken black solution
15 ch ken black solution15 ch ken black solution
15 ch ken black solution
 
Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help
 
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Correlation and Reg...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Correlation and Reg...Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Correlation and Reg...
Javier Garcia - Verdugo Sanchez - Six Sigma Training - W2 Correlation and Reg...
 
Regression
RegressionRegression
Regression
 
Multivariate time series
Multivariate time seriesMultivariate time series
Multivariate time series
 
Solution2.pdf
Solution2.pdfSolution2.pdf
Solution2.pdf
 
Regression Analysis , A statistical approch to analysis data.pptx
Regression Analysis , A statistical  approch to analysis data.pptxRegression Analysis , A statistical  approch to analysis data.pptx
Regression Analysis , A statistical approch to analysis data.pptx
 
Linear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec domsLinear regression and correlation analysis ppt @ bec doms
Linear regression and correlation analysis ppt @ bec doms
 
Regression Presentation.pptx
Regression Presentation.pptxRegression Presentation.pptx
Regression Presentation.pptx
 
01_SLR_final (1).pptx
01_SLR_final (1).pptx01_SLR_final (1).pptx
01_SLR_final (1).pptx
 
604_multiplee.ppt
604_multiplee.ppt604_multiplee.ppt
604_multiplee.ppt
 
introduction to measurements.pptx
introduction to measurements.pptxintroduction to measurements.pptx
introduction to measurements.pptx
 
linear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annovalinear Regression, multiple Regression and Annova
linear Regression, multiple Regression and Annova
 
OLS chapter
OLS chapterOLS chapter
OLS chapter
 
Chapter6
Chapter6Chapter6
Chapter6
 

Recently uploaded

SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
CaitlinCummins3
 
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
EADTU
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
中 央社
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
EADTU
 

Recently uploaded (20)

SURVEY I created for uni project research
SURVEY I created for uni project researchSURVEY I created for uni project research
SURVEY I created for uni project research
 
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
Transparency, Recognition and the role of eSealing - Ildiko Mazar and Koen No...
 
The Liver & Gallbladder (Anatomy & Physiology).pptx
The Liver &  Gallbladder (Anatomy & Physiology).pptxThe Liver &  Gallbladder (Anatomy & Physiology).pptx
The Liver & Gallbladder (Anatomy & Physiology).pptx
 
Major project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategiesMajor project report on Tata Motors and its marketing strategies
Major project report on Tata Motors and its marketing strategies
 
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community PartnershipsSpring gala 2024 photo slideshow - Celebrating School-Community Partnerships
Spring gala 2024 photo slideshow - Celebrating School-Community Partnerships
 
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading RoomSternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
Sternal Fractures & Dislocations - EMGuidewire Radiology Reading Room
 
diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....diagnosting testing bsc 2nd sem.pptx....
diagnosting testing bsc 2nd sem.pptx....
 
Observing-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptxObserving-Correct-Grammar-in-Making-Definitions.pptx
Observing-Correct-Grammar-in-Making-Definitions.pptx
 
Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"Mattingly "AI and Prompt Design: LLMs with NER"
Mattingly "AI and Prompt Design: LLMs with NER"
 
UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024UChicago CMSC 23320 - The Best Commit Messages of 2024
UChicago CMSC 23320 - The Best Commit Messages of 2024
 
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptxCOMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
COMMUNICATING NEGATIVE NEWS - APPROACHES .pptx
 
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文會考英文
 
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjjStl Algorithms in C++ jjjjjjjjjjjjjjjjjj
Stl Algorithms in C++ jjjjjjjjjjjjjjjjjj
 
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes GuàrdiaPersonalisation of Education by AI and Big Data - Lourdes Guàrdia
Personalisation of Education by AI and Big Data - Lourdes Guàrdia
 
male presentation...pdf.................
male presentation...pdf.................male presentation...pdf.................
male presentation...pdf.................
 
OSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & SystemsOSCM Unit 2_Operations Processes & Systems
OSCM Unit 2_Operations Processes & Systems
 
Including Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdfIncluding Mental Health Support in Project Delivery, 14 May.pdf
Including Mental Health Support in Project Delivery, 14 May.pdf
 
The Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDFThe Story of Village Palampur Class 9 Free Study Material PDF
The Story of Village Palampur Class 9 Free Study Material PDF
 
Rich Dad Poor Dad ( PDFDrive.com )--.pdf
Rich Dad Poor Dad ( PDFDrive.com )--.pdfRich Dad Poor Dad ( PDFDrive.com )--.pdf
Rich Dad Poor Dad ( PDFDrive.com )--.pdf
 
How to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptxHow to Manage Website in Odoo 17 Studio App.pptx
How to Manage Website in Odoo 17 Studio App.pptx
 

multiple.ppt