SlideShare a Scribd company logo
1 of 11
MULTIPLE
REGRESSION
Multiple regression generally explains the relationship between
multiple independent or predictor variables and one dependent or
criterion variable.
𝑦 = 𝛽0 + 𝛽1𝑥1 + 𝛽2𝑥2 + 𝛽3𝑥3 + ⋯ . +𝛽𝑘𝑥𝑘 + 𝑢
𝛽0 is the intercept
𝛽1 is the parameter associated with x1.
𝛽2is the parameter associated with x2, and so on
Assumptions for multiple regression
1) normality
assumption
for any specific value of the independent
variable, the values of the y variable are
normally distributed
2) equal-variance
assumption
(assumption of
homoscedasticity)
The variances (or standard deviations) for the
y variables are the same for each value of the
independent variable.
If the errors do not have a constant
variance, they are said to be
heteroscedastic.
There are a number of formal statistical tests
for heteroscedasticity: the Goldfeld-Quandt
test, White’s general test
Assumptions for multiple regression
3) the linearity
assumption
there is a linear relationship between the
dependent variable and the independent
variables
The covariance between the error terms over time is zero. In other
words, it is assumed that the errors are uncorrelated with one
another.
The formal method to detect autocorrelation is Durbin and Watson
test, Breusch–Godfrey test
Durbin-Watson (DW) is a test for first order
autocorrelation - it tests only for a relationship between
an error and its immediately previous value
 the DW test statistic is approximately equal to 2(1−𝜌).
 Since 𝜌 is a correlation, it implies that −1≤ 𝜌 ≤1
 𝜌 is bounded to lie between −1 and +1.
The corresponding limits for DW as 0≤DW ≤4.
Consider now the implication of DW taking one of three
important values (0, 2, and 4):
 𝜌=0, DW =2 This is the case where there is no autocorrelation in the
residuals. So roughly speaking, the null hypothesis would not be
rejected if DW is near 2→i.e. there is little evidence of autocorrelation.
 𝜌=1, DW =0 This corresponds to the case where there is perfect
positive autocorrelation in the residuals.
 𝜌 =− 1, DW =4 This corresponds to the case where there is perfect
negative autocorrelation in the residual
The rejection, non-
rejection, and
inconclusive
regions for DW test
Autocorrelation can be violated in two ways
model misspecification
If an important
independent variable is
omitted or if an incorrect
functional form is used,
the residuals may not be
independent. The
solution to this dilemma
is to find the proper
functional form or to
include the proper
independent variables
time-sequenced data
Regression analysis is performed on data
taken over time, the residuals are often
correlated (serial correlation or
autocorrelation).
Positive autocorrelation means that the
residual in time period j tends to have the
same sign as the residual in time period (j-
k), where k is the lag in time periods.
Negative autocorrelation means that the
residual in time period j tends to have the
opposite sign as the residual in time period
(j-k).
Assumptions for multiple regression
4) the nonmulti-
collinearity
assumption
the independent variables are not correlated
5) the
independence
assumption
The values for the y variables are independent
The strength of the relationship between the
independent variables and the dependent
variable is measured by a correlation
coefficient. This multiple correlation
coefficient is symbolized by R. The value of
R can range from 0 to +1. The closer to +1,
the stronger the relationship; the closer to
0, the weaker the relationship.
Multicollinearity occurs when there are high correlations
between two or more predictor variables
Data-based multicollinearity Structural multicollinearity
caused by poorly designed
experiments, data that is
100% observational, or data
collection methods that
cannot be manipulated
caused by the researcher,
creating new predictor variables
Causes for multicollinearity
 Insufficient data. In some cases, collecting more data can resolve
the issue.
 Dummy variables may be incorrectly used. For example, the
researcher may fail to exclude one category, or add a dummy
variable for every category (e.g. spring, summer, autumn, winter).
 Including a variable in the regression that is actually a
combination of two other variables. For example, including “total
investment income” when total investment income = income from
stocks and bonds + income from savings interest.
 Including two identical (or almost identical) variables. For
example, weight in pounds and weight in kilos, or investment
income and savings/bond income.
Solutions to the problem of multicollinearity
 Ignore it, if the model is otherwise adequate, i.e. statistically
and in terms of each coefficient being of a plausible
magnitude and having an appropriate sign;
 Drop one of the collinear variables, so that the problem
disappears;
 Transform the highly correlated variables into a ratio and
include only the ratio and not the individual variables in the
regression;
 A problem with the data than with the model, so that there is
insufficient information in the sample to obtain estimates for
all of the coefficients

More Related Content

Similar to Multiple regression .pptx

Heteroscedasticity
HeteroscedasticityHeteroscedasticity
HeteroscedasticityMuhammad Ali
 
Regression with Time Series Data
Regression with Time Series DataRegression with Time Series Data
Regression with Time Series DataRizano Ahdiat R
 
Tempest in teapot
Tempest in teapotTempest in teapot
Tempest in teapotGaetan Lion
 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06Kishor Ade
 
Outlying and Influential Data In Regression Diagnostics .docx
Outlying and Influential Data In Regression Diagnostics .docxOutlying and Influential Data In Regression Diagnostics .docx
Outlying and Influential Data In Regression Diagnostics .docxkarlhennesey
 
Logistic regression
Logistic regressionLogistic regression
Logistic regressionAyurdata
 
Introduction to Limited Dependent variable
Introduction to Limited Dependent variableIntroduction to Limited Dependent variable
Introduction to Limited Dependent variableAshok Dsouza
 
ML4 Regression.pptx
ML4 Regression.pptxML4 Regression.pptx
ML4 Regression.pptxDayal Sati
 
IDBR-DP-TaverneLambert
IDBR-DP-TaverneLambertIDBR-DP-TaverneLambert
IDBR-DP-TaverneLambertCedric Taverne
 
Scale development khalid-key concepts
Scale development khalid-key conceptsScale development khalid-key concepts
Scale development khalid-key conceptsKhalid Mahmood
 
Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help HelpWithAssignment.com
 
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Daniel Katz
 
A Topic on REGRESSION Analysis conducted pptx
A Topic on REGRESSION Analysis conducted pptxA Topic on REGRESSION Analysis conducted pptx
A Topic on REGRESSION Analysis conducted pptxzeusrex4815162342
 

Similar to Multiple regression .pptx (20)

Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Regression with Time Series Data
Regression with Time Series DataRegression with Time Series Data
Regression with Time Series Data
 
Simple linear regressionn and Correlation
Simple linear regressionn and CorrelationSimple linear regressionn and Correlation
Simple linear regressionn and Correlation
 
Tempest in teapot
Tempest in teapotTempest in teapot
Tempest in teapot
 
Research Methodology Module-06
Research Methodology Module-06Research Methodology Module-06
Research Methodology Module-06
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Outlying and Influential Data In Regression Diagnostics .docx
Outlying and Influential Data In Regression Diagnostics .docxOutlying and Influential Data In Regression Diagnostics .docx
Outlying and Influential Data In Regression Diagnostics .docx
 
Logistic regression
Logistic regressionLogistic regression
Logistic regression
 
Introduction to Limited Dependent variable
Introduction to Limited Dependent variableIntroduction to Limited Dependent variable
Introduction to Limited Dependent variable
 
stats notes.doc
stats notes.docstats notes.doc
stats notes.doc
 
ML4 Regression.pptx
ML4 Regression.pptxML4 Regression.pptx
ML4 Regression.pptx
 
M8.logreg.ppt
M8.logreg.pptM8.logreg.ppt
M8.logreg.ppt
 
M8.logreg.ppt
M8.logreg.pptM8.logreg.ppt
M8.logreg.ppt
 
Modelo Generalizado
Modelo GeneralizadoModelo Generalizado
Modelo Generalizado
 
Logistics regression
Logistics regressionLogistics regression
Logistics regression
 
IDBR-DP-TaverneLambert
IDBR-DP-TaverneLambertIDBR-DP-TaverneLambert
IDBR-DP-TaverneLambert
 
Scale development khalid-key concepts
Scale development khalid-key conceptsScale development khalid-key concepts
Scale development khalid-key concepts
 
Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help Get Multiple Regression Assignment Help
Get Multiple Regression Assignment Help
 
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
Quantitative Methods for Lawyers - Class #22 - Regression Analysis - Part 5
 
A Topic on REGRESSION Analysis conducted pptx
A Topic on REGRESSION Analysis conducted pptxA Topic on REGRESSION Analysis conducted pptx
A Topic on REGRESSION Analysis conducted pptx
 

Recently uploaded

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...PsychoTech Services
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 

Recently uploaded (20)

Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
IGNOU MSCCFT and PGDCFT Exam Question Pattern: MCFT003 Counselling and Family...
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"Mattingly "AI & Prompt Design: The Basics of Prompt Design"
Mattingly "AI & Prompt Design: The Basics of Prompt Design"
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Advance Mobile Application Development class 07
Advance Mobile Application Development class 07Advance Mobile Application Development class 07
Advance Mobile Application Development class 07
 
Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 

Multiple regression .pptx

  • 2. Multiple regression generally explains the relationship between multiple independent or predictor variables and one dependent or criterion variable. 𝑦 = 𝛽0 + 𝛽1𝑥1 + 𝛽2𝑥2 + 𝛽3𝑥3 + ⋯ . +𝛽𝑘𝑥𝑘 + 𝑢 𝛽0 is the intercept 𝛽1 is the parameter associated with x1. 𝛽2is the parameter associated with x2, and so on
  • 3. Assumptions for multiple regression 1) normality assumption for any specific value of the independent variable, the values of the y variable are normally distributed 2) equal-variance assumption (assumption of homoscedasticity) The variances (or standard deviations) for the y variables are the same for each value of the independent variable. If the errors do not have a constant variance, they are said to be heteroscedastic. There are a number of formal statistical tests for heteroscedasticity: the Goldfeld-Quandt test, White’s general test
  • 4. Assumptions for multiple regression 3) the linearity assumption there is a linear relationship between the dependent variable and the independent variables The covariance between the error terms over time is zero. In other words, it is assumed that the errors are uncorrelated with one another. The formal method to detect autocorrelation is Durbin and Watson test, Breusch–Godfrey test
  • 5. Durbin-Watson (DW) is a test for first order autocorrelation - it tests only for a relationship between an error and its immediately previous value  the DW test statistic is approximately equal to 2(1−𝜌).  Since 𝜌 is a correlation, it implies that −1≤ 𝜌 ≤1  𝜌 is bounded to lie between −1 and +1.
  • 6. The corresponding limits for DW as 0≤DW ≤4. Consider now the implication of DW taking one of three important values (0, 2, and 4):  𝜌=0, DW =2 This is the case where there is no autocorrelation in the residuals. So roughly speaking, the null hypothesis would not be rejected if DW is near 2→i.e. there is little evidence of autocorrelation.  𝜌=1, DW =0 This corresponds to the case where there is perfect positive autocorrelation in the residuals.  𝜌 =− 1, DW =4 This corresponds to the case where there is perfect negative autocorrelation in the residual The rejection, non- rejection, and inconclusive regions for DW test
  • 7. Autocorrelation can be violated in two ways model misspecification If an important independent variable is omitted or if an incorrect functional form is used, the residuals may not be independent. The solution to this dilemma is to find the proper functional form or to include the proper independent variables time-sequenced data Regression analysis is performed on data taken over time, the residuals are often correlated (serial correlation or autocorrelation). Positive autocorrelation means that the residual in time period j tends to have the same sign as the residual in time period (j- k), where k is the lag in time periods. Negative autocorrelation means that the residual in time period j tends to have the opposite sign as the residual in time period (j-k).
  • 8. Assumptions for multiple regression 4) the nonmulti- collinearity assumption the independent variables are not correlated 5) the independence assumption The values for the y variables are independent The strength of the relationship between the independent variables and the dependent variable is measured by a correlation coefficient. This multiple correlation coefficient is symbolized by R. The value of R can range from 0 to +1. The closer to +1, the stronger the relationship; the closer to 0, the weaker the relationship.
  • 9. Multicollinearity occurs when there are high correlations between two or more predictor variables Data-based multicollinearity Structural multicollinearity caused by poorly designed experiments, data that is 100% observational, or data collection methods that cannot be manipulated caused by the researcher, creating new predictor variables
  • 10. Causes for multicollinearity  Insufficient data. In some cases, collecting more data can resolve the issue.  Dummy variables may be incorrectly used. For example, the researcher may fail to exclude one category, or add a dummy variable for every category (e.g. spring, summer, autumn, winter).  Including a variable in the regression that is actually a combination of two other variables. For example, including “total investment income” when total investment income = income from stocks and bonds + income from savings interest.  Including two identical (or almost identical) variables. For example, weight in pounds and weight in kilos, or investment income and savings/bond income.
  • 11. Solutions to the problem of multicollinearity  Ignore it, if the model is otherwise adequate, i.e. statistically and in terms of each coefficient being of a plausible magnitude and having an appropriate sign;  Drop one of the collinear variables, so that the problem disappears;  Transform the highly correlated variables into a ratio and include only the ratio and not the individual variables in the regression;  A problem with the data than with the model, so that there is insufficient information in the sample to obtain estimates for all of the coefficients