Multiple Regression Analysis for Business Lecture

Quantitative Analysis for Business Lecture 5 August 9th, 2010

Multiple Regression Analysis ,[object Object],Y = 0 + 1X1 + 2X2 + … + kXk +  where Y = dependent variable (response variable) Xi = ith independent variable (predictor or explanatory variable) 0 = intercept (value of Y when all Xi= 0) I = coefficient of the ith independent variable k = number of independent variables  = random error

where = predicted value of Y b0 = sample intercept (and is an estimate of 0) bi= sample coefficient of the ith variable (and is an estimate of i) Multiple Regression Analysis ,[object Object],[object Object]

Interpreting example 10-year real earnings growth of S&P500 (EG10) Intercept term If dividend payout ratio (PR) is zero and the slope of the yield curve (YC) is zero, we would expect the subsequent 10-year real earnings growth rate to be -11.6%  intercept Slope coefficient of PR If they payout ratio increases by 1%, we would expect the subsequent 10-year earnings growth rate to increase by 0.25%, holding YC constant Slope coefficient of YC If the yield curve slope increases by 1%, we would expect the subsequent 10-year earnings growth rate to increase by 0.14%, holding PR constant

Hypothesis testing of regression coefficients t-statistic – used to test the significance of the individual coefficient in a multiple regression t-statistic has n-k-1 degrees of freedom Estimated regression coefficient – hypothesized value Coefficient standard error of bj

Ex: testing the statistical significance of a regression coefficient Test the statistical significance of the independent variable PR in the real earnings growth example at the 10% significance level. Data based on 46 observations

Ex: testing the statistical significance of a regression coefficient We are testing the following hypothesis: The 10% two-tailed critical t-value with 43 degree of freedom (46-2-1) is approximately 1.68 We should reject the hypothesis if the t-statistic is greater than 1.68 or less than -1.68 Greater than 1.68, we can reject the null hypothesis and conclude that PR regression coefficient is statistically significant a the 10% significant level

where = predicted value of dependent variable (selling price) b0 = Y intercept X1 and X2 = value of the two independent variables (square footage and age) respectively b1 andb2 = slopes for X1 and X2 respectively Example: Jenny Wilson Realty ,[object Object]

She selects a sample of houses that have sold recently and records the data shown in Table 4.5,[object Object]

Evaluating Multiple Regression Models ,[object Object]

The p-value for the F-test and r2 are interpreted the same

The hypothesis is different because there is more than one independent variable

The F-test is investigating whether all the coefficients are equal to 0

p-value – the smallest level of significance for which the null hypothesis can be rejected

Null hypothesis cannot be rejected,[object Object]

The test statistic is calculated and if the p-value is lower than the level of significance (), the null hypothesis is rejected,[object Object]

F-statistic F-test assesses how well the set of independent variables, as a group, explains the variation of the dependent variable F-statistic is used to test whether at least one of the independent variables explains a significant portion of the variation of the dependent variable

F-statistic F-statistic is calculated as Where: SSR = Sum of Square of Regression SSE = Sum of Square of Errors MSR = Mean Regression Sum of Squares MSE = Mean Squared Error Reject H0 if F-statistic > Fc (critical value)

EX: calculating and interpreting f-statistic An analyst runs a regression of monthly value-stock returns on five independent variables over 60 months. The total sum of squares is 460, and the sum of squared errors is 170. Test the null hypothesis at the 5% significance level that all five of the independent variables are equal to zero The critical F-value for 5 and 54 degrees of freedom at 5% significance level is approximately 2.40

EX: calculating and interpreting f-statistic The null and alternative hypothesis are Calculations F-statistic > F-critical We reject null hypothesis! At least one independent variable is significantly different than zero

EX: Jenny Wilson Realty ,[object Object]

The p-value for the F-test is 0.002

r2 = 0.6719 so the model explains about 67% of the variation in selling price (Y)

But the F-test is for the entire model and we can’t tell if one or both of the independent variables are significant

By calculating the p-value of each variable, we can assess the significance of the individual variables

Since the p-value for X1 (square footage) and X2 (age) are both less than the significance level of 0.05, both null hypotheses can be rejected,[object Object]

Adjusted R2 Unfortunately, R2 by itself may not be a reliable measure of the multiple regression model R2almost always increases as variables are added to the model We need to take new variables into account Where n = number of observations k = number of independent variables Ra2 = adjusted R2

Adjusted R2 Whenever there is more than 1 independent variable Ra2 is less than or equal to R2 So adding new variables to the model will increase R2 but may increase or decrease the Ra2 Ra2 maybe less than 0 if R2 is low enough

EX: adjusted R2 ,[object Object]

The R2 of 63% suggests that the five independent variables together explain 63% of the variation in monthly value-stock returns,[object Object]

The analyst would prefer the first model because the adjusted Ra2 is higher and the model has five independent variables as opposed to nine,[object Object]

A dummy variable is assigned a value of 1 if a particular condition is met and a value of 0 otherwise

The number of dummy variables must equal one less than the number of categories of the qualitative variable,[object Object]

Jenny Wilson Realty ,[object Object],X3 = 1 if house is in excellent condition = 0 otherwise X4 = 1 if house is in mint condition = 0 otherwise ,[object Object]

No variable is needed for “good” condition since if both X3 and X4 = 0, the house must be in good condition,[object Object]

Multiple Regression Analysis for Business Lecture

Recommended

Recommended

More Related Content

What's hot

What's hot (18)

Viewers also liked

Viewers also liked (8)

Similar to Multiple Regression Analysis for Business Lecture

Similar to Multiple Regression Analysis for Business Lecture (20)

More from saark

More from saark (16)

Recently uploaded

Recently uploaded (20)

Multiple Regression Analysis for Business Lecture