SlideShare a Scribd company logo
1 of 21
U.S. GASOLINE PRICE MARKET
1953-2004
ECONOMETRICS
PROJECT
PRESENTED BY:-
1) SAKSHI ARORA
2) SIMRAN TANWAR
3) SHUBHAM JOON
4) GAURISH KANT SHUKLA
INTRODUCTION
The objective of the project is to study the relationship
between Gasoline Expenditure, Gasoline Price Index,
Per capita disposable income, Price Index for new
cars, Price index for old cars and price index for
public transports in the US. There may be other factors
which may affect the Gasoline Expenditure but the factors
taken here represent the expenditure of gasoline reasonably.
METHODOLOGY
The model taken here is multivariate i.e. it has more than two variables.
We have used Multiple linear regression technique.
Multiple regression is an extension of simple linear regression. It is used
when we want to predict the value of a variable based on the value of two
or more other variables.
The variable we want to predict is called the dependent variable (or
sometimes, the outcome, target or criterion variable). The variables we
are using to predict the value of the dependent variable are called the
independent variables (or sometimes, the predictor, explanatory or
repressor variables). Multiple regression analysis is applied here to study
the relationship between the dependent variable and all the factors
involved. The data taken into consideration is a time series data (1953-
2004).
DATA SOURCING
The data taken into consideration (U.S. Gasoline
market) is a time series data (1953-2004).
We collected it from a very reliable source.
It was compiled by Prof. Chris Bell, Department of
Economics,
University of North Carolina, Asheville.
www.bea.gov and www.bls.gov
VARIABLES
.
The variables we considered for the gasoline market are defined below:
GasExp- Total gasoline expenditure in U.S. in billions of dollars
(dependent variable)
Gasp-Price Index for gasoline.
Income- per capita Disposable Income
PNC- Price index for new cars Independent
variables
PUC- Price index for used cars
PPT- Price Index for public transportation.
REGRESSION STATISTICS
We fit the regression model to our data and the following results are
witnessed:
These are the “Goodness of Fit” measures. They tell you how well the
calculated linear regression equation fits your data.
The coefficient of determination of the model comes out to be 0.996 i.e.
99.6% of the variations in the gasoline expenditure are explained by the
factors taken into consideration.
ANOVA TABLE
The linear regression's F-test has the null
hypothesis that there is no linear relationship
between the variables
Ho: Bo=B1=B2=B3=B4=B5=0
H1: at least one of the Bi’s are not 0.
Here, The significance value for F-Test is 0.000 which is
less than 0.05 therefore we reject the null hypothesis
which states that there is no linear relationship between
the variables.
Thus we can assume that there is a linear relationship
between the variables in our model.
Which also indicates that, overall, the regression model
statistically, significantly predicts the outcome variable
(i.e., it is a good fit for the data).
COEFFICIENTS
The coefficients for every variable is statistically significant because their p-
values are smaller than 0.05. So, the model becomes:
IN MULTIPLE REGRESSION,EACH COEFFECIENT IS INTERPRETED AS THE
ESTIMATED CHANGE IN Y CORRESPONDING TO A UNIT CHANGE IN A
VARIABLE,WHEN ALL OTHER VARIALBES ARE HELD CONSTANT.
As per our a priori expectations we see that the coefficients of the regression are
same as expected:
 We have the coefficient of gasoline price index positive as it is an price index it
can have a positive relation with consumption and thus expenditure on
gasoline.
 As the per capita income increases, so should the expenditure on gasoline. The
coefficient obtained here depicts that with increasing income people tend to
spend more on gasoline directly or indirectly.
 As the price index of new cars increases we witness negative relation between
the gasoline expenditure and the price index which is as expected. The
technology improvement in the new cars does not imply that the consumption
will be low and indirectly the expenditure, even if it is true, the negative effect of
price increase of new car on gasoline expenditure and the positive effect exist at
the same time, but the former is much larger than the latter.
 Coefficient of price index of used cars is also negative with the meaning that the
increasing price of used cars decrease the demand of used cars and thus
decreasing the expenditure on gasoline.
 As the price index of public transportation increases, total gasoline expenditure
should also increase, because cost of travelling in public transportation has
become costlier. This is also depicted by the results.
AUTOCORRELATION
A key assumption in regression is that the error terms are independent of
each other. In this section, we present a simple test to determine whether
there is autocorrelation i.e. whether there is a (linear) correlation
between the error term for one observation and the next.
Now , we will detect the presence of autocorrelation using Durbin Watson
‘s D test.
The Durbin-Watson test uses the following statistic:
Since most regression problems involving time series data show a positive
autocorrelation, we usually test the null hypothesis
H0: No autocorrelation ,versus the alternative hypothesis ,H1: ρ >0.
Using SPSS we calculated the Durbin Watson value and
it came out to be:
We checked the Durbin Watson table for 52 observations
and k=6 and got dL=1.35124 and dU=1.76942 and the
durbin-watson value from the above table is 0.733 which
lies between 0-dL.
So from the graph above, we conclude that some positive
autocorrelation is present.
We can also remove this positive auto-correlation by using
COCHRAN ORCUTT iterative method.
HOMOSCEDASTICITY
It is the violation of the assumption of homoscedasticity ( equally spread
variance) i.e. It is a problem of unequal variance of the error term.
On plotting the standardized residuals(y-axis) against the standardized
predicted values, we find that the error terms are evenly spread out over
all values implying homogeneity in the data.
But to confirm this, we use Spearman Rank Correlation test to detect if there
is heteroscedasticity in data or not.
For that we have to calculate Unstandardized Predicted value and
Unstandardized residual and then check the correlation between them.
We usually test the null hypothesis H0: No heteroscedasticity versus the
alternative hypothesis H1: heteroscedasticity is present.
Here, the significant value is 0.388 which is greater than 0.05 therefore we accept
the null hypothesis i.e. there is no heteroscedasticity present in the data.
That means we can say that there is homoscedasticity in the data.
MULTICOLLINEARITY
In statistics, Multicollinearity (also collinearity) is a
phenomenon in which two or more predictor variables in a
multiple regression model are highly correlated, meaning that
one can be linearly predicted from the others with a
substantial degree of accuracy. In this situation the coefficient
estimates of the multiple regression may change erratically in
response to small changes in the model or the data.
We are checking collinearity statistics i.e. variance inflation
factor (VIF) and tolerance level (ToL) to detect the presence
of multicollinearity in our data.
The value of VIF > 2.5 indicates that Multicollinearity is present in the
data.
Here, every independent variable has VIF value greater than 2.5 that
means there is STRONG MULTICOLLINEARITY present in the data.
Also, the value of ToL < 0.4 indicates that Multicollinearity is present in
the data.
And here every variable has ToL value less than 0.4 indicating the
presence of strong multicollinearity in the data.
REMOVAL OF MULTICOLLINEARITY
We can remove multicollinearity from the data by following methods:
Collecting additional data
If there is multicollinearity present in the data, Add some more
data(observations) to reduce/remove multicollinearity.
Removing redundant variables
Remove some redundant variables to remove/reduce
multicollinearity.
Combining variables
Define another variable which is simply the combination of two
variables which are causing multicollinearity.
CONCLUSION
As discussed earlier The objective of the project is to study the
relationship between Gasoline Expenditure, Gasoline Price
Index, Income, Price Indices for new cars, old cars and public
transport in the US.
We calculated the measures of goodness of fit for the model, i.e.
R = 0.998 , R2 = 0.996 , adjusted R square = 0.996 and
standard error = 3.7758 for N = 52.
From the ANOVA table, the statistical significance of the
regression model i.e. our p value 0.000 ( which is less than
0.05), indicates that the overall regression model is statistically
significant and predicts the dependent variable (i.e., it is a good
fit for the data).
Finally, after running regression we tested for auto-correlation,
heteroscedasticity and multi-collinearity in the model. We found that
there is strong multicollinearity and positive auto-collinearity is
present in the data.
As multicollinearity is not a problem we may ignore it and consider all
the variables to be important as they have a significant effect on the
gasoline expenditure
Heteroscedasticity is not present in the data so it does not affect our
model in any case.
Thus we conclude that our best (fitted) regression model is:
THANK
YOU

More Related Content

What's hot

Correlation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelCorrelation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelSetia Pramana
 
Autocorrelation
AutocorrelationAutocorrelation
AutocorrelationADITHYAT1
 
Granger causality testing
Granger causality testingGranger causality testing
Granger causality testingThomasReader
 
Chapter8
Chapter8Chapter8
Chapter8Vu Vo
 
Cointegration and error correction model
Cointegration and error correction modelCointegration and error correction model
Cointegration and error correction modelAditya KS
 
Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Alamin Milton
 
Lecture 7B Panel Econometrics I 2011
Lecture 7B Panel Econometrics I 2011Lecture 7B Panel Econometrics I 2011
Lecture 7B Panel Econometrics I 2011Moses sichei
 
Dummy variable
Dummy variableDummy variable
Dummy variableAkram Ali
 
Autocorrelation- Detection- part 1- Durbin-Watson d test
Autocorrelation- Detection- part 1- Durbin-Watson d testAutocorrelation- Detection- part 1- Durbin-Watson d test
Autocorrelation- Detection- part 1- Durbin-Watson d testShilpa Chaudhary
 
Auto Correlation Presentation
Auto Correlation PresentationAuto Correlation Presentation
Auto Correlation PresentationIrfan Hussain
 
DUMMY VARIABLE REGRESSION MODEL
DUMMY VARIABLE REGRESSION MODELDUMMY VARIABLE REGRESSION MODEL
DUMMY VARIABLE REGRESSION MODELArshad Ahmed Saeed
 
regression assumption by Ammara Aftab
regression assumption by Ammara Aftabregression assumption by Ammara Aftab
regression assumption by Ammara AftabUniversity of Karachi
 

What's hot (20)

Correlation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft ExcelCorrelation and Regression Analysis using SPSS and Microsoft Excel
Correlation and Regression Analysis using SPSS and Microsoft Excel
 
Autocorrelation
AutocorrelationAutocorrelation
Autocorrelation
 
Granger causality testing
Granger causality testingGranger causality testing
Granger causality testing
 
Chapter 10
Chapter 10Chapter 10
Chapter 10
 
Chapter8
Chapter8Chapter8
Chapter8
 
Ols
OlsOls
Ols
 
Cointegration and error correction model
Cointegration and error correction modelCointegration and error correction model
Cointegration and error correction model
 
Multicollinearity PPT
Multicollinearity PPTMulticollinearity PPT
Multicollinearity PPT
 
Heteroscedasticity
HeteroscedasticityHeteroscedasticity
Heteroscedasticity
 
Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-Econometrics chapter 5-two-variable-regression-interval-estimation-
Econometrics chapter 5-two-variable-regression-interval-estimation-
 
Lecture 7B Panel Econometrics I 2011
Lecture 7B Panel Econometrics I 2011Lecture 7B Panel Econometrics I 2011
Lecture 7B Panel Econometrics I 2011
 
Econometrics- lecture 10 and 11
Econometrics- lecture 10 and 11Econometrics- lecture 10 and 11
Econometrics- lecture 10 and 11
 
Dummy variable
Dummy variableDummy variable
Dummy variable
 
Autocorrelation
AutocorrelationAutocorrelation
Autocorrelation
 
Panel data
Panel dataPanel data
Panel data
 
Autocorrelation- Detection- part 1- Durbin-Watson d test
Autocorrelation- Detection- part 1- Durbin-Watson d testAutocorrelation- Detection- part 1- Durbin-Watson d test
Autocorrelation- Detection- part 1- Durbin-Watson d test
 
Auto Correlation Presentation
Auto Correlation PresentationAuto Correlation Presentation
Auto Correlation Presentation
 
DUMMY VARIABLE REGRESSION MODEL
DUMMY VARIABLE REGRESSION MODELDUMMY VARIABLE REGRESSION MODEL
DUMMY VARIABLE REGRESSION MODEL
 
Distributed lag model koyck
Distributed lag model koyckDistributed lag model koyck
Distributed lag model koyck
 
regression assumption by Ammara Aftab
regression assumption by Ammara Aftabregression assumption by Ammara Aftab
regression assumption by Ammara Aftab
 

Similar to Econometrics project

Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipRithish Kumar
 
Regression and Classification Analysis
Regression and Classification AnalysisRegression and Classification Analysis
Regression and Classification AnalysisYashIyengar
 
Multiple Linear Regression Applications Automobile Pricing
Multiple Linear Regression Applications Automobile PricingMultiple Linear Regression Applications Automobile Pricing
Multiple Linear Regression Applications Automobile Pricinginventionjournals
 
Stats ca report_18180485
Stats ca report_18180485Stats ca report_18180485
Stats ca report_18180485sarthakkhare3
 
Marketing Engineering Notes
Marketing Engineering NotesMarketing Engineering Notes
Marketing Engineering NotesFelipe Affonso
 
Covariance and correlation
Covariance and correlationCovariance and correlation
Covariance and correlationRashid Hussain
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationAsadJaved304231
 
Consumer Spending causing unemployment analysis
Consumer Spending causing unemployment analysisConsumer Spending causing unemployment analysis
Consumer Spending causing unemployment analysisGaetan Lion
 
REG.pptx
REG.pptxREG.pptx
REG.pptxRaniT16
 
Multivariate data analysis regression, cluster and factor analysis on spss
Multivariate data analysis   regression, cluster and factor analysis on spssMultivariate data analysis   regression, cluster and factor analysis on spss
Multivariate data analysis regression, cluster and factor analysis on spssAditya Banerjee
 
elasticityThe ratio of the percentagechange in a depende.docx
elasticityThe ratio of the percentagechange in a depende.docxelasticityThe ratio of the percentagechange in a depende.docx
elasticityThe ratio of the percentagechange in a depende.docxSALU18
 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfAlemAyahu
 
X18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsX18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsShantanu Deshpande
 

Similar to Econometrics project (20)

Applications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationshipApplications of regression analysis - Measurement of validity of relationship
Applications of regression analysis - Measurement of validity of relationship
 
X18136931 statistics ca2_updated
X18136931 statistics ca2_updatedX18136931 statistics ca2_updated
X18136931 statistics ca2_updated
 
Group5
Group5Group5
Group5
 
Regression and Classification Analysis
Regression and Classification AnalysisRegression and Classification Analysis
Regression and Classification Analysis
 
Multiple Linear Regression Applications Automobile Pricing
Multiple Linear Regression Applications Automobile PricingMultiple Linear Regression Applications Automobile Pricing
Multiple Linear Regression Applications Automobile Pricing
 
Stats ca report_18180485
Stats ca report_18180485Stats ca report_18180485
Stats ca report_18180485
 
Marketing Engineering Notes
Marketing Engineering NotesMarketing Engineering Notes
Marketing Engineering Notes
 
X18145922 statistics ca2 final
X18145922   statistics ca2 finalX18145922   statistics ca2 final
X18145922 statistics ca2 final
 
Covariance and correlation
Covariance and correlationCovariance and correlation
Covariance and correlation
 
Correlation 2
Correlation 2Correlation 2
Correlation 2
 
Quantity Demand Analysis
Quantity Demand AnalysisQuantity Demand Analysis
Quantity Demand Analysis
 
Logistic regression and analysis using statistical information
Logistic regression and analysis using statistical informationLogistic regression and analysis using statistical information
Logistic regression and analysis using statistical information
 
Ch14 multiple regression
Ch14 multiple regressionCh14 multiple regression
Ch14 multiple regression
 
Consumer Spending causing unemployment analysis
Consumer Spending causing unemployment analysisConsumer Spending causing unemployment analysis
Consumer Spending causing unemployment analysis
 
REG.pptx
REG.pptxREG.pptx
REG.pptx
 
Multivariate data analysis regression, cluster and factor analysis on spss
Multivariate data analysis   regression, cluster and factor analysis on spssMultivariate data analysis   regression, cluster and factor analysis on spss
Multivariate data analysis regression, cluster and factor analysis on spss
 
elasticityThe ratio of the percentagechange in a depende.docx
elasticityThe ratio of the percentagechange in a depende.docxelasticityThe ratio of the percentagechange in a depende.docx
elasticityThe ratio of the percentagechange in a depende.docx
 
Multinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdfMultinomial Logistic Regression.pdf
Multinomial Logistic Regression.pdf
 
X18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalyticsX18125514 ca2-statisticsfor dataanalytics
X18125514 ca2-statisticsfor dataanalytics
 
Demand Estimation
Demand EstimationDemand Estimation
Demand Estimation
 

Recently uploaded

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computationsit20ad004
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...Florian Roscheck
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPramod Kumar Srivastava
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts ServiceSapana Sha
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfSocial Samosa
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...soniya singh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingNeil Barnes
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 

Recently uploaded (20)

Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Data Warehouse , Data Cube Computation
Data Warehouse   , Data Cube ComputationData Warehouse   , Data Cube Computation
Data Warehouse , Data Cube Computation
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
Russian Call Girls Dwarka Sector 15 💓 Delhi 9999965857 @Sabina Modi VVIP MODE...
 
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...From idea to production in a day – Leveraging Azure ML and Streamlit to build...
From idea to production in a day – Leveraging Azure ML and Streamlit to build...
 
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptxPKS-TGC-1084-630 - Stage 1 Proposal.pptx
PKS-TGC-1084-630 - Stage 1 Proposal.pptx
 
Call Girls In Mahipalpur O9654467111 Escorts Service
Call Girls In Mahipalpur O9654467111  Escorts ServiceCall Girls In Mahipalpur O9654467111  Escorts Service
Call Girls In Mahipalpur O9654467111 Escorts Service
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdfKantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
Kantar AI Summit- Under Embargo till Wednesday, 24th April 2024, 4 PM, IST.pdf
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改
 
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
High Class Call Girls Noida Sector 39 Aarushi 🔝8264348440🔝 Independent Escort...
 
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...
 
E-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptxE-Commerce Order PredictionShraddha Kamble.pptx
E-Commerce Order PredictionShraddha Kamble.pptx
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Brighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data StorytellingBrighton SEO | April 2024 | Data Storytelling
Brighton SEO | April 2024 | Data Storytelling
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 

Econometrics project

  • 1. U.S. GASOLINE PRICE MARKET 1953-2004 ECONOMETRICS PROJECT PRESENTED BY:- 1) SAKSHI ARORA 2) SIMRAN TANWAR 3) SHUBHAM JOON 4) GAURISH KANT SHUKLA
  • 2. INTRODUCTION The objective of the project is to study the relationship between Gasoline Expenditure, Gasoline Price Index, Per capita disposable income, Price Index for new cars, Price index for old cars and price index for public transports in the US. There may be other factors which may affect the Gasoline Expenditure but the factors taken here represent the expenditure of gasoline reasonably.
  • 3. METHODOLOGY The model taken here is multivariate i.e. it has more than two variables. We have used Multiple linear regression technique. Multiple regression is an extension of simple linear regression. It is used when we want to predict the value of a variable based on the value of two or more other variables. The variable we want to predict is called the dependent variable (or sometimes, the outcome, target or criterion variable). The variables we are using to predict the value of the dependent variable are called the independent variables (or sometimes, the predictor, explanatory or repressor variables). Multiple regression analysis is applied here to study the relationship between the dependent variable and all the factors involved. The data taken into consideration is a time series data (1953- 2004).
  • 4. DATA SOURCING The data taken into consideration (U.S. Gasoline market) is a time series data (1953-2004). We collected it from a very reliable source. It was compiled by Prof. Chris Bell, Department of Economics, University of North Carolina, Asheville. www.bea.gov and www.bls.gov
  • 5. VARIABLES . The variables we considered for the gasoline market are defined below: GasExp- Total gasoline expenditure in U.S. in billions of dollars (dependent variable) Gasp-Price Index for gasoline. Income- per capita Disposable Income PNC- Price index for new cars Independent variables PUC- Price index for used cars PPT- Price Index for public transportation.
  • 6. REGRESSION STATISTICS We fit the regression model to our data and the following results are witnessed: These are the “Goodness of Fit” measures. They tell you how well the calculated linear regression equation fits your data. The coefficient of determination of the model comes out to be 0.996 i.e. 99.6% of the variations in the gasoline expenditure are explained by the factors taken into consideration.
  • 7. ANOVA TABLE The linear regression's F-test has the null hypothesis that there is no linear relationship between the variables Ho: Bo=B1=B2=B3=B4=B5=0 H1: at least one of the Bi’s are not 0.
  • 8. Here, The significance value for F-Test is 0.000 which is less than 0.05 therefore we reject the null hypothesis which states that there is no linear relationship between the variables. Thus we can assume that there is a linear relationship between the variables in our model. Which also indicates that, overall, the regression model statistically, significantly predicts the outcome variable (i.e., it is a good fit for the data).
  • 9. COEFFICIENTS The coefficients for every variable is statistically significant because their p- values are smaller than 0.05. So, the model becomes: IN MULTIPLE REGRESSION,EACH COEFFECIENT IS INTERPRETED AS THE ESTIMATED CHANGE IN Y CORRESPONDING TO A UNIT CHANGE IN A VARIABLE,WHEN ALL OTHER VARIALBES ARE HELD CONSTANT.
  • 10. As per our a priori expectations we see that the coefficients of the regression are same as expected:  We have the coefficient of gasoline price index positive as it is an price index it can have a positive relation with consumption and thus expenditure on gasoline.  As the per capita income increases, so should the expenditure on gasoline. The coefficient obtained here depicts that with increasing income people tend to spend more on gasoline directly or indirectly.  As the price index of new cars increases we witness negative relation between the gasoline expenditure and the price index which is as expected. The technology improvement in the new cars does not imply that the consumption will be low and indirectly the expenditure, even if it is true, the negative effect of price increase of new car on gasoline expenditure and the positive effect exist at the same time, but the former is much larger than the latter.  Coefficient of price index of used cars is also negative with the meaning that the increasing price of used cars decrease the demand of used cars and thus decreasing the expenditure on gasoline.  As the price index of public transportation increases, total gasoline expenditure should also increase, because cost of travelling in public transportation has become costlier. This is also depicted by the results.
  • 11. AUTOCORRELATION A key assumption in regression is that the error terms are independent of each other. In this section, we present a simple test to determine whether there is autocorrelation i.e. whether there is a (linear) correlation between the error term for one observation and the next. Now , we will detect the presence of autocorrelation using Durbin Watson ‘s D test. The Durbin-Watson test uses the following statistic: Since most regression problems involving time series data show a positive autocorrelation, we usually test the null hypothesis H0: No autocorrelation ,versus the alternative hypothesis ,H1: ρ >0.
  • 12. Using SPSS we calculated the Durbin Watson value and it came out to be: We checked the Durbin Watson table for 52 observations and k=6 and got dL=1.35124 and dU=1.76942 and the durbin-watson value from the above table is 0.733 which lies between 0-dL.
  • 13. So from the graph above, we conclude that some positive autocorrelation is present. We can also remove this positive auto-correlation by using COCHRAN ORCUTT iterative method.
  • 14. HOMOSCEDASTICITY It is the violation of the assumption of homoscedasticity ( equally spread variance) i.e. It is a problem of unequal variance of the error term. On plotting the standardized residuals(y-axis) against the standardized predicted values, we find that the error terms are evenly spread out over all values implying homogeneity in the data.
  • 15. But to confirm this, we use Spearman Rank Correlation test to detect if there is heteroscedasticity in data or not. For that we have to calculate Unstandardized Predicted value and Unstandardized residual and then check the correlation between them. We usually test the null hypothesis H0: No heteroscedasticity versus the alternative hypothesis H1: heteroscedasticity is present. Here, the significant value is 0.388 which is greater than 0.05 therefore we accept the null hypothesis i.e. there is no heteroscedasticity present in the data. That means we can say that there is homoscedasticity in the data.
  • 16. MULTICOLLINEARITY In statistics, Multicollinearity (also collinearity) is a phenomenon in which two or more predictor variables in a multiple regression model are highly correlated, meaning that one can be linearly predicted from the others with a substantial degree of accuracy. In this situation the coefficient estimates of the multiple regression may change erratically in response to small changes in the model or the data. We are checking collinearity statistics i.e. variance inflation factor (VIF) and tolerance level (ToL) to detect the presence of multicollinearity in our data.
  • 17. The value of VIF > 2.5 indicates that Multicollinearity is present in the data. Here, every independent variable has VIF value greater than 2.5 that means there is STRONG MULTICOLLINEARITY present in the data. Also, the value of ToL < 0.4 indicates that Multicollinearity is present in the data. And here every variable has ToL value less than 0.4 indicating the presence of strong multicollinearity in the data.
  • 18. REMOVAL OF MULTICOLLINEARITY We can remove multicollinearity from the data by following methods: Collecting additional data If there is multicollinearity present in the data, Add some more data(observations) to reduce/remove multicollinearity. Removing redundant variables Remove some redundant variables to remove/reduce multicollinearity. Combining variables Define another variable which is simply the combination of two variables which are causing multicollinearity.
  • 19. CONCLUSION As discussed earlier The objective of the project is to study the relationship between Gasoline Expenditure, Gasoline Price Index, Income, Price Indices for new cars, old cars and public transport in the US. We calculated the measures of goodness of fit for the model, i.e. R = 0.998 , R2 = 0.996 , adjusted R square = 0.996 and standard error = 3.7758 for N = 52. From the ANOVA table, the statistical significance of the regression model i.e. our p value 0.000 ( which is less than 0.05), indicates that the overall regression model is statistically significant and predicts the dependent variable (i.e., it is a good fit for the data).
  • 20. Finally, after running regression we tested for auto-correlation, heteroscedasticity and multi-collinearity in the model. We found that there is strong multicollinearity and positive auto-collinearity is present in the data. As multicollinearity is not a problem we may ignore it and consider all the variables to be important as they have a significant effect on the gasoline expenditure Heteroscedasticity is not present in the data so it does not affect our model in any case. Thus we conclude that our best (fitted) regression model is: