Brief notes on heteroscedasticity, very helpful for those who are bigners to econometrics. i thought this course to the students of BS economics, these notes include all the necessary proofs.
Brief notes on heteroscedasticity, very helpful for those who are bigners to econometrics. i thought this course to the students of BS economics, these notes include all the necessary proofs.
We can define heteroscedasticity as the condition in which the variance of the error term or the residual term in a regression model varies. As you can see in the above diagram, in the case of homoscedasticity, the data points are equally scattered while in the case of heteroscedasticity, the data points are not equally scattered.
Two Conditions:
1] Known Variance
2] Unknown Variance
Identification problem in simultaneous equations modelGarimaGupta229
In this presentation, identification problem is explained with the example of Supply and Demand equilibrium and why identification problem arises. In addition, the rank and order conditions are also introduced.
For further explanation, checkout the youtube link:
https://youtu.be/PyU_RJJspfE
We can define heteroscedasticity as the condition in which the variance of the error term or the residual term in a regression model varies. As you can see in the above diagram, in the case of homoscedasticity, the data points are equally scattered while in the case of heteroscedasticity, the data points are not equally scattered.
Two Conditions:
1] Known Variance
2] Unknown Variance
Identification problem in simultaneous equations modelGarimaGupta229
In this presentation, identification problem is explained with the example of Supply and Demand equilibrium and why identification problem arises. In addition, the rank and order conditions are also introduced.
For further explanation, checkout the youtube link:
https://youtu.be/PyU_RJJspfE
Page 1 of 18Part A Multiple Choice (1–11)______1. Using.docxalfred4lewis58146
Page 1 of 18
Part A: Multiple Choice (1–11)
______1. Using the “eyeball” method, the regression line = 2+2x has been fitted to the data points (x = 2, y = 1), (x = 3, y = 8), and (x = 4, y = 7). The sum of the squared residuals will be
a. 7 b. 19 c. 34 d. 8
______2. A computer statistical package has included the following quantities in its output: SST = 50, SSR = 35, and SSE = 15. How much of the variation in y is explained by the regression equation?
a. 49% b. 70% c. 35% d. 15%
______3. In testing the significance of b, the null hypothesis is generally that
a. β = b b. β 0 c. β = 0 d. β = r
______4. Testing whether the slope of the population regression line could be zero is equivalent to testing whether the population _____________ could be zero.
a. standard error of estimate c. y-intercept
b. prediction interval d. coefficient of correlation
______5. A multiple regression equation includes 4 independent variables, and the coefficient of multiple determination is 0.64. How much of the variation in y is explained by the regression equation?
a. 80% b. 16% c. 32% d. 64%
______6. A multiple regression analysis results in the following values for the sum-of-squares terms: SST = 50.0, SSR = 35.0, and SSE = 15.0. The coefficient of multiple determination will be
a. = 0.35 b. = 0.30 c. = 0.70 d. = 0.50
______7. In testing the overall significance of a multiple regression equation in which there are three independent variables, the null hypothesis is
a. :
b. :
c. :
d. :
______8. In a multiple regression analysis involving 25 data points and 4 independent variables, the sum-of-squares terms are calculated as SSR = 120, SSE = 80, and SST = 200. In testing the overall significance of the regression equation, the calculated value of the test statistic will be
a. F = 1.5 c. F = 5.5
b. F = 2.5 d. F = 7.5
______9. For a set of 15 data points, a computer statistical package has found the multiple regression equation to be = -23 + 20+ 5 + 25 and has listed the t-ratio for testing the significance of each partial regression coefficient. Using the 0.05 level in testing whether = 20 differs significantly from zero, the critical t values will be
a. t = -1.960 and t= +1.960
b. t = -2.132 and t = +2.132
c. t = -2.201 and t = +2.201
d. t = -1.796 and t = +1.796
______10. Computer analyses typically provide a p-Value for each partial regression coefficient. In the case of , this is the probability that
a. = 0
b. =
c. the absolute value of could be this large if = 0
d. the absolute value of could be this large if 1
______11. In the multiple regression equation, = 20,000 + 0.05+ 4500 , is the estimated household income, is the amount of life insurance held by the head of the household, and is a dummy variable ( = 1 if the family owns mutual funds, 0 if it doesn’t). The interpretation of = 4500 is that
a. owing mutual funds increases the estimated income by $4500
b. the average value of a mut.
Isotonic Regression is a statistical technique of fitting a free-form line to a sequence of observations such that the fitted line is non-decreasing (or non-increasing) everywhere, and lies as close to the observations as possible. Isotonic Regression is limited to predicting numeric output so the dependent variable must be numeric in nature…
I am Hannah Lucy. Currently associated with excelhomeworkhelp.com as excel homework helper. After completing my master's from Kean University, USA, I was in search of an opportunity that expands my area of knowledge hence I decided to help students with their homework. I have written several excel homework till date to help students overcome numerous difficulties they face.
Any business and economic applications of forecasting involve time series data. Re-gression models can be fit to monthly, quarterly, or yearly data using the techniques de-scribed in previous chapters. However, because data collected over time tend to exhibit trends, seasonal patterns, and so forth, observations in different time periods are re¬lated or autocorrelated. That is, for time series data, the sample of observations cannot be regarded as a random sample. Problems of interpretation can arise when standard regression methods are applied to observations that are related to one another over time. Fitting regression models to time series data must be done with considerable care.
Linear regression is an approach for modeling the relationship between one dependent variable and one or more independent variables.
Algorithms to minimize the error are
OLS (Ordinary Least Square)
Gradient Descent and much more.
Let me know if anything is required. Ping me at google #bobrupakroy
June 3, 2024 Anti-Semitism Letter Sent to MIT President Kornbluth and MIT Cor...Levi Shapiro
Letter from the Congress of the United States regarding Anti-Semitism sent June 3rd to MIT President Sally Kornbluth, MIT Corp Chair, Mark Gorenberg
Dear Dr. Kornbluth and Mr. Gorenberg,
The US House of Representatives is deeply concerned by ongoing and pervasive acts of antisemitic
harassment and intimidation at the Massachusetts Institute of Technology (MIT). Failing to act decisively to ensure a safe learning environment for all students would be a grave dereliction of your responsibilities as President of MIT and Chair of the MIT Corporation.
This Congress will not stand idly by and allow an environment hostile to Jewish students to persist. The House believes that your institution is in violation of Title VI of the Civil Rights Act, and the inability or
unwillingness to rectify this violation through action requires accountability.
Postsecondary education is a unique opportunity for students to learn and have their ideas and beliefs challenged. However, universities receiving hundreds of millions of federal funds annually have denied
students that opportunity and have been hijacked to become venues for the promotion of terrorism, antisemitic harassment and intimidation, unlawful encampments, and in some cases, assaults and riots.
The House of Representatives will not countenance the use of federal funds to indoctrinate students into hateful, antisemitic, anti-American supporters of terrorism. Investigations into campus antisemitism by the Committee on Education and the Workforce and the Committee on Ways and Means have been expanded into a Congress-wide probe across all relevant jurisdictions to address this national crisis. The undersigned Committees will conduct oversight into the use of federal funds at MIT and its learning environment under authorities granted to each Committee.
• The Committee on Education and the Workforce has been investigating your institution since December 7, 2023. The Committee has broad jurisdiction over postsecondary education, including its compliance with Title VI of the Civil Rights Act, campus safety concerns over disruptions to the learning environment, and the awarding of federal student aid under the Higher Education Act.
• The Committee on Oversight and Accountability is investigating the sources of funding and other support flowing to groups espousing pro-Hamas propaganda and engaged in antisemitic harassment and intimidation of students. The Committee on Oversight and Accountability is the principal oversight committee of the US House of Representatives and has broad authority to investigate “any matter” at “any time” under House Rule X.
• The Committee on Ways and Means has been investigating several universities since November 15, 2023, when the Committee held a hearing entitled From Ivory Towers to Dark Corners: Investigating the Nexus Between Antisemitism, Tax-Exempt Universities, and Terror Financing. The Committee followed the hearing with letters to those institutions on January 10, 202
Francesca Gottschalk - How can education support child empowerment.pptxEduSkills OECD
Francesca Gottschalk from the OECD’s Centre for Educational Research and Innovation presents at the Ask an Expert Webinar: How can education support child empowerment?
The Roman Empire A Historical Colossus.pdfkaushalkr1407
The Roman Empire, a vast and enduring power, stands as one of history's most remarkable civilizations, leaving an indelible imprint on the world. It emerged from the Roman Republic, transitioning into an imperial powerhouse under the leadership of Augustus Caesar in 27 BCE. This transformation marked the beginning of an era defined by unprecedented territorial expansion, architectural marvels, and profound cultural influence.
The empire's roots lie in the city of Rome, founded, according to legend, by Romulus in 753 BCE. Over centuries, Rome evolved from a small settlement to a formidable republic, characterized by a complex political system with elected officials and checks on power. However, internal strife, class conflicts, and military ambitions paved the way for the end of the Republic. Julius Caesar’s dictatorship and subsequent assassination in 44 BCE created a power vacuum, leading to a civil war. Octavian, later Augustus, emerged victorious, heralding the Roman Empire’s birth.
Under Augustus, the empire experienced the Pax Romana, a 200-year period of relative peace and stability. Augustus reformed the military, established efficient administrative systems, and initiated grand construction projects. The empire's borders expanded, encompassing territories from Britain to Egypt and from Spain to the Euphrates. Roman legions, renowned for their discipline and engineering prowess, secured and maintained these vast territories, building roads, fortifications, and cities that facilitated control and integration.
The Roman Empire’s society was hierarchical, with a rigid class system. At the top were the patricians, wealthy elites who held significant political power. Below them were the plebeians, free citizens with limited political influence, and the vast numbers of slaves who formed the backbone of the economy. The family unit was central, governed by the paterfamilias, the male head who held absolute authority.
Culturally, the Romans were eclectic, absorbing and adapting elements from the civilizations they encountered, particularly the Greeks. Roman art, literature, and philosophy reflected this synthesis, creating a rich cultural tapestry. Latin, the Roman language, became the lingua franca of the Western world, influencing numerous modern languages.
Roman architecture and engineering achievements were monumental. They perfected the arch, vault, and dome, constructing enduring structures like the Colosseum, Pantheon, and aqueducts. These engineering marvels not only showcased Roman ingenuity but also served practical purposes, from public entertainment to water supply.
Acetabularia Information For Class 9 .docxvaibhavrinwa19
Acetabularia acetabulum is a single-celled green alga that in its vegetative state is morphologically differentiated into a basal rhizoid and an axially elongated stalk, which bears whorls of branching hairs. The single diploid nucleus resides in the rhizoid.
The French Revolution, which began in 1789, was a period of radical social and political upheaval in France. It marked the decline of absolute monarchies, the rise of secular and democratic republics, and the eventual rise of Napoleon Bonaparte. This revolutionary period is crucial in understanding the transition from feudalism to modernity in Europe.
For more information, visit-www.vavaclasses.com
Introduction to AI for Nonprofits with Tapp NetworkTechSoup
Dive into the world of AI! Experts Jon Hill and Tareq Monaur will guide you through AI's role in enhancing nonprofit websites and basic marketing strategies, making it easy to understand and apply.
Synthetic Fiber Construction in lab .pptxPavel ( NSTU)
Synthetic fiber production is a fascinating and complex field that blends chemistry, engineering, and environmental science. By understanding these aspects, students can gain a comprehensive view of synthetic fiber production, its impact on society and the environment, and the potential for future innovations. Synthetic fibers play a crucial role in modern society, impacting various aspects of daily life, industry, and the environment. ynthetic fibers are integral to modern life, offering a range of benefits from cost-effectiveness and versatility to innovative applications and performance characteristics. While they pose environmental challenges, ongoing research and development aim to create more sustainable and eco-friendly alternatives. Understanding the importance of synthetic fibers helps in appreciating their role in the economy, industry, and daily life, while also emphasizing the need for sustainable practices and innovation.
Instructions for Submissions thorugh G- Classroom.pptxJheel Barad
This presentation provides a briefing on how to upload submissions and documents in Google Classroom. It was prepared as part of an orientation for new Sainik School in-service teacher trainees. As a training officer, my goal is to ensure that you are comfortable and proficient with this essential tool for managing assignments and fostering student engagement.
Honest Reviews of Tim Han LMA Course Program.pptxtimhan337
Personal development courses are widely available today, with each one promising life-changing outcomes. Tim Han’s Life Mastery Achievers (LMA) Course has drawn a lot of interest. In addition to offering my frank assessment of Success Insider’s LMA Course, this piece examines the course’s effects via a variety of Tim Han LMA course reviews and Success Insider comments.
Adversarial Attention Modeling for Multi-dimensional Emotion Regression.pdf
Ali, Redescending M-estimator
1. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
1
Autocorrelation
Definition
The classical assumptions in the linear regression are that the errors terms i have zero mean and
constant variance and are uncorrelated [E( i) = 0, Var( i) = δ2
, and E( i j ) = 0 ]. For the
construction of Confidence Interval, and Testing of hypothesis about the regression coefficients
we add the assumption of normality. so that i are NID(0, δ2
). Some applications of regression
involve regressor and response variables that have a natural sequential order over time. Such data
are called time series data. Regression models using time series data occur relatively often in
economics, business, and some fields of engineering. The assumption of uncorrelated or
independent errors for time series data is often not appropriate. Usually the errors in time series
data exhibit serial correlation, that is, E( i j ) ≠ 0. Such error terms are said to be
autocorrelated. Autocorrelation sometimes called "lagged correlation or "serial correlation".
Causes of Autocorrelation
Specification Bias:
a) Excluded Variables Case
There are several causes of autocorrelation. Perhaps the primary cause of
autocorrelation in regression problems involving time series data is failure to include one
or more important regressors in the model. For example suppose that we wish to regress
2. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
2
annual sales of a soft drink company against the annual advertising expenditure for that
product. Now the growth in population over the period of time used in the study will also
influence the product sales. If population size is not included in the model, this may cause
the errors in the model to be positively autocorrelated, because population size is
positively correlated with product sales.
Consider the true model:
Sale (Yt) = β0 + β1X1t + β2X2t + εt ---------------------- ( I )
Where Y is the sale, X1 is the advertising expenditure, X2 is the population size.
However for some reason we run the following regression:
Sale (Yt) = β0 + β1X1t + υt ---------------------- ( II )
As model ( I ) is a true model and we run model ( II ), and hence the error or disturbance
term υ will be autocorrelated.
b) Incorrect Functional Form:
Consider the following cost and output model:
Yt = β1 + β2 X1 + β3 X2
2
+ υt ------------------- ( III )
3. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
3
Instead of using the above form which is considered to be correct, if we fit the
following model:
Yt = β1 + β2 X1 + β3 X2 + υt ----------------( IV)
In this case, υ will reflect autocorrelation because of the use of an incorrect
functional form.
Theoretical consequences of autocorrelation
The presence of autocorrelation in the errors has several effects on the ordinary least-squares
regression procedures. These are summarized as follows:
1. Ordinary least-squares regression coefficients are still unbiased.
2. OLS regression coefficients are no longer efficient i..e. they are no longer minimum
variance estimates. We say that these estimates are inefficient.
3. The residual mean square MSres may seriously underestimate δ2
. Consequently, the
standard errors of the regression coefficients may be too small. Thus, confidence intervals
are shorter than they really should be, and tests of hypothesis on individual regression
coefficients may indicate that one or more regression contribute significantly to the
model when they really do not. Generally, underestimating δ2
gives the researcher a false
impression of accuracy.
4. The confidence intervals and tests of hypothesis based on the t and F distributions are no
longer appropriate.
4. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
4
OLS estimates in presence of autocorrelation
There are three main consequences of autocorrelation on the ordinary least squares estimates.
1. Ordinary least squares regression coefficients are still unbiased even if the disturbance
term is autocorrelated. i.e.
We know that
( )
( ) ( )
( )
εβ
εβ
εβ
εβεβ
β
XXX
XXXI
XXXXXXX
XYXXXX
YXXX
′′+=
′′+=
′′+′′=
+=∴+′′=
′′=
−
−
−−
−
−
1
1
11
1
1
)(
)(
)()(
ˆ
Taking expectation on both sides of the above equation #1, assuming that E(ε) = 0 i.e.
β
β
εββ
=
+=
′+= −
0
)()()ˆ( 1
XEXXE
Hence in the presence of autocorrelation the OLS estimates are still unbiased.
5. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
5
2. The residual mean square underestimate δ2
result in small standard errors of the
regression coefficients.
We know that variance of the OLS estimate is:
)(ˆ
Putˆ
(B)equationinvaluesthese
1and0
)(
/
)(
)(
;
ereWh;
0)(X
)(
)(
)(
)()(
)(
))((ˆ
)(]ˆ[)ˆ(
11
2
2
10
10
2
2
i2
22
2
cw
w
Putting
Xw
x
XX
xxwSince
BwXww
Xw
x
x
wYw
XXx
x
xY
X
XX
XXY
XX
XXYXXY
XX
YYXX
Since
AEVar
ii
ii
ii
i
i
iii
iiiii
iii
i
i
iii
ii
i
ii
i
ii
i
iii
i
ii
−−−−−−−−∑=−
=∑+=
=∑=
∑
−∑
=∑∑=∑
−−−−−−−∑+∑+∑=
++∑=
∑
=∴∑=
−=
∑
∑
=
=−∑∴
−∑
−∑
=
−∑
−∑−−∑
=
−∑
−−∑
=
−−−−−−=
εββ
ββεββ
εββ
εββ
β
βββ
6. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
6
Putting the values of equation ( c) in equation (A).
[ ] [ ]
[ ]
[ ]
∑ ∑= <
−−
−−
+=
+++++++=
+++++++=
+++=∑=
n
i
n
ji
jijiii
nnnnnn
nnnnnn
iniiii
wwEwE
wwwwwwwwwE
wwwwwwwwwE
wwwEwEVar
1
22
1131312121
222
2
2
2
2
1
2
1
1131312121
222
2
2
2
2
1
2
1
2
22
2
][2][
...[2]...
2...22...
...)ˆ(
εεε
εεεεεεεεε
εεεεεεεεε
εεεεβ
∑ ∑= <
−−−−−−+=
n
i
n
ji
jijiii DEwwEw
1
22
)()(2)( εεε
If there is no correlation between error terms i.e. E( 0) =jiεε then equation (D)
becomes:
( )
)(
)(
/
)(
0)()ˆ(
2
2
22
22
2
2
2
2
222
22
1
22
E
XX
x
x
x
x
x
w
wEwVar
i
i
i
i
i
i
i
i
n
i
ii
−−−−−
−∑
=∑=
∑
∑
=
∑
∑=∑=
∑=+= ∑=
δ
δδ
δδ
δεβ
Now we have to find ( )βˆVar when the errors are autocorrected. i.e. errors are AR(1).
7. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
7
Under AR (1)
= + (0,
( = ( + ( = 0
( = ( + (
= +
− =
=
1 −
As ( = ( = ( = ( =
Now ( = ( + ( = ( + ( =
!
"!
#$%&'(1 = )
∑ +
∑ +
, = )
∑ + + 2∑+ +
(∑ +
, =
∑ +
+
2∑+ +
(∑ + 1 −
=
1
∑ + 1 −
+
2∑+ +
(∑ + 1 −
=
1 −
1
∑ +
)1 +
2∑+ +
∑ +
, − − − − − (.
It is clear from equation (E) and (F) that 1)ˆ()ˆ( 22 ARVarVar ββ < . Therefore, if we use )ˆ( 2βVar ,
we shall inflate the precision or accuracy of the estimator 2
ˆβ . As a result, the t ratio will be
overestimated.
Methods of detection of Autocorrelation
Following are the methods of detecting the problem of autocorrelation:
1. Residual plots: Residuals plots can be useful for the detection of autocorrelation. The
most meaningful display is the plot of residuals versus time. If there is positive
8. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
8
autocorrelation, residuals of identical sign occur in clusters. That is, there are not
enough changes of sign in the pattern of residuals. On the other hand, if there is negative
autocorrelation, the residuals will alternate signs too rapidly.
2. The Runs Test: The Run test having the following steps.
Step1. Write down the plus and minus sign's of the residuals.
Step2. Count the number of plus signs, negative signs and total number of runs( A run is
a sequence of either positive or negative signs without interruption). Now let
N= Total number of observations= N1+N2
N1=Number of + symbols (i.e. + residuals)
N2=Number of - symbols (i.e. - residuals)
R= number of runs
Step3. Compare the value of 'R' with that of the tabulated value, if it is less than the
smaller tabulated value or greater that the larger tabulated value then we have to reject
the hypothesis that pattern of errors are random i.e. H0; The sequence of errors are
random. In other words, the residuals exhibit autocorrelation.
Special case: If N1 > 20 or N2 > 20 or both then the number of runs is asymptotically
normally distributed with
Mean: 1
2 21
+=
N
NN
Rµ & Variance:
)1()(
)2(2
2
21212
−
−
=
NN
NNNNN
Rδ
Testing of hypothesis procedure is the same as of the Z-test.
Note that the run test sometimes also known as the Geary test, a nonparametric test.
9. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
9
3. Duban-Watson d Test: This test is based on the assumption that the errors in the
regression model are generated by a first-order autoregressive process observed at
equally spaced time periods, i.e.
= + --------------( i )
where is the error term in the model at time t, is an NID(0,/2
ε) random variable,
and ρ is the autocorrelation parameter. Thus, a simple linear regression model with first-
order autoregressive errors would be
yt = β0 + β1 xt + εt---------------( ii )
Where yt and xt are the observations on the response and regressor variables at time
period t. The hypothesis usually considered in the Dubrin-Watson test are
Ho: ρ = 0
H1: ρ > 0
The test statistic is
( )
2
2
2
1
t
n
t
tt
e
ee
d
∑
−
=
∑=
−
The value of the d statistic lies between two bounds, say dL and dU, such that if d is
outside these limits, a conclusion regarding the hypothesis can be reached. The decision
procedure is as follows:
Reject H0 if:
0 ≤ d ≤ dL Evidence of positive autocorrelation
4−dL ≤ d ≤ 4 Evidence of negative autocorrelation
10. Muhammad Ali
Lecturer in Statistics
GPGC Mardan.
10
Do not reject H0 if:
dU ≤ d ≤ 4−dU
Zone of indecision if:
dL ≤ d ≤ dU or 4−dU ≤ d ≤ 4− dL
Values of dL and dU can be obtained from the Durban-Watson table.
Situation where negative autocorrelation occurs are not often encountered. However, if a
test for negative autocorrelation is desired, one can use the statistic 4−d, where d is
defined above, then the decision rules for H0 : ρ = 0 versus H1 : ρ < 0 are the same as
those used in testing for positive autocorrelation. It is also possible to conduct a two-
sided ( H0 : ρ = 0 versus H1 : ρ ≠ 0 ) by using both one-sided tests simultaneously. If this
is done, the two-sided procedure has Type I error 2α, where α is the Type I error used for
each one-sided test.