This document summarizes an empirical study by Robert Lucas examining the relationship between output and inflation using annual time series data from 18 countries between 1951-1967. Lucas develops an economic model based on the natural rate hypothesis, where supply behavior depends on expected relative prices. The model yields solutions showing inflation and output as distributed lags of nominal output changes. Estimating the model across countries allows testing the natural rate theory's implications regarding the output-inflation relationship.
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
American Economic Association Some International Evid.docx
1. American Economic Association
Some International Evidence on Output-Inflation Tradeoffs
Author(s): Robert E. Lucas, Jr.
Source: The American Economic Review, Vol. 63, No. 3 (Jun.,
1973), pp. 326-334
Published by: American Economic Association
Stable URL: http://www.jstor.org/stable/1914364
Accessed: 08-02-2017 17:06 UTC
JSTOR is a not-for-profit service that helps scholars,
researchers, and students discover, use, and build upon a wide
range of content in a trusted
digital archive. We use information technology and tools to
increase productivity and facilitate new forms of scholarship.
For more information about
JSTOR, please contact [email protected]
Your use of the JSTOR archive indicates your acceptance of the
Terms & Conditions of Use, available at
http://about.jstor.org/terms
American Economic Association is collaborating with JSTOR to
digitize, preserve and extend access to The
American Economic Review
2. This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
Some International Evidence on
Output-Inflation Tradeofs
By ROBERT E. LUCAS, JR.*
This paper reports the results of an
empirical study of real output-inflation
tradeoffs, based on annual time-series from
eighteen countries over the years 1951-67.
These data are examined from the point
of view of the hypothesis that average
real output levels are invariant under
changes in the time pattern of the rate of
inflation, or that there exists a "natural
rate" of real output. That is, we are con-
cerned with the questions (i) does the
natural rate theory lead to expressions of
the output-inflation relationship which
perform satisfactorily in an econometric
sense for all, or most, of the countries in
the sample, (ii) what testable restrictions
does the theory impose on this relation-
ship, and (iii) are these restrictions con-
sistent with recent experience?
Since the term "'natural rate theory"
refers to varied aggregation of models and
verbal developments,' it may be helpful
to sketch the key elements of the particular
version used in this paper. The first
3. essential presumption is that nominal out-
put is determined on the aggregate demand
side of the economy, with the division
into real output and the price level largely
dependent on the behavior of suppliers of
labor and goods. The second is that the
partial "rigidities" which dominate short-
run supply behavior result from suppliers'
lack of information on some of the prices
relevant to their decisions. The third
presumption is that inferences on these
relevant, unobserved prices are made
optimally (or "rationally") in light of the
stochastic character of the economy.
As I have argued elsewhere (1972),
theories developed along these lines will
not place testable restrictions on the co-
efficients of estimated Phillips curves or
other single equation expressions of the
tradeoff. They will not, for example, imply
that money wage changes are linked to
price level changes with a unit coefficient,
or that {"long-run"' (in the usual distrib-
uted lag sense) Phillips curves must be
vertical. They will (as we shall see below)
link supply parameters to parameters
governing the stochastic nature of demand
shifts. The fact that the implications of the
natural rate theory come in this form sug-
gests an attempt to test it using a sample,
such as the one employed in this study, in
which a wide variety of aggregate demand
behavior is exhibited.
4. In the following section, a simple ag-
gregative model will be constructed using
the elements sketched above. Results
based on this model are reported in Sec-
tion II, followed by a discussion and con-
clusions.
1. An Economic Model
The general structure of the model de-
veloped in this section may be described
very simply. First, the aggregate price-
quantity observations are viewed as inter-
section points of an aggregate demand and
an aggregate supply schedule. The former
is drawn up under the assumption of a
cleared money market and represents the
output-price level relationship implicit in
* Graduate School of Industrial Administration, Car-
negie-Mellon University.
1 The most useful, general statements are those of .
Milton Friedman (1968) and Edmund Phelps. Specific
illustrative examples are provided by Donald Gordon
and Allan Hynes and Lucas (April 1972).
326
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
VOL. 63 NO. 3 LUCAS: OUTPUT-INFLATION TRADEOFF
5. 327
the standard IS-LM diagram. It is viewed
as being shifted by the usual set of
demand-shift variables: monetary and
fiscal policies and variation in export de-
mands. The supply schedule is drawn un-
der the assumption of a cleared labor
market; its slope therefore reflects labor
and product market "rigidities."
The structure of this model, which is
essentially that suggested in Lucas and
Leonard Rapping (1969), will be greatly
simplified by an additional special assump-
tion: that the aggregate demand curve is
unit elastic.2 In this case, the level of
nominal output can be treated as an
"exogenous"' variable with respect to the
goods market, and the entire burden of ac-
counting for the breakdown of nominal
income into real output and price is placed
on the aggregate supply side. In the next
subsection, A, a supply model designed to
serve this purpose is developed. In subsec-
tion B, solutions to the full (demand and
supply) model are obtained.
A. Aggregate Supply
All formulations of the natural rate
theory postulate rational agents, whose
decisions depend on relative prices only,
placed in an economic setting in which
they cannot distinguish relative from gen-
eral price movements. Obviously, there is
6. no limit to the number of models one can
construct where agents are placed in this
situation of imperfect information; the
trick is to find tractable schemes with this
feature. One such model is developed
below.
We imagine suppliers as located in a
large number of scattered, competitive
markets. Demand for goods in each period
is distributed unevenly over markets,
leading to relative as well as general price
movements. As a consequence, the situa-
tion as perceived by individual suppliers
will be quite different from the aggregate
situation as seen by an outside observer.
Accordingly, we shall attempt to keep these
two points of view separate, turning first
to the situation faced by individual sup-
pliers.
Quantity supplied in each market will be
viewed as the product of a normal (or
secular) component common to all markets
and a cyclical component which varies
from market to market. Letting z index
markets, and using ynt and yet to denote
the logs of these components, supply in
market z is:
( 1 ) yt(Z) = Ynt + yet(Z)
The secular component, reflecting capital
7. accumulation and population change, fol-
lows the trend line:
(2) y.t a +? t
The cyclical component varies with per-
ceived, relative prices and with its own
lagged value:
(3) yet(z) y [Pt(z) - E(Pt It(Z))]
+ NyC_t-1(z)
where Pt(z) is the actual price in z at t and
E(PtI It(z)) is the mean current, general
price level, conditioned on information
available in z at t, It(z).3 Since yet is a
deviation from trend, f I < 1.
2 An explicit derivation of the price-output relation-
ship from the IS-LM framework is given by Frederic
Raines. Of course, this framework does not imply an
elasticity of unity, though it is consistent with it. Since
the unit elasticity hypothesis is primarily a matter of
convenience in the present study, I shall comment below
on the probable consequences of relaxing it.
3 A supply function for labor which varies with the
ratio of actual to expected prices is developed and veri-
fied empirically by Lucas and Rapping (1969). The
effect of lagged on actual employment is also shown.
In our 1972 paper, in response to Albert Rees's criti-
cism, we found that this persistence in employment
cannot be fully explained by price expectations behav-
ior. Both these effects-an expectations and a persis-
tence effect-will be transmitted by firms to the goods
8. market. In addition, they are probably augmented by
speculative behavior on the part of firms (as analyzed
for example, by Paul Taubman and Maurice Wilkinson).
For a general equilibrium model in which suppliers
behave essentially as given by (3), see my 1972 papers.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
328 THE AMERICAN ECONOMIC REVIEW JUNE 1973
The information available to suppliers
in z at t comes from two sources. First,
traders enter period t with knowledge of
the past course of demand shifts, of normal
supply ynt, and of past deviations ye,t-i,
yc,t-2 . While this information does
not permit exact inference of the log of the
current general price level, Pt, it does de-
termine a "prior" distribution on Pt, com-
mon to traders in all markets. We assume
that this distribution is known to be nor-
mal, with mean Pt (depending in a known
way on the above history) and a constant
variance a2.
Second, we suppose that the actual price
deviates from the (geometric) economy-
wide average by an amount which is dis-
tributed independently of Pt. Specifically,
9. let the percentage deviation of the price in
z from the average Pt be denoted by z (so
that markets are indexed by their price
deviations from average) where z is nor-
mally distributed, independent of Pt, with
mean zero and variance r2. Then the ob-
served price in z, Pt(z) (in logs) is the sum
of independent, normal variates
(4) Pt(z) = Pt + z
The information It(z) relevant for estima-
tion of the unobserved (by suppliers in z
at t) Pt, consists then of the observed
price Pt(z) and the history summarized
in P.t
To utilize this information, suppliers use
(4) to calculate the distribution of Pt,
conditional on Pt(z) and Pt. This distribu-
tion is (by straightforward calculation)
normal with mean:
E(Pt I(z)) = E(Pt Pt(z), iPt)
(~ ~ ~~~ ) = (1-)Pt (z) + 07Pt
where 0 = i2/(u2+i2) , and variance Ooa2.
Combining (1), (3), and (5) yields the
supply function for market z:
(6) yt(Z) = Ynt + [email protected][Pt(z) - PTt]
+ XYc,t-1(z)
Averaging over markets (integrating with
respect to the distribution of z) gives the
10. aggregate supply function:
(7) = yYnt + G7y(Pt - Pt)
+ xLyt-1 - yn,t-1I
The slope of the aggregate supply func-
tion (7) thus varies with the fraction 0 of
total individual price variance, a2+r2,
which is due to relative price variation,, In
cases where r2 is relatively small, so that
individual price changes are virtually cer-
tain to reflect general price changes, the
supply curve is nearly vertical. At the
other extreme when general prices are
stable (a2 is relatively small) the slope of
the supply curve approaches the limiting
value of y.4
B. Completion and
Solution
of the Model
A central assumption in the develop-
ment above is that supply behavior is
based on the correct distribution of the
unobserved current price level, Pt. To
proceed, then, it is necessary to determine
what this correct distribution is, a step
11. which requires the completion of the
model by inclusion of an aggregate de-
mand side.
As suggested earlier, this will be done by
postulating a demand function for goods of
the form:
(8) yt + Pt = xt
where xt is an exogenous shift variable-
equal to the observable log of nominal
GNP. Further, let Axt } be a sequence of
independent, normal variates with mean 6
and variane2 r ande var'lance ax 0
4This predicted relationship between a supply elas-
ticity and the variance of a component of the price series
is analogous to the link between the income elasticity of
consumption demand and the variances of permanent
and transitory income components which Friedman
(1957) observes. As will be seen in Section II, it works
in empirical testing in much the same way as well.
12. This particular characterization of the "shocks" to
the economy is not central to the theory, but to discuss
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
VOL. 63 NO. 3 LUCAS: OUTPUT-INFLATION TRADEOFF
329
The relevant history of the economy
then consists (at most) of ynt (which fixes
calendar time), the demand shifts xt,
xt1,... , and past actual real outputs
Yt-1, Yt-2. * Since the model is linear in
logs, it is reasonable to conjecture a price
solution of the form: 6
(9) Pt= 7rO+rlXt+7r2Xt_l+7r3Xt_2+ *
+l1 Yt_1+?12Yty2+ +tOynt
Then 7Pt will be the expectation of Pt,
13. based on all information except Xt (the
current demand level) or:
Pt = 750 + 7rl(Xt-1 + 5) + 72Xt-l
(10) + 7r3Xt-2 + ... + ?lyt-l
+ n72yt.-2 + + tOynt
To solve for the unknown parameters
7ri, Ij and 4o we first eliminate yt between
(7) and (8), or equate quantity demanded
and supplied. Then inserting the right
sides of (9) and (10) in place of Pt and Pt,
one obtains an identity in Xt }, {yt , and
ynt, which is then used to obtain the
parameter values. The resulting solutions
for price and output are:'
Pt= - + X- ,
1+GY 1 +OY
+ - Xt-1 - (1 - X)Y.t
1 + O-Y
14. +X0 + -Axt I8 1- + O + 1 + 8 AX
+ Xyt-1 + (1 - X)ynt
In terms of APt and yet, and letting
7r= y/(1+G'y), the solutions are:
(11) Yet = - irb + 7rAxt + Xye,t_1
(12) APt = - d + (1 - r)AXt + 7rAXt-i
Let us review these solutions for internal
consistency. Evidently, Pt is normally dis-
tributed about Pt. The conditional vari-
ance of Pt will have the constant (as
assumed) variance 1/(1+0Gy)2o-,. Thus
those features of the behavior of prices
which were assumed "known" by suppliers
in subsection A are, in fact, true in this
economy.
To review, equations (11) and (12) are
the equilibrium values of the inflation rate
15. and real output (as a percentage deviation
from trend). They give the intersection
points of an aggregate demand schedule,
shifted by changes in xt, and an aggregate
supply schedule shifted by variables
(lagged prices) which determine expecta-
tions. In order to avoid the introduction of
an additional, spurious "expectations pa-
rameter," one cannot solve for this inter-
section on a period-by-period basis; 4c-
cordingly, we have adopted a method
which yields equilibrium "paths" of prices
and output. Otherwise, the interpretation
of (11) and (12) is entirely conventional.
Not surprisingly, the solution values of
inflation and the cyclical component of
real output are indicated by (11) and (12)
to be distributed lags of current and past
changes in nominal output. A change in the
nominal expansion rate, Axt, has an im-
mediate effect on real output, and lagged
effects which decay geometrically. The
16. rational expectations formation at all, some explicit
stochastic description is clearly required. Independence
is used here partly for simplicity, partly because it is
empirically roughly accurate for most countries in the
sample. The effect of autocorrelation in the shocks
would, as can be easily traced out, be to add higher order
lag terms to the solutions found below.
6 This solution method is adapted from Lucas (1972),
which is in turn based on the ideas of John Muth.
7 If a demand function of the form Yt=-Pt+xt had
been used, these solutions would assume the same form,
with different expressions for the coefficients. If t $1,
however, xt is an unobserved shock, unequal in general
to observed nominal income. In this case, the model still
predicts the time-series structure (moments and lagged
moments) of the series Yet and APt and is thus, in princi-
ple, testable. I have found empirical experimenting
along these lines suggestive, but the series used are
simply too short to yield results of any reliability.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
17. 330 THE AMERICAN ECONOMIC REVIEW JUNE 1973
immediate effect on prices is one minus the
real output effect, with the remainder of
the impact coming in the succeeding pe-
riod. We note in particular that this lag
pattern may well produce periods of simul-
taneous inflation and below average real
output. Though these periods arise be-
cause of supply shifts, the shifts result
from lagged perception of demand changes,
and not from autonomous changes in the
cost structure of suppliers.
In addition to these features, the model
does indeed assert the existence of a nat-
ural rate of output: the average rate of de-
mand expansion, 5, appears in (11) with a
coefficient equal in magnitude to the co-
efficient of the current rate, and with the
opposite sign. Thus changes in the average
rate of nominal income growth will have no
18. effect on average real output. On the other
hand, unanticipated demand shifts do
have output effects, with magnitude given
by the parameter ir. Since this effect de-
pends on "fooling" suppliers (in the sense
of subsection A), one expects that 7r will be
larger the smaller the variance of the
demand shifts. We next develop this im-
plication explicitly.
From the definition of ir in terms of 0
and y, and the definition of 6 in terms of
q2 and r2 we have
r27
a2 + r2(1 + y)
Combining with the expression for u2 ob-
tained above, this gives
(13) r =-
*, (1 - )2a2 + r2(1 + y)
For fixed r2 and y, then, ir takes the value
19. y/(1 +y) at ax= 0 and tends monotonically
to zero as o2 tends to infinity.
The prediction that the average devia-
tion of output from trend, E(y,,), is in-
variant under demand policies is not, of
course, subject to test: the deviations from
a fitted trend line must average to zero.
Accordingly, we must base tests of the
natural rate hypothesis (in this context)
on (13): a relationship between an ob-
servable variance and a slope parameter.
II. Test Results
Testing the hypothesis advanced above
involves two steps. First, within each
country (11) and (12) should perform
reasonably well. In particular, under the
presumption that demand fluctuations are
the major source of variation in APt and
20. yct, the fits should be "good." The esti-
mated values of 7r and X should be between
zero and one. Finally, since (11) and (12)
involve five slope parameters but only two
theoretical ones, the estimated ir and X
values obtained from fitting (11) should
work reasonably well in explaining varia-
tions in APt.
The main object of this study, however,
is not to "explain" output and price level
movements within a given country, but
rather to see whether the terms of the
output-inflation "tradeoff'' vary across
countries in the way predicted by the nat-
ural rate theory. For this purpose, we shall
utilize the theoretical relationship (13)
and the estimated values of r and a'2.
Under the assumption that r2 and y are
relatively stable across countries, the esti-
mated r values should decline as the sam-
ple variance of Axt increases.
Descriptive statistics for the eighteen
countries in the sample are given in Table
21. 1., As is evident, there is no association
8 The raw data on real and nominal GNP are from
Yearbook of National Accounts Statistics, where series
from many countries are collected and put on a uniform
basis. The choice of countries is by no means random:
the eighteen used are all the countries from which con-
tinuous series are available. The sample could thus be
broadened considerably by use of sources from indi-
vidual countries. To obtain the variables used in the
tests, the logs of real and nominal output, Yt and xt, are
logs of the series in the source. The log of the price level,
Pt, is the difference xt-yt; yt is the residual from the
trend line yt= a+bt, fit by least squares from the sample
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
VOL. 63 NO. 3 LUCAS: OUTPUT-INFLATION TRADEOFF
331
TABLE 1-DESCRIPTIVE STATISTICS, 1952-67
22. Mean Mean Variance. Variance Variance
Cyt Apt Yet Axt
Argentina .026 .220 .00096 .01998 .01555
Austria .048 .038 .00104 .00113 .00124
Belgium .034 .021 .00075 .00033 .00072
Canada .043 .024 .00109 .00018 .00139
Denmark .039 .041 .00082 .00038 .00084
West Germany .056 .026 .00147 .00026 .00073
Guatemala .046 .004 .00111 .00079 .00096
Honduras .044 .012 .00042 .00084 .00109
Ireland .025 .038 .00139 .00060 .00111
Italy .053 .032 .00022 .00044 .00040
Netherlands .047 .036 .00055 .00043 .00101
Norway .038 .034 .00092 .00033 .00098
Paraguay .054 .157 .00488 .03192 .03450
Puerto Rico .058 .024 .00205 .00021 .00077
Sweden .039 .036 .00030 .00043 .00041
United Kingdom .028 .034 .00022 .00037 .00014
United States .036 .019 .00105 .00007 .00064
Venezuela .060 .016 .00175 .00068 .00127
between average real growth rates and
average rates of inflation: this fact seems
23. to be consistent with both the conventional
and natural rate views of the tradeoff.
Since our interest is in comparing real out-
put and price behavior under different time
patterns of nominal income, these statistics
are somewhat disappointing. Essentially
two types of nominal income behavior are
observed: the highly volatile and expansive
policies of Argentina and Paraguay, and
the relatively smooth and moderately ex-
pansive policies of the remaining sixteen
countries. But if the sample provides only
two "points," they are indeed widely
separated: the estimated variance of de-
mand in the high inflation countries is on
the order of 10 times that in the stable
price countries.
The first three columns of Table 2 sum-
marize the performance of equation (11)
in accounting for movements in yet. The
estimated values for ir all lie between zero
and one; with the exceptions of Argentina
24. and Puerto Rico, so do the estimated X
values. The R's indicate that for many, or
perhaps most countries, important out-
put-determining variables have been omit-
ted from the model. The R s for the infla-
tion rate equation, (12), are given in
column (4) of Table 2. In general, these
tend to be lower than for equation (11),
and not surprisingly the estimated co-
efficients from (12) (which are not shown)
tend to behave erratically. Column (5) of
Table 2 gives the fraction of the variance
of AP' explained by (12) when the co-
efficient estimates from (11) are imposed.
(A "-" indicates a negative value.)9
With respect to its performance as an
intracountry model of income and price
determination, then, the system (11)-(12)
passes the formal tests of significance. On
the other hand, the goodness-of-fit statis-
period. The moments given in Table 1 are maximum
likelihood estimates based on these series. The estimates
reported in Table 2 are by ordinary least squares.
25. 9 The loss of explanatory power when these coeffi-
cients are imposed on (12) can be assessed formally by
an approximate Chi-square test. By this measure, the
loss is significant at the .05 level for Paraguay only. As
Table 2 shows, however, this test is somewhat decep-
tive: for several countries the least squares estimates of
(12) are so poor that there is little explanatory power to
lose, and the test is "passed" vacuously.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
332 THE AMERICAN ECONOMIC REVIEW JUNE 1973
TABLE 2-SUMMARY STATISTICS BY COUNTRY, 1953-67
Country 7r R R R 2 R2
Argentina .011 -.126 .018 .929 .914
(.070) (.258)
28. (.183) (.148)
tics are generally considerably poorer than
we have come to expect from annual time-
series models.
In contrast to these somewhat mixed re-
sults, the behavior of the estimated 7r
values across countries is in striking con-
formity with the natural rate hypothesis.
For the sixteen stable price countries, *
ranges from .287 to .910; for the two
volatile price countries, this estimate is
smaller by a factor of 10! To illustrate this
order-of-magnitude effect more sharply,
let us examine the complete results for two
countries: the United States and Argen-
tina. For the United States, the fitted ver-
sions of (11) and (12) are:
yct = - .049 + (.910),Axt + (.887)yc,t-l
APt = - .028 + (.119) Axt + (.758) Axt_
29. - (.637) Aycet_1
The comparable results for Argentina are:
yct = - .006 + (.011)Axt - (.126)y,.,t_j
APt = - .047 + (1.140)Axt - (.083) Axt-
+ (.102)Ayc,t-1
In a stable price country like the United
States, then, policies which increase nomi-
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
VOL. 63 NO. 3 LUCAS: OUTPUT-INFLATION TRADEOFF
333
nal income tend to have a large initial
30. effect on real output, together with a small,
positive initial effect on the rate of infla-
tion. Thus the apparent short-term trade-
off is favorable, as long as it remains un-
used. In contrast, in a volatile price
country like Argentina, nominal income
changes are associated with equal, con-
temporaneous price movements with no
discernible effect on real output. These
results are, of course, inconsistent with the
existence of even moderately stable Phillips
curves. On the other hand, they follow
directly from the view that inflation
stimulates real output if, and only if, it
succeeds in "fooling" suppliers of labor
and goods into thinking relative prices are
moving in their favor.
III. Concluding Remarks
The basic idea underlying the tests re-
ported above is extremely simple, yet I am
afraid it may have become obscured by the
rather special model in which it is em-
31. bodied. In this section, I shall try to re-
state this idea in a way which, though not
quite accurate enough to form the basis for
econometric work, conveys its essential
feature more directly.
The propositions to be compared empiri-
cally refer to the effects of aggregate de-
mand policies which tend to move infla-
tion rates and output (relative to trend) in
the same direction, or alternatively, un-
employment and inflation in opposite
directions. The conventional Phillips curve
account of this observed co-movement says
that the terms of the tradeoff arise from
relatively stable structural features of the
economy, and are thus independent of the
nature of the aggregate demand policy
pursued. The alternative explanation of
the same observed tradeoff is that the
positive association of price changes and
output arises because suppliers misinter-
pret general price movements for relative
price changes. It follows from this view,
32. first, that changes in average inflation
rates will not increase average output, and
secondly, that the higher the variance in
average prices, the less "favorable" will be
the observed tradeoff.
The most natural cross-national com-
parison of these propositions would seem
to be a direct examination of the associa-
tion of average inflation rates and average
output, relative to "normal" or "full em-
ployment." Unfortunately, there seems to
be no satisfactory way to measure normal
output. The deviation-from-fitted-trend
method I have used defines normal output
to be average output. The use of unem-
ployment series suffers from the same diffi-
culty, since one must somehow select the
(obviously positive) rate to be denoted full
employment.
33. Thus although the issue revolves around
the relation between means of inflation
and output rates, it cannot be resolved by
examination of sample averages. Fortu-
nately, the existence of a stable tradeoff
also implies a relationship between vari-
ances of inflation and output rates, as
illustrated in Figure 1. With a stable
tradeoff, policies which lead to wide varia-
tion in prices must also induce comparable
variation in real output.' If these sample
variances do not tend to move together
(and, as Table 1 shows, they do not) one
A Pt'
Var(yct) 'a2 Varf(APt) 0 0
00
0
0 Stable Price Country
0 Volatile Price Country
34. Yct
FIGURE 1
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
334 THE AMERICAN ECONOMIC REVIEW JUNE 1973
can only conclude that the tradeoff tends
to fade away the more frequently it is
used, or abused.
This simple argument leads to a formal
test if the output-inflation association is
entirely contemporaneous. In fact, how-
ever, it involves lagged effects which make
a direct comparison of variances, as just
suggested, difficult in short time-series.
Accordingly, it has been necessary to im-
pose a specific, simple structure on the
35. data. As we have seen, this structure ac-
counts for output and inflation rate move-
ments only moderately well, but well
enough to capture the main phenomenon
predicted by the natural rate theory: the
higher the variance of demand, the more
unfavorable are the terms of the Phillips
tradeoff.
REFERENCES
M. Friedman, A Theory of the Consumption
Function, Princeton 1957.
, "The Role of Monetary Policy,"
Amer. Econ. Rev., Mar. 1968, 58, 1-17.
D. F. Gordon and A. Hynes, "On the Theory
of Price Dynamics," in E. S. Phelps et al.,
Micro-economics of Inflation and Employ-
ment Theory, New York 1969.
R. E. Lucas, Jr., "Expectations and the Neu-
trality of Money," J. Econ. Theor., Apr.
36. 1972, 4, 103-24.
, "Econometric Testing of the Natural
Rate Hypothesis," Conference on the Econ-
ometrics of Price Determination, Washing-
ton 1972, 50-59.
and L. A. Rapping, "Real Wages,
Employment and the Price Level," J. Polit.
Econ., Sept./Oct. 1969, 77, 721-54.
_ and , "Unemployment in the
Great Depression: Is There a Full Expla-
nation?", J. Polit. Econ., Jan./Feb. 1972, 80,
186-91.
J. F. Muth, "Rational Expectations and the
Theory of Price Movements," Economet-
rica, July 1961, 29, 315-35.
E. S. Phelps, introductory chapter in E. S.
Phelps et al., Micro-economics of Inflation
and Employment Theory, New York 1969.
37. F. Raines, "Macroeconomic Demand and Sup-
ply: an Integrative Approach," Washing-
ton Univ. working paper, Apr. 1971.
A. Rees, "On Equilibrium in Labor Markets,"
J. Polit. Econ., Mar./Apr. 1970, 78, 306-10.
P. Taubman and M. Wilkinson, "User Cost,
Capital Utilization and Investment
Theory," Int. Econ. Rev., June 1970, 11,
209-15.
United Nations, Department of Economic
and Social Affairs, United Nations Statisti-
cal Office, Yearbook of National Accounts
Statistics, 66 and 68, New York 1958.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:06:22 UTC
All use subject to http://about.jstor.org/terms
Contentsimage 1image 2image 3image 4image 5image 6image
7image 8image 9Issue Table of ContentsThe American
Economic Review, Vol. 63, No. 3, Jun., 1973[Photograph]: Carl
S. Shoup: Distinguished Fellow 1972Front MatterModern
38. Economic Growth: Findings and Reflections [pp. 247 - 258]The
Economics of the Network-Affiliate Relationship in the
Television Broadcasting Industry [pp. 259 - 268]On The
Theory of Optimal Investment, Dividends, and Growth in the
Firm [pp. 269 - 279]Private Demands for Public Goods [pp.
280 - 296]Intermediate Products and the Pure Theory of
International Trade: A Neo-Hecksher-Ohlin Framework [pp.
297 - 311]The "F-Twist" and the Methodology of Paul
Samuelson [pp. 312 - 325]Some International Evidence on
Output-Inflation Tradeoffs [pp. 326 - 334]A Note on the
Concept and Measure of Consumer's Surplus [pp. 335 -
344]Monetary and Fiscal Policy in a Two-Sector Aggregative
Model [pp. 345 - 365]The Demand for State and Local
Government Employees [pp. 366 - 379]Quarterly Models of
Wage Determination: Some New Efficient Estimates [pp. 380 -
389]Quit Rates Over Time: A Search and Information Approach
[pp. 390 - 401]Employment Impacts of Textile Imports and
Investment: A Vintage-Capital Model [pp. 402 - 416]A
Generalization of the Pure Theory of Public Goods [pp. 417 -
432]The Supply of Rental Housing: Comment [pp. 433 -
436]The Supply of Rental Housing: Reply [pp. 437 -
438]Property Crime and Economic Behavior: Some Empirical
Results [pp. 439 - 446]A Skeptical Note on "The Optimality" of
Wage Subsidy Programs [pp. 447 - 453]Optimal Mechanisms
for Income Transfer: Note [pp. 454 - 457]On Terms of Foreign
39. Borrowing [pp. 458 - 461]Recent Exercises in Growth
Accounting: New Understanding or Dead End? [pp. 462 -
468]Faculty Salaries, Promotions, and Productivity at a Large
University [pp. 469 - 477]Making Inferences from Controlled
Income Maintenance Experiments [pp. 478 - 483]The
Production Function and the Theory of Distributive Shares [pp.
484 - 489]The Neoclassical Theory of Technical Progress: Note
[pp. 490 - 493]International Trade and the Forward Exchange
Market [pp. 494 - 503]Announcement [p. 504]Notes [pp. 505 -
508]
Macroeconomics and Reality
Author(s): Christopher A. Sims
Source: Econometrica, Vol. 48, No. 1 (Jan., 1980), pp. 1-48
Published by: The Econometric Society
Stable URL: http://www.jstor.org/stable/1912017
Accessed: 08-02-2017 17:04 UTC
JSTOR is a not-for-profit service that helps scholars,
researchers, and students discover, use, and build upon a wide
40. range of content in a trusted
digital archive. We use information technology and tools to
increase productivity and facilitate new forms of scholarship.
For more information about
JSTOR, please contact [email protected]
Your use of the JSTOR archive indicates your acceptance of the
Terms & Conditions of Use, available at
http://about.jstor.org/terms
The Econometric Society is collaborating with JSTOR to
digitize, preserve and extend access to
Econometrica
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
E C O N O M E T R I C A
VOLUME 48 JANUARY, 1980 NUMBER 1
41. MACROECONOMICS AND REALITY'
BY CHRISTOPHER A. SIMS
Existing strategies for econometric analysis related to
macroeconomics are subject to a
number of serious objections, some recently formulated, some
old. These objections are
summarized in this paper, and it is argued that taken together
they make it unlikely that
macroeconomic models are in fact over identified, as the
existing statistical theory usually
assumes. The implications of this conclusion are explored, and
an example of econometric
work in a non-standard style, taking account of the objections
to the standard style, is
presented.
THE STUDY OF THE BUSINESS cycle, fluctuations in
aggregate measures of
economic activity and prices over periods from one to ten years
or so, constitutes
or motivates a large part of what we call macroeconomics.
Most economists would
42. agree that there are many macroeconomic variables whose
cyclical fluctuations
are of interest, and would agree further that fluctuations in
these series are
interrelated. It would seem to follow almost tautologically that
statistical models
involving large numbers of macroeconomic variables ought to
be the arena within
which macroeconomic theories confront reality and thereby
each other.
Instead, though large-scale statistical macroeconomic models
exist and are by
some criteria successful, a deep vein of skepticism about the
value of these models
runs through that part of the economics profession not actively
engaged in
constructing or using them. It is still rare for empirical research
in macro-
economics to be planned and executed within the framework of
one of the large
models. In this lecture I intend to discuss some aspects of this
situation, attempting
both to offer some explanations and to suggest some means for
improvement.
43. I will argue that the style in which their builders construct
claims for a
connection between these models and reality-the style in which
"identification"
is achieved for these models-is inappropriate, to the point at
which claims for
identification in these models cannot be taken seriously. This is
a venerable
assertion; and there are some good old reasons for believing
it;2 but there are also
some reasons which have been more recently put forth. After
developing the
conclusion that the identification claimed for existing large-
scale models is
incredible, I will discuss what ought to be done in
consequence. The line of
argument is: large-scale models do perform useful forecasting
and policy-analysis
functions despite their incredible identification; the restrictions
imposed in the
usual style of identification are neither essential to
constructing a model which can
perform these functions nor innocuous; an alternative style of
identification is
44. available and practical.
Finally we will look at some empirical work based on an
alternative style of
macroeconometrics. A six-variable dynamic system is
estimated without using
1 Research for this paper was supported by NSF Grant Soc-76-
02482. Lars Hansen executed the
computations. The paper has benefited from comments by many
people, especially Thomas J. Sargent
and Finn Kydland.
2 T. C. Liu [15] presented convincing arguments for the
assertion in a classic article.
1
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
2 CHRISTOPHER A. SIMS
45. theoretical perspectives. Under a sophisticated neo-monetarist
interpretation, a
restriction on the system which implies that monetary policy
shocks could explain
nearly all cyclical variation in real variables in the economy is
tested and rejected.
Under a more standard macroeconometric interpretation, a
restriction which is
treated as a maintained hypothesis in econometric work with
Phillips curve "wage
equations" or paired wage and price equations is also rejected.
1. INCREDIBLE IDENTIFICATION
A. The Genesis of "A Priori Restrictions"
When discussing statistical theory, we say a model is identified
if distinct points
46. in the model's parameter space imply observationally distinct
patterns of behavior
for the model's variables. If a parameterization we derive from
economic theory
(which is usually what we mean by a "structural form" for a
model) fails to be
identified, we can always transform the parameter space so that
all points in the
original parameter space which imply equivalent behavior are
mapped into the
same point in the new parameter space. This is called
normalization. The obvious
example is the case where, not having an identified
simultaneous equation model
in structural form, we estimate a reduced form instead. Having
achieved
identification by normalization in this example, we admit that
the individual
equations of the model are not products of distinct exercises in
economic theory.
47. Instead of using a reduced form, we could normalize by
requiring the residuals to
be orthogonal across equations and the coefficient matrix of
current endogenous
variables to be triangular. The resulting normalization into
Wold causal chain
form is identified, but results in equations which are linear
combinations of the
reduced form equations. Nobody is disturbed by this situation
of multiple possible
normalizations.
Similarly, when we estimate a complete system of demand
equations, we
recognize that the set of equations, in which each quantity
appears only once, on
the left-hand-side of one equation in the system, and all prices
appear on the right
of each equation, is no more than one of many possible
normalizations for a system
48. of equations describing demand behavior. In principle, we
realize that it does not
make sense to regard "demand for meat" and "demand for
shoes" as the products
of distinct categories of behavior, any more than it would make
sense to regard
"price of meat" and "price of shoes" equations as products of
distinct categories
of behavior if we normalized so as to reverse the place of
prices and quantities in
the system. Nonetheless we do sometimes estimate a small part
of a complete
demand system together with part of a complete supply system-
supply and
demand for meat, say. In doing this, it is common and
reasonable practice to make
shrewd aggregations and exclusion restrictions so that our
small partial-equili-
brium system omits most of the many prices we know enter the
demand relation in
principle and possibly includes a shrewdly selected set of
exogenous variables we
expect to be especially important in explaining variation in
meat demand (e.g., an
49. Easter dummy in regions where many people buy hams for
Easter dinner).
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 3
While individual demand equations developed for partial
equilibrium use may
quite reasonably involve an array of restrictions appropriate to
that use, it is
evident that a system of demand equations built up
incrementally from such
partial-equilibrium models may display very undesirable
properties. In effect,
the shrewd restrictions which are useful for partial equilibrium
purposes,
50. when concatenated across many categories of demand, yield a
bad system of
restrictions.
This point is far from new, having been made, e.g., by Zvi
Griliches [9] in his
criticism of the consumption equations of the first version of
the Brookings model
and by Brainard and Tobin [3] in relation to financial sector
models in general.
And of course this same point motivates the extensive work
which has been done
on econometrically usable functional forms for complete
systems of demand
equations and factor demand equations.
The reason for re-emphasizing the dangers of one-equation-at-
a-time
specification of a large model here is that the extent to which
the distinctions
among equations in large macromodels are normalizations,
51. rather than truly
structural distinctions, has not received much emphasis. In the
version of the
FRB-MIT model reported in [2], for example, a substantial part
of the interesting
behavioral equations of the financial sector are demand
equations for particular
assets. Consumption, of course, is represented by demand
equations, and the
supply of labor and demand for housing also in principle
represent components of
a system of equations describing the public's allocations of
their resources. Thus
the strictures against one-equation-at-a-time specification
which are ordinarily
applied to the financial or consumption equations of a model as
a subgroup, really
apply to this whole set of equations.
If large blocks of equations, running across "sectors" of the
model which are
ordinarily treated as separate specification problems, are in fact
distinguished
52. from one another only by normalization, what "economic
theory" tells us about
them is mainly that any variable which appears on the right-
hand-side of one of
these equations belongs in principle on the right-hand-side of
all of them. To the
extent that models end up with very different sets of variables
on the right-hand-
sides of these equations, they do so not by invoking economic
theory, but (in the
case of demand equations) by invoking an intuitive,
econometrician's version of
psychological and sociological theory, since constraining
utility functions is what is
involved here. Furthermore, unless these sets of equations are
considered as a
system in the process of specification, the behavioral
implications of the restric-
tions on all equations taken together may be much less
reasonable than the
restrictions on any one equation taken by itself.
The textbook paradigm for identification of a simultaneous
53. equation system is
supply and demand for an agricultural product. There we are
apt to speak of the
supply equation as reflecting the behavior of farmers, and the
demand equation as
reflecting the behavior of consumers. A similar use of
language, in which labor
supply equations are taken to apply to "workers," consumption
equations to
''consumers,' asset demand equations to ''savers,'' sometimes
obscures the
distinction in macromodels between normalized and
structurally identified
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
4 CHRISTOPHER A. SIMS
54. equations.3 On the other hand, the distinction between
"employers" and
"investigators" on the one hand, and "consumers" and
"workers" on the other,
does have some structural justification. There certainly are
policies which can
drive a wedge between supply and demand prices for
transactions between the
"business" and "household" sectors, which is roughly the
distinction with which
we are concerned. Furthermore, if business behavior is taken to
be competitive,
the business sector simply traces out the efficient envelope of
available technology
in response to demand shifts. Then the distinction between
business and house-
holds becomes the distinction between "nature" and "tastes" on
55. which
identification in the supply-demand paradigm rests. The idea
that weather affects
grain supply and not (much) grain demand, while the ethnic and
demographic
structure of the population affects grain demand but not (much)
grain supply, is a
powerful source of identifying restrictions. The same nature-
tastes distinction is a
source of powerful identifying restrictions in large
macromodels, but the number
of such restrictions available is not large relative to the number
of equations and
variables in large macromodels.
B. Dynamics
The fact that large macroeconomic models are dynamic is a
rich source of
spurious "a priori" restrictions, as we shall see below, but it
56. also weakens the few
legitimate bases for generating identifying restrictions alluded
to in the previous
section. If we accept the modern anti-interventionist school's
argument that
dynamic macroeconomic models ought not to violate the
principle that markets
clear,4 then dynamics do not raise new problems in this
respect; the business
sector singlemindedly pursues profit, according to the
directions of the observable
price vector, so that the difference between the business sector
of a dynamic model
and that of a static model is only in whether the efficiency
frontier traced out has
dynamic elements. If instead we take the view that prices
themselves may adjust
sluggishly, we enter the wilderness of "disequilibrium
57. economics." This phrase
must, it seems to me, denote a situation in which we cannot
suppose that business
behavior is invariant under changes in the public's tastes. The
reason is that
business behavior, when markets don't clear, must depend not
only on hypotheti-
cal business demands and supplies given current prices, but
also on the nature of
whatever rationing is currently going on-e.g. on the excess
demand of Walrasian
theory. If the degree of excess demand or supply in the labor
market enters
3In Hurwicz's [12] abstract discussion of structural systems it
is apparent that an equation system
identified by normalization is not an identified structure. An
identified structural equation is one which
uniquely remains invariant under a certain class of
"interventions" in the system. In the supply and
demand paradigm, the natural class of interventions to consider
58. is excise taxes. It is not impossible that
a system of demand equations be structural in Hurwicz's sense-
McFadden [19] has provided an
instance of a structural interpretation of a sort of demand
equation, in which the identifying
interventions are deletions or additions in the list of available
commodities. But nothing like
McFadden's analysis exists or is likely to be developed to
justify structural distinction between labor
supply and consumption, for example.
4 This position is set forth persuasively by Lucas [16].
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 5
employer behavior, then by that route any variable which we
think of as connected
to labor supply decisions enters the dynamic labor demand
59. equation.
J. D. Sargan [25] several years ago considered the problem of
simultaneous-
equation identification in models containing both lagged
dependent variables and
serially correlated residuals. He came to the reassuring
conclusion that, if a few
narrow-looking special cases are ruled out, the usual rules for
checking
identification in models with serially uncorrelated residuals
apply equally well to
models with serially correlated residuals. In particular, it
would ordinarily be
reasonable to lump lagged dependent variables with strictly
exogenous variables
in checking the order condition for identification, despite the
fact that a consistent
estimation method must take account of the presence of
correlation between
lagged dependent variables and the serially correlated
residuals. Though consis-
60. tent estimation of such models poses formidable problems,
Sargan's analysis
suggested that identification is not likely to be undermined by
the combination of
lagged dependent variables and serial correlation.
Recent work by Michio Hatanaka [11], however, makes it clear
that this
sanguine conclusion rests on the supposition that exact lag
lengths and orders of
serial correlation are known a priori. On the evidently more
reasonable assump-
tion that lag lengths and shapes of lag distributions are not
known a priori,
Hatanaka shows that the order condition takes on an altered
form: we must in this
case cease to count repeat occurrences of the same variable,
with different lags, in
a single equation. In effect, this rule prevents lagged dependent
variables from
playing the same kind of formal role as strictly exogenous
variables in
identification; we must expect that to identify an equation we
will have to locate
61. in other equations of the system at least one strictly exogenous
variable to serve
as an instrument for each right-hand-side endogenous variable
in the given
equation.
Application of Hatanaka's criterion to large-scale macromodels
would prob-
ably not suggest that they are formally unidentified. The
version of the FRB-MIT
model laid out in [10], e.g., has over 90 variables categorized
as strictly exogenous,
while most equations contain no more than 6 or 8 variables.
However the
Hatanaka criterion, by focusing attention more sharply on the
distinction between
endogenous and strictly exogenous variables, might well result
in models being
respecified with shorter lists of exogenous variables. Many,
perhaps most, of the
62. exogenous variables in the FRB -MIT model [10] or in Fair's
model [6] are treated
as exogenous by default rather than as a result of there being
good reason to
believe them strictly exogenous. Some are variables treated as
exogenous only
SBy saying that it is evidently more reasonable to assume we
do not know lag lengths and shapes a
priori, I do not mean to suggest that one should not impose
restrictions of a reasonable form on lag
lengths and shapes in the process of estimation. However, we
should recognize that truncating lag
distributions is part of the process of estimaton-lag length is
itself estimated one way or another-and
that when our model is not identified without the pretense that
we know lag length to begin with, it is
just not identified. A similar point applies to "identifying"
simultaneous equations models by
imposing "a priori" constraints that coefficients which prove
statistically insignificant are zero. Setting
such coefficients to zero may be a justifiable part of the
estimation process, but it does not aid in
identification.
63. This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
6 CHRISTOPHER A. SIMS
because seriously explaining them would require an extensive
modeling effort in
areas away from the main interests of the model-builders.
Agricultural price and
output variables, the price of imported raw materials, and the
volume of exports
are in this category in the FRB-MIT model. Other variables are
treated as
exogenous because they are policy variables, even though they
evidently have a
substantial endogenous component. In this category are the
64. Federal Reserve
discount rat., federal government expenditures on goods and
services, and other
variables. It appears to me that if the list of exogenous
variables were carefully
reconsidered and tested in cases where exogeneity is doubtful,
the identification of
6
these models might well, by Hatanaka's criterion, fail, and
would at best be weak,
even if the several other sources of doubt about identifying
restrictions in
macromodels listed in this paper are discounted.
C. Expectations
It used to be that when expected future values of a variable
were thought to be
important in a behavioral equation, they were replaced by a
65. distributed lag on that
same variable. Whatever else may be said for or against it, this
practice had the
advantage of producing uncomplicated effects on
identification. As the basis in
economic theory for such simple treatments of expectations has
been examined
more critically, however, it has become apparent that they are
unsound, and that
sound treatments of expectations complicate identification
substantially. Whether
or not one agrees that economic models ought always to assume
rational behavior
under uncertainty, i.e. "rational expectations," one must agree
that any sensible
treatment of expectations is likely to undermine many of the
exclusion restrictions
66. econometricians had been used to thinking of as most reliable.
However certain
we are that the tastes of consumers in the U.S. are unaffected
by the temperature
in Brazil, we must admit that it is possible that U.S. consumers,
upon reading of a
frost in Brazil in the newspapers, might attempt to stockpile
coffee in anticipation
of the frost's effect on price. Thus variables known to affect
supply enter the
demand equation, and vice versa, through terms in expected
price.
But though analysis of rational expectations raises this problem
for us, by
carrying through with that analysis we may achieve
identification again by a new
route. The rational expectations hypothesis tells us
expectations ought to be
67. formed optimally; by restricting temperature in Brazil to enter
U.S. demand for
coffee only through its effect on the optimal forecast of price,
we may again
identify the demand equation. Wallis [33] and Sargent [26]
(among others) have
shown how this can be done. Lucas [16] in fact suggested that
this be done in some
of the earliest work on the implications of rational expectations
for macro-
economics.
6In this case of serial correlation of undetermined form and
lagged dependent variables with
undetermined lag lengths, the model is identified by the
relation between structural parameters and
the distributed lag regressions of endogenous variables on
strictly exogenous variables. When the
strictly exogenous variables have low explanatory power,
estimates of the endogenous-on-exogenous
regressions are likely to be subject to great sampling error, and
the identification may be said to be
weak.
68. This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 7
It is my view, however, that rational expectations is more
deeply subversive of
identification than has yet been recognized. When we follow
Hatanaka in remov-
ing the crutch of supposed a priori knowledge of lag lengths,
then in the absence of
expectational elements, we find the patient, though perhaps
wobbly, re-establish-
ing equilibrium. At least the classical form of identifying
restriction, the nature-
vs.-tastes distinction that identifies most supply and demand
models, is still in a
69. form likely to work under the Hatanaka criterion. In the
presence of expectations,
it turns out that the crutch of a priori knowledge of lag lengths
is indispensable,
even when we have distinct, strictly exogenous variables
shifting supply and
demand schedules.
The behavioral interpretation of this identification problem can
be displayed in
a very simple example.7 Suppose a firm is hiring an input,
subject to adjustment
costs, and that input purchase decisions have to be made one
period in advance of
actual production. Suppose further that the optimization
problem has a quadratic-
linear structure (justifying certainty-equivalence) and that the
only element of
70. uncertainty is a stochastic process shifting the demand curve.
In this situation, the
firm's hiring decisions will depend on forecasts of the demand-
shift variable. But
suppose that the demand-shift process is a martingale -
increment process-that is,
suppose that the expected value of all future demand shifts is
always the mean
value of that variable. Then the expected future demand curve
is always the same,
input hiring decisions are always the same, and we obviously
cannot hope to
estimate from observed firm behavior the parameters of the
dynamic production
function.
Special though it may seem, this example is representative of a
general problem
with models incorporating expectations. Such models will
generally imply that
behavior depends on expected values of future prices (or of
71. other variables). In
order to guarantee that we can discover from observed behavior
the nature of that
dependence on future prices, we must somehow insure that
expected future prices
have a rich enough pattern of variation to identify the
parameters of the structure.
This will ordinarily require restrictions on the form of serial
correlation in the
exogenous variables.
Of course, in a sense these problems are not fresh. If we want
to estimate a
distributed lag regression of y on x we must always restrict x
not to be identically
zero. The new element is that when we try to estimate a
distributed lag regression
of y on x and expected future x, the variation in the expected
future x will always
be less rich than that in the past x, so that the required
restrictions are likely to be
72. an order of magnitude more stringent in rational expectations
models. To take a
slightly more elaborate example, suppose our behavioral model
is
(1) c*y(t) =b-*p(t)+ p
where "*" denotes continuous-time convolution, Pt is the
stochastic process of
expected values of p given information available at time t,
b+(s) = 0 for s > 0, and
7 Robert Solow used essentially the same example in published
comments [31] on earlier work of
mine.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
8 CHRISTOPHER A. SIMS
b -(s) = 0 for s < 0, and c (s) = 0, s < O.8 To be explicit about
73. the notation, (1) could
be written as
(2) c* y(t)= Vb(s)p(t -s) ds + b +(-s) ',(t +s) ds.
o~~~~~~~
Now suppose that the only information available at t is current
and past values of
p, and suppose further that p is a stationary first-order Markov
process, i.e. that p
can be thought of as generated by the stochastic differential
equation
(3) p (t) = -rp (t) + e (t),
with e a white noise process. It then follows that
(4) At(t+s)=e rs(t), alls>0.
Therefore (1) takes the form
(5) c*y(t) = b-*p(t) + p(t)g(b+),
where the function g is given by g(b+) = lo b+(s) e-rs ds. While
74. we can expect to
recover g(b+) from the observed behavior of y and p,
knowledge of g(b+) will not
in general determine b+ itself unless we have available enough
restrictions on b+
to make it a function of a single unknown parameter. First-
order Markov
processes are widely used as examples in econometric
discussions because of their
analytic convenience, and they do not of course pose any
identification problem
for the estimation of b- the past of p will show adequately rich
variation to
identify b- even if our parameter space for b- is infinite-
dimensional. This
distinction, the need for enough restrictions to make b+ lie in a
one-dimensional
space while b- need only be subject to weak damping or
smoothness restrictions
for identification, is the order-of-magnitude difference in
stringency to which I
referred above.
75. At this point two lines of objection to the above argument-by-
example may
occur to you. In the first place, might we not be dealing with a
hairline category of
exceptional cases? For example, what if we ruled out all finite-
order Markov
processes for p in the preceding example? This is a small
subset of all stationary
processes, yet ruling it out would invalidate the dimensionality
argument used to
show b+ not to be identified. In the second place, isn't it true
that in most
applications, c, b+, and b- are not separately parameterized, so
that information
about c and b-, which we agree is available, will help us
determine b+? The latter
line of objection is correct as far as it goes, and will be
discussed below. The former
line of objection is not valid, and the next paragraphs contain
an argument that
this identification problem is present no matter what stationary
process generates
p in the example. Since the argument gets technical, readers
with powerful
76. intuition may wish to skip it.
8 Though it does not matter for our argument, in actual
examples c, b-, and b+ may be generalized
functions, so that c*y, e.g., may be a linear combination of
derivatives of y.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 9
If, say, b+ is square-integrable and p is a stationary process
with bounded
spectral density, then the term b+*pt(t) in (1) itself is a
stationary process.
Furthermore, the prediction error from using b+ in (1) when b+
is correct is a
stationary stochastic process with variance given by
s2(b+, b+) = Ilb+ - b IjR, where
77. (6) lifliR = [Jf (s)f(u)R (s, u) dsdu] and
R(s, u) =E[t(t + S) A(t + u)].
Now under fairly weak restrictions requiring some minimal rate
of damping in the
autocorrelation function of p, we will have an inequality of the
form
(7) 1 R (s, u) I < Rl(s)R2(u -s), for u > s,
where R1(s) - 0 monotonically as s -* oo and R2 is integrable.9
We define the translation operator by Tsf(t) = f(t - s). Then
from (7) and the
definition of 11 ||R in the second line of (6) we get, for f(t) = 0,
t < 0,
(8) IITsfII2 - R1(s) JJ f(v)f(u)R2(u -v) du dv.
Therefore IITsfllR - 0 as s -* 00, and we have proved the
following proposition.
PROPOSITION: If the moving average representation of p has
a weighting function
78. which is O(S-2), then no translation-invariant functional is
continuous with respect
to the norm 11 ||R defined in the second line of (6).
Obviously this means that the L2 and L1 norms are not
continuous with respect
to 11 ||R. Putting this result in somewhat more concrete terms,
we have shown that
when p meets the conditions of the proposition, we can make
the effect of
estimation error on the fit of equation (1), given by s2(b+ b+),
as small as we like,
while at the same time making the integrated squared or
absolute deviations
between b+ and b+ as large as we like. The fit of equation (1)
cannot be used to fix
the shape of b+, under these general conditions.
Somehow, then, we must use information on the relation of c
and b- to b+ or
other prior information to put substantial restrictions on b+ a
priori. Restriction
on the relation of c and b- to b+ are especially promising, since
c and b- are in
general identified without strong prior restrictions. For
79. example, a symmetry
restriction, requiring c-1 and b+ to be mirror images, which
does emerge from
some optimization problems, would be enough to identify b+.
On the other hand,
many behavioral frameworks leave parameters which
economists would not
ordinarily fix a priori dependent on the difference in shape
between b+ and c,
9The process p has the moving average representation p(t) =
a*e(t), with e white noise: Then
R(s, u)=fs a(v)a(v+u-s)dv for s<u. If we then assume, for
example, that a(s)s2 is bounded,
which would follow if p were assumed to have a spectral
density with integrable fourth derivative, then
it is not hard to verify that R1 can be taken to have the form
A(1 + s)f3 and R2 the form B(1 + u - s)-2.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
80. 10 CHRISTOPHER A. SIMS
which is precisely what will be hard to estimate. The following
example illustrates
the point.
Suppose firms maximize the expected discounted present value
of revenue,
given by
00
(9) e- P(t-s)(Q2(t) -P(t)(8K(t) + K(t)) _ 0(K(t) + 6K(t))2) dt
subject to
(10) Q(t) = aK(t) - AK2(t).
The interpretation is that P(t) is the price of the fixed factor
input K, a is the
depreciation rate, p is the interest rate, and 0 determines the
output foregone as
the rate of gross investment increases.
81. The first-order conditions for a solution to this equation give
us
(11 ) (D 2- pD - (A/0) -_p _-82)K = (3 + p - D)P/20 - a/20,
where D is the derivative operator. Firms taking P as
exogenous will, at each s,
choose a solution to (1 1) from time s onward, using Ps (t + s)
in their computations
in place of P itself. Since the Ps series and the problem's initial
conditions change
with s, (11) itself does not apply to observed K and P. If,
however, we assume that
firms have enough foresight not to choose solutions to (11)
along which K
diverges exponentially from its static optimum value, then we
will find the
following equation holding at each s:
(12) (D +MD)K(s) = (D +M2) 1(3 + p -D)P (s)/20 - (a/20M2),
where M1 and M2 are the two roots (with signs reversed) of the
polynomial in D on
the left of (11). These two roots will always be of opposite
82. sign, and M2 is negative,
so that (D +M2)-1 operates only on the future of the function to
which it is
applied. It is not hard to verify that the roots have the form
(13) Mi =M /p2+4((A/O)+p+2)-p]; M2 =-p -Mi.
In the case p = = 0, we get -M1 = M2, so that from knowledge
of M1 we obtain
M2 and thereby the entire operator applied to expected future P
in (12). The only
way identification could be frustrated would be for expected
future P to show no
variation at all, so that K itself became constant. It should be
noted, that this
could, of course, happen without P being constant. If P were a
moving average
process of the form P= a*e, with a(s) = 0 for s > T, and if at
time t firms know
only the history of P up to time t - T, then Ps is identically
equal to P's
unconditional mean. In the more interesting case where the
information set
includes current P, identification problems arise only with p
83. and a 0 0.
With p and 3 non-zero, equation (12) involves five coefficients,
all functions of
the five unknown parameters of the model. If Ps were a
stationary process, we
would be justified in following our instincts in declaring all
structural parameters
identified. However, in fact identification depends on there
being sufficient
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 11
independent variation between the time path of expected future
levels of P and
expected future derivatives of P. With p or 3 nonzero, the
operator applied to PS
on the right side of (12) differs from that applied to K on the
84. left by more than a
reflection. Even if we know p a priori (by reading the financial
press), first-order
Markov behavior for P implies that a is not identified
(assuming still that past P
makes up the information set). In the-first-order Markov case
with P = -rP + e, we
have (d/dt)PS(t) = -rPs(t) for all t > s. Thus (12) becomes,
when we replace Ps by
its observable counterpart,
(14) (D + M1)K(s) = -(8 + p + r)P(s)/20(M2 + r) + (a/20M2).
The separate coefficients on Ps and its derivative in (12) have
merged into one,
leaving the structural parameters unidentifiable from the
relation of the obser-
vable variables. In particular, one can see by examining (14)
and (13) that one
could vary 8, 0, a, and A in such a way as to leave coefficients
in (14) unchanged
even for fixed p and r, so that knowing r from the data and p a
85. priori will not suffice
to identify the model.
D. Concrete Implications
Were any one of the categories of criticism of large-model
identification
outlined in the preceding three sections the only serious
criticism, it would make
sense to consider existing standard methodology as a base from
which to make
improvements. There is much good work in progress on
estimating and specifying
systems of demand relations. Some builders of large models are
moving in the
direction of specifying sectoral behavior equations as
systems.10 There is much
good work in progress on estimating dynamic systems of
equations without getting
fouled up by treating knowledge of lag lengths and orders of
serial correlation as
exact. There is much good work treating expectations as
rational and using the
86. implied constraints in small systems of equations. Rethinking
structural
macromodel specification from any one of these points of view
would be a
challenging research program. Doing all of these things at once
would be a
program which is so challenging as to be impossible in the
short run.
On the other hand, there is no immediate prospect that large-
scale
macromodels will disappear from the scene, and for good
reason: they are useful
tools in forecasting and policy analysis.
How can the assertion that macroeconomic models are
identified using false
assumptions be reconciled with the claim that they are useful
tools? The answer is
that for forecasting and policy analysis, structural
identification is not ordinarily
needed and that false restrictions may not hurt, may even help a
model to function
in these capacities.
87. Textbook discussions sometimes suggest that structural
identification is neces-
sary in order for a model to be used to analyze policy. This is
true if "structure"
10For example, Fair [6] takes this approach in principle,
though his empirical equations are
specified with a single-equation approach to forming lists of
variables. Modigliani [20] reports that the
MPS model (like the Fair model) has interest rates turning up
in many household behavioral equations.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
12 CHRISTOPHER A. SIMS
and "identification" are interpreted in a broad way. A structure
is defined (by me,
following Hurwicz [12] and Koopmans [13]) as something
88. which remains fixed
when we undertake a policy change, and the structure is
identified if we can
estimate it from the given data. But in this broad sense, when a
policy variable is an
exogenous variable in the system, the reduced form is itself a
structure and is
identified. In a supply and demand example, if we contemplate
introducing an
excise tax into a market where none has before existed, then we
need to be able
to estimate supply and demand curves separately. But if there
has previously been
an excise tax, and it has varied exogenously, reduced form
estimation will allow us
accurately to predict the effects of further changes in the tax.
Policy analysis in
macromodels is more often in the latter mode, projecting the
89. effect of a change in
a policy variable, than in the mode of projecting the effect of
changing the
parameters of a model equation.
Of course, if macroeconomic policy-makers have a clear idea
of what they are
supposed to do and set about it systematically, macroeconomic
policy variables
will not be at all exogenous. This is a big if, however, and in
fact some policy
variables are close enough to exogenous that reduced forms
treating them or their
proximate determinants as exogenous may be close to structural
in the required
sense.1" Furthermore, we may sometimes be able to separate
endogenous and
exogenous components of variance in policy variables by
careful historical
analysis, in effect using a type of instrumental variables
procedure for estimating a
90. structural relation between policy variables and the rest of the
economy.
Lucas' [16] critique of macroeconomic policy making goes
further and argues
that, since a policy is not really just one change in a policy
variable, but rather a
rule for systematically changing that variable in response to
conditions, and since
changes in policy in this sense must be expected to change the
reduced form of
existing macroeconometric models, the reduced form of
existing models is not
structural even when policy variables have historically been
exogenous-institu-
tion of a nontrivial policy would end that exogeneity and
thereby change expec-
tation formation rules and the reduced form.
There is no doubt that this position is correct, if one accepts
this definition of
policy formation. One cannot choose policy rules rationally
91. with an econometric
model in which the structure fails to include realistic
expectation formation.
However what practical men mean by policy formation is not
entirely, probably
not even mainly, choice of rules of this sort. Policy makers do
spend considerable
effort in comparing projected time paths for variables under
their control. As
Prescott and Kydland [23] have recently shown, making policy
from such pro-
jections, while ignoring the effect of policy on expectation-
formation rules, can
lead to a very bad time path for the economy, under some
assumptions. Or, as
Sargent and Wallace [29] have shown, it can on other
assumptions be merely a
charade, with the economy's real variables following a
stochastic process which
cannot be affected by any such exercises in choice of time
paths for policy
variables.
92. 1 l We shall see below, for example, that in Germany and the
U.S. money supply, while not entirely
exogenous, has an exogenous component which accounts for
much of its variance.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 13
I do not think, however, that practical exercises in conditional
projection of
effects of policy are either charades or (usually) Prescott and
Kydlund's case of
"Peter White" policy making.12 Suppose it were true that the
policy rule did make
a difference to the economy. There are many ways to argue that
this is true in the
face of Sargent's [28] or Sargent and Wallace's [29] analysis,
all being suggestions
for forms of non-neutrality of money. To be concrete, suppose
93. that the real
variables of the economy do follow a stochastic process
independent of the money
supply rule but that for some reason the rate of inflation enters
the social utility
function. 13 Then the optimal form for macropolicy will be
stabilization of the price
level.14 If we could agree on a stable model in which all forms
of shock to the
aggregate price level were specified a priori, then it would be
easy in principle to
specify an appropriate function mapping past values of
observed macrovariables
into current levels of policy variables in such a way as to
minimize price variance.
However, if disturbances in the economy can originate in a
variety of different
ways, the form of this policy reaction function may be quite
complicated.
It is much easier simply to state that policy rule is to minimize
the variance
of the price level. Furthermore, if there is uncertainty about the
structure of
94. the economy, then even with a fixed policy objective function,
widely understood,
the form of the dependence of policy on observed history will
shift over time as
more is learned about (or as opinions shift about) the structure
of the economy.
One could continually re-estimate the structure and, each
period, re-announce an
explicit relation of policy variables to history. However it is
simpler to announce
the stable objective function once and then each period solve
only for this period's
policy variable values instead of computing a complete policy
reaction function.
This is done by making conditional projections from the best
existing reduced
form model, and picking the best-looking projected future time
path. Policy
choice is then most easily and reliably carried out by
comparing the projected
effects of alternative policies and picking the policy which
most nearly holds the
95. price level constant. Accurate projections can be made from
reduced form models
fit to history because it is not proposed to change the policy
rule, only to
implement effectively the existing rule.
12 Peter White will ne'er go right/ Would you know the reason
why?/ He follows his nose where'er
he goes/ And that stands all awry.-Nursery Rhyme.
13 It is a little hard to imagine why the rate of inflation should
matter if it affects no real variables. A
more realistic and complicated scenario would suppose that
there are costs to writing contingencies
into contracts, and enforcing contracts with complicated
provisions, so that a macropolicy which
stabilizes certain macroeconomic aggregates-prices, wages,
unemployment rates, etc.-may simplify
contract-writing and thus save resources. This has been made
the basis of an argument against inflation
by Arthur Okun [22].
14 Discussion of such a policy seems particularly appropriate
in the Fisher-Schultz lecture, as Irving
Fisher supported such a policy: "The more the evidence in the
96. case is studied, the deeper will grow the
public conviction that our shifting dollar is responsible for
colossal social wrongs and is all the more at
fault because those wrongs are usually attributed to other
causes. When these who can apply the
remedy realize that our dollar is the great pickpocket, robbing
first one set of people, then another, to
the tune of billions of dollars a year, confounding business
calculations and convulsing politics, and, all
the time, keeping out of sight and unsuspected, action will
follow and we shall secure a boon for all
future generations, a true standard for contracts, a stabilized
dollar" [7].
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
14 CHRISTOPHER A. SIMS
In fact, it appears to be a mistake to assume that the economy's
real variables
97. follow a process even approximately unrelated to nominal
aggregates. Thus
stabilization of the price level alone is not likely to be the best
policy. However, it
is not clear that the existing pattern of policy in most countries,
in which there is
weight given to stabilization of inflation, unemployment, and
income distribution,
is very far from an optimal policy. Simply implementing policy
according to these
objectives in the way the public expects is a highly nontrivial
task, and one in
which reduced-form modeling may be quite useful.
To summarize the argument, it is admitted that the task of
choosing among
policy regimes requires models in which explicit account is
taken of the effect
98. of policy regime on expectations. On the other hand, it is
argued that the choice of
policy regime probably does have important consequences, and
that an optimal
regime and the present regime in most countries are both most
naturally specified
in terms of the effects of policy on the evolution of the
economy, rather than in
terms of the nature of the dependence of policy on the
economy's history.
Effectively implementing a stable optimal or existing policy
regime therefore is
likely appropriately to involve reduced-form modeling and
policy projection.
But I have argued earlier that most of the restrictions on
existing models are
false, and the models are nominally over-identified. Even if we
admit that a model
whose claimed behavioral interpretation is spurious may have a
useful reduced
99. form, isn't it true that when the spurious identification results
in restrictions on the
reduced form, the reduced form is distorted by the false
identifying restrictions?
The answer is yes and no. Yes, the reduced form will be
infected by false
restrictions and may thereby become useless as a framework
within which to do
formal statistical tests of competing macroeconomic theories.
But no, the resul-
ting infection need not distort the results of forecasting and
policy analysis with
the reduced form. Much recent theoretical work gives rigorous
foundation for a
rule of thumb that in high dimensional models restricted
estimators can easily
produce smaller forecast or projection errors than unrestricted
estimators even
when the restrictions are false. Of course very false restrictions
will make forecasts
100. worse, but in large macromodels restrictions very false in the
sense of producing
very bad reduced-form fits are probably usually detected and
eliminated. Thus
models whose self-proclaimed behavioral interpretation is
widely disbelieved
may nonetheless find satisfied users as tools of forecasting and
policy projection.
Because existing large models contain too many incredible
restrictions,
empirical research aimed at testing competing macroeconomic
theories too often
proceeds in a single- or few-equation framework.15 For this
reason alone it
appears worthwhile to investigate the possibility of building
large models in a style
which does not tend to accumulate restrictions so haphazardly.
In addition,
though, one might suspect that a more systematic approach to
imposing restric-
tions could lead to capture of empirical regularities which
remain hidden to
the standard procedures and hence lead to improved forecasts
101. and policy
projections.'6
15 Modigliani [20] has used the MPS model as an arena within
which to let macroeconomic theories
confront each other, however.
16 The work of Nelson [21] and Cooper and Nelson [5]
provides empirical support for this idea.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 15
Empirical macroeconomists sometimes express frustration at
the limited
amount of information in economic time series, and it does not
infrequently turn
out that models reflecting rather different behavioral
hypotheses fit the data about
equally well. This attitude may account for the lack of previous
102. research on the
possibility of using much less parsimoniously parameterized
multiple-equation
models. It might be expected that in such a model one would
find nothing new
except a relatively larger number of "insignificant" t statistics.
Forecasts might be
expected to be worse, and the accurate picture of the relation of
data to theory one
would obtain might be expected to be simply the conclusion
that the data cannot
discriminate between competing theories.
In the next section of this paper we discuss a general strategy
for estimating
profligately (as opposed to parsimoniously) parameterized
macromodels, and
present results for a particular relatively small-scale
application.
2. AN ALTERNATIVE STRATEGY FOR EMPIRICAL
MACROECONOMICS
It should be feasible to estimate large-scale macromodels as
unrestricted
103. reduced forms, treating all variables as endogenous. Of course,
some restrictions,
if only on lag length, are essential, so by "unrestricted" here I
mean "without
restrictions based on supposed a priori knowledge." The style I
am suggesting we
emulate is that of frequency-domain time series theory (though
it will be clear I am
not suggesting we use frequency-domain methods themselves),
in which what is
being estimated (e.g., the spectral density) is implicitly part of
an infinite-
dimensional parameter space, and the finite-parameter methods
we actually use
are justified as part of a procedure in which the number of
parameters is explicitly
a function of sample size or the data. After the arbitrary
"smoothness" or
"rate-of-damping" restrictions have been used to formulate a
model which serves
to summarize the data, hypotheses with economic content are
formulated and
tested at a second stage, with some perhaps looking attractive
enough after a test
104. to be used to further constrain the model. Besides frequency-
domain work, such
methods are implicit or explicit in much distributed lag model
estimation in
econometrics, and Amemiya [1] has proposed handling serial
correlation in time
domain regression models in this style.
The first step in developing such an approach is evidently to
develop a class of
multivariate time series models which will serve as the
unstructured first-stage
models. In the six-variable system discussed below, the data
are accepting of a
relatively stringent limit on lag length (four quarters), so that it
proves feasible to
use an otherwise unconstrained (144 parameter) vector
autoregression as the
basic model. In the larger systems one will eventually want to
study this way, some
additional form of constraint, beyond lag length or damping
rate constraints, will
be necessary. Finding the best way to do this is very much an
open problem.
105. Sargent and I [281 have published work using a class of
restricted vector time
series models we call index models in macroeconomic work,
and I am currently
working on applying those methods to systems larger than that
explored below.
Priestly, Rao, and Tong [24] in the engineering literature and
Brillinger [4] have
suggested related classes of models. All of these methods in
one way or another
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
16 CHRISTOPHER A. SIMS
aim at limiting the nature of cross-dependencies between
variables. If every
variable is allowed to influence every other variable with a
distributed lag of
106. reasonable length, without restriction, the number of
parameters grows with the
square of the number of variables and quickly exhausts degrees
of freedom.
Besides the above approaches, it seems to me worthwhile to try
to invent Bayesian
approaches along the lines of Shiller's [30] and Leamer's [14]
work on distributed
lags to accomplish similar objectives, though there is no
obvious generalization of
those methods to this sort of problem.
The foregoing brief discussion is included only to dispose of
the objection that
the kind of analysis I carry out below could not be done on
systems comparable in
size to large-scale macromodels currently existing.
What I have actually done is to fit to quarterly, postwar time
series for the U.S.
and West Germany (F.R.G.) on money, GNP, unemployment
107. rate, price level,
and import price index, an unconstrained vector autoregression.
Before descri-
bing the results in detail, I will set out the two main
conclusions, to help light the
way through the technical thickets to follow.
Phillips curve equations or wage-price systems of equations are
often estimated
treating only wages or only wages and prices jointly as
endogenous. The "price"
equation is often treated as behavioral, describing the methods
firms use to set
prices for the products, while the wage or Phillips curve
equations are often
discussed as if they describe the process of wage bargaining or
are in some way
connected to only those variables (unemployment in particular)
which we asso-
ciate with the labor market. In the estimated systems for both
108. the U.S. and F.R.G.,
the hypothesis that wage or price or the two jointly can be
treated as endogenous,
while the rest of the system is taken as exogenous, is decisively
rejected. Estimates
conditioned on this hypothesis would then be biased, if the
equations did have a
structural interpretation. On the other hand, the estimated
equations, having
been allowed to take the form the data suggests, do not take the
forms commonly
imposed on them. Unemployment is not important in the
estimated wage equa-
tions, while it is of some importance in explaining prices. The
money supply has a
direct impact on wages, but not on prices.17
Sargent [27] has recently put forward a more sophisticated
version of the
109. rational expectations macromodel he had analyzed in earlier
work. He shows that
the implication of his earlier model that a variable measuring
real aggregate labor
or output should be serially uncorrelated is not a necessary
adjunct to the main
policy implication of his earlier model: that deterministic
monetary policy rules
cannot influence the form of time path of real variables in the
economy. We shall
see that such an elaborate model can take two extreme forms,
one in which the
nature of cyclical variation is determined by the parameters of
economic
behavioral relations, the other in which unexplained serially
correlated shocks to
technology and tastes account for cyclical variation. The more
satisfying extreme
form of the model, with a behavioral explanation for the form
110. of the cycle, implies
17 Some empirical macroeconomists in the U.S. have begun to
reach similar conclusions. Wachter
[32] has introduced money supply into "wage" equations, and
R. J. Gordon [8] has taken the view that
equations of this type ought to be interpreted as reduced forms.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 17
that the real variables in the economy, including relative
prices, ought to form a
vector of jointly exogenous variables relative to the money
supply, the price level,
or any other nominal aggregate. This is very far from holding
true in the system
estimated here. For the U.S., money supply, and for the F.R.G.
the price level
111. shows strong feedback into the real economy.
A. Methodological Issues
Since the model being estimated is an autoregression, the
distribution theory on
which tests are based is asymptotic. However, for many of the
hypotheses tested
the degrees of freedom in the asymptotic x distribution for the
likelihood ratio
test statistic is not a different order of magnitude from the
degrees of freedom left
in the data after fitting the model. This makes interpretation of
the tests difficult,
for a number of reasons. Even if the model were a single
equation and not
autoregressive, we know that F statistics with similar
numerator and denominator
degrees of freedom are highly sensitive to non-normality, in
contrast to the usual
112. case of numerator degrees of freedom much smaller than
denominator degrees of
freedom, where robustness to non-normality follows from
asymptotic distribution
theory. This problem is worse in the case where some
coefficients being estimated
are not consistently estimated, as will be true when dummy
variables for specific
periods are involved. If constraints being tested involve
coefficients of such
variables (as do the tests for model stability below), even F
statistics with few
numerator degrees of freedom will be sensitive to non-
normality. In the case
which seems most likely, where distributions of residuals have
fat tails, this creates
a bias toward rejection of the null hypothesis.
There is a further problem that different, reasonable-looking,
113. asymptotically
equivalent formulas for the test statistic may give very
different significance levels
for the same data. In the single equation case where k linear
restrictions are being
tested, the usual asymptotic distribution theory suggests
treating T log (1+
kF/(T - k)) as XK(k), where F is the usual F statistic and T is
sample size. Where k
is not much less than T, significance levels of the test drawn
from asymptotic
distribution theory may differ substantially from those of the
exact F test. Of
course k times the F statistic is also asymptotically X2(k) and a
test based on k is
asymptotically equivalent to the likelihood ratio test. Since
treating kF as x
ignores the variability of the denominator of F, such a
procedure has a bias against
the null hypothesis relative to the F test. The usual likelihood
ratio test shares this
bias. Furthermore, over certain ranges of values of F, including
the modal value of
1.0, the usual likelihood ratio is larger than kF and thus even
114. further biased
against the null hypothesis.
In the statistical tests reported below, I have computed
likelihood ratios as if the
sample size were T - k, where k is the total number of
regression coefficients
estimated divided by the number of equations.18 This makes
the likelihood ratio
18 That is, the usual test statistic, T(log I DR I - log I Du I) is
replaced by (T - k)(log IDR I - log I Du 1),
where DR is the matrix of cross products of residuals when the
model is restricted; DU is the same
matrix for the unrestricted model.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
18 CHRISTOPHER A. SIMS
115. tend to be smaller than kF in the single-equation case, though
whether this
improves the applicability of the distribution theory much is
certainly debatable.
In any case we shall see that most hypotheses entertained are
rejected, so this
modification of the usual likelihood ratio test in favor of the
null hypothesis would
not change the main results.
The procedures adopted here are obviously only ad hoc
choices; and the
problem of finding the appropriate procedure in situations like
this deserves more
study.19
B. Stability Over Time and Lag Length
The six data series used in the model for each country-money,
116. real GNP,
unemployment, wages, price level, and import prices-are
defined in detail in the
data appendix. Each series except unemployment was logged,
and the regressions
all included time trends. For Germany, but not the U.S.,
seasonal dummy
variables were included. Most but not all the series were
seasonally adjusted. The
period of fit was 1958-76 for Germany, 1949-75 for the U.S.
The estimated general vector autoregressions were initially
estimated with lag
lengths of both four and eight, and the former specification was
tested as a
restriction of the latter. In both countries the shorter lag length
was acceptable.
The X2(144) = 166.09 for the U.S. and X2(144) = 142.53 for
the F.R.G. The
corresponding significance levels are about 0.20 and 0.50. In
117. all later work the
shorter lag length was used.
The sample-split tests we are about to consider were all
executed by adding a set
of dummy variables to the right-hand-side of all regressions in
the system,
accounting for all variation within the period being tested. The
likelihood ratio
statistic was then formed as described in Section 2.A,
comparing the fit of the
system with and without these dummy variables. Because non-
normal residuals
are not "averaged" in forming such a test statistic, the statistics
are probably
biased against the null hypothesis when degrees of freedom in
the test statistic are
small. On the other hand, they are probably biased in favor of
the null hypothesis
when the degrees of freedom approach half the sample size, at
least when
118. compared with the single-equation F statistic.
For both West German and U.S. data, splitting the sample at
1965 (with the
dummy variables applied to the post-65 period) shows no
significant difference
between the two parts of the sample. However, again for both
countries, splitting
19 Some readers have questioned the absence in this paper of a
list of coefficients and standard
errors, of the sort usually accompanying econometric reports of
regression estimates. The autoregres-
sive coefficients themselves are difficult to interpret, and
equivalent, more comprehensible, informa-
tion is contained in the MAR coefficients, which are presented
in the charts. Because estimated AR
coefficients are so highly correlated, standard errors on the
individual coefficients provide little of the
sort of insight into the shape of the likelihood we ordinarily try
to glean from standard errors of
regression coefficients. The various x2 tests on block
triangularity restrictions which are presented
below provide more useful information. However, it must be
119. admitted that it would be better were
there more emphasis on the shape of the sum-of-squared-
residuals function around the maximum than
is presented here. Ideally, one would like to see some sort of
error bound on the MAR plots, for
example; I have not yet worked out a practical way to do this.
This content downloaded from 131.162.129.148 on Wed, 08 Feb
2017 17:04:51 UTC
All use subject to http://about.jstor.org/terms
MACROECONOMICS AND REALITY 19
the sample at the first quarter of 1971 or 1958 (using dummy
variables for the
smaller segment of the sample) shows a significant difference
between the two
parts of the sample. For the 1971 split the marginal
significance levels of the test
are 0.003 for Germany and less than 10-4 for the U.S.
However, as can be seen
from Table I, in both countries the difference between periods
is heavily concen-
120. trated in the equation for price of imports. Testing the five
other equations in the
system, treating the import variable as predetermined, yields
marginal
significance levels of 0.07 for the U.S. and 0.15 for
Germany.20
TABLE I
TESTS FOR MODEL HOMOGENEITY: 1953-1971 vs. 1972-
1976
(Germany); 1949-1971 vs. 1972-1975 (U.S.)a
Equation U.S. Germany
M F(16, 54) 1.84 F(20,47) - 2.88
RGNP = 1.10 = 1.94
U = .92 = .76
W = .61 = .42
P = 1.75 = .74
PM = 5.10 = 4.10
Overall 2 (96) = 160.05 '2(120) = 170.76
first five