Skip to main content
Log in

Sticky information and inflation persistence: evidence from the U.S. data

  • Published:
Empirical Economics Aims and scope Submit manuscript

Abstract

This paper analyzes the relationship between sticky information and inflation persistence by implementing a novel approach to estimate the Sticky Information Phillips Curve (SIPC). The degree of sticky information is estimated using a GMM estimator that matches the covariance between inflation and the shocks that affect firms’ pricing decisions. Although the SIPC contains an infinite number of terms, the theoretical covariances derived from the model have finite dimensions, thus allowing the estimation of the structural parameters without any truncation of the original model. This work shows that sticky information is significantly different if the model is estimated by matching inflation persistence or inflation variance. Previous empirical literature found that the SIPC model does not provide an accurate representation of the US postwar inflation. This paper qualifies such a finding by demonstrating that the SIPC is able to match the inflation persistence only at the cost of mismatching the inflation variance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. Mankiw and Reis (2002) presented the SIPC as an alternative (theory of inflation) to the New Keynesian Phillips Curve (NKPC), which was criticized because of its lack of intrinsic persistence. Such criticisms pointed out that (i) the actual inflation responds gradually to monetary policy shocks, whereas the NKPC implies an immediate adjustment; (ii) output losses typically accompany a reduction in inflation, but this is not true with the NKPC; (iii) the NKPC implies that announced disinflation causes a boom, but in the actual economy the opposite is true. In general, if we assume that firms maximize profits and have rational expectations, then they would react to any exogenous shock by adjusting their prices as soon as they acquire information on the incoming shock and in each period, they would acquire all of the information available to identify new shocks. Hence, the effect of a shock on prices lasts for few periods and as a result, the only source of persistence in inflation dynamics is the one of exogenous shocks (e.g., cost push shocks, monetary policy shocks, demand shocks).

  2. Note that for \(\lambda =1\), the SIPC encompasses the rational expectations model with monopolistic competition and flexible prices, which has repeatedly been shown in the literature to generate little endogenous persistence in inflation dynamics.

  3. The whole estimation procedure is based on Molinari (2007).

  4. Following MR, the SIPC is simulated by assuming that the growth rate of nominal output, i.e., \(\Delta m_{t}=\Delta y_{t}+\pi _{t}\), is exogenous and follows an autoregressive process of order 1. As a result, inflation from the SIPC can be easily simulated using its moving average representation as function of the exogenous errors in the autoregressive process. The calibration of the autoregressive coefficient and the variance of the exogenous shocks are taken from Reis (2006).

  5. The average duration of information is \(D=\left( 1-(1-\lambda )\right) ^{-1}\). Thus, in the quarterly model used by MR, \(\lambda =0.25\) implies \(D=4\), i.e., four quarters.

  6. Equation (3) follows after multiplying (2) by \(\left( \delta \varepsilon _{t-i}\right) ^{\prime }\) and taking the expectation conditional on the information at time \(t\). This calculation uses the fact that the errors are uncorrelated, i.e., \(E\left[ \varepsilon _{t}\varepsilon _{t-i}\right] =0\) for \(i\ge 1\). In Appendix B, the orthogonality conditions (3) are referred to as \(g_{1,t}\).

  7. Given that \(Z_{t}\) is ergodic (see Proposition 1), then \(E\left[ \delta Z_{t}\cdot (\delta \varepsilon _{t-i})^{\prime }\right] =E\left[ \delta Z_{t-i}\cdot (\delta \varepsilon _{t})^{\prime }\right] \).

  8. If \(\left\{ \widehat{\varepsilon }_{t},\widehat{A}_{i},\Sigma _{T}\right\} \) are consistent estimates of \(\left\{ \varepsilon _{t},A_{i},\Sigma \right\} \), it can be shown that the sample analog:

    $$\begin{aligned} \frac{1}{T}\sum _{t=1}^{T}\left[ \left( \frac{\alpha \lambda }{1-\lambda } y_{t}+\alpha \Delta y_{t}\right) \left( \delta \widehat{\varepsilon } _{t-i}\right) ^{\prime }-\left( 1-\lambda \right) ^{i}\delta \widehat{A} _{i}\Sigma _{T}\delta ^{\prime }\right] =0 \end{aligned}$$

    converges almost surely to the population moment (3).

  9. This follows immediately from the fact that the quadratic terms in (4) are all positive definite. Note that there is only one additional task required to compute (4) beyond the GMM estimation of (3), i.e., one needs to derive \(\frac{\partial g_{1,t}}{\partial \beta ^{\prime }}\) and to evaluate it at \(\{\beta _{T}^\mathrm{VAR},\theta _{T}^{2s}\}\). All of the other terms that appear in (4) are already computed in the two-stage estimation. \(\frac{\partial g_{1,t}}{\partial \lambda }\) and \(\Sigma _{g_{1}}^{-1}\) are delivered by the GMM algorithm that computes \(\lambda _{T}^{2s}\), and \(\left( E\frac{\partial g_{2,t}^{\prime }}{\partial \beta }\Sigma _{g_{2}}^{-1} E\frac{\partial g_{2,t}}{\partial \beta ^{\prime }}\right) \), the VCV matrix of VAR coefficients, is delivered by the algorithm that estimates the VAR(p) model.

  10. See Sect. 4.2.

  11. All data used in this paper come from the FRED-II database issued by the Federal Reserve Bank of St. Louis.

  12. The output gap is the real GDP detrended with the HP filter.

  13. The GMM model has a total of seven orthogonality conditions, including the contemporaneous moment (\(i=0\)). Results using a higher number of o.c. are quite similar to those reported and are available upon request.

  14. In his baseline estimation, Coibion (2010) estimated \(\alpha \) and \(\lambda \) jointly, finding that they both are not significant. That result, however, is obtained using either subjective or out-of-sample VAR forecasts to generate the expectation terms present in the SIPC. When he uses in-sample forecasts (from a single VAR estimated over the entire period as the one estimated in this paper) instead, he finds estimates of \(\lambda \) significant and in the range [0.37, 0.42].

  15. Kiley (2006) and Korenok (2008) express the SIPC in terms of marginal cost; therefore, \(\alpha \) does not appear in their estimated specifications.

  16. See Reis (2006) and Woodford (2003), ch. 2.

  17. For a more precise comparison, I estimated \(\lambda \) using the same information as used in previous papers, which turns out to be the information contained in the covariances between inflation and the first \(L\) lags of \(\delta Z_{t-i}\). In this case, using Eq. (2) and lagged \(\delta Z_{t-i}\) as instruments, I derive and estimate the following orthogonality conditions:

    $$\begin{aligned} E\left[ \left( \frac{\alpha \lambda }{1-\lambda }y_{t}+\alpha \Delta y_{t}\right) \left( \delta Z_{t-i}\right) ^{\prime }\right] = \sum _{j=0}^{\infty }\left( 1-\lambda \right) ^{i+j}\delta A_{i+j}\Sigma A_{j}^{\prime }\delta ^{\prime } \quad \mathrm{for}\; i =1,\ldots ,L \end{aligned}$$
    (6)

    where the infinite summation on the RHS of (6) is simulated at each step of the GMM algorithm, replacing \(\infty \) with \(J_{max}=120\). In the estimation of (6), \(\lambda _{T}^{2s}\) ranges between \(\left[ 0.30,0.41\right] \), a result in line with that obtained using lagged \(\varepsilon _{t}\) as instruments and very close to the estimates of Khan and Zhu (2006), Kiley (2006) and Korenok (2008).

  18. The min RMSE VAR uses five extra variables to predict \(\pi _{t}\) and \(\Delta y_{t}\) with respect to the baseline VAR. As a result, the residuals \(\widehat{\varepsilon }_{t}\) from the min RMSE VAR are smaller than the ones of the baseline VAR.

  19. Following similar GMM estimates in the literature, e.g., Galí and Gertler (1999), I use 19 instruments: a constant, 4 lags of inflation, 4 lags of output gap, and 2 lags of unemployment rate, interest rate, marginal cost, money growth, and term spread.

  20. In this case, the GMM point estimates of \(\lambda _{T}^{2s}\) coincides with the estimate of the non-linear IV estimator, but with smaller variance.

  21. I also estimated the model with \(\alpha = 0.15\) to control for a possible non-monotonic effect of \(\alpha \) on \(\lambda _{T}^{2s}\) within the \(\left[ 0.1,0.2\right] \) interval. These results are in line with those for \(\alpha =0.1\) presented in Table 8, and are available upon request.

  22. The AR(2) model is extensively employed in the literature as a univariate benchmark model to predict inflation, e.g., in Khan and Zhu (2006) or Batchelor (1982), because it is the simplest univariate model with sufficiently rich dynamics to represent the actual inflation series.

  23. See Angeloni et al. (2006).

  24. This follows immediately from (3). If inflation persistence increases because of adaptive agents but structural \(\lambda \) remains constant, then in the misspecified o.c. (3) the RHS increases, and the estimated \(\lambda ^{2s}_{T}\) must decrease for the equality to hold.

  25. Dupor et al. (2005) developed a similar model. They proposed a model of “dual stickiness” in which producers change prices sporadically and absorb the relevant information for price setting in random periods, as in the SIPC. Basically, Dupor–Kitamura–Tsuruga nested together Calvo’s sticky price framework with the sticky information of MR. As a result, inflation in period \(t\) is a function of all past periods’ expectations of future variables indexed from \(t+1\) onwards. The econometric strategy presented in Sect. 2.2 is computationally burdensome when applied to such a model, and therefore, in this paper I do not estimate Dupor–Kitamura–Tsuruga model but rather derive and estimate an alternative model with sticky information and adaptive agents.

  26. The reduced form VAR(p) model used in Sect. 3.1 encompasses both the SIPC model and the hybrid versions derived in this section.

  27. Andrews’s test looks for structural breaks with unknown timing by cutting the tails of the sample and computing recursively for each observation in the remaining middle part which is the most likely period where a break might have occurred.

  28. The oil shock occurred in the middle of the sample and lasted sufficiently long to be included in each of the two sub-sample statistics calculated by Andrews’s test. Thus, if the effect of the oil shock were sufficiently large, any change in the inflation dynamics of one sub-sample, due to variations in the degree of sticky information, would not be detected by the test.

  29. The VAR(p) model (5) is assumed to have errors \(\varepsilon _{t}\sim i.i.d. N\left( 0,\Sigma \right) \) and is estimated least square equation by equation.

References

  • Andrews D (1993) Test for parameter instability and structural change with unknown change point. Econometrica 61(4):821–856

    Article  Google Scholar 

  • Angeloni I, Aucremanne L, Ehrmann M, Galí J, Levin A, Smets F (2006) New evidence on inflation persistence and price stickiness in the Euro area: implications for macro modeling. J Eur Econ Assoc 4(2–3):562–574

    Article  Google Scholar 

  • Batchelor RA (1982) Expectations, output and inflation: the European experience. Eur Econ Rev 17(1): 1–25. http://www.sciencedirect.com/science/article/pii/0014292182900393

  • Bayoumi T, Sgherri S (2004) Monetary magic? How the fed improved the flexibility of the U.S. economy. IMF wp 04–24

  • Christiano L, Eichenbaum M, Evans C (2005) Nominal rigidities and the dynamic effects of a shock to monetary policy. J Polit Econ 113(1):1–45

    Article  Google Scholar 

  • Cogley T, Primicieri GE, Sargent TJ (2010) Inflation-gap persistence in the US. Am Econ J: Macroecon 2(1):43–69

    Google Scholar 

  • Coibion O (2010) Testing the sticky information Phillips curve. Rev Econ Stat 92:87–101

    Article  Google Scholar 

  • Coibion O, Gorodnichenko Y (2012) What can survey forecasts tell us about information rigidities? J Polit Econ 120:116–159

    Article  Google Scholar 

  • Dupor B, Kitamura T, Tsuruga T (2005) Do sticky prices need to be replaced with sticky information? IMES Discussion Paper Series no. 2006-E-23, Bank of Japan

  • Fischer S (1977) Long-term contracts, rational expectations, and the optimal money supply rule. J Polit Econ 85:191–205

    Article  Google Scholar 

  • Galí J, Gertler M (2006) Inflation dynamics: a structural econometric analysis. J Monet Econ 44:195–222

    Article  Google Scholar 

  • Khan H, Zhu Z (2006) Estimates of the sticky-information Phillips curve for the United States. J Money Credit Bank 38(1):195–207

    Article  Google Scholar 

  • Kiley MT (2006) A quantitative comparison of sticky-price and sticky-information models of price setting. Finance and Economics Discussion series no. 2006–2045, Federal Reserve Board

  • Korenok O (2008) Empirical comparison of sticky price and sticky information models. J Macroecon 30(3):906–927

    Article  Google Scholar 

  • Laforte J-P (2007) Pricing models: a Bayesian DSGE approach for the US. J Money Credit Bank 39:127–154

    Article  Google Scholar 

  • Lucas RE Jr (1973) Some international evidence on output-inflation tradeoffs. Am Econ Rev 63:326–334

    Google Scholar 

  • Mankiw GN, Reis R (2002) Sticky information versus sticky prices: a proposal to replace the new Keynesian Phillips curve. Q J Econ 17(4):1295–1328

    Article  Google Scholar 

  • Mankiw GN, Reis R (2007) Sticky information in general equilibrium. J Eur Econ Assoc 5:603–613

    Article  Google Scholar 

  • Molinari B (2007) The role of sticky information in inflation dynamics: estimates and findings. Ente Luigi Einaudi Temi di Ricerca no. 50

  • Reis R (2006) Inattentive producers. Rev Econ Stud 73:793–821

    Article  Google Scholar 

  • Stock J, Watson M (2003) Forecasting output and inflation: the role of asset prices. J Econ Lit XLI:788–829

    Article  Google Scholar 

  • Wang P, Wen Y (2006) Solving linear difference systems with lagged expectations by a method of undetermined coefficients. Federal Reserve Bank of St. Louis, Working Paper 2006–003C

  • Woodford M (2003) Interest and prices. Princeton University Press, Princeton

    Google Scholar 

Download references

Acknowledgments

I am grateful to Albert Marcet for valuable comments and discussion regarding this paper. I also thank Klaus Adam, Luca Gambetti, Pau Rabanal, Jaume Ventura, Michael Reiter, Wouter den Haan, an anonymous referee, and the participants at the UPF macroeconomic seminars, the Asset 2007 Conference, and the 8th Annual Bank of Finland/CEPR Conference “Expectations and the Business Cycle” for useful comments. Any remaining errors are my responsibility. Financial support from Ente Luigi Einaudi (EIEF) is gratefully acknowledged.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Benedetto Molinari.

Appendices

A Proof of Proposition 1

Denoting the \(j\)-periods-ahead forecast error as \(\displaystyle \varepsilon _{t\mid t-j}^{F}=Z_{t}-E\left[ Z_{t}\mid \Omega _{t-j}\right] \), the SIPC can be written as:

$$\begin{aligned} \pi _{t}=\frac{\alpha \lambda }{1-\lambda }y_{t}+\frac{\lambda }{1-(1-\lambda )} \delta Z_{t} -\lambda \sum _{j=0}^{\infty }\left( 1-\lambda \right) ^{j}\delta \varepsilon _{t\mid t-j-1}^{F} \end{aligned}$$
(14)

where \(Z_{t}\) is any covariance stationary vector of variables that includes inflation and output gap, and \(\delta \) is a \(\left( 1\times n\right) \) row vector of zeros and constants that picks \(\left( \pi _{t}+\alpha \Delta y_{t}\right) \) within \(Z_{t}\). From the definition of \(\delta Z_{t}\), we immediately see that Eq. (14) implies

$$\begin{aligned} \frac{\alpha \lambda }{1-\lambda }y_{t}+\alpha \Delta y_{t}=\lambda \sum _{j=0}^{\infty }\left( 1-\lambda \right) ^{j}\delta \varepsilon _{t\mid t-j-1}^{F} \end{aligned}$$
(15)

Using the Wold decomposition of \(Z_{t}\), each forecast error can be written as:

$$\begin{aligned} \varepsilon _{t\mid t-j-1}^{F}=\sum _{i=0}^{j}A_{i}\varepsilon _{t-i} \end{aligned}$$
(16)

Next, using (16) to substitute for \(\varepsilon _{t\mid t-j-1}^{F}\) in (15), we obtain:

$$\begin{aligned} \frac{\alpha \lambda }{1-\lambda }y_{t}+\alpha \Delta y_{t}=\lambda \sum _{j=0}^{\infty }\left( 1-\lambda \right) ^{j}\delta \sum _{i=0}^{j}A_{i}\varepsilon _{t-i} \end{aligned}$$
(17)

Moreover, notice that:

$$\begin{aligned}&\sum _{j=0}^{\infty }\left( 1-\lambda \right) ^{j}\delta \sum _{i=0}^{j}A_{i}\varepsilon _{t-i}\nonumber \\&\quad =\left( \delta \varepsilon _{t}+\left( 1-\lambda \right) \delta \varepsilon _{t} + \left( 1-\lambda \right) ^{2}\delta \varepsilon _{t}+\ldots \right) \nonumber \\&\qquad \quad + \left( \left( 1-\lambda \right) \delta A_{1}\varepsilon _{t-1}+\left( 1-\lambda \right) ^{2} \delta A_{1}\varepsilon _{t-1}+\ldots \right) + \ldots \nonumber \\&\quad =\frac{1}{1-\left( 1-\lambda \right) }\sum _{i=0}^{\infty }\left( 1-\lambda \right) ^{i}\delta A_{i}\varepsilon _{t-i} \end{aligned}$$
(18)

Hence, by plugging (18) into (17), we obtain

$$\begin{aligned} \frac{\alpha \lambda }{1-\lambda }y_{t}+\alpha \Delta y_{t}=\sum _{i=0}^{\infty }\left( 1-\lambda \right) ^{i}\delta A_{i}\varepsilon _{t-i} \end{aligned}$$

which proves the proposition.

B Proof of Proposition 2

Let \(g_{1,t}\) be the set of orthogonality conditions (3), i.e.,

$$\begin{aligned} E[g_{1,t}]\equiv E\left[ \underset{m\times 1}{g_{1} \left( \lambda ,\beta _{T}^\mathrm{VAR},Y_{t} \right) }\right] = 0 \end{aligned}$$
(19)

where \(m=L+1\) is the number of orthogonality conditions, \(\beta \!=\!\mathrm{vec}\!\left( \left[ B_{1}^{\prime },\ldots ,B_{p}^{\prime }\right] ^{\prime }\right) , B_{j}\) are the matrices of parameters defined in (5), vec\(\left( \cdot \right) \) is the column stacking operator, and \(Y_{t}=\{y_{t},y_{t-1}\}\). Using the same notation, let \(g_{2,t}\) be the orthogonality conditions used to estimate the VAR(p) model (5), i.e.,

$$\begin{aligned} E\left[ \underset{kn\times 1}{g_{2}\left( \beta ,X_{t}\right) }\right] =0 \end{aligned}$$
(20)

where \(X_{t}\) are the \(n\) endogenous variables of the VAR(p), and \(k=np+1\).Footnote 29

The correct variance of \(\lambda _{T}^{2s}\) should take into account the variance of the stochastic regressors \(\beta _{T}^\mathrm{VAR}\), whereas the two-stage estimator mistreats \(\beta _{T}^\mathrm{VAR}\) as fixed variables, therefore failing to account for the volatility of \(X_{t}\) when computing the variance of \(\lambda _{T}^{2s}\). To overcome this problem, I first construct a GMM model that jointly estimates \(\{\lambda ,\,\beta \}\) pooling together the orthogonality conditions \(\{g_{1,t}, g_{2,t}\}\). Next, I show that the point estimates of this estimator coincide with that of the two-stage estimator, even though the variances are different because the variance of \(\lambda _{T}\) correctly incorporates the volatility of \(X_{t}\) through the effect of \(\beta \) on \(g_{1,t}\), whereas the variance of \(\lambda _{T}\) does not. Thus, I calculate the variance of \(\theta _{T}\) in the pooled model and use it as the adjusted variance of \(\theta _{T}^{2s}\).

The pooled GMM model

The vector of parameters \(\left\{ \lambda , \beta \right\} \) can be jointly estimated by GMM using the pooled vector of o.c.

$$\begin{aligned} E\left[ \begin{array}{l} \underset{m\times 1}{g_{1,t}} \\ \underset{k n\times 1}{g_{2,t}} \end{array}\right] =0 \end{aligned}$$
(21)

where (19) and (20) are stacked column-wise, with a weighting matrix \(W\) equal to:

$$\begin{aligned} W=\left( \begin{array}{ll} \Sigma _{g_{1}}^{-1} &{} O \\ O^{\prime } &{} I_{k n} \end{array}\right) \end{aligned}$$
(22)

The equivalence result

Denote the one-stage estimator of \(\lambda \) from the pooled model (21) as \(\lambda _{T}\), and note that \(\displaystyle \frac{\partial E\left[ g_{2}\left( \beta ,X_{t}\right) \right] }{\partial \lambda }=0\) by construction and that the weighting matrix (22) is block diagonal. In this case two conditions hold. First, the objective function of the GMM estimation of (21), \(\displaystyle J(\lambda _{T})=E(g_{1,t})\cdot \Sigma _{g_{1}}^{-1}\cdot E(g_{1,t}^{\prime })\), exactly coincides with the objective function of the two-stage GMM estimator \(\lambda _{T}^{2s}\) that uses the inverse of its VCV matrix, \(\Sigma _{g_{1}}^{-1}\), as the (optimal) weighting matrix. Second, the estimates \(\beta _{T}\) coincide with the VAR model estimation, i.e., the equation by equation estimation of (20), which are the regressors used to estimate \(\lambda _{T}^{2s}\) in the two-stage estimation.

Calculation of the variance of \(\lambda _{T}\)

Under the assumption that the covariance between \(g_{1,t}\) and \(g_{2,t}\) is zero, \(W\) is the optimal weighting matrix for the model (21), and accordingly, the VCV matrix of \(\theta _{T}\) is:

$$\begin{aligned} V\left( \begin{array}{l} \theta _{T} \\ \beta _{T} \end{array} \right) \!=\!\left[ T\cdot \underset{\equiv G}{\underbrace{E\left( \frac{\partial \left( \begin{array}{l} g_{1,t} \\ g_{2,t} \end{array} \right) }{\partial \left( \begin{array}{cc} \lambda&\beta ^{\prime } \end{array} \right) }\right) }}^{\prime }\underset{\equiv \Omega }{\underbrace{\left( E\left( \begin{array}{l} g_{1,t} \\ g_{2,t} \end{array}\right) \left( \begin{array}{ll} g_{1,t}^{\prime }&g_{2,t}^{\prime } \end{array} \right) \right) }}^{-1}E\left( \frac{\partial \left( \begin{array}{l} g_{1,t} \\ g_{2,t} \end{array} \right) }{\partial \left( \begin{array}{cc} \lambda&\beta ^{\prime } \end{array} \right) }\right) \right] ^{-1} \nonumber \\ \end{aligned}$$
(23)

where \(G\) is a \(\left( k n+m\right) \times \left( k n+1\right) \) matrix. The variance (23) can be written as:

$$\begin{aligned} V\left( \begin{array}{l} \lambda _{T} \\ \beta _{T} \end{array} \right) =\left[ E\left( \begin{array}{ll} \frac{\partial g_{1}^{\prime }}{\partial \lambda } &{} O^{\prime } \\ \frac{\partial g_{1}^{\prime }}{\partial \beta } &{} \frac{\partial g_{2}^{\prime }}{\partial \beta } \end{array} \right) \left( \begin{array}{ll} \Sigma _{g_{1}}^{-1} &{} O^{\prime } \\ O &{} \Sigma _{g_{2}}^{-1} \end{array} \right) E\left( \begin{array}{ll} \frac{\partial g_{1}}{\partial \lambda } &{} \frac{\partial g_{1}}{\partial \beta ^{\prime }} \\ 0 &{} \frac{\partial g_{2}}{\partial \beta ^{\prime }} \end{array} \right) \right] ^{-1}\Bigg /T \end{aligned}$$

where \(\displaystyle \Sigma _{g_{h}}=E\left( g_{h,t}\cdot g_{h,t}^{\prime }\right) \) for \(h=\{1,2\}\) and \(O\) denotes a matrix of zeros of needed dimensions. After some algebra manipulation, this variance can be written as:

$$\begin{aligned} V\left( \begin{array}{l} \lambda _{T} \\ \beta _{T} \end{array} \right) =\left( \begin{array}{cc} E\frac{\partial g_{1,t}^{\prime }}{\partial \lambda }\Sigma _{g_{1}}^{-1}E \frac{\partial g_{1,t}}{\partial \lambda } &{} E\frac{\partial g_{1,t}^{\prime }}{\partial \lambda }\Sigma _{g_{1}}^{-1}E\frac{\partial g_{1,t}}{\partial \beta ^{\prime }} \\ E\frac{\partial g_{1,t}^{\prime }}{\partial \beta }\Sigma _{g_{1}}^{-1}E \frac{\partial g_{1,t}}{\partial \lambda } &{} E\frac{\partial g_{1,t}^{\prime }}{\partial \beta }\Sigma _{g_{1}}^{-1}E\frac{\partial g_{1,t}}{\partial \beta ^{\prime }}+E\frac{\partial g_{2,t}^{\prime }}{\partial \beta }\Sigma _{g_{2}}^{-1}E\frac{\partial g_{2,t}}{\partial \beta ^{\prime }} \end{array} \right) ^{-1}\Bigg /T\nonumber \\ \end{aligned}$$
(24)

In particular, using the simplifying formula for the inverse of partitioned matrices, the variance of \(\lambda _{T}\) from Eq. (24) can be written as:

$$\begin{aligned} V(\lambda _{T})&= \left( \left( TV_\mathrm{na}\Bigg (\lambda _{T}^{2s})\right) ^{-1}-E\frac{ \partial g_{1,t}^{\prime }}{\partial \lambda }\Sigma _{g_{1}}^{-1}E\frac{ \partial g_{1,t}}{\partial \beta ^{\prime }}\left( E\frac{\partial g_{1,t}^{\prime }}{\partial \beta }\Sigma _{g_{1}}^{-1}E\frac{\partial g_{1,t}}{\partial \beta ^{\prime }}\right. \right. \nonumber \\&\left. \left. +E\frac{\partial g_{2,t}^{\prime }}{ \partial \beta }\Sigma _{g_{2}}^{-1}E\frac{\partial g_{2,t}}{\partial \beta ^{\prime }}\right) ^{-1}\cdot E\frac{\partial g_{1,t}^{\prime }}{\partial \beta }\Sigma _{g_{1}}^{-1}E\frac{\partial g_{1,t}}{\partial \lambda }\right) ^{-1}\Bigg /T \end{aligned}$$
(25)

where \(\displaystyle V_\mathrm{na}(\lambda _{T}^{2s})=\left( T\cdot E\frac{\partial g_{1,t}^{\prime }}{\partial \lambda }\Sigma _{g_{1}}^{-1}E\frac{\partial g_{1,t}}{\partial \lambda }\right) ^{-1}\) is the non-adjusted variance of \(\lambda _{T}^{2s}\) computed by the algorithm of the two-stage GMM estimation. Equation (25) proves the proposition.

Robustness analysis: tables

See Tables 8, 9, 10, 11, 12 and 13.

Table 8 GMM estimates of \(\lambda \): fixed \(\alpha = 0.1\)
Table 9 GMM estimates of \(\lambda \): QD filter
Table 10 GMM estimates of SIPC parameters: subjective expectations
Table 11 GMM estimates of the SIPC parameters: Real-time data
Table 12 GMM estimates of the hybrid SIPC model
Table 13 One-stage GMM estimator: the pooled model

Rights and permissions

Reprints and permissions

About this article

Cite this article

Molinari, B. Sticky information and inflation persistence: evidence from the U.S. data. Empir Econ 46, 903–935 (2014). https://doi.org/10.1007/s00181-013-0700-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00181-013-0700-y

Keywords

Navigation