1 Introduction

The so-called hybrid new Keynesian Phillips curve (NKPC) for inflation receives quite some attention in the recent literature. Mavroeidis et al. (2014) provide an excellent survey, and earlier substantive accounts can be found in Calvo (1983), Gali (2008) and Woodford (2003). A key feature in this literature is the inclusion of inflation expectations in empirical models. Mavroeidis et al. (2014) provide an overview of various ways to include those expectations, which range from the inclusion of various observable variables to survey-based expectations, see for example Preston (2005) for an interesting account on surveys data. To set matters more precise: when considering annual data, a basic version of a NKPC model for this year’s inflation includes as explanatory variables the one-year lagged inflation, a measure of marginal costs, and the expected value for next year’s inflation. The present paper focuses on this last variable in this basic type of model.

In one part of the literature on the NKPC, notably Lanne and Luoto (2013), Gali and Gertler (1999), and Gali et al. (2005), among others, a key assumption is rational expectations. There it is commonly assumed that the expected value of next year’s inflation is equal to the realized value of inflation in the next year plus the forecast error for next year. Substituting the latter two terms in the model renders a model for this year’s inflation being a function of one-year-ahead and one-year lagged inflation, and the measure for marginal costs. Abstaining from the latter variable, in time series terms, this model for inflation provides an exciting opportunity to include the past and the future.

In the present paper, I argue that this assumption made in the RE literature is problematic for at least two reasons. First, the well-known Wold decomposition theorem says that any time series is decomposable into a deterministic component (like a constant and a trend) and a weighted sum of all current and past shocks (or prediction errors). More precise for the case at hand, it is thus impossible that a current observation on inflation depends on a shock to inflation in the future. Of course, if an event will take place, with known impact on inflation (think of a devaluation of a currency), it can be included. Note that it is then not a shock but a deterministic event. Expert adjustment of forecasts typically involves such actions. The second reason why the assumption in the RE literature is problematic is that by construction the error term in the estimable version of the NKPC model is perfectly correlated with one of the regressors. Indeed, again Wold’s decomposition theorem tells us that the expected value of the covariance between the current observation and the current shock is equal to the variance of the shocks. To “instrument away” this nonzero covariance may be a difficult task.

In this paper, I propose an alternative approach to including the expected value of inflation. One could perhaps think of including this year’s inflation rate or past year’s inflation rate, but this leads to the un-identifiability of the parameter associated with the expectations. It seems better to resort to a MIDAS-type approach, which here entails that the forecast for next year’s inflation is created based on information that comes in as the current year proceeds. That information can be based on all kinds of monthly, or even weekly and daily sources. Think, for example, of daily observable hotel prices via booking sites, or flight ticket prices. Just as an illustration, I will simply use monthly inflation figures, inspired by Frijns and Margaritis (2008) who use early-in-the-day volatility estimates to predict overall daily volatility. The MIDAS approach means that each month, one can update the NKPC model and re-estimate its parameters, because new information comes in.

The outline of this letter is as follows. Section 2 formalizes the notions above using simple expressions. Section 3 provides an illustration for USA inflation data and shows, for example, that only using the first 4 months of the year with inflation data delivers already quite accurate predictions. Section 4 concludes.

2 An analysis

This section shows that replacing the expected value of next year’s inflation by the realized value of inflation in the next year plus the forecast error for next year might not be a good idea. Next, I provide alternative approaches, where the one based on MIDAS-type modeling seems most useful.

Denote \( \pi_{t} \) as the annual inflation rate, \( x_{t} \) as a measure of marginal costs, and \( E_{t} \pi_{t + 1} \) as the one-year-ahead expected value of inflation made at time t. The hybrid new Keynesian Phillips curve (NKPC) can be summarized as

$$ \pi_{t} = \mu + \alpha E_{t} \pi_{t + 1} + \beta \pi_{t - 1} + \gamma x_{t} $$
(1)

See equation (4) in Lanne and Luoto (2013). To estimate the parameters in this NKPC model, one needs to replace the unobserved variable \( E_{t} \pi_{t + 1} \) by an observable variable. In the rational expectations (RE) literature, it is custom to assume that

$$ E_{t} \pi_{t + 1} = \pi_{t + 1} + \omega_{t + 1} $$

See Lanne and Luoto (2013, p. 564). This results in an estimable version of the NKPC model like

$$ \pi_{t} = \mu + \alpha \pi_{t + 1} + \beta \pi_{t - 1} + \varepsilon_{t} $$
(2)

with

$$ \varepsilon_{t} = \alpha \omega_{t + 1} + \gamma x_{t} $$

The key assumption in the RE literature is that \( \alpha \omega_{t + 1} \) is distributed as independently and identically (IID) over time. With this assumption, Lanne and Luoto (2013) use Maximum Likelihood to estimate the parameters in (2).

There is, however, a problem with this approach. The IID assumption of \( \alpha \omega_{t + 1} \) may perhaps be defendable, but the key issue is that the error term in (2) is not independent from one of the regressors. In fact,

$$ E\left( {\pi_{t + 1} ,\varepsilon_{t} } \right) = \alpha E\left( {\pi_{t + 1} \omega_{t + 1} } \right) + \gamma E\left( {\pi_{t + 1} x_{t} } \right) $$

First, one may wonder whether \( E\left( {\pi_{t + 1} x_{t} } \right) \) is equal to 0, and that seems hard to verify using empirical data. Certainly, it holds that

$$ E\left( {\pi_{t + 1} \omega_{t + 1} } \right) = E(E_{t} \pi_{t + 1} - \omega_{t + 1} ,\omega_{t + 1} ) = - \sigma_{\omega }^{2} \ne 0 $$

This makes a regressor and the error term in (2) to be correlated.

This insight basically follows from the familiar Wold decomposition theorem. This theorem says that any time series \( y_{t} \) can be written as the sum of a deterministic component \( \mu_{t} \), including, for example, a constant and a trend, and a component that includes a weighted average of current and past shocks \( \varepsilon_{t} \), that is,

$$ y_{t} = \mu_{t} + \theta_{0} \varepsilon_{t} + \theta_{1} \varepsilon_{t - 1} + \theta_{2} \varepsilon_{t - 2} + \cdots = \mu_{t} + \mathop \sum \limits_{i = 0}^{\infty } \theta_{i} \varepsilon_{t - i} $$

Usually, \( \theta_{0} \) is set equal to 1. Given availability of past shocks and the parameters for these past shocks and the deterministic terms, it is clear that the Wold decomposition implies that

$$ E_{t} y_{t + 1} = \mu_{t + 1} + \mathop \sum \limits_{i = 1}^{\infty } \theta_{i} \varepsilon_{t + 1 - i} $$

Hence, as \( \mu_{t + 1} \) is deterministic and perfectly forecastable,

$$ y_{t + 1} = E_{t} y_{t + 1} + \varepsilon_{t + 1} $$

In words, a time series is the sum of a predictable part and an unpredictable part, where the latter is also called the forecast error. This expression also shows that

$$ E\left( {y_{t + 1} \varepsilon_{t + 1} } \right) = E\left( {E_{t} y_{t + 1} + \varepsilon_{t + 1} ,\varepsilon_{t + 1} } \right) = 0 + E\left( {\varepsilon_{t + 1}^{2} } \right) = \sigma_{\varepsilon }^{2} $$

So, replacing \( E_{t} \pi_{t + 1} \) in (1) by

$$ E_{t} \pi_{t + 1} = \pi_{t + 1} + \omega_{t + 1} $$

does not seem the best option.

What then could we do? Let us go back to

$$ \pi_{t} = \mu + \alpha E_{t} \pi_{t + 1} + \beta \pi_{t - 1} + \gamma x_{t} $$

One may now decide to replace \( E_{t} \pi_{t + 1} \) by \( \pi_{t} \). This is also not a good idea for two reasons. First, during a year, there is no information on \( \pi_{t} \) in that total year, and only at the end of the year, we know this year’s annual inflation rate. The second reason is that then (1) becomes

$$ \pi_{t} = \mu + \alpha \pi_{t} + \beta \pi_{t - 1} + \gamma x_{t} $$

This is equivalent to

$$ \pi_{t} = \frac{\mu }{1 - \alpha } + \frac{\beta }{1 - \alpha }\pi_{t - 1} + \frac{\gamma }{1 - \alpha }x_{t} $$

In that case, the parameters in the NKPC model are not identifiable.

The same problem arises when one decides to replace \( E_{t} \pi_{t + 1} \) by \( \pi_{t - 1} \). This leads to

$$ \pi_{t} = \mu + \left( {\alpha + \beta } \right)\pi_{t - 1} + \gamma x_{t} $$

where again the parameters are not identified.

3 A MIDAS-based solution

There is a simple solution though, and that is that you do not replace \( E_{t} \pi_{t + 1} \) by \( \pi_{t} \), but, for example, by the monthly inflation rates that come in as the year proceeds. These models are called MIDAS models, see Ghysels et al. (2006, 2007), Foroni et al. (2015), and Breitung and Roling (2015) among many possible references.

Denoting \( \pi_{s,t} \) as the inflation rate in month s of year t, relative to the same month s in the previous year t − 1, that is,

$$ \pi_{s,t} = 100(\log {\text{CPI}}_{s,t} - \log {\text{CPI}}_{s,t - 1} ) $$

where \( {\text{CPI}}_{s,t} \) is the consumer price index in month s of year t. One can now replace \( \alpha E_{t} \pi_{t + 1} \) by

$$ \begin{aligned} & \alpha_{1} \pi_{1,t} \\ & \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} \\ & \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \alpha_{3} \pi_{3,t} \\ \end{aligned} $$

and so on, until

$$ \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \alpha_{3} \pi_{3,t} + \cdots + \alpha_{12} \pi_{12,t} $$

In words, this says that the forecast for next year’s inflation is first based on the inflation rate in January, and next it is based on the inflation rates in January and February of the current year, and so on. Note that the average of the twelve \( \pi_{s,t} \) terms is not equal to \( \pi_{t} \) as

$$ 100\left( {\log {\text{CPI}}_{t} - \log {\text{CPI}}_{t - 1} } \right) \ne \frac{1}{12}\mathop \sum \limits_{s = 1}^{12} 100(\log {\text{CPI}}_{s,t} - \log {\text{CPI}}_{s,t - 1} ) $$

Hence, during the current year, one can use monthly data as input to forecasts for next year’s inflation.

4 Illustration for US annual inflation rates

Figure 1 displays the annual inflation rates for the USA, for the sample 1956–2016. Figure 2 displays the annualized inflation rate observed in January as well as the yearly data. Clearly, there is substantial common variation in the data. Figure 3 contains all the monthly inflation rates and shows that there can be sizable variation in the data within a year.

Fig. 1
figure 1

Annual CPI-based inflation, USA, 1956–2016

Fig. 2
figure 2

Annual inflation rate versus the inflation rate in January of the same year

Fig. 3
figure 3

Annualized inflation rates per month

There are various MIDAS-type models to consider. The first type assumes that each month new and relevant information might come is, and these models are

$$ \begin{aligned} & \pi_{t} = \mu + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \ldots \\ & \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \cdots + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ \end{aligned} $$

The parameters are estimated unrestrictedly, thereby following the format recommended in Foroni et al. (2015), which is called the unrestricted MIDAS model, or in short, UMIDAS.

Table 1 gives a selection of the estimation results.Footnote 1 It is clear that there is substantial variation across the estimated \( \alpha_{s} \) parameters across the UMIDAS models. This reinforces that imposing structure on the parameters using, for example, Almon lags, as is often done in the literature, does not make sense here.

Table 1 Estimation results for the models

Figure 4 presents the root-mean-squared prediction error and mean absolute error for the test sample 2005–2016, where the parameters are estimated for 1956–2004. The UMIDAS model includes January (1), January and February (2), …, January–December (12). The graphs in Fig. 4 show that already quite some accurate forecasts can be obtained when the model

$$ \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \alpha_{3} \pi_{3,t} + \alpha_{4} \pi_{4,t} + \beta \pi_{t - 1} + \varepsilon_{t} $$

is considered. This can also be learned from Fig. 5 which shows a sharp increase in the \( R^{2} \) when the months January–April are included. Figure 6 gives a graphical impression of how the parameter estimates develop.

Fig. 4
figure 4

Root-mean-squared prediction error and mean absolute error for the test sample 2005–2016, where the parameters are estimated for 1956–2004. The UMIDAS model includes January (1), January and February (2), …, January–December (12)

Fig. 5
figure 5

The \( R^{2} \) of the UMIDAS models where each time an additional month is included

Fig. 6
figure 6

Estimated parameters in UMIDAS models, where each time an additional month is included

Now that UMIDAS models are considered, it is also possible to look at alternative forecast schedules, like, for example,

$$ \begin{aligned} & \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \alpha_{2} \pi_{2,t} + \cdots + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \alpha_{2} \pi_{2,t} + \cdots + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \alpha_{3} \pi_{3,t} + \cdots + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \ldots \\ & \pi_{t} = \mu + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \beta \pi_{t - 1} + \varepsilon_{t} \\ \end{aligned} $$

A selection of estimation results is presented in Table 2. Looking at the significant parameters, it is clear that there is wide variety of possible relevant models.

Table 2 Estimation results for the models

Finally, it is also possible to see which of the monthly inflation rates can be viewed as the most informative for forecasting next year’s inflation. For that purpose, one can consider

$$ \begin{aligned} & \pi_{t} = \mu + \alpha_{1} \pi_{1,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \pi_{t} = \mu + \alpha_{2} \pi_{2,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ & \ldots \\ & \pi_{t} = \mu + \alpha_{12} \pi_{12,t} + \beta \pi_{t - 1} + \varepsilon_{t} \\ \end{aligned} $$

The estimation results appear in Table 3. The peak \( R^{2} \) value appears in August (0.983). On the other hand, around April and May the \( R^{2} \) values are already quite high, and also the associated parameter is close to 1, with a small standard error. In other words, predictions from the NKPC model based on data available to and including April are rather accurate for the finally to be obtained annual end-of-the-year inflation.Footnote 2

Table 3 Estimation results for

5 Conclusion

The hybrid new Keynesian Phillips curve model for annual inflation involves an expectation of next year’s inflation. The common assumption in the rational expectations literature is to include the actual next year’s inflation and prediction error. This assumption leads to two inconveniences, that is, endogeneity of one of the regressors, and it violates the Wold decomposition theorem. A simple solution is presented which relies on the MIDAS notion, that is, higher frequency data within the same year can be used to create forecasts for next year’s inflation. An illustration to US annual inflation rates with incoming monthly inflation rates showed the merits of this approach.