1 Introduction

The Great Recession of 2008–2009 represents a watershed for macroeconomics. Kernel relationships broke down, thus showing fragility of macroeconomic models and their inability to make accurate predictions. The price Phillips Curve (PC), which describes the relationship between inflation and the cyclical position of economic activity (we shall refer to this format of the PC unless otherwise stated), is on the forefront. Its failure to produce reliable forecasts of future price developments, has led many scholars to consider this instrument in need of substantial revisions.Footnote 1

A brief account of the recent empirical “puzzles” of the PC may start from the evidence of its “flattening” during the Great Moderation era put forward by Blanchard et al. (2015) who reported two widely shared stylized facts. First, the substantial reduction of the structural slope of the PC plummeting from more than 1 in the mid-1970s to about 0.3. Second, the “anchoring of expectations”, meaning that the expectational component of inflation had become more stable and tighter (less weight of lagged inflation rates), implying that both expected and actual inflation would tend to converge faster towards the official inflation target.

These features seemed to persist in the aftermath of the Great Recession, when the “missing deflation puzzle” emerged in the US, where the collapse of about 10% of GDP relative to trend was followed by a modest decline of 1.5% of inflation (Williams 2010). In the Euro Zone (EZ), however, concerns at the European Central Bank (ECB) and among scholars were of opposite sign: a deflationary drift with de-anchoring of expectations seemed under way (Draghi 2014, 2016). Research on the post-crisis EZ challenged the consensus on the worldwide flattening of the PC pointing to its “steepening” (e.g. Riggi and Venditti 2014, 2015; Oinonen and Paloviita 2014; Bank of Ireland 2014). Direct investigation of expectation forecasts showed both their downward drift and a significant underestimation of actual inflation (Riggi and Venditti 2014; Miccoli and Neri 2015). Parallelly, direct evidence of the downward de-anchoring of expectations was detected in various studies (Buono and Formai 2016; Fracasso and Probo 2017; Nautz 2017; Natoli and Sigalotti 2017).

Afterwards, since recovery was taking hold across advanced economies, the new “missing inflation puzzle” spurred researchers’ efforts as inflation remained subdued in spite of more sustained economic activity, falling unemployment and some recovery of wage dynamics. In that period, the EZ realigned with the rest of the advanced world (Ciccarelli and Osbat 2017; Fiedler et al. 2018; Bobeica and Sokol 2019).

Given the intrinsic relationship between accurate economic forecasts and the implementation of appropriate economic policies, the literature about revisions of the PC has taken two main avenues (Ehrmann et al. 2020). One is, as it were, more radical, arguing for a dismissal of the traditional macro-apparatus in favor of a revival of a micro, disaggregated approach to the unemployment-wages-prices transmission. The other remains within the macro-tradition seeking for better specifications of the PC estimation equation. Our contribution may be ascribed to this latter strand.

In particular, we locate the need for better specification of the PC in its expectational component. For instance, Coibion and Gorodnichenko (2015) show that the missing deflation puzzle in the US in the aftermath of the Great Recession can be solved by replacing the usual surveys of expert forecasts with households’ ones, as the latter better captured the impact on consumer prices of the concomitant upsurge of oil and commodity prices. Jörgensen and Lansing (2019) follow the literature about boundedly-rational formation of expectations, and assume that agents update their inflation expectations upon solving a signal extraction problem to disentangle temporary versus permanent shocks to inflation. They show that the slope of the PC may appear flatter or stepper depending on the evolution of the parameter (the Kalman-filter gain) that governs the expectation formation process. We approach the issue from a more general perspective.

Inflation expectations expressed “on the right-hand-side” of the PC are generally treated as independent variables, collected from various available forecast surveys. This practice is not satisfactory for two reasons. First, the PC should be understood as one component of a set of co-evolving macroeconomic relationships (think for instance of the New Keynesian three-equation model). Indeed, to the extent that the central bank controls inflation tightly, it is no surprise that the ex-post inflation-output relationship is seemingly flat (Bernanke 2007). Second, and consequently, the alleged explanatory variables in the PC are co-determined elsewhere in the system, and may not even be independent one of another. This is specifically the case of inflation expectations, which are arguably elaborated, in some way and to some extent, according to the evolution of the macroeconomics.

It may be said that major institutions indeed base their inflation forecasts (also) on full-blown macro-models. Nonetheless, the PC as a stand-alone equation remains a pivotal macroeconomic tool as testified by the debate summarized above. More importantly, several studies show that the inflation forecasts collected in available surveys are consistently explained by means of a standard formulation of the PC (Fendel et al. 2011; Rülke 2012; Draeger et al. 2016; Casey 2020). Therefore, in Sect. 2 we present a modified PC where inflation expectations are no longer considered to be independent of the slack variable (output gap in the specific case).Footnote 2 The idea is simply that if the PC is used worldwide to foresee inflation, then its expectational component should in some way be the result of agents using the PC itself.Footnote 3 As a consequence, the independent explanatory variables of inflation turn out to be the output gaps and the related forecast errors by agents, with notable consequences from an empirical point of view. First, standard empirical models with exogenous inflation expectations yield a distorted estimator of the output structural parameter. Second, the observed inflation-output relationship may appear flatter or steeper not (only) owing to structural factors, but (also) to the effect of output expectations and forecast errors. Third, our model allows to disentangle the true output component of inflation from the transitional expectational one. These features relate our model also to the issue of the anchoring (de-anchoring) of expectations, namely anchored (de-anchored) expectations may make the PC seemingly flatter (steeper) in recession and steeper (flatter) in recovery.Footnote 4

Section 3 will provide an empirical application of our model developed on data collected in the EZ from 1999 to 2019. The ECB Survey of Professional Forecasters is employed as proxy of output expectations. Then, an econometric linear regression model is introduced and tested through the ordinary least squares (OLS) method and the generalized method of moments (GMM). Specific tests are provided in support of this choice. Our results show that in the EZ the output component on inflation is not so flat, and the extent to which the post-crisis deflationary drift and the post-recovery missing inflation can be explained by the transitory expectational component.

The last section concludes with a view to new perspectives in terms of empirical research and possible implications regarding monetary and fiscal policies undertaken by public institutions.

2 The Phillips Curve with endogenous expectations

A baseline specification of the PC for empirical analysis may be the following (see e.g. Hooper et al. 2019):

$$\begin{aligned} \pi _t = \alpha + \beta _1\pi ^e_t + \beta _2x_t + \beta _3\Lambda (\pi ,n)+ \beta _4^{'}Z_t + u_{\pi t} \end{aligned}$$
(1)

where \(\pi _t\) is current inflation, \(\pi ^e_t\) is an expectational term as of time t to be specified, \(x_t\) is a measure of the business cycle or “economic slack”, \(\Lambda (\pi ,n)\) is the lag operator of inflation of order n, \(Z_t\) is a vector of other variables, and \(u_{\pi t}\) is a random shock. The intercept \(\alpha\) may capture an autonomous drift in inflation.

The empirical issues about the PC revolve around the estimation of coefficient \(\beta _2\), which is meant to capture the structural slope of the PC, or better the responsiveness of inflation to the business cycle. As to the relevant variable \(x_t\) , it is now common to employ the output gap, defined as the percent difference between actual output (GDP) and the maximum potential output the economy can produce (potential GDP). Looking back at the origins, some would rather use the unemployment gap, the difference between the unemployment rate and the natural rate or the NAIRU, or the Phillips’ wage PC altogether where the dependent variable is wage inflation. Researchers testing these respecifications of the PC argue that the puzzles of the price PC are less compelling or vanish altogether (Hooper et al. 2019; Nickel et al. 2019).

More radical revisions of the estimation model wish to capture nonlinearities, that is different reactions of inflation to upward or downward phases of the business cycle (Nalewaik 2016; Lindé and Trabandt 2019; Hooper et al. 2019).

In the following we shall concentrate on the expectational term \(\pi ^e_t\) of Eq. 1. By way of this term, the model accommodates the Monetarist critique to the older Keynesian PC, and is also akin to the (inverted) Lucas (1973) “surprise” aggregate supply. The debate on the nature of the expectational term is, however, still open. In the “accelerationist” PC, Friedman (1968) used adaptive expectations formed on the basis of the last-period inflation term. In the Lucas supply curve (1973) it was replaced by the expectation of current inflation. In the New Keynesian, Calvo-type PC there appears the current expectation of next period inflation. The “hybrid” PC proposed by Galì and Gertler (1999), and adopted for example by Blanchard et al. (2015), splits expectations in a forward-looking component (usually the anticipation of the current inflation) and in a complementary backward-looking one (usually the previous-period inflation, not dissimilar to Friedman). This amounts to adding the lagged inflation component \(\Lambda (\pi ,n)\) in order to allow the model to fit inflation persistence.

Our point is that the expectational term, whatever it is, is generally treated as an independent (exogenous) explanatory variable, proxied by available forecast surveys. This practice is problematic for two reasons. The first is that it isolates the PC from the rest of economy, whereas it should better be understood as co-evolving with the rest of the economy. The consequence can easily be seen, for example, through the New Keynesian three-equation model. If the central bank controls inflation tightly by means of the Taylor Rule, the ex-post PC will seemingly be flat because output fluctuates around its potential due to shocks to the IS curve whereas inflation does not deviate from its target. The second, related, reason is that inflation expectations, too, cannot be taken to be exogenous with respect to the evolution of the rest of the economy, and output in particular. Indeed, if the PC as a stand-alone tool is to survive, a proper treatment of the expectational term may provide the bridge with “the rest of the economy”.

Our proposed treatment in the first place considers the case in which \(\pi ^e_t\) is the anticipation of the current inflation, with \(x_t\) equal to the output gap, the other terms \(\Lambda (\pi ,n)\) and \(Z_t\) being dropped, that is

$$\begin{aligned} \pi _t = \alpha + \beta _1\pi ^e_t + \beta _2x_t + u_{\pi t} \end{aligned}$$
(2)

Further ingredients mentioned above may enrich the picture, but at the cost of some complexification which is not strictly necessary, at least at this first stage. If the central bank has an inflation target \(\pi ^*\), this should be consistent with the stochastic equilibrium of the inflation process (2), i.e. \(\pi ^e_t = \pi _t = \pi ^*\), \(x_t = 0\), for \(u_{\pi t} = 0\). It follows that \(\pi ^* = \alpha /(1-\beta _1)\).Footnote 5 Therefore, substituting \(\alpha = \pi ^*(1 - \beta _1)\), we can rewrite Eq. (2) as

$$\begin{aligned} \pi _t = \pi ^* + \beta _1(\pi ^e_t - \pi ^*) + \beta _2x_t + u_{\pi t} \end{aligned}$$
(3)

Where does the expectation of current inflation come from? Treatment of expectation formation is notoriously controversial, but our idea can simply be put this way. If the PC is used worldwide to foresee inflation, its expectational component should in some way come from agents using the PC itself. This idea is consistent with a broad principle of rationality of expectations (of recurrent and stable processes) as the outcome of informed understanding of how the economy works - or the “data generation process” (Evans and Honkapohja 2011; Kurz 2011). In the limit case of the (unboundedly) rational expectations hypothesis (REH)Footnote 6, \(\pi ^e_t\) should be the statistical expected value of inflation, i.e. \(\pi ^e_t = E(\pi _t)\) conditional on Eq. (3) (Woodford 2003, ch.3). Consequently,

$$\begin{aligned} E(\pi _t) = \pi ^* + \frac{\beta _2}{1-\beta _1} E(x_t) \end{aligned}$$
(4)

The PC-based rational expectation of inflation at time t can be expressed as a deviation from the target determined by the expected value of the output gap in t. Intuitively, in a price-setters’ environment, output expectations may drive economic agents’ strategic behavior, so to justify a variation in the price level.

Substituting Eq. (4) in (3), and rearranging terms, we obtain

$$\begin{aligned} \pi _t = \pi ^* + \frac{\beta _2}{1-\beta _1}[\beta _1E(x_t) + (1-\beta _1)x_t] + u_{\pi t} \end{aligned}$$
(5)

The actual inflation path around the target results to be the linear combination of the expected and the actual output gap, where \(\beta _1\) represents the weight between the two terms, plus random shocks. It may be noted that Eq. (5) nests two extreme cases. If \(\beta _1 = 0\), i.e. no expectational component of inflation, the equation reproduces the old Keynesian PC such that inflation is procyclical, given its structural slope \(\beta _2\). If \(\beta _1 = 1\), i.e. full pass-through of expected inflation, the PC becomes vertical with respect to expected nonzero values of \(x_t\). In other words, any small expected deviation of output from potential is fully transmitted to inflation. This verticalization of the PC, though, is not necessarily due to full wage-price flexibility (\(\beta _2 \rightarrow \infty\)), but to the expectational component of inflation.

Taking a step forward, we can express actual and expected output gaps in terms of forecast errors or Lucas’s “surprises”

$$\begin{aligned} \upsilon _t=x_t-E(x_t) \end{aligned}$$
(6)

By way of Eq. (6), Eq. (5) can be rewritten as

$$\begin{aligned} \pi _t = \pi ^{*} +\frac{\beta _2}{1-\beta _1}x_t -\frac{\beta _2\beta _1}{1-\beta _1}\upsilon _t + u_{\pi t} \end{aligned}$$
(7)

If the REH holds, forecast errors will be random, with zero mean, and uncorrelated, so that the expected value of (7) coincides with (4).

In the first place, we may employ our inflation equation in connection with the issue of anchoring vs. de-anchoring of expectations. In the empirical literature, after Bernanke (2007), the most common criterion used to test the anchoring of expectations is based on their responsiveness to macroeconomic news. Therefore, we may also say that agents with well-anchored expectations hold \(E(x_t) = 0\) at all times irrespective of the observed news about \(x_t\). Consequently, \(\upsilon _t = x_t\), and

$$\begin{aligned} \pi _t = \pi ^* + \beta _2x_t + u_{\pi t} \end{aligned}$$
(8)

By contrast, agents with de-anchored expectations will tend to correlate their output gap expectation to its observed values. In the limit case of fully anticipated output gaps, \(\upsilon _t = 0\), the result is:

$$\begin{aligned} \pi _t = \pi ^{*} +\frac{\beta _2}{1-\beta _1}x_t + u_{\pi t} \end{aligned}$$
(9)

We can therefore define two macroeconomic regimes depending on the state of expectations in which the PC is different (see also Nalewaik 2016; Gobbi et al. 2018; Hooper et al. 2019). When expectations are anchored (AE regime), Eq. (8) shows a seemingly old Keynesian PC whose estimation would yield the structural slope \(\beta _2\). When expectations are de-anchored (DE regime), Eq. (9) shows that the PC displays a greater slope, which is not due structural factor \(\beta _2\) but to the effect of the expectational component \((1-\beta _1)<1\).

Of course, most of the time the economy will lie in between the two pure regimes, so that inflation will be driven by Eq. (7). Consequently, in order to identify the slope of the PC correctly, it is necessary to take into account both the output gap and its forecast errors. Table 1 displays the effects on the PC of the possible combinations of output gaps and forecast errors.

Table 1 Effects of output gaps and forecast errors on the Phillips Curve, Eq. (7)

As can be seen, depending on the sign of the forecast error, the PC may appear steeper or flatter. Meaning that the observed change in inflation is greater or smaller than due to the output gap alone. This effect is due to forecast errors that shift the PC as in Eq. (7) as a consequence of agents’ behavior led by overestimated or underestimated output gaps. The PC appears steeper when output gaps are overestimated (\(|x_t| < |E(x_t)|\)), in which case forecast errors amplify the effect of output gaps. It appears flatter when output gaps are underestimated (\(|x_t| > |E(x_t)|\)), in which case forecast errors dampen the effect of output gaps.

The combinations in Table 1 can accommodate the various “puzzles” detected in the empirical literature. For instance, seemingly missing inflation in a boom as well as missing deflation in a slump may be the consequences of the smoothing effect of agents’ underestimation of output gaps or better anchored expectations. By contrast, the excess deflation detected in the course of the EZ recession may be the consequence of the amplifying effect of agents’ overestimation of negative output gaps. This vindicates the concern at the ECB for the de-anchoring of deflationary expectations running ahead of the actual downward trend of inflation (Draghi 2014, 2016).

3 Empirical analysis

In this section we present the results of the econometric tests on our theoretical PC Eq. (7),which lends itself to straightforward treatment in terms of inflation gap, \(\pi _t-\pi ^*\). The estimation linear model corresponding to Eq. (7) is as follows:

$$\begin{aligned} inf_t = \alpha + \gamma _1out_t + \gamma _2err_{t,t-i} + u_t \end{aligned}$$
(10)

where \(\alpha\) is a constant; \(inf_t\) represents the gap between actual inflation and inflation targeting (because of the reasons already explained above); \(out_t\) is the output gap; \(err_{t,t-i}\) is the forecast error. The subscript in forecast error term \((t,t-i)\) means that the output gap expectations, used to compute the errors, refer to current time t, but are formulated earlier at time \(t-i\), with \(i=4\) for short-term forecast (1-year), 8 for medium-term forecast (2-years) and 20 for long-term one (5-years). In what follows we will mainly refer to short-term forecast, unless otherwise specified. The error component \(u_t\) is assumed to be additive. Some specification tests on the residuals will be provided in support of our estimation.Footnote 7

3.1 The data

The data have been collected from Eurostat, AMECO, ECB Statistical Data Warehouse and OECD Main Economic Indicators. All the data are quarterly one-year percentage changes referred to the EZ (changing composition up to January 1, 2015) from 1999-Q3 to 2019-Q4.

Inflation data are based on the Harmonized Index of Consumer Price (HICP), from the Eurostat database. The inflation gap is then computed as a simple difference between inflation and 1.9% inflation target, as the price stability objective of the Eurosystem is “a year-on-year increase in the HICP for the EZ of below 2%”,Footnote 8 as stated by the ECB’s Governing Council in 1998. Quarterly output gaps are obtained by implementing a cubic spline interpolation on AMECO and OECD annual data.Footnote 9 For comparison, the two output gap time series are displayed in Fig. 1.

Fig. 1
figure 1

Output Gap, AMECO and OECD, 1999–2019. Data Source: AMECO database and OECD Main Economic Indicators

The expected inflation rate and expected GDP growth rate are reported as average point forecasts on a 12 months target period (1 year ahead). For our estimations we have also made use of forecasts on 24 months and 5 years target period. They are provided by the ECB Survey of Professional Forecasters (SPF). The expected inflation gap is computed as the difference of expected inflation with the target; the expected output gap is calculated as the difference between expected and potential GDP, as a percentage of potential GDP (PGDP).Footnote 10 A more detailed description of the data is reported in Table 2 (summary statistics of the actual and expected inflation gap) and Table 3 (summary statistics of the actual and expected output gap).

First, two interesting elements emerge from Table 2: the difference in the variables ranges between the 10th and 90th percentile (2.23 for actual inflation, 0.81 for the expected one) and in the variance (the variance of the actual gap is 9 times larger than for the expected gap). Both hint at the nature of agents’ expectations, which are less volatile than observed inflation and fluctuate closer to the equilibrium level. In addition, expected values are on average lower than actual ones (\(-0.28\) against \(-0.19\)), which reveals a potential pessimistic or prudential attitude of agents when building their forecasts on the inflation rate. This hypothesis is further confirmed by the fact that the median of expected values is negative (\(-0.25\)), while the median of actual values is zero.

Table 2 Summary statistics table, real and expected inflation gap

On the contrary, data on output gaps (Table 3) show a similar pattern between real and expected values. This is shown again by the variables percentile range (5.09 and 4.79) and variance (3.528 and 3.08), which do not differ significantly. Also in this case, output forecasts are on average lower than observed GDP (− 0.23 against − 0.17) but the median of expected gap is much lower (0.08 against 0.14). This could entail a pessimistic behavior of agents when making output forecasts, resulting in an overestimation of gaps, even though it is rather a weak evidence. Indeed, as already explained in the previous section, the overestimation of GDP gaps (in absolute values) should then be reflected in the inflation gap by amplifying it, as described by Eq. 7.

Table 3 Summary statistics table, real and expected output gap

To better understand the dynamics at work, let us now turn to the pattern the data exhibit over time. As shown in Fig. 2, expectations of the inflation gap have been well anchored (close to zero) from the beginning of the 2000’s to 2008 (in correspondence with the outbreak of the financial crisis). From that point on, expectations have fallen about 1 point below the target, showing a higher sensitivity with respect to the trend of observed inflation. This would confirm the hypothesis of de-anchoring of inflation expectations endorsed by a growing number of scholars in last years.

Fig. 2
figure 2

Inflation Gap and Expected Inflation Gap, 1999–2019. Data Source Inflation gap data are from Eurostat database; Expected inflation gap is calculated on expected inflation from ECB Survey of Professional Forecasters

Similarly, the expected output gap (Fig. 3) has remained quite stable around the GDP gap until 2009, when agents have dramatically failed to predict the abrupt drop in the output level of the EZ, as highlighted by the fall in the forecast error value (\(-6.5\)%). After 2010, instead, the gap between real and expected output has decreased again (in absolute value).

Some preliminary considerations can already be made here. First, a general positive co-movement of output and inflation gap, although with some lag, appears between the two series. Second, it is not negligible the role of negative forecast errors in counterbalancing the effect of output gap on inflation. This is particularly evident in the period 2011–2013 (when the “missing deflation” puzzle arises) and 2017–2019 (when we observe a “missing inflation” problem). These observations seem to be quite in line with the predictions of our theoretical model.

Fig. 3
figure 3

Output Gap, Expected Output Gap and Forecast Error, 1999–2019. Data Source: Output gap data are from AMECO (database); Expected Output gap is calculated on expected GDP data from ECB Survey of Professional Forecasters

So key to our model of the PC is the compound effect of both output gaps and agents’ forecast errors. The effects of the four combinations of these variables on the PC were summarized in Table 1. Those combinations can easily be mapped onto the data by means of the dispersion graph presented in Fig. 4 which plots forecast errors against output gaps. The labels of the four quadrants of the graph reproduce the combinations in Table 1.

Fig. 4
figure 4

Forecast errors vis-à-vis output gaps, 1999–2019

The output gaps are divided in four phases: pre-crisis (1999–2007), recession (2008–2009), depression (2010–2016), recovery (2017–2019). In the pre-crisis and recovery phases both output gaps and forecast errors are mostly positive or close to zero (output gaps are underestimated), hence the predicted result is a flatter PC. In the recession phase, both output gaps and forecast errors are mostly negative (again, output gaps are underestimated), keeping the PC flat. In the depression phase, instead, systematically negative output gaps are mostly associated with positive forecast errors, i.e. they are overestimated by agents, creating a steeper PC. These predictions of changing slopes of the PC fit remarkably well the puzzles in the literature, discussed in the previous sections, about missing inflation in the pre-crisis and recovery phases, missing deflation in the aftermath of the recession, and excess deflation in the depression phase, with a clear impact of de-anchored expectations.

Having identified the role of output gap and forecast errors in conditioning the slope of the PC, a few additional considerations may be in order. Whatever method agents use to form their expectations, forecast errors are inevitable, even in the REH limit case of agents holding the “true” statistical expectation of the (stochastic) variable. On the other hand, forecast errors are also a symptom of uncertainty, indeed they are the correct measure of uncertainty according to Jurado et al. (2015). Hence it is not by chance that in the EZ forecast errors have become critical as amplifiers of the depression phase in conjunction with the de-anchoring of expectations, that is in a spell of increasing uncertainty. The same phenomenon can be expected to materialize if greater uncertainty and de-anchoring of expectations take place in the run-up of inflation, steepening the PC and fueling the inflationary process. If these considerations are correct, we may also conclude that time-varying uncertainty, reflected into larger “unforecastability” à la Jurado et al. (2015), broadens the set of factors that impinge on the conundrum of the slope of the PC.

3.2 Econometric analysis and results

In accordance with recent literature on the empirical analysis of the expectation-augmented Phillips Curve (e.g. Paloviita 2008; Busetti et al. 2017), and assuming our theoretical Eq. (7) to take the form of a linear model, a standard estimation procedure for multiple linear regressions, ordinary least squares (OLS), is employed for the estimation.

First issue to consider in time series analysis is the degree of correlation among residuals. Both auto and partial-autocorrelation plots reveal the presence of significant autocorrelation of first order.Footnote 11 Moreover, tests for multicollinearity (Belsey’s diagnostics) and conditional heteroscedasticity (Engle’s ARCH Test ) are run on the model. For the first test the null hypothesis is not rejected; for the second it is rejected (see “Appendix”). Heteroscedasticity and autocorrelation robust standard errors (HAC) have then been employed in the OLS regression. Another possible issue which is more difficult to detect, is the endogeneity problem. In order to prevent possible sources of endogeneity (unobserved heterogeneity, simultaneous causation, measurement error), which would cause OLS to be biased, we have employed additionally a GMM estimation method,Footnote 12 which provides instead consistent results in the presence of endogeneity.

We report the results from OLS and GMM estimation of model (10) in Table 4. For robustness check, we have estimated our model using two different sources for output gap data: the AMECO and the OECD database. Results from the latter are reported in the “Appendix” (see Table 6). Estimated coefficients (all statistically significant at conventional levels) are quite reasonable and coherent with our theoretical model. In particular, an increase of 1% in the output gap, leads to an increase of the inflation gap between 0.24% (in OLS) and 0.28% (in GMM); on the contrary, a 1% increase in the forecast error implies a drop of the inflation level 0.22% with OLS estimation, while the reduction decreases to 0.11% in GMM when employing the 1 lag IV (it increases instead to 0.25 with a 2 lags IV). Moreover, when adding additional lags as IV, hence employing more than one instrument for each explanatory variable, the magnitude of both the output gap and the forecast-error coefficients (column 3) further increases.Footnote 13 Thus the sensitivity of inflation to the output gap would be far from being negligible. In addition, the “shifting effect” of forecast error (which increases or reduces the output gap impact depending on whether the sign of the error is negative or positive, respectively) is also considerable.

Table 4 Phillips Curve linear regression table

What happens instead when employing longer-term forecasts instead of usual 1-year predictions? In Fig. 5 forecast errors with different prediction horizons are illustrated. As it can be seen, short and long term forecasts do not differ significantly over time, in particular expectations with 2 and 5 years horizons (the two lines overlap for almost the entire time interval). This should not sound strange, considering that agents make predictions on different horizons in the same moment, so that the information available they rely on are identical.Footnote 14 Interestingly, long and medium term expectations depart from short ones in two precise points: once around 2009–2010, another between 2012 and 2014 (which represent even the lowest points in forecast errors series). The most credible hypothesis is that short-term expectations, although far from the real value of the output gap, as shown by the negative value of the forecast error, have benefited from the possibility of gleaning from recent information. On the contrary, longer-term expectations have suffered from agents’ inability to make correct long-term forecasts. The fact that 2 and 5-years error terms are larger (in absolute value) might be due to greater optimism that agents have with respect to output growth in the distant future.

Fig. 5
figure 5

Forecast Error, 1-year, 2-years and 5-years target period, 1999–2019. Data Source: Output gap data are from AMECO database; Expected Output gap is derived from expected GDP growth from ECB Survey of Professional Forecasters

In order to test the explanatory power of longer-term expectations, we have employed the same OLS and GMM regression as before, substituting the 1-year forecast error variable respectively with 2 and 5-years error term. Estimation results are reported in Table 5. When comparing these outcomes to those reported in Table 4, we immediately notice two facts: the first is that the output gap coefficients do not diverge significantly, whatever kind of expectation we employ in the estimation (they are slightly higher with medium and long-term expectations with respect to estimation with short-term ones); the second is that, on the contrary, the coefficient associated with the forecast error decreases significantly (in absolute term) as the target period increases. Indeed the lowest value of \(\gamma _2\) is reached for GMM estimation with \(err_{t,t-5}\)Footnote 15 (column 4 of Table 5). This result is quite reasonable. As already stated above, short-term expectation benefit from the quality (and quantity) of information available at the time professional forecasters provide their predictions in the surveys. This suggests that the more distant in time forecasts are built, the lower is their explanatory power over inflation gap (and so the higher is the output gap’s one).

Table 5 Phillips Curve Linear Regression Table, medium and long-term forecasts

It might be wise however to consider alternative approaches to our modeling, in order to enrich the analysis. It is indeed true that another crucial question may arise when employing time-series in a regression, that is the stability of parameters over time. In the presence of significant structural breaks in the \(\hat{\gamma }_s\), the estimated model might be not completely reliable. A clear representation of this issue comes from the graph in Fig. 6. Here it is represented the evolution of the output gap and forecast error estimated coefficients of Eq. (10), obtained by employing a recursive OLS regression with Newey-West robust standard errors.Footnote 16 As illustrated, the two parameters are quite unstable, as they rapidly change direction as new data are added in the process.Footnote 17 In particular, 3 main structural breaks appear around iteration 30, 40 and 50. Further evidence of coefficient instability is provided by the structural change tests “Cumulative-sum-test” (Cusum) and “Chow-test”. As a robustness check, we have tested an additional linear model in which six new variables were included: these were the result of the interaction between the two explanatory variables and three time-dummies, corresponding to the main structural breaks detected by the tests, respectively 2009-Q1, 2011-Q3 and 2013-Q4.Footnote 18 The employment of dummy variables in the regression, accounting for the structural breaks, allowed us to use a standard OLS for the estimation. It is important to note that, as revealed by the “Chow-test”, other minor breaks are present in the time series in use, but accounting for all of them would have resulted in a confused and overloaded model. Nevertheless, even the employment of those six dummies led to contradictory and not always statistically significant results. We preferred then a more structured and robust approach.

Fig. 6
figure 6

Output Gap and Forecast error Coefficients, Recursive linear regression of model (10) Notes: The graph is obtained by employing a recursive (nested-window) regression on Eq. (10). Number of iterations is reported on th x-axis

In order to address this issue, we have introduced in our analysis an alternative model specification which takes into account the evolution over time of parameters. In analyzing time series data, the assumption that coefficients are fixed throughout the entire sample period under study is not always reasonable (Harvey and Phillips 1982; Tucci 1995). One way to deal with it is then to assume that coefficients are a function of some smoothing variable at time t. Such model is called time-varying coefficient linear model, which in our case is defined as follows:

$$\begin{aligned} inf_t = \alpha + \gamma _1(z_t)out_t + \gamma _2(z_t)err_{t,t-i} + u_t \quad \text {and} \quad t=1,...,T \end{aligned}$$
(11)

where \(z_t = \tau = t/T\) is the rescaled time allowing our \(\gamma _j\) to vary over time.Footnote 19 In so doing, the time-varying coefficients turn out to be unknown functions of time, as \(\gamma _j(z_t)=f_j(\tau )\). Collecting the explanatory variables in the column vector \(\mathbf {X}_t\), the dependent variable in \({Y}_t\), and the time-varying parameters in the vector \(\mathbf {\gamma }(z_t)\), we can express the model as:

$$\begin{aligned} Y_t=\mathbf {X}_t^{'}\varvec{\gamma }(z_t)+u_t \end{aligned}$$
(12)

Then the sample estimator is proportional to an OLS estimator, net of a weighting function \({p}_{b}\); the result is a series of weighted local regressions with a specific window size b (bandwidth):

$$\begin{aligned} \hat{\varvec{\gamma }}(z_t)=&\left( {p}_{b} \mathbf {X}^{\prime } \mathbf {X}\right) ^{-1}\left( {p}_{b} \mathbf {X}^{\prime } \mathbf {Y}\right) \end{aligned}$$
(13)
$$\begin{aligned} {p}_{b} & =K\left( \frac{z_t-z}{b}\right) \end{aligned}$$
(14)

where K is a kernel function, b sets the dimension of the bandwidth. An extensive description of computational implementation of these kind of models is provided by Casas and Fernández-Casal (2019). Without prejudice to the importance of linear models, an estimation with time-varying regressors (tvReg)Footnote 20 allows us to verify how the sensitivity of the inflation gap changes over time with respect to the output and agents’ forecast error. Results from employing this procedure on our datasets are displayed in Fig. 7.

Fig. 7
figure 7

Time-varying Coefficients regression. Notes: tvReg model uses the same formula as the linear function (10); “Gaussian” type kernel employed in the coefficient estimation. The selected bandwidth for time-varying regression is 0.3. Output gap data are from AMECO database. Confidence interval at 90%; 50 resamples used in the bootstrapping calculation

The panel on the left (a) shows the time-varying output gap coefficient; similarly, the one on the right (b) shows the time-varying forecast-error coefficient. The results suggest an important increase (in absolute value) in the forecast error coefficient, which doubles from 0.15 to 0.30 at the end of the sample. For what concerns the output gap coefficient, it reaches its peak in 2010 (0.25), after a slow increase from the initial value of 0.18; then it decreases back to the initial level. According to Eq. (7), the combined behavior of the two parameters could be determined by an increase in the sensitivity of inflation gap to output slack, in the form of a higher structural term coefficient \(\beta _2\). There is however a clear and growing relative weight of output forecast errors in shaping the inflation gap.

In view of all this, the hypothesis relative to our endogenous-expectation PC can be confirmed and provide an explanation to the above-mentioned “inflation puzzles”. In particular:

  1. a

    the “missing deflation” puzzle in the aftermath of the recession (in particular between 2011 and 2013) coincides with a largely negative value of the forecast-error component, i.e. an initial underestimation of the worsening of output gaps (see Fig. 3); the negative value of the estimated \(\hat{\gamma }_2\) from Eq. (10) would have then attenuated the impact of the output gap on prices, which explains why inflation has not fallen together with output;

  2. b

    in the second half of the depression phase (between 2014 and 2016), we observe instead a rapid steepening of the curve: the positive forecast error indicates an overestimation of the negative output gaps (here between \(-2\)% and \(-1\)%), strengthening their overall impact on the inflation gap (see Table 1);

  3. c

    lastly, the “missing inflation” puzzle in the recovery phase (more evident between 2017 and 2019) is clearly explained by the positive value of the forecast error term (until 2018-Q2), which indicates a strong underestimation of the improvement of output gaps and hence a new flattening of the PC.

4 Conclusions and future perspectives

Some so-called “inflation puzzles” have led researchers to wonder whether the PC could still be a valid tool to estimate and foresee the relationship between inflation and the business cycle. In this paper we have shown that the basic expectations-augmented PC can accommodate the relevant puzzles provided that its expectational component is no longer treated as an independent explanatory variable. Our idea, broadly consistent with the principles of rational expectations, and corroborated by independent evidence about how available inflation forecasts are elaborated, is that if agents rely on the PC to foresee inflation, then its expectational component should be endogenized accordingly. The result is a PC in which inflation depends no longer on its own expectations and the output gap, but on the output gap and its forecast errors. In this new formulation, output and inflation keep moving in the same direction, while forecast errors (calculated as the difference between forecasts and observed output) act in two opposite ways: they amplify the impact of output on inflation when agents overestimate the gap (the curve appears steeper), or, on the contrary, they reduce the impact when agents underestimate the gap (the curve appears flatter).

The hypotheses generated by the model are confirmed by the data collected on the EZ from 1999 to 2019. The econometric analysis reveals not only that the structural component of the PC is not flatter than ever, but also that the model is able to explain the inflation puzzles and the various phases of apparent flattening and steepening of the PC during the life of the Euro. Of course, other explanatory variables might be added, but our purpose has been to check how well our reformulation of the basic PC may fare.

On the side of policy implications, the inflation puzzles have challenged the effectiveness of conventional and unconventional monetary policies in stimulating inflation and economic growth, particularly in the EZ in consequence of its institutional setup. Several scholars suggest a major change of direction in the policies undertaken by the ECB (e.g. Parliament European (2019) and Capolongo and Gros (2019)). One of these proposals would be that of a dual-mandate (similarly to the Federal Reserve) focusing not only on the price stability, but also on the progressive reduction of output and unemployment gap. Another one would focus on a possible range in which inflation could fluctuate, depending on current specific economic circumstances. A third view is that the low level of inflation persistently observed in the EZ is the result of a structurally flatter Phillips Curve, so that the ECB should reparametrize its target rather than struggling to push inflation up. The discrimination between the structural and transient expectational components of the PC proposed in this paper may provide valuable guidance in the search for better monetary policy making. For instance, in the case of the EZ, our finding that the missing inflation in the recovery phase (after 2016) is due to a large extent to the flattening effect of output expectations lagging behind actual developments suggests that it would be a mistake for the ECB to surrender to depressed expectations instead of being their driver.

Our empirical work is open to further developments in various directions. First, it would be important to extend the research to a country-level analysis, to investigate the impact of structural economic differences between countries in the EZ, or to test whether the model is also able to accurately predict the inflation behavior in the US economy. Furthermore, other variables could serve as a proxy for output forecasts, other than the Survey of Professional Forecasters, to better reproduce households’ and firms’ expectations along the lines of the empirical work by Coibion and Gorodnichenko (2015). In this regard, there exists the Consumer Surveys conducted by the Directorate General for Economic and Financial Affairs of the European Commission, though these are qualitative data that are difficult to integrate into an econometric model.Footnote 21 Finally, beyond the effect of the output gap and forecast errors observed at a given point in time, our model may be extended towards mechanisms whereby agents learn from errors and seek to improve their output forecast capacity, along the lines indicated for instance by Jörgensen and Lansing (2019) in the case of inflation forecasts.