Introduction

The issue of the analysis of the time series properties of the unemployment rate of different developed countries has traditionally been the subject of numerous studies since the 1980s. However, there is far less empirical analysis on the dynamic properties of unemployment distinguishing by sex and education attainment than for aggregated data. Unemployment is a social stigma that epitomises the health of an economy. Some European countries such as Spain and Greece have never managed to recover the pre-Great Recession unemployment rate levels. After the COVID-19 pandemic, things have only taken a turn for the worse.

In order to analyse the effect of shocks on different groups for the unemployment rate in a set of European countries, we analyse the order of integration of unemployment rates by gender and schooling attainment. The analysis of the order of integration I(d) is relevant for policy matters. If the variable is integrated of an order less than 0.5, the variable is considered to be stationary process with shocks only lasting temporarily. However, if the order of integration is in the interval [0.5, 1), the variable is a non-stationary process but mean reverting. This means that shocks will have long-lasting effects although the variable will eventually revert to the equilibrium; finally, if the order of integration is 1 or over, shocks will never die out, and policy intervention will be necessary to revert the effects of a shock on the variable.

This empirical testing can be linked to the traditional hypothesis of unemployment: hysteresis, persistence and the natural rate of unemployment or non-accelerating inflation rate of unemployment (NAIRU) (Friedman, 1968; Phelps, 1968). Hence, by testing the order of integration, not only are we testing the effect of shocks on the variable, but also which of the hypotheses is fulfilled by the data. Hysteresis implies that the variable does not revert to the equilibrium after a shock, meaning that the order of integration is 1 or over. If the natural rate of unemployment hypothesis is valid, the order of integration should be less than 1. Finally, the persistence hypothesis establishes that the rate of unemployment needs a long time to revert to the equilibrium after a shock, which is epitomised by an order of integration between 0.5 and 1 (Leon-Ledesma, 2002; Leon-Ledesma & McAdam, 2004; Christopoulos & León‐Ledesma, 2007; Cuestas et al., 2011; Canarella et al., 2019).

In recent years, there has been a significant increase in the number of papers analysing these hypotheses for the unemployment rate (Akdoğan, 2017; Yaya et al., 2021); however, only a few have analysed it by gender (Passinhas & Proença, 2020) and education level (Faďoš & Bohdalová, 2017; Torgovitsky, 2019).Footnote 1 In the case of Passinhas and Proença (2020), the focus is only on the case of Portugal. Faďoš and Bohdalová (2017) focus on an aggregate level for the EU, Spain and Switzerland, \as Torgovitsky (2019) on a methodological contribution analyses the unemployment rate dynamics for the USA though limited to the 2011–2013 period. For this reason, in this paper, we aim to analyse the degree of unemployment persistence over different levels of schooling segregating by gender and focussing on a larger pool of countries from the EU.

The remainder of the paper is organised as follow: the second section briefly describes the main theories dealing with unemployment dynamics, and the third section focuses on the empirical results. The fourth section deals with the discussion and policy recommendations.

A Brief Description of the Empirical Testing of Unemployment Theories

As mentioned in the previous section, the NAIRU hypothesis implies that there is an unemployment rate of equilibrium at which the variable tends to revert. This implies that the Phillips Curve is vertical, and regardless of the inflation rate, the unemployment rate does not change. Despite this fact, we may observe transitory deviations from the equilibrium. Hence, from an econometric point of view, the unemployment rate should be an I(0) process. This is of course, provided that the fundamentals which define the equilibrium rate of unemployment do not change over time (Holmes et al., 2013). More recently, we find several contributions related to our matter; Ghoshray et al. (2016) analyse how structural changes affect the evolution of adult and youth unemployment rates, finding that only adult unemployment is more sensitive to structural breaks due to events, whereas youth one is more sensible to the business cycle. Cairó and Cajner (2018) find that the fact that more educated workers enjoy greater employment stability has to do with the on-the-job training and the formal education itself. Su et al. (2022) find that the COVID-19 has had a significant effect on unemployment for a number of European countries, i.e. France, Germany, Italy, Spain and UK. For this purpose, they use unit root tests with non-linearities. In a seminal contribution, Ahn and Hamilton (2020) establish that changes in the composition of new incomers into unemployment tend to be the most significant ones in economic recessions.

However, if the fundamentals of the equilibrium unemployment rate do change over time, this equilibrium rate may also be time-varying (Phelps, 1994; Layard et al., 2005; Cuestas & Harrison, 2014). In this case, the unemployment rate should be a stationary process around a time trend, which may be linear or non-linear, with structural breaks (Meng et al., 2017).

Testing for the NAIRU hypothesis empirically has traditionally relied on unit root or stationarity tests to label the variables as I(1) or I(0). However, fractional integration methods provide us with a more flexible framework since the parameter d may be a non-integer number (see, e.g. Caporale & Gil-Alana, 2008). Fractionally integrated (or I(d)) models are as follows:

$${\left(1-L\right)}^{d}{x}_{t}={u}_{t},\quad t=1,\dots .,T,$$
(1)

where ut is a covariance stationary I(0) process, whose spectral density function is positive and finite at the zero frequency, d can be any real number, and L is the lag operator. We can re-write the above equation by using the expansion:

$${\left(1-L\right)}^{d}{x}_{t}={x}_{t}-{dx}_{t-1}+\frac{d\left(d-1\right)}{2}{x}_{t-2}-\frac{d\left(d-1\right)\left(d-2\right)}{6}{x}_{t-3}+\dots$$
(2)

provided that

$${\left(1-L\right)}^{d}=\sum_{j=0}^{\infty }\left(\begin{array}{c}d\\ j\end{array}\right){\left(-1\right)}^{j}{L}^{j}.$$
(3)

Therefore, the closer the parameter d is to 1, the more persistent the process is, and the effect of shocks on the variable will last longer. If d ∈ (0, 0.5), the series is covariance stationary and mean reverting and therefore the NAIRU hypothesis holds. However, if d ∈ [0.5, 1), the series is no longer stationary but still mean reverting, and then we are dealing with the persistence hypothesis. Finally, when d ≥ 1, the series is non-stationary and non-mean reverting, and hence, the hysteresis hypothesis is met (Blanchard & Summers, 1987). In the latter, shocks affecting the unemployment rate will never disappear from the behaviour of the variable, and it will not revert to the equilibrium. In Table 1 we summarise these hypotheses.

Table 1 Order of integration of unemployment and hypothesis fulfilled

Empirical Analysis

The data for this paper are quarterly seasonally adjusted observations for unemployment rates by sex and educational attainment in a group of European countries from 2003:Q2 until 2019:Q4 for the 15 to 74 age group. We have omitted observations in 2020 to avoid the effect of the COVID-19 pandemic. The data was extracted from Eurostat on 11 March 2021. The code of the variable is une_educ_q. The data are displayed in the Appendix. The different levels of education attainment are as follows: less than primary, primary and lower secondary education (levels 0–2); upper secondary and post-secondary non-tertiary education (levels 3 and 4); and tertiary education (levels 5–8).

The estimated model is as follows:

$$y\left(t\right)=a+\beta t+x\left(t\right);\quad {\left(1-L\right)}^{d}x\left(t\right)=u\left(t\right).$$
(4)

where y(t) is the observed time series, α and β are unknown coefficients on the intercept and the linear time trend, and x(t) is assumed to be I(d), with d being another parameter to be estimated. The estimation is based on the Whittle function expressed in the frequency domain, and for this purpose, we use a simple version of the tests of Robinson (1994) widely used in empirical applications (see, e.g. Gil-Alana & Robinson, 1997). These tests have the advantage that they do not impose stationarity and thus do not require first differentiation prior to the estimation in case of non-stationary series. In addition, it is the most efficient method in the Pitman against local departures from the null and have an asymptotic standard N(0, 1) distribution.

Across the tables, we present the estimates of d and the 95% bands of the non-rejection values of d using Robinson (1994)’s method in Eq. (4) under three different modelling specifications for the deterministic terms. Thus, in the second column, we present the results supposing that α = β = 0 a priori, so no deterministic components are included in the model; in the third column, we report the estimated values of d under the assumption that α is unknown and β = 0 a priori, i.e. including an intercept in the regression model; finally, in the last column, we estimate d along with α and β also estimated from the data. We display in the table in bold the most adequate model for each series. This is based on the corresponding t values on the estimated coefficients in the d-differenced regressions, noting that the joint representation of the two equalities in Eq. (4) produces a new regression model where the errors are I(0) by assumption so standard t values (p values) on the estimated coefficients hold.

In Tables 2, 3 and 4, we show the results for the estimated order of integration for the total unemployment by education attainment. In Table 5, we report a summary of the three tables. From Tables 2, 3, 4 and 5, we find that there is evidence of mean reversion in four countries for the unemployment at low level (Austria, Belgium, Finland and Slovenia); a single country with medium level (Belgium) and another group of four countries with high level of education (Austria, Belgium, Croatia and Denmark). In these cases, the differencing parameter is statistically significantly lower than 1, supporting thus transitory shocks and reversion to the mean. In all the other cases, the estimates of the differencing parameter are equal to or higher than 1 at the 5% level, implying lack of mean reversion. Thus, only for Belgium do we find evidence of mean reversion across the different degrees of schooling. A decreasing trend in Germany is also found for the three groups of education level. This deterministic trend is most likely related to technological progress and policy measures to make the labour markets more flexible. Surprisingly, a positive trend is found for Sweden for low levels of education but not for medium and high education levels.

Table 2 Estimates of the differencing parameter: total unemployment (levels 0–2)
Table 3 Estimates of the differencing parameter: total unemployment (levels 3–4)
Table 4 Estimates of the differencing parameter: total unemployment (levels 5–8)
Table 5 Degrees of persistence based on education level: total unemployment

In general, we observe higher degrees of integration at the medium education level compared with low and high levels. In fact, the highest values of the differencing parameter d are observed at medium education level in all countries except for Spain and Portugal. For these two countries, the highest order of integration takes place at low education level followed by medium and then high level of education. For Belgium, the mean reversion (d < 1) occurs at all education levels. Thus, for this country, shocks in unemployment rates are expected to be transitory, not requiring strong measures to recover the original long term projections. Though it is not in all cases, in many of them, the lowest are observed at the high education level (14 out of 22 countries).

In Tables 6, 7 and 8, we display the estimated orders of integration for the case of male unemployment for the different levels of education attainment. Table 9 contains a summary. Hence, from these tables, we find that, as with total unemployment series in general, we observe higher degrees of integration at the medium education level compared with low and high levels. More evidence of mean reversion for high levels of education and in general the lowest d occurring for this group of individuals. Thus, for 16 countries, the lowest d are found for high education level, while for 6, it is in the low level of education. It is again found that Germany displays a significant negative trend for low and high educational levels, as well as Poland, and Sweden has a positive trend.

Table 6 Estimates of the differencing parameter: male unemployment (levels 0–2)
Table 7 Estimates of the differencing parameter: male unemployment (levels 3–4)
Table 8 Estimates of the differencing parameter: male unemployment (levels 5–8)
Table 9 Degrees of persistence based on education level: male unemployment

In Tables 10, 11 and 12, we present the results for the case of female unemployment along with a summary of the results in Table 13.

Table 10 Estimates of the differencing parameter: female unemployment (levels 0–2)
Table 11 Estimates of the differencing parameter: female unemployment (levels 3–4)
Table 12 Estimates of the differencing parameter: female unemployment (levels 5–8)
Table 13 Degrees of persistence based on education level: female unemployment

From these tables, we find that at low levels of education, there are significant positive trends in Finland, France and Sweden, while negative ones for Germany, Poland and Slovakia. At the medium level, the five significant trends are negative (Belgium, Bulgaria, Croatia, Germany and Romania). For high levels of education, only two negative trends are detected: Belgium and Germany. In all the other cases, the time trend coefficients are found to be statistically insignificantly different from zero. As with the total unemployment series, in general, we observe higher degrees of integration at the medium education level compared with low and high levels. For 14 countries, the highest value of d is found for the medium level of education. For six countries (Bulgaria, Cyprus, Finland, Greece, Romania and Slovenia), the highest d occurs at high level of education, while for Croatia and Spain, this happens at the low level. Also, more evidence of mean reversion (d < 1) and thus transitory shocks is found in high levels of education (Table 14).

Table 14 Summary results: Averaged values

Conclusion

In this paper, we have examined the unemployment rate series corresponding to a group of 22 European countries and for the time period from 2003Q2 to 2019Q4. Using fractional integration methods, we have tried to determine if shocks in the series of unemployment disaggregated by sex and education have transitory or permanent effects.

Our results indicate high levels of persistence in all cases, with values close to 1 in the majority of the series. In general, higher degrees of integration are observed in unemployed males than in females, and looking at the level of schooling, the highest degree of persistence is obtained at the medium education level. In addition, it is also observed that females display lower degrees of integration than males; however, for females, a lower value for d is obtained at the low education level. These results indicate that in the event of shocks affecting unemployment rates, long-lasting effects are observed in all cases, as well as higher degrees of dependence in males and at medium education levels.

This article can be extended in various directions. Thus, for example, structural breaks can be taken into account. There are exit procedures for testing I(d) models in the context of known and unknown breaks (Gil-Alana, 2004, 2008; Sibbersten, 2004; Choi et al., 2010; etc.); however, structural breaks produce abrupt changes in the data, and they can be avoided by using non-linear deterministic trends like those proposed in Cuestas and Gil-Alana (2016) (and based on Chebyshev polynomials in time) or using Fourier functions in time (Gil-Alana & Yaya, 2021) or neural networks (Yaya et al., 2021). Works in these directions are now in progress.