Abstract
Cointegrating polynomial regressions (CPRs), i.e., regressions that include deterministic terms, integrated processes and powers of integrated processes as explanatory variables and stationary errors, have become prominent in several fields of applications, e.g., in the analysis of environmental Kuznets curves. A key issue, as always in cointegration analysis, is testing for the presence or absence of a cointegrating relationship. This paper discusses two complementary tests: one with the null hypothesis of cointegration and one with the null hypothesis of the absence of cointegration. It is shown that (inter alia) for the empirically most relevant case, in which only one of the integrated regressors occurs as regressor also with higher powers, critical values can be simulated and are provided for a variety of specifications. Finally, the usage of the tests is illustrated for the environmental Kuznets curve for carbon and sulfur dioxide emissions. The illustration also investigates the sensitivity of the test decisions with respect to kernel and bandwidth choices, sample size and data vintage.
Similar content being viewed by others
1 Introduction
Recent years have seen growing theoretical and empirical interest in estimating and testing for nonlinear cointegrating relationships. A very voluminous body of applied literature in this respect has been and continues to be generated within the environmental economics literature. The so-called environmental Kuznets curve (EKC) hypothesis, which postulates an inverted U-shaped relationship between the level of economic development, typically measured by (the logarithm of) GDP per capita, and (the logarithm of) pollution or emissions per capita is one of the most widely studied empirical relationships in environmental economics. Early survey papers like Stern (2004) and Yandle et al. (2004) already count more than one-hundred refereed publications. The term EKC refers by analogy to the inverted U-shaped relationship between the level of economic development and the degree of income inequality postulated by Kuznets (1955) in his 1954 presidential address to the American Economic Association. Estimation of an EKC entails, in its simplest parametric form, a regression of emissions on GDP and its square to allow for a U or inverted U shape. Given that the logarithm of GDP per capita is often found to be unit root non-stationary, this is thus a regression where as regressors a unit root process and its square are present.Footnote 1 In doing so, the empirical EKC literature in large parts fails to acknowledge the fact that if the logarithm of GDP per capita is a unit root process, the square of the logarithm of GDP per capita cannot be a unit root process and continues to use “standard” unit root and cointegration techniques. This means that a large part of the empirical EKC literature (see, e.g., Friedl and Getzner 2003, Perman and Stern 2003 or Galeotti et al. 2009 from a large and still growing list of such contributions) simply treats GDP and its square as two integrated processes and uses standard unit root and cointegration test and estimation methods.Footnote 2 This is, notwithstanding its popularity, a problematic practice since tailor-made methods designed for regressions involving integrated processes and nonlinear transformations thereof are available.Footnote 3
Wagner and Hong (2016) develop estimation and inference theory for the special case of nonlinear cointegrating relationships including deterministic components as well as unit root processes and their powers as explanatory variables. They refer to such relationships as cointegrating polynomial regressions (CPRs). More precisely, Wagner and Hong (2016) extend the FM-OLS estimator of Phillips and Hansen (1990) from cointegrating linear to cointegrating polynomial regressions.Footnote 4 In addition to hypothesis testing with respect to the estimated parameters, they furthermore also investigate specification tests based on augmented or auxiliary regressions in detail. One important aspect that is not treated exhaustively in that paper is testing for cointegration and for the absence of cointegration, respectively. To be precise, only testing the null hypothesis of cointegration is considered in Wagner and Hong (2016, Propositions 5 and 6), whereas testing the null hypothesis of no cointegration is not considered. Furthermore, Wagner and Hong (2016) do not provide critical values. The present paper considers tests for both null hypotheses in detail and also provides critical values. We (re-)consider, with this test already discussed in Wagner and Hong (2016), in detail an extension of the Shin (1994) cointegration test, itself an extension of the KPSS stationarity test of Kwiatkowski et al. (1992), from cointegrating linear to cointegrating polynomial regressions. Furthermore, we consider an extension of a variance ratio test for the absence of cointegration of Phillips and Ouliaris (1990). The null hypothesis of the latter test is that the relationship is not cointegrating but spurious. It turns out that in the general case of several integrated regressors entering the CPR with powers higher than one, neither of the two tests has a nuisance-parameter-free limiting distribution that could be tabulated. This has already been observed for the KPSS-type test in Wagner and Hong (2016). Consequently, for the Shin- or KPSS-type test Wagner and Hong (2016) and, based on D-OLS estimation rather than FM-OLS estimation, Choi and Saikkonen (2010) consider sub-sampling versions of the KPSS-type test statistic that results in nuisance-parameter-free limiting null distributions, at the expense of a drastically reduced sample size due to sub-sampling. For the relatively small samples available for typical macroeconomic applications, a sub-sampling based test is potentially only of limited practical value. For the extension of the test of Phillips and Ouliaris (1990), this sub-sampling approach does not lead to asymptotically nuisance-parameter-free limiting distribution.
However, using the terminology of Vogelsang and Wagner (2016), in case of a full design CPR relationship it can be shown that the limiting distributions of both tests can be tabulated. Full design refers to a situation in which the limiting distribution of the FM-OLS estimator—and a fortiori the limiting partial sum processes of the residuals—can be expressed as a functional of standard Brownian motions rather than of non-standard Brownian motions. The empirically most relevant case, with only one of the integrated processes entering the CPR with powers higher than one is straightforwardly seen to be of full design. “Appendix B” of the paper provides tables with critical values for this case, for up to four integrated regressors, up to power four of the one integrated regressor entering with higher powers for the usual specifications of the deterministic component (no deterministic component, intercept only, intercept and linear trend). Thus, the results of the paper allow to perform non-/cointegration testing in typical applications like EKCs, MKCs or exchange rate target-zone relationships.Footnote 5
The tests discussed are illustrated with annual carbon dioxide (\(\hbox {CO}_2\)) and sulfur dioxide (\(\hbox {SO}_2\)) emissions data for 18 early industrialized countries over the period 1870–2016, with New Zealand data starting only in 1878. For \(\hbox {CO}_2\) emissions an EKC relationship is found for six of the 18 countries, Austria, Belgium, Finland, Germany, Switzerland and the UK. For all countries but Germany a quadratic specification suffices, whereas for Germany a cubic specification is required. For \(\hbox {SO}_2\) emissions an EKC relationship is found for none of the countries. In this respect note that the logarithm of GDP per capita and its powers are the only regressors included in the simple “reduced-form” equation and the absence of an EKC for \(\hbox {SO}_2\) emissions could partly be driven by important explanatory variables and legislative changes not entirely captured by GDP. A more structural investigation of the EKC relationship is beyond the scope of this paper that is merely devoted to discussing the (non-)cointegration tests rather than to a fully fledged empirical analysis. A second observation that emerges from the illustration is that the usage of appropriate (non-)cointegration tests, designed for testing in a CPR context, reduces the evidence for a cointegrating EKC relationship compared to using (non-)cointegration tests designed for linear cointegrating relationships. By means of robustness checks we also illustrate that the results obtained with the two nonparametric tests are quite insensitive to kernel and bandwidth choices, in particular with respect to the evidence for the prevalence of an EKC. Changing the sample size and, in particular, using different data vintages lead to more variation in the test results.
The paper is organized as follows: The following section discusses and presents the tests, Sect. 3 briefly illustrates the tests with \(\hbox {CO}_2\) and \(\hbox {SO}_2\) emissions data, and Sect. 4 briefly summarizes and concludes. Two appendices follow the main text: “Appendix A” contains the proofs, and “Appendix B” contains the tables with the critical values. “Supplementary Appendix C” contains the detailed test results for the robustness checks.
2 The tests
Following Wagner and Hong (2016), we consider a cointegrating polynomial regression (CPR), i.e., an equation including a constant and polynomial time trends up to power q, integer powers of integrated regressors \(x_{jt}, j=1,\ldots ,m\) up to degrees \(p_j\) and a stationary error term \(u_t\):
with \(D_t := [1,t,t^2,\ldots ,t^q]'\), \(X_{jt} := [x_{jt},x_{jt}^2,\ldots ,x_{jt}^{p_j}]'\) and the parameter vectors \(\theta _D \in {{\mathbb {R}}}^{q+1}\) and \(\theta _{X_j} \in {{\mathbb {R}}}^{p_j}\).
The following assumption on the regressors and the errors is put in place:
Assumption 1
The process \(\left\{ \eta _t\right\} _{t \in {{\mathbb {Z}}}} := \left\{ [u_t,v_t']'\right\} _{t \in {{\mathbb {Z}}}}\) with \(\left\{ v_t\right\} _{t \in {{\mathbb {Z}}}} :=\{\Delta x_t\}_{t \in {{\mathbb {Z}}}}\), with \(x_t := [x_{1t},\ldots ,x_{mt}]'\), is generated as:
with \(\sum _{j=0}^\infty j||C_j|| < \infty \) and \(\det (C(1)) \ne 0\). Furthermore, we assume that the process \(\left\{ \eta _t^0\right\} _{t \in {{\mathbb {Z}}}}\) is a stationary and ergodic martingale difference sequence with natural filtration \(\mathcal {F}_t=\sigma \left( \left\{ \eta _s^0\right\} _{-\infty }^t\right) \), positive definite covariance matrix \(\Sigma _{\eta ^0\eta ^0}\) and \(\sup _{t\ge 1}{{\mathbb {E}}}(\Vert \eta ^0_t\Vert ^r|\mathcal {F}_{t-1})<\infty \;a.s.\) for some \(r>4\).
Assumption 1 is analogous to the corresponding assumption in Wagner and Hong (2016). Quite similar assumptions have been used in several places in the literature, e.g., Chang et al. (2001), Park and Phillips (1999, 2001) and Hong and Phillips (2010).Footnote 6 The key result that this type of assumption is required for is an invariance principle for terms of the form \(T^{-\frac{k+1}{2}}\sum _{t=1}^T x_{jt}^k u_t\). Alternative sets of assumptions that could be used instead are formulated in Ibragimov and Phillips (2008) in a martingale framework or in de Jong (2002, Assumptions 1 and 2) who combines near-epoch dependence and appropriate moment assumptions. For the present paper any such set of assumptions leading to the required invariance principle can be put in place. The assumption \(\text{ det }(C(1)) \ne 0\) together with positive definiteness of \(\Sigma _{\eta ^0\eta ^0}\) implies that \(\{x_t\}\) is an integrated but not cointegrated process. Assumption 1 implies an invariance principle for \(\left\{ \eta _t\right\} _{t \in {{\mathbb {Z}}}}\):
with \(B_v(r) := [B_{v_1}(r),\ldots ,B_{v_m}(r)]'\). It holds that \(B(r) = \Omega ^{1/2}W(r)\) with the long-run covariance matrix \(\Omega := \sum _{h=-\infty }^\infty {{\mathbb {E}}}\left( \eta ^0_0 \eta ^{0\prime }_h\right) \) and where W(r) is an \(m+1\)-dimensional vector of standard Brownian motions. For later usage define the one-sided long-run covariance \(\Delta := \sum _{h=0}^\infty {{\mathbb {E}}}\left( \eta ^0_0 \eta ^{0\prime }_h\right) \) and partition both matrices according to the partitioning of \(\eta _t\):
Similarly to the linear case (compare Phillips and Hansen 1990), the OLS estimator of the parameter vector in (1) is consistent but its limiting distribution is contaminated by second-order bias terms, the presence of which renders standard asymptotic inference based on the OLS estimator invalid. Consequently, Wagner and Hong (2016, Proposition 1) extend the FM-OLS estimation principle from the cointegrating linear to the cointegrating polynomial case. The FM-OLS estimator is based on two modifications of the OLS estimator. First, the dependent variable \(y_t\) is replaced by \(y_t^+ := y_t - \Delta x_t'\hat{\Omega }_{vv}^{-1}\hat{\Omega }_{vu}\), \(y^+ :=[y_1^+,\ldots ,y_T^+]'\) and, second, a correction term is subtracted. The modified dependent variable \(y_t^+\) is exactly as in Phillips and Hansen (1990), but the correction terms are different in the CPR case and given by:
Throughout, we rely upon consistent estimators of the required long-run variances \(\hat{\Omega }_{vv}\), \(\hat{\Omega }_{vu}\), \(\hat{\Delta }_{v_ju}\) and \(\hat{\Delta }_{v_ju}^+ := \hat{\Delta }_{v_ju} - \hat{\Omega }_{uv}\hat{\Omega }_{vv}^{-1}\hat{\Delta }_{vv_j}\). In this respect, OLS consistency is important since it allows for consistent long-run variance estimation based on the OLS residuals (it is straightforward to verify that the necessary assumptions of Jansson 2002, Corollary 3 are fulfilled).
To (re-)state the asymptotic distribution of the FM-OLS estimator developed in Wagner and Hong 2016, Proposition 1), we need to define a few more quantities: \(D(r) := [1,\ldots ,r^q]'\), \(\textbf{B}_{v_j}(r) := [B_{v_j}(r),\ldots ,B_{v_j}^{p_j}(r)]'\), \(\textbf{B}_v(r) := [\textbf{B}_{v_1}(r)',\ldots ,\textbf{B}_{v_m}(r)']'\), \(D := [D_1,\ldots ,D_T]'\), \(X_t := [X_{1t}',\ldots ,X_{mt}']'\), \(X := [X_1,\ldots ,X_T]'\), \(Z := [D, \; X]\), \(G_D := \text{ diag }(T^{-{1/2}},\ldots ,T^{-(q+1/2)})\), \(G_{X_j} := \text{ diag }(T^{-1},\ldots ,T^{-\frac{p_j+1}{2}})\), \(G_X := \text{ diag }(G_{X_1},\ldots ,G_{X_m})\), \(G := \text{ diag }(G_D,G_X)\), and \(\theta := [\theta _D',\theta _X']'\) with \(\theta _X := [\theta _{X_1}',\ldots ,\theta _{X_m}']'\).
Proposition 1
Let \(\{y_t\}\) be generated by (1) with the regressors \(\{x_t\}\) and errors \(\{u_t\}\) satisfying Assumption 1. Define the FM-OLS estimator of \(\theta \) as:
with:
and \(M^*\) as given in (3) with consistent estimators of the required long-run (co)variances. Then, \(\hat{\theta }^+\) is consistent and has a zero mean Gaussian mixture asymptotic distribution given by:
with \(J(r) := \left[ D(r)', \;\; \textbf{B}_v(r)'\right] '\) and \(B_{u \cdot v}(r) := B_u(r) - B_v(r)^\prime \Omega _{vv}^{-1}\Omega _{vu}\).
The limiting distribution given in (4) provides the basis for asymptotic standard, i.e., standard normal or chi-squared, inference on the coefficients in case the error term \(\{u_t\}\) is stationary, i.e., in case of prevalence of a CPR relationship. Thus, testing for the presence of a CPR relationship is of prime importance and as always in cointegration testing two null hypotheses are conceivable: the null hypothesis of cointegration, i.e., the null hypothesis of stationarity of \(\{u_t\}\), or the null hypothesis of no cointegration, i.e., testing whether (1) is in fact a spurious regression. The first null hypothesis can be tested by an extension of the Shin (1994) test, which itself is an extension of the stationarity test of Kwiatkowski et al. (1992), from cointegrating linear to cointegrating polynomial regressions. Since, of course, the errors \(u_t\) are not observed, the test statistic has to be based on observable residuals, in particular the FM-OLS residuals, \({\hat{u}}_t^+\) say, can be used. The KPSS-type test statistic for the null hypothesis of cointegration is defined as:
with \(\hat{\Omega }_{u \cdot v}\) a consistent estimator of the long-run variance \(\Omega _{u \cdot v} :=\Omega _{uu} -\Omega _{uv}\Omega _{vv}^{-1}\Omega _{vu}\) of \(\{{\hat{u}}_t^+\}\).
For testing the null hypothesis of no cointegration, we consider an extension of a variance ratio test statistic of Phillips and Ouliaris (1990), their \({\hat{P}}_u\) statistic, and we continue to use this name for the extension. Denoting the OLS residuals of (1) as \({\hat{u}}_t\), the denominator of the test statistic is the properly scaled estimated “variance” of \(\{{\hat{u}}_t\}\), which is an integrated process under the null hypothesis. Under the null hypothesis of a spurious regression, with \(\{y_t\}\) being an I(1) process, the stacked vector \(\{m_t\} := \{[y_t,x_t']'\}\) is a non-cointegrated I(1) vector and for later use we denote its first difference as \(\{\Delta m_t\} := \{[w_t,v_t']'\}\). The second element of \({\hat{P}}_u\) is a long-run variance estimate based on the OLS residuals of a vector autoregression (VAR) of order one for \(\{m_t\}\), taking into account the deterministic components \(D_t\) considered:
The required conditional long-run variance is computed using VAR(1) OLS residuals from (6) rather than from the vector of first differences \(\Delta m_t\). Both versions lead to the same asymptotic behavior of the test statistic under the null hypothesis, but the asymptotic behavior differs under the alternative, compare Phillips and Ouliaris (1990, Theorems 5.2 and 5.3). Denoting the long-run variance estimated from \(\{\hat{\xi }_t\}\) as \(\tilde{\Omega }= \left[ \begin{array}{cc} \tilde{\Omega }_{ww} &{} \tilde{\Omega }_{wv} \\ \tilde{\Omega }_{vw} &{} \tilde{\Omega }_{vv} \end{array}\right] \), the estimated conditional long-run variance corresponding to the first component (the conditional long-run variance of \(\{w_t\}\) given \(\{v_t\}\)) is given by \(\tilde{\Omega }_{w\cdot v} := \tilde{\Omega }_{ww} - \tilde{\Omega }_{w v}\tilde{\Omega }_{vv}^{-1}\tilde{\Omega }_{vw}\).
The test statistic for the null hypothesis of no cointegration is defined as:
The following proposition characterizes the asymptotic behavior of the two test statistics, with the CT test already discussed in Wagner and Hong (2016, Proposition 5).
Proposition 2
-
(i)
Let \(\{y_t\}\) be generated by (1) with the regressors \(\{x_t\}\) and errors \(\{u_t\}\) satisfying Assumption 1 and let \(\hat{\Omega }_{u \cdot v}\) be a consistent estimator of \(\Omega _{u \cdot v}\), then the asymptotic distribution of the test statistic (5) defined above is:
$$\begin{aligned} CT\Rightarrow & {} \int _0^1 (W_{u \cdot v}^J(r))^2dr, \end{aligned}$$(8)with:
$$\begin{aligned} W_{u \cdot v}^J(r) := W_{u \cdot v}(r) - \int _0^r J(s)^{\prime }ds\left( \int _0^1 J(s)J(s)^{\prime }ds\right) ^{-1}\int _0^1 J(s)dW_{u \cdot v}(s). \end{aligned}$$(9) -
(ii)
Let (1) be a spurious regression, i.e., let \(\{y_t\}\) be an integrated process not related to \(\{X_t\}\) in a cointegrating polynomial relationship, then the asymptotic distribution of the test statistic (7) defined above is:
$$\begin{aligned} {\hat{P}}_u\Rightarrow & {} \frac{\Omega _{w\cdot v}}{\tau '\int _0^1 J^*(r) J^*(r)'dr \tau }, \end{aligned}$$(10)with \(\tau := \left( \begin{array}{c} 1 \\ - (\int _0^1 J(r)J(r)'dr)^{-1}\int _0^1 J(r)B_w(r)dr \end{array}\right) \), \(J^*(r) := [B_w(r), J(r)']'\) and \(\tilde{\Omega }_{w\cdot v}\) as defined above.
The asymptotic distributions of the CT and \({\hat{P}}_u\) test statistics cannot—in general—be tabulated due to their nuisance parameter dependency related to the correlation structure between the variables. Wagner and Hong (2016, Proposition 6), following Choi and Saikkonen (2010), consider a sub-sample version of the CT test statistic that has a nuisance-parameter-free limiting distribution. They show, in particular, that the limiting distribution of the sub-sample CT test statistics is \(\int _0^1 W_{u\cdot v}(r)^2dr\), if the sub-samples of size b are chosen such that \(b \rightarrow \infty \) and \(b/T \rightarrow 0\), i.e., sub-sampling achieves that the second term in (9) vanishes asymptotically. Wagner and Hong (2016), in addition, discuss how the set of sub-sample test statistics can be used in conjunction with modified Bonferroni bounds to improve the performance of the sub-sample-based test. An inspection of the proof of Proposition 1 reveals that sub-sampling does not lead to similar simplifications for the \({\hat{P}}_u\) test, with this “unfortunate difference” caused by the too slow convergence rate of \(\hat{\tau }\) in the spurious regression case. Thus, for the general case only the CT test, when used in a sub-sampling fashion, is available.Footnote 7
The problem with the nuisance parameter dependency of the limiting distributions of the test statistics originates in J(r) as defined in Proposition 1. In case it is possible to write:
with a regular matrix \(\Pi \) and \(\textbf{W}_v(r)\) a functional of standard Brownian motions one can rewrite both test statistics as functions of standard Brownian motions and a scalar (long-run) variance that can be scaled out. If a transformation as just described exists, the CPR is said to exhibit full design, a terminology coined by Vogelsang and Wagner (2016). CPRs with only one of the integrated processes present as regressor also with higher powers are straightforwardly seen to be of full design, other cases include, e.g., Translog-type relationships (see the discussion in Stypka and Wagner 2020). For brevity, we provide the corresponding result below for the one integrated regressor with higher power only case, which—as mentioned before—allows to test in the empirically most relevant case covering EKCs, MKCs, intensity-of-use and exchange rate target-zone relationships. As before in Proposition 2, the result for the CT test has already been discussed in Wagner and Hong (2016).
Proposition 3
Consider the special case of (1) when \(p_1 = \cdots = p_{m-1} = 1\) and let otherwise the assumptions of Proposition 2 be fulfilled.
-
(i)
Under the null hypothesis of cointegration in (1) the limiting distribution of the CT statistic is given by:
$$\begin{aligned} CT \Rightarrow \int _0^1 (W_{u \cdot v}^{J^W}(r))^2dr, \end{aligned}$$(12)with:
$$\begin{aligned} W_{u \cdot v}^{J^W}(r) := W_{u \cdot v}(r) - \int _0^r J^{W}(s)^{\prime }ds\left( \int _0^1 J^W(s)J^{W}(s)^{\prime }ds\right) ^{-1}\int _0^1 J^W(s)dW_{u \cdot v}(s)\nonumber \\ \end{aligned}$$(13)and \(J^W(r) := [D(r)',\textbf{W}(r)',W_m(r)^2,\ldots ,W_m(r)^{p_m}]'\), where \(\textbf{W}(r) := [{W}_1(r)',\ldots ,{W}_m(r)']'\) is a vector of standard Brownian motions independent of \(W_{u \cdot v}(r)\).
-
(ii)
Under the null hypothesis of (1) being a spurious regression, the limiting distribution of the \({\hat{P}}_u\) statistic is given by:
$$\begin{aligned} {\hat{P}}_u\Rightarrow & {} \frac{1}{\int _0^1 W_{w\cdot v}^2(r)dr - \int _0^1 W_{w\cdot v}(r)J^W(r)'dr\left( \int _0^1 J^W(r)J^W(r)'dr\right) ^{-1}\int _0^1J^W(r)W_{w\cdot v}(r)dr}, \nonumber \\ \end{aligned}$$(14)with \(W_{w \cdot v}(r)\) independent of \(\textbf{W}(r)\).
The limiting distributions (12) and (14) can be tabulated because they are functionals of standard Brownian motions. The corresponding critical values depend upon the deterministic component, the number of integrated regressors and the included powers of the single integrated regressor that enters the CPR with higher powers. Critical values for the CT statistic are given in Table 6 and for the \({\hat{P}}_u\) statistic in Table 7 in “Appendix B.” These tables extend the corresponding tables provided in Shin (1994) and Phillips and Ouliaris (1990) for linear cointegrating relationships to the CPR case. The tables contain critical values for up to four integrated regressors, up to power four of the integrated regressor entering with powers for the usual deterministic components, i.e., no deterministic component, an intercept only and intercept and linear trend.Footnote 8
3 An illustration with the environmental Kuznets curve
We illustrate the tests discussed with annual data for real GDP per capita, carbon dioxide (\(\hbox {CO}_2\)) in metric tons per capita and sulfur dioxide (\(\hbox {SO}_2\)) emissions in kilograms per capita for 18 early industrialized countries, listed in Table 1, over the period 1870–2016; with the exception of New Zealand where the sample range is 1878–2016. Real GDP is measured in 2011 US dollars and calculated from the GDP and population data in the 2018 version of the Maddison Project Database (see Bolt et al. 2018). The \(\hbox {CO}_2\) per capita emissions are calculated from total \(\hbox {CO}_2\) emissions from fossil fuel usage, downloaded from the webpage of the Carbon Dioxide Information Analysis Center (CDIAC) at the Appalachian State University (see Boden et al. 2018). The \(\hbox {SO}_2\) per capita emissions are calculated from combining total \(\hbox {SO}_2\) emissions taken from the NASA Socioeconomic Data and Applications Center (see Smith et al. 2011), which provides data for 1870–2005, with OECD (2020) data for the period 2006–2016.Footnote 9
We use the data described in logarithmic specifications and test for cointegration and non-cointegration in both the quadratic and the cubic EKC relationship including an intercept and a linear trend:
with \(e_t\) denoting the logarithm of emissions (\(\hbox {CO}_2\) or \(\hbox {SO}_2\)) per capita and \(y_t\) the logarithm of GDP per capita.
Performing the usual battery of unit root tests on the logarithm of GDP per capita does not lead to a rejection of the null hypothesis for any of the 18 countries. Thus, a “necessary condition” for embarking on CPR analysis, that of an integrated regressor, appears to be fulfilled. The CT test is performed using the FM-OLS residuals (i.e., is based on using the estimator of Wagner and Hong 2016) with the Bartlett kernel and the Newey and West (1994) bandwidth rule.Footnote 10
Tables 2 and 3 contain the test results for \(\hbox {CO}_2\) and \(\hbox {SO}_2\) emissions, respectively. We use the following (quite obvious) “decision rule”: If CT does not lead to a rejection, but \({\hat{P}}_u\) does, this is evidence for the presence of a cointegrating polynomial EKC relationship. If CT does lead to a rejection, but \({\hat{P}}_u\) does not, this is evidence against the presence of an EKC. If either both or none of the tests lead to a rejection, this is regarded as conflicting evidence.Footnote 11 Given this decision rule, let us now turn to the results, first for \(\hbox {CO}_2\) emissions given in Table 2. For six of the 18 countries, Austria, Belgium, Finland, Germany, Switzerland, and the UK we find evidence for an EKC relationship. For Germany, only for the cubic specification, for the other five countries for both the quadratic and the nesting cubic specification (which is, of course, a necessary theoretical implication of the prevalence of a quadratic CPR). The test decisions are also consistent between the cubic and quadratic specifications in the sense that evidence against the presence of a cubic EKC for a certain country comes along with evidence also against a quadratic EKC for that country. The one exception here is Sweden for which there is evidence against an EKC for the cubic specification and “only” conflicting evidence for the quadratic specification. Also note that conflicting evidence for the cubic specification occurs typically in conjunction with evidence against a quadratic EKC. The results for \(\hbox {SO}_2\) emissions given in Table 3 are quite clear: There is no evidence for a quadratic or cubic EKC for any of the 18 countries considered. For 16 of the 18 countries, however, the evidence is conflicting for the cubic specification.
The limited evidence for the prevalence of an EKC is akin to an observation made earlier in Wagner (2015), i.e., that the usage of adequate (non-)cointegration tests reduces the evidence for an EKC compared to using (non-)cointegration tests (critical values) developed for linear cointegrating relationships. This is exemplified here by including also the asymptotically invalid test results obtained by applying the residual-based augmented Dickey–Fuller non-cointegration t-type test of Phillips and Ouliaris (1990) developed for cointegrating linear relationships, labeled \(PO_t\). Using this test, for \(\hbox {CO}_2\) emissions the null hypothesis of non-cointegration is rejected for twelve of the 18 countries at the 5% significance level compared to only five rejections obtained with the asymptotically valid \({\hat{P}}_u\) test. For \(\hbox {SO}_2\) emissions, the differences between the \({\hat{P}}_u\) and \(PO_t\) results are less pronounced, but point in the same direction. These findings illustrate the importance of using (non-)cointegration tests—or more precisely appropriate critical values—constructed for the CPR setting. For the CT test, this issue has already been illustrated in Wagner (2015) and is discussed more thoroughly from a theoretical perspective in Stypka et al. (2019).
3.1 Robustness checks
Both tests, being nonparametric, depend upon kernel and bandwidth choices. Consequently, the results might be sensitive to these choices. To gauge the extent of this potential sensitivity, we consider four combinations of bandwidths and kernels. The Bartlett kernel and the Newey and West (1994) bandwidth (the baseline, used above), labeled I, the Bartlett kernel and the Andrews (1991) bandwidth, labeled II, and the Quadratic Spectral kernel with these two bandwidth rules, labeled III and IV in Tables 4 for \(\hbox {CO}_2\) emissions and 5 for \(\hbox {SO}_2\) emissions. It turns out that the results are in fact highly robust with respect to these choices, most importantly the evidence for the prevalence of an EKC for \(\hbox {CO}_2\) emissions. For the five countries with evidence for a quadratic (and cubic) EKC, this evidence is present for all four combinations of kernel and bandwidth. For Germany, with evidence for the cubic specification only and mixed evidence for the quadratic specification in the baseline combination I, the evidence is scattered a bit between “y” and “o” across kernel and bandwidth choices. Looking at columns I to IV in Table 4 indicates some limited variation between “n” and “o” across combinations for some of the countries. For \(\hbox {SO}_2\) emissions, there is—similarly to \(\hbox {CO}_2\) emissions—some variability for some countries between “n“ and “o” decisions. However, for Switzerland using the Andrews (1991) bandwidth rule leads to evidence for an EKC in the cubic specification, whereas the evidence is conflicting when using the Newey and West (1994) bandwidth rule. The differences in conclusions are driven by \({\hat{P}}_u\) rejecting with the Andrews (1991) bandwidth. Altogether, however, the test results are quite insensitive—most importantly with respect to evidence for the prevalence of an EKC—to kernel and bandwidth choices.
Columns \(\hbox {N}_{\tiny {0}}\) and \(\hbox {O}_{\tiny {0}}\) of Tables 4 and 5 shed light on two other empirical dimensions of robustness of the test results: \(\hbox {N}_{\tiny {0}}\) with respect to the sample size, using the data used so far but only until 2000 and \(\hbox {O}_{\tiny {0}}\) using the same data as used in Wagner (2015). Sample size and data vintage robustness do not directly relate to robustness of the tests, but it may still be informative to see their impact on the test results for our EKC illustration. The results in columns \(\hbox {N}_{\tiny {0}}\) and \(\hbox {O}_{\tiny {0}}\) are based on using the Bartlett kernel and the Newey and West (1994) bandwidth. Comparing columns \(\hbox {N}_{\tiny {0}}\) with columns I leads to only one difference, for \(\hbox {SO}_2\) emissions of Canada in the quadratic specification. Using the full data range the evidence points against the prevalence of an EKC, whereas the reduced sample leads to conflicting evidence. Thus, it appears that the additional 16 years of data help to sharpen inference. We close the robustness check by comparing the results for the sample period until 2000 with the results obtained using the data of Wagner (2015) that also range until 2000, i.e., we compare the results in columns \(\hbox {N}_{\tiny {0}}\) and \(\hbox {O}_{\tiny {0}}\). For \(\hbox {CO}_2\) emissions, using the new data vintage leads to evidence for the prevalence of an EKC for three more countries than for the old data vintage, for Switzerland and the UK for both specifications and for Germany for the cubic specification. Additionally, the evidence for four countries is changed from negative to conflicting, with these changes occurring primarily in the cubic specification. For \(\hbox {SO}_2\) emissions the evidence changes from negative to conflicting for a number of countries in both the quadratic and the cubic specification. Additionally, the evidence changes for the UK from positive to conflicting for the cubic specification.Footnote 12 Whether these changes relate to some form of sensitivity of the tests to data with unchanged CPR characteristics or because the data revision truly changes the nature of the data is, of course, not clear. What we can, however, take home from our robustness checks is that the tests are not very sensitive to kernel and bandwidth choices.
4 Summary and conclusions
This paper discusses two nonparametric tests: one for the null hypothesis of a cointegrating polynomial regression and one for the null hypothesis of the absence of cointegration. Both tests are extensions from tests developed for cointegrating linear regressions to the cointegrating polynomial regression setting. Specifically, they are extensions of the Shin (1994) test and of a variance ratio test of Phillips and Ouliaris (1990).Footnote 13 The word extension hereby refers to deriving the null limiting distributions of the test statistics in the CPR setting. It turns out that, in the general case, neither of the two tests has a nuisance-parameter-free limiting distribution. However, in case of full design of the regression model, both tests’ limiting distributions are nuisance parameter free and can hence be tabulated. The empirically most relevant case with only one of the integrated regressors appearing also with higher powers is easily seen to be of full design. For this case, “Appendix B” provides tables with critical values for up to four integrated regressors, up to power four of the single integrated regressor entering with higher powers and the usual deterministic specifications.
The tests are briefly illustrated—which is by no means a fully fledged EKC analysis—for the environmental Kuznets curve for both \(\hbox {CO}_2\) and \(\hbox {SO}_2\) emissions for 18 early industrialized countries over, with the exception of New Zealand, the period 1870–2016. For \(\hbox {CO}_2\) emissions an EKC relationship appears to be present for six of the 18 countries, i.e., Austria, Belgium, Finland, Germany, Switzerland and the UK. For \(\hbox {SO}_2\) emissions there is no evidence for the prevalence of a quadratic or cubic cointegrating EKC relationship. Little or reduced evidence for an EKC is a typical finding when using adequate tests compared to using tests designed for linear cointegrating relationships, as is discussed in this paper and already earlier in Wagner (2015). The illustration also indicates that the test results are not (really) sensitive with respect to kernel and bandwidth choices, most importantly the results for the prevalence of an EKC. Our illustration also shows that the findings are more sensitive with respect to sample size and in particular data vintage used.
The tests discussed are important inputs for developing (non-)cointegration tests in panel (see, e.g., de Jong and Wagner 2022; Wagner and Reichold 2022) or seemingly unrelated CPR settings (see, e.g., Wagner et al. 2020).
Change history
11 May 2023
A Correction to this paper has been published: https://doi.org/10.1007/s00181-022-02343-0
Notes
Another literature where this type of relationship occurs is the so-called intensity-of-use or material Kuznets curve (MKC) literature that investigates the potentially inverted U-shaped relationship between GDP and energy or metals use (see, e.g., Labson and Crompton 1993 or Grabarczyk et al. 2018). Almost by definition, the study of inequality, put to the center of attention again by Piketty (2014), when performed with time series data can make use of methods for cointegrating polynomial regressions. The same holds true for the exchange rate target-zone literature, see, e.g., Darvas (2008) or Svensson (1992).
Bradford et al. (2005) and Wagner (2008, 2015) contain critical assessments of this practice. Stypka et al. (2019) derive and discuss the asymptotic properties of using methods, in particular FM-OLS, developed for cointegrating linear regressions in a cointegrating polynomial regression setting. The results of that paper show that “standard” cointegration and non-cointegration tests lead to asymptotically invalid inference, not least because of wrong critical values being used. This problem is overcome with the tests and proper critical values provided in this paper.
Also, e.g., Chang et al. (2001), provide extensions of FM-OLS to nonlinear cointegration settings. While these authors consider more general functions than Wagner and Hong (2016), they assume serially uncorrelated errors. Wagner and Hong (2016) allow for serially correlated errors that are allowed to be dynamically correlated with the regressors, thereby considering the same setting as commonly used in linear cointegration analysis.
A basic observation that is not always sufficiently acknowledged is the fact that nonlinear transformations change the properties of integrated processes fundamentally, depending upon the type of transformation. It is a basic mathematical property that only linear transformations “commute” with summation. Wagner (2012) or Stypka and Wagner (2019) exemplify this issue for polynomial transformations.
For further discussion concerning full design and FM-OLS type estimation for cointegrating multivariate polynomial regressions (CMPRs), that potentially include arbitrary cross-products of powers of integrated processes as explanatory variables, see Stypka and Wagner (2020). Vogelsang and Wagner (2016) effectively consider IM-OLS estimation for CMPRs.
The key difference between the assumption used here—and in Wagner and Hong (2016)—and those in the other papers mentioned is that the other papers assume that the errors \(\{u_t\}\) are serially uncorrelated, as mentioned already in Footnote 4.
An alternative route may be to develop bootstrap inference.
MATLAB code to generate (additional) critical values as well as to perform the tests is available upon request.
To be precise, the OECD provides data for \(\hbox {SO}_x\) emissions. The National Research Council (1975) estimates the share of \(\hbox {SO}_2\) in \(\hbox {SO}_x\) emissions at about 98% and this is the factor we also use to transform OECD \(\hbox {SO}_x\) to \(\hbox {SO}_2\) data.
The following subsection investigates inter alia the impact of kernel and bandwidth choices on the test results.
In those cases where one of the tests rejects at the 5% significance level, but the other only at the 10% significance level, we put higher weight on the “stronger” 5% rejection. Considering also such cases as conflicting evidence does not lead to substantial differences in the overall picture. In Tables 4 and 5 that summarize the results from the robustness checks, evidence for an EKC is labeled with “y,” evidence against an EKC with “n“ and conflicting evidence with “o.”
The UK was the sole country for which Wagner (2015) finds an EKC for \(\hbox {SO}_2\) emissions.
References
Andrews DWK (1991) Heteroskedasticity and autocorrelation consistent covariance matrix estimation. Econometrica 59:817–858
Boden TA, Marland G, Andres RJ (2018) Global, regional, and national fossil-fuel \(\text{CO}_{2}\) emissions. Carbon Dioxide Information Analysis Center at Appalachian State University, Boone North Carolina
Bolt J, Inklaar R, de Jong H, van Zanden JL (2018). Rebasing Maddison: new income comparisons and the shape of long-run economic development. Maddison Project Working Paper, 10. Maddison Project Database, version 2018
Bradford D, Fender R, Shore SH, Wagner M (2005) The environmental Kuznets curve: exploring a fresh specification. Contrib Econ Anal Policy 4(1):5
Chang Y, Park JY, Phillips PCB (2001) Nonlinear econometric models with cointegrated and deterministically trending regressors. Econom J 4:1–36
Choi I, Saikkonen P (2010) Tests for nonlinear cointegration. Econom Theory 26:682–709
Darvas Z (2008) Estimation bias and inference in overlapping autoregressions: implications for the target-zone literature. Oxf Bull Econ Stat 70:1–22
de Jong R (2002) Nonlinear estimators with integrated regressors but without exogeneity. Mimeo
de Jong R, Wagner M (2022) Panel cointegrating polynomial regression analysis and an illustration with the environmental Kuznets curve. Forthcoming in Econom Stat. https://doi.org/10.1016/j.ecosta.2022.03.005
Friedl B, Getzner M (2003) Determinants of \(\text{ CO}_2\) emissions in a small open economy. Ecol Econ 45:133–148
Galeotti M, Manera M, Lanza A (2009) On the robustness of robustness checks of the EKC hypothesis. Environ Resour Econ 42:551–574
Grabarczyk P, Wagner M, Frondel M, Sommer S (2018) A cointegrating polynomial regression analysis of the material Kuznets curve. Resour Policy 57:236–245
Grossman GM, Krueger AB (1993) Environmental impacts of a North American free trade agreement. In: Garber P (ed) The Mexico-US free trade agreement. MIT Press, Cambridge, pp 13–56
Holtz-Eakin D, Selden TM (1995) Stoking the fires? \(\text{ CO}_{{2}}\) emissions and economic growth. J Public Econ 57:85–101
Hong SH, Phillips PCB (2010) Testing linearity in cointegrating relations with an application to purchasing power parity. J Bus Econ Stat 28:96–114
Ibragimov R, Phillips PCB (2008) Regression asymptotics using martingale convergence methods. Economet Theor 24:888–947
Jansson M (2002) Consistent covariance matrix estimation for linear processes. Econom Theor 18:1449–1459
Kuznets S (1955) Economic growth and income inequality. Am Econ Rev 45:1–28
Kwiatkowski D, Phillips PCB, Schmidt P, Shin Y (1992) Testing the null hypothesis of stationarity against the alternative of a unit root: how sure are we that economic time series have a unit root? J Econom 54:159–178
Labson BS, Crompton PL (1993) Common trends in economic activity and metals demand: cointegration and the intensity of use debate. J Environ Econ Manag 25:147–161
National Research Council (1975) Air quality and stationary source emission control. The National Academies Press, Washington, DC
Newey W, West K (1994) Automatic lag selection in covariance matrix estimation. Rev Econ Stud 61:631–654
OECD (2020) Air and GHG emissions (indicator). https://doi.org/10.1787/93d10cf7-en (Accessed on 05 March 2020)
Park JY, Phillips PCB (1999) Asymptotics for nonlinear transformations of integrated time series. Econom Theor 15:269–298
Park JY, Phillips PCB (2001) Nonlinear regressions with integrated time series. Econometrica 69:117–161
Perman R, Stern DI (2003) Evidence from panel unit root and cointegration tests that the environmental Kuznets curve does not exist. Aust J Agric Resour Econ 47:325–347
Piketty T (2014) Capital in the twenty-first century. Harvard University Press, Cambridge
Phillips PCB, Hansen BE (1990) Statistical inference in instrumental variables regression with I(1) processes. Rev Econ Stud 57:99–125
Phillips PCB, Ouliaris S (1990) Asymptotic properties of residual based tests for cointegration. Econometrica 58:165–193
Shin Y (1994) A residual-based test for the null of cointegration against the alternative of no cointegration. Econom Theor 10:91–115
Smith SJ, van Aardenne J, Klimont Z, Andres RJ, Volke A, Delgado Arias S (2011) Anthropogenic sulfur dioxide emissions, 1850–2005: national and regional data set by source category, Version 2.86. Data distributed by the NASA Socioeconomic Data and Applications Center (SEDAC), CIESIN, Columbia University, Palisades, New York
Stern DI (2004) The rise and fall of the environmental Kuznets curve. World Dev 32:1419–1439
Stypka O, Wagner M (2019) The Phillips unit root tests for polynomials of integrated processes revisited. Econ Lett 176:109–113
Stypka O, Wagner M (2020) Cointegrating multivariate polynomial regressions: fully modified OLS estimation and inference. Mimeo
Stypka O, Wagner M, Grabarczyk P, Kawka R (2019) On the asymptotic validity of “standard” FM-OLS estimation and inference in cointegrating polynomial regression models. Revised and resubmitted to Econom Theor
Svensson LEO (1992) An interpretation of recent research on exchange rate target zones. J Econ Perspect 6:119–144
Vogelsang TJ, Wagner M (2016) An integrated modified OLS RESET test for cointegrating regressions. Mimeo
Wagner M (2008) The carbon Kuznets curve: a cloudy picture emitted by bad econometrics? Energy Resour Econ 30:388–408
Wagner M (2012) The Phillips unit root tests for polynomials of integrated processes. Econ Lett 114:299–303
Wagner M (2015) The environmental Kuznets curve, cointegration and nonlinearity. J Appl Economet 30:948–967
Wagner M, Hong SH (2016) Cointegrating polynomial regressions: fully modified OLS estimation and inference. Econom Theor 32:1289–1315
Wagner M, Reichold K (2022) Cointegrating polynomial regressions: fully modified OLS estimation and inference. Revised and resubmitted to Econom Rev
Wagner M, Grabarczyk P, Hong SH (2020) Fully modified OLS estimation and inference for seemingly unrelated polynomial cointegrating regressions and the environmental kuznets curve for carbon dioxide emissions. J Econom 214:216–255
Yandle B, Bjattarai M, Vijayaraghavan M (2004) Environmental Kuznets curves: a review of findings, methods, and policy implications. Research Study 02.1 update, PERC
Acknowledgements
Financial support from Deutsche Forschungsgemeinschaft via SFB823 as well as from the Jubiläumsfonds of the Oesterreichische Nationalbank via several grants is gratefully acknowledged. Furthermore, the author thanks Fabian Knorre for generous support in providing updated data and test results and two reviewers and the editor for helpful and constructive comments. The views expressed in this paper are those of the author only and do not necessarily reflect the views of the Bank of Slovenia or the European System of Central Banks. On top of this the usual disclaimer applies.
Funding
Open access funding provided by University of Klagenfurt.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares that he has no conflict of interest related to this research. This article does not contain any studies with human participants or animals performed by the author.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Proofs
Proof of Proposition 2
The proof of item (i) is contained in Wagner and Hong (2016, Proposition 5) and it thus only remains to show the second item. By definition, it holds that:
with the equation defining \({\tilde{Z}}_t\) and \(\hat{\tau }\) and with \(\hat{\theta }\) denoting the OLS coefficient estimate in (1), which is now a spurious regression with an integrated error process \(\{u_t\}\). The fact that we are in the spurious regression case implies that:
using again the notation \(\Delta y_t = w_t\) and \(\frac{1}{\sqrt{T}}y_{[rT]} = \frac{1}{\sqrt{T}}\sum _{t=1}^{[rT]} w_t \Rightarrow B_w(r)\). This implies that \(\frac{1}{T^2}\sum _{t=1}^T {\hat{u}}_t^2 = \frac{1}{T^2}\hat{\tau }'{\tilde{Z}}_t {\tilde{Z}}_t' \hat{\tau }=\)
converges to:
with \(\tau \) and \(J^*(r)\) as defined in the main text. Consistency of the numerator, i.e., of \(\tilde{\Omega }_{w\cdot v}\) follows as in the proof of Phillips and Ouliaris (1990, Theorem 4.1), which relies upon the consistency of \(\hat{\xi }_t \rightarrow \xi _t\) and thus of \(\tilde{\Omega }\rightarrow \Omega \) under usual assumptions on kernel and bandwidth. \(\square \)
Proof of Proposition 3
The main issue is the existence of a bijective transformation between \(\textbf{B}_v(r)\) and a vector of functions of standard Brownian motions. Denote with \(\tilde{\textbf{B}}_v(r) = [B_1(r),\ldots ,B_m(r)]'\) and let \(\Omega _{vv}^{1/2}\) be an upper triangular matrix such that \(\tilde{\textbf{B}}_v(r) = \Omega _{vv}^{1/2}{} \textbf{W}(r)\), with \(\textbf{W}(r)\) a vector of standard Brownian motions. Furthermore, denote the (m, m) element of \(\Omega _{vv}\) by \(\Omega _{vv}(m,m)\). This yields:
Inserting this expression into (9) and multiplying the terms then leads to (13) and thus to the result for the CT test.
The argument for the \({\hat{P}}_u\) also rests upon the above transformation with two differences: First, the considered vector is \(J^*(r)\), i.e., also the first component corresponding to \(y_t\) is considered. Second, the block structure of \(\tau \), with the first element equal to one, is used to simplify the limit of \({\hat{P}}_u\) from (10) to the expression (14) involving only (functions) of standard Brownian motions. The calculations are analogous, with the difference being the terms with powers (and also the deterministic components), to the calculations in the proofs of Phillips and Ouliaris 1990, Lemma 2.2 and Theorem 4.1). \(\square \)
Appendix B: Tables with critical values
Supplementary Appendix C: Robustness of results
1.1 Robustness: Kernel and bandwidth
1.2 Robustness: Sample range until 2000
1.3 Robustness: Old data vintage (sample range until 2000)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wagner, M. Residual-based cointegration and non-cointegration tests for cointegrating polynomial regressions. Empir Econ 65, 1–31 (2023). https://doi.org/10.1007/s00181-022-02332-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00181-022-02332-3
Keywords
- Cointegrating polynomial regression
- Cointegration
- Unit root
- Testing
- Environmental Kuznets curve
- Material Kuznets curve
- Exchange rate target-zone