Residual-based cointegration and non-cointegration tests for cointegrating polynomial regressions

Cointegrating polynomial regressions (CPRs), i.e., regressions that include deterministic terms, integrated processes and powers of integrated processes as explanatory variables and stationary errors, have become prominent in several fields of applications, e.g., in the analysis of environmental Kuznets curves. A key issue, as always in cointegration analysis, is testing for the presence or absence of a cointegrating relationship. This paper discusses two complementary tests: one with the null hypothesis of cointegration and one with the null hypothesis of the absence of cointegration. It is shown that (inter alia) for the empirically most relevant case, in which only one of the integrated regressors occurs as regressor also with higher powers, critical values can be simulated and are provided for a variety of specifications. Finally, the usage of the tests is illustrated for the environmental Kuznets curve for carbon and sulfur dioxide emissions. The illustration also investigates the sensitivity of the test decisions with respect to kernel and bandwidth choices, sample size and data vintage.

literature in this respect has been and continues to be generated within the environmental economics literature. The so-called environmental Kuznets curve (EKC) hypothesis, which postulates an inverted U-shaped relationship between the level of economic development, typically measured by (the logarithm of) GDP per capita, and (the logarithm of) pollution or emissions per capita is one of the most widely studied empirical relationships in environmental economics. Early survey papers like Stern (2004) and Yandle et al. (2004) already count more than one-hundred refereed publications. The term EKC refers by analogy to the inverted U-shaped relationship between the level of economic development and the degree of income inequality postulated by Kuznets (1955) in his 1954 presidential address to the American Economic Association. Estimation of an EKC entails, in its simplest parametric form, a regression of emissions on GDP and its square to allow for a U or inverted U shape. Given that the logarithm of GDP per capita is often found to be unit root non-stationary, this is thus a regression where as regressors a unit root process and its square are present. 1 In doing so, the empirical EKC literature in large parts fails to acknowledge the fact that if the logarithm of GDP per capita is a unit root process, the square of the logarithm of GDP per capita cannot be a unit root process and continues to use "standard" unit root and cointegration techniques. This means that a large part of the empirical EKC literature (see, e.g., Friedl and Getzner 2003, Perman and Stern 2003or Galeotti et al. 2009 from a large and still growing list of such contributions) simply treats GDP and its square as two integrated processes and uses standard unit root and cointegration test and estimation methods. 2 This is, notwithstanding its popularity, a problematic practice since tailor-made methods designed for regressions involving integrated processes and nonlinear transformations thereof are available. 3 Wagner and Hong (2016) develop estimation and inference theory for the special case of nonlinear cointegrating relationships including deterministic components as well as unit root processes and their powers as explanatory variables. They refer to such relationships as cointegrating polynomial regressions (CPRs). More precisely, Wagner and Hong (2016) extend the FM-OLS estimator of Phillips and Hansen (1990) from cointegrating linear to cointegrating polynomial regressions. 4 In addition to hypothesis 1 The seminal empirical study of Grossman and Krueger (1993) uses a third-order polynomial in GDP, whereas the more popular quadratic specification has been initiated by Holtz-Eakin and Selden (1995). 2 Another literature where this type of relationship occurs is the so-called intensity-of-use or material Kuznets curve (MKC) literature that investigates the potentially inverted U-shaped relationship between GDP and energy or metals use (see, e.g., Labson andCrompton 1993 or Grabarczyk et al. 2018). Almost by definition, the study of inequality, put to the center of attention again by Piketty (2014), when performed with time series data can make use of methods for cointegrating polynomial regressions. The same holds true for the exchange rate target-zone literature, see, e.g., Darvas (2008) or Svensson (1992). 3 Bradford et al. (2005) and Wagner (2008Wagner ( , 2015 contain critical assessments of this practice.  derive and discuss the asymptotic properties of using methods, in particular FM-OLS, developed for cointegrating linear regressions in a cointegrating polynomial regression setting. The results of that paper show that "standard" cointegration and non-cointegration tests lead to asymptotically invalid inference, not least because of wrong critical values being used. This problem is overcome with the tests and proper critical values provided in this paper. testing with respect to the estimated parameters, they furthermore also investigate specification tests based on augmented or auxiliary regressions in detail. One important aspect that is not treated exhaustively in that paper is testing for cointegration and for the absence of cointegration, respectively. To be precise, only testing the null hypothesis of cointegration is considered in Wagner and Hong (2016, Propositions 5 and 6), whereas testing the null hypothesis of no cointegration is not considered. Furthermore, Wagner and Hong (2016) do not provide critical values. The present paper considers tests for both null hypotheses in detail and also provides critical values. We (re-)consider, with this test already discussed in Wagner and Hong (2016), in detail an extension of the Shin (1994) cointegration test, itself an extension of the KPSS stationarity test of Kwiatkowski et al. (1992), from cointegrating linear to cointegrating polynomial regressions. Furthermore, we consider an extension of a variance ratio test for the absence of cointegration of Phillips and Ouliaris (1990). The null hypothesis of the latter test is that the relationship is not cointegrating but spurious. It turns out that in the general case of several integrated regressors entering the CPR with powers higher than one, neither of the two tests has a nuisance-parameter-free limiting distribution that could be tabulated. This has already been observed for the KPSS-type test in Wagner and Hong (2016). Consequently, for the Shin-or KPSS-type test Wagner and Hong (2016) and, based on D-OLS estimation rather than FM-OLS estimation, Choi and Saikkonen (2010) consider sub-sampling versions of the KPSStype test statistic that results in nuisance-parameter-free limiting null distributions, at the expense of a drastically reduced sample size due to sub-sampling. For the relatively small samples available for typical macroeconomic applications, a subsampling based test is potentially only of limited practical value. For the extension of the test of Phillips and Ouliaris (1990), this sub-sampling approach does not lead to asymptotically nuisance-parameter-free limiting distribution.
However, using the terminology of Vogelsang and Wagner (2016), in case of a full design CPR relationship it can be shown that the limiting distributions of both tests can be tabulated. Full design refers to a situation in which the limiting distribution of the FM-OLS estimator-and a fortiori the limiting partial sum processes of the residuals-can be expressed as a functional of standard Brownian motions rather than of non-standard Brownian motions. The empirically most relevant case, with only one of the integrated processes entering the CPR with powers higher than one is straightforwardly seen to be of full design. "Appendix B" of the paper provides tables with critical values for this case, for up to four integrated regressors, up to power four of the one integrated regressor entering with higher powers for the usual specifications of the deterministic component (no deterministic component, intercept only, intercept and linear trend). Thus, the results of the paper allow to perform non-/cointegration testing in typical applications like EKCs, MKCs or exchange rate target-zone relationships. 5 tion analysis. A basic observation that is not always sufficiently acknowledged is the fact that nonlinear transformations change the properties of integrated processes fundamentally, depending upon the type of transformation. It is a basic mathematical property that only linear transformations "commute" with summation. Wagner (2012) or  exemplify this issue for polynomial transformations. 5 For further discussion concerning full design and FM-OLS type estimation for cointegrating multivariate polynomial regressions (CMPRs), that potentially include arbitrary cross-products of powers of integrated The tests discussed are illustrated with annual carbon dioxide (CO 2 ) and sulfur dioxide (SO 2 ) emissions data for 18 early industrialized countries over the period 1870-2016, with New Zealand data starting only in 1878. For CO 2 emissions an EKC relationship is found for six of the 18 countries, Austria, Belgium, Finland, Germany, Switzerland and the UK. For all countries but Germany a quadratic specification suffices, whereas for Germany a cubic specification is required. For SO 2 emissions an EKC relationship is found for none of the countries. In this respect note that the logarithm of GDP per capita and its powers are the only regressors included in the simple "reduced-form" equation and the absence of an EKC for SO 2 emissions could partly be driven by important explanatory variables and legislative changes not entirely captured by GDP. A more structural investigation of the EKC relationship is beyond the scope of this paper that is merely devoted to discussing the (non-)cointegration tests rather than to a fully fledged empirical analysis. A second observation that emerges from the illustration is that the usage of appropriate (non-)cointegration tests, designed for testing in a CPR context, reduces the evidence for a cointegrating EKC relationship compared to using (non-)cointegration tests designed for linear cointegrating relationships. By means of robustness checks we also illustrate that the results obtained with the two nonparametric tests are quite insensitive to kernel and bandwidth choices, in particular with respect to the evidence for the prevalence of an EKC. Changing the sample size and, in particular, using different data vintages lead to more variation in the test results.
The paper is organized as follows: The following section discusses and presents the tests, Sect. 3 briefly illustrates the tests with CO 2 and SO 2 emissions data, and Sect. 4 briefly summarizes and concludes. Two appendices follow the main text: "Appendix A" contains the proofs, and "Appendix B" contains the tables with the critical values. "Supplementary Appendix C" contains the detailed test results for the robustness checks.

The tests
Following Wagner and Hong (2016), we consider a cointegrating polynomial regression (CPR), i.e., an equation including a constant and polynomial time trends up to power q, integer powers of integrated regressors x jt , j = 1, . . . , m up to degrees p j and a stationary error term u t : with D t := [1, t, t 2 , . . . , t q ] , X jt := [x jt , x 2 jt , . . . , x p j jt ] and the parameter vectors θ D ∈ R q+1 and θ X j ∈ R p j .
The following assumption on the regressors and the errors is put in place: processes as explanatory variables, see Stypka and Wagner (2020). Vogelsang and Wagner (2016) effectively consider IM-OLS estimation for CMPRs. . . . , x mt ] , is generated as:
Assumption 1 is analogous to the corresponding assumption in Wagner and Hong (2016). Quite similar assumptions have been used in several places in the literature, e.g., Chang et al. (2001), Phillips (1999, 2001) and Hong and Phillips (2010). 6 The key result that this type of assumption is required for is an invariance principle for terms of the form T − k+1 2 T t=1 x k jt u t . Alternative sets of assumptions that could be used instead are formulated in Ibragimov and Phillips (2008) in a martingale framework or in de Jong (2002, Assumptions 1 and 2) who combines near-epoch dependence and appropriate moment assumptions. For the present paper any such set of assumptions leading to the required invariance principle can be put in place. The assumption det(C(1)) = 0 together with positive definiteness of η 0 η 0 implies that {x t } is an integrated but not cointegrated process. Assumption 1 implies an invariance principle for {η t } t∈Z : Similarly to the linear case (compare Phillips and Hansen 1990), the OLS estimator of the parameter vector in (1) is consistent but its limiting distribution is contaminated by second-order bias terms, the presence of which renders standard asymptotic inference based on the OLS estimator invalid. Consequently, Wagner and Hong (2016, Proposition 1) extend the FM-OLS estimation principle from the cointegrating linear to the cointegrating polynomial case. The FM-OLS estimator is based on two modifications of the OLS estimator. First, the dependent variable y t is replaced by y + t := y t − x tˆ −1 vvˆ vu , y + := [y + 1 , . . . , y + T ] and, second, a correction term is subtracted. The modified dependent variable y + t is exactly as in Phillips and Hansen (1990), but the correction terms are different in the CPR case and given by: Throughout, we rely upon consistent estimators of the required long-run varianceŝ vv ,ˆ vu , In this respect, OLS consistency is important since it allows for consistent long-run variance estimation based on the OLS residuals (it is straightforward to verify that the necessary assumptions of Jansson 2002, Corollary 3 are fulfilled).
To (re-)state the asymptotic distribution of the FM-OLS estimator developed in Wagner and Hong 2016, Proposition 1), we need to define a few more quantities: Proposition 1 Let {y t } be generated by (1) with the regressors {x t } and errors {u t } satisfying Assumption 1. Define the FM-OLS estimator of θ as: with: and M * as given in (3) with consistent estimators of the required long-run (co)variances. Then,θ + is consistent and has a zero mean Gaussian mixture asymptotic distribution given by: The limiting distribution given in (4) provides the basis for asymptotic standard, i.e., standard normal or chi-squared, inference on the coefficients in case the error term {u t } is stationary, i.e., in case of prevalence of a CPR relationship. Thus, testing for the presence of a CPR relationship is of prime importance and as always in cointegration testing two null hypotheses are conceivable: the null hypothesis of cointegration, i.e., the null hypothesis of stationarity of {u t }, or the null hypothesis of no cointegration, i.e., testing whether (1) is in fact a spurious regression. The first null hypothesis can be tested by an extension of the Shin (1994) test, which itself is an extension of the stationarity test of Kwiatkowski et al. (1992), from cointegrating linear to cointegrating polynomial regressions. Since, of course, the errors u t are not observed, the test statistic has to be based on observable residuals, in particular the FM-OLS residuals,û + t say, can be used. The KPSS-type test statistic for the null hypothesis of cointegration is defined as: withˆ u·v a consistent estimator of the long-run variance u·v : For testing the null hypothesis of no cointegration, we consider an extension of a variance ratio test statistic of Phillips and Ouliaris (1990), theirP u statistic, and we continue to use this name for the extension. Denoting the OLS residuals of (1) aŝ u t , the denominator of the test statistic is the properly scaled estimated "variance" of {û t }, which is an integrated process under the null hypothesis. Under the null hypothesis of a spurious regression, with {y t } being an I(1) process, the stacked vector {m t } := {[y t , x t ] } is a non-cointegrated I(1) vector and for later use we denote its first difference as { m t } := {[w t , v t ] }. The second element ofP u is a long-run variance estimate based on the OLS residuals of a vector autoregression (VAR) of order one for {m t }, taking into account the deterministic components D t considered: The required conditional long-run variance is computed using VAR(1) OLS residuals from (6) The following proposition characterizes the asymptotic behavior of the two test statistics, with the CT test already discussed in Wagner and Hong (2016, Proposition 5).

Proposition 2 (i) Let {y t } be generated by (1) with the regressors {x t } and errors {u t }
satisfying Assumption 1 and letˆ u·v be a consistent estimator of u·v , then the asymptotic distribution of the test statistic (5) defined above is: with: (ii) Let (1) be a spurious regression, i.e., let {y t } be an integrated process not related to {X t } in a cointegrating polynomial relationship, then the asymptotic distribution of the test statistic (7) defined above is: and˜ w·v as defined above.
The asymptotic distributions of the CT andP u test statistics cannot-in generalbe tabulated due to their nuisance parameter dependency related to the correlation structure between the variables. Wagner and Hong (2016, Proposition 6), following Choi and Saikkonen (2010), consider a sub-sample version of the CT test statistic that has a nuisance-parameter-free limiting distribution. They show, in particular, that the limiting distribution of the sub-sample CT test statistics is 1 0 W u·v (r ) 2 dr, if the sub-samples of size b are chosen such that b → ∞ and b/T → 0, i.e., sub-sampling achieves that the second term in (9) vanishes asymptotically. Wagner and Hong (2016), in addition, discuss how the set of sub-sample test statistics can be used in conjunction with modified Bonferroni bounds to improve the performance of the sub-sample-based test. An inspection of the proof of Proposition 1 reveals that sub-sampling does not lead to similar simplifications for theP u test, with this "unfortunate difference" caused by the too slow convergence rate ofτ in the spurious regression case. Thus, for the general case only the CT test, when used in a sub-sampling fashion, is available. 7 The problem with the nuisance parameter dependency of the limiting distributions of the test statistics originates in J (r ) as defined in Proposition 1. In case it is possible to write: with a regular matrix and W v (r ) a functional of standard Brownian motions one can rewrite both test statistics as functions of standard Brownian motions and a scalar (long-run) variance that can be scaled out. If a transformation as just described exists, the CPR is said to exhibit full design, a terminology coined by Vogelsang and Wagner (2016). CPRs with only one of the integrated processes present as regressor also with higher powers are straightforwardly seen to be of full design, other cases include, e.g., Translog-type relationships (see the discussion in Stypka and Wagner 2020). For brevity, we provide the corresponding result below for the one integrated regressor with higher power only case, which-as mentioned before-allows to test in the empirically most relevant case covering EKCs, MKCs, intensity-of-use and exchange rate targetzone relationships. As before in Proposition 2, the result for the CT test has already been discussed in Wagner and Hong (2016).
Proposition 3 Consider the special case of (1) when p 1 = · · · = p m−1 = 1 and let otherwise the assumptions of Proposition 2 be fulfilled.

(i) Under the null hypothesis of cointegration in (1) the limiting distribution of the C T statistic is given by
with: and ] is a vector of standard Brownian motions independent of W u·v (r ). (ii) Under the null hypothesis of (1) being a spurious regression, the limiting distribution of theP u statistic is given by: with W w·v (r ) independent of W(r ).
The limiting distributions (12) and (14) can be tabulated because they are functionals of standard Brownian motions. The corresponding critical values depend upon the deterministic component, the number of integrated regressors and the included powers of the single integrated regressor that enters the CPR with higher powers. Critical values for the CT statistic are given in Table 6 and for theP u statistic in Table 7 in "Appendix B." These tables extend the corresponding tables provided in Shin (1994) and Phillips and Ouliaris (1990) for linear cointegrating relationships to the CPR case. The tables contain critical values for up to four integrated regressors, up to power four of the integrated regressor entering with powers for the usual deterministic components, i.e., no deterministic component, an intercept only and intercept and linear trend. 8

An illustration with the environmental Kuznets curve
We illustrate the tests discussed with annual data for real GDP per capita, carbon dioxide (CO 2 ) in metric tons per capita and sulfur dioxide (SO 2 ) emissions in kilograms per capita for 18 early industrialized countries, listed in  Boden et al. 2018). The SO 2 per capita emissions are calculated from combining total SO 2 emissions taken from the NASA Socioeconomic Data and Applications Center (see Smith et al. 2011), which provides data for 1870-2005, with OECD (2020) data for the period 2006-2016. 9 We use the data described in logarithmic specifications and test for cointegration and non-cointegration in both the quadratic and the cubic EKC relationship including an intercept and a linear trend: with e t denoting the logarithm of emissions (CO 2 or SO 2 ) per capita and y t the logarithm of GDP per capita.
Performing the usual battery of unit root tests on the logarithm of GDP per capita does not lead to a rejection of the null hypothesis for any of the 18 countries. Thus, a "necessary condition" for embarking on CPR analysis, that of an integrated regressor, appears to be fulfilled. The CT test is performed using the FM-OLS residuals (i.e., is based on using the estimator of Wagner and Hong 2016) with the Bartlett kernel and the Newey and West (1994) bandwidth rule. 10 Tables 2 and 3 contain the test results for CO 2 and SO 2 emissions, respectively. We use the following (quite obvious) "decision rule": If CT does not lead to a rejection, butP u does, this is evidence for the presence of a cointegrating polynomial EKC relationship. If CT does lead to a rejection, butP u does not, this is evidence against the presence of an EKC. If either both or none of the tests lead to a rejection, this is regarded as conflicting evidence. 11 Given this decision rule, let us now turn to the results, first for CO 2 emissions given in Table 2. For six of the 18 countries, Austria, Belgium, Finland, Germany, Switzerland, and the UK we find evidence for an EKC relationship. For Germany, only for the cubic specification, for the other five countries for both the quadratic and the nesting cubic specification (which is, of course, a necessary theoretical implication of the prevalence of a quadratic CPR). The test decisions are also consistent between the cubic and quadratic specifications in the sense that evidence against the presence of a cubic EKC for a certain country comes along with evidence also against a quadratic EKC for that country. The one exception here is Sweden for which there is evidence against an EKC for the cubic specification and "only" conflicting evidence for the quadratic specification. Also note that conflicting evidence for the cubic specification occurs typically in conjunction with evidence against a quadratic EKC. The results for SO 2 emissions given in Table 3 are quite clear: There is no evidence for a quadratic or cubic EKC for any of the 18 countries considered. For 16 of the 18 countries, however, the evidence is conflicting for the cubic specification.
The limited evidence for the prevalence of an EKC is akin to an observation made earlier in Wagner (2015), i.e., that the usage of adequate (non-)cointegration tests reduces the evidence for an EKC compared to using (non-)cointegration tests (critical values) developed for linear cointegrating relationships. This is exemplified here by including also the asymptotically invalid test results obtained by applying the residualbased augmented Dickey-Fuller non-cointegration t-type test of Phillips and Ouliaris (1990) developed for cointegrating linear relationships, labeled P O t . Using this test, for CO 2 emissions the null hypothesis of non-cointegration is rejected for twelve of the 18 countries at the 5% significance level compared to only five rejections obtained with the asymptotically validP u test. For SO 2 emissions, the differences between theP u and P O t results are less pronounced, but point in the same direction. These findings illustrate the importance of using (non-)cointegration tests-or more precisely  Newey and West (1994) appropriate critical values-constructed for the CPR setting. For the CT test, this issue has already been illustrated in Wagner (2015) and is discussed more thoroughly from a theoretical perspective in .

Robustness checks
Both tests, being nonparametric, depend upon kernel and bandwidth choices. Consequently, the results might be sensitive to these choices. To gauge the extent of this potential sensitivity, we consider four combinations of bandwidths and kernels. The Bartlett kernel and the Newey and West (1994) bandwidth (the baseline, used above), labeled I, the Bartlett kernel and the Andrews (1991) bandwidth, labeled II, and the Quadratic Spectral kernel with these two bandwidth rules, labeled III and IV in Tables 4 for CO 2 emissions and 5 for SO 2 emissions. It turns out that the results are in fact highly robust with respect to these choices, most importantly the evidence for the prevalence of an EKC for CO 2 emissions. For the five countries with evidence for a For further explanations see caption of Table 2 quadratic (and cubic) EKC, this evidence is present for all four combinations of kernel and bandwidth. For Germany, with evidence for the cubic specification only and mixed evidence for the quadratic specification in the baseline combination I, the evidence is scattered a bit between "y" and "o" across kernel and bandwidth choices. Looking at columns I to IV in Table 4 indicates some limited variation between "n" and "o" across combinations for some of the countries. For SO 2 emissions, there is-similarly to CO 2 emissions-some variability for some countries between "n" and "o" decisions. However, for Switzerland using the Andrews (1991) bandwidth rule leads to evidence for an EKC in the cubic specification, whereas the evidence is conflicting when using the Newey and West (1994) bandwidth rule. The differences in conclusions are driven bŷ P u rejecting with the Andrews (1991) bandwidth. Altogether, however, the test results are quite insensitive-most importantly with respect to evidence for the prevalence of an EKC-to kernel and bandwidth choices. Columns N 0 and O 0 of Tables 4 and 5 shed light on two other empirical dimensions of robustness of the test results: N 0 with respect to the sample size, using the data used so far but only until 2000 and O 0 using the same data as used in Wagner (2015). Sample size and data vintage robustness do not directly relate to robustness of the tests, but it may still be informative to see their impact on the test results for our EKC illustration. The results in columns N 0 and O 0 are based on using the Bartlett kernel and the Newey Table 4 Test decisions, as explained above, for CO 2 emissions, with "y" indicating evidence for the presence of an EKC, "n" evidence against and "o" indicating conflicting evidence Quadratic  Cubic   I  II  III  IV  N 0  O 0  I  II  III  IV  N  and West (1994) bandwidth. Comparing columns N 0 with columns I leads to only one difference, for SO 2 emissions of Canada in the quadratic specification. Using the full data range the evidence points against the prevalence of an EKC, whereas the reduced sample leads to conflicting evidence. Thus, it appears that the additional 16 years of data help to sharpen inference. We close the robustness check by comparing the results for the sample period until 2000 with the results obtained using the data of Wagner (2015) that also range until 2000, i.e., we compare the results in columns N 0 and O 0 . For CO 2 emissions, using the new data vintage leads to evidence for the prevalence of an EKC for three more countries than for the old data vintage, for Switzerland and the UK for both specifications and for Germany for the cubic specification. Additionally, the evidence for four countries is changed from negative to conflicting, with these changes occurring primarily in the cubic specification. For SO 2 emissions the evidence changes from negative to conflicting for a number of countries in both the quadratic and the cubic specification. Additionally, the evidence changes for the UK from positive to conflicting for the cubic specification. 12 Whether these changes relate to some form  I  II  III  IV  N 0  O 0  I  II  III  IV  N Table 4 of sensitivity of the tests to data with unchanged CPR characteristics or because the data revision truly changes the nature of the data is, of course, not clear. What we can, however, take home from our robustness checks is that the tests are not very sensitive to kernel and bandwidth choices.

Summary and conclusions
This paper discusses two nonparametric tests: one for the null hypothesis of a cointegrating polynomial regression and one for the null hypothesis of the absence of cointegration. Both tests are extensions from tests developed for cointegrating linear regressions to the cointegrating polynomial regression setting. Specifically, they are extensions of the Shin (1994) test and of a variance ratio test of Phillips and Ouliaris (1990). 13 The word extension hereby refers to deriving the null limiting distributions of the test statistics in the CPR setting. It turns out that, in the general case, neither of the two tests has a nuisance-parameter-free limiting distribution. However, in case of full design of the regression model, both tests' limiting distributions are nuisance parameter free and can hence be tabulated. The empirically most relevant case with only one of the integrated regressors appearing also with higher powers is easily seen to be of full design. For this case, "Appendix B" provides tables with critical values for up to four integrated regressors, up to power four of the single integrated regressor entering with higher powers and the usual deterministic specifications.
The tests are briefly illustrated-which is by no means a fully fledged EKC analysis-for the environmental Kuznets curve for both CO 2 and SO 2 emissions for 18 early industrialized countries over, with the exception of New Zealand, the period 1870-2016. For CO 2 emissions an EKC relationship appears to be present for six of the 18 countries, i.e., Austria, Belgium, Finland, Germany, Switzerland and the UK. For SO 2 emissions there is no evidence for the prevalence of a quadratic or cubic cointegrating EKC relationship. Little or reduced evidence for an EKC is a typical finding when using adequate tests compared to using tests designed for linear cointegrating relationships, as is discussed in this paper and already earlier in Wagner (2015). The illustration also indicates that the test results are not (really) sensitive with respect to kernel and bandwidth choices, most importantly the results for the prevalence of an EKC. Our illustration also shows that the findings are more sensitive with respect to sample size and in particular data vintage used.
The tests discussed are important inputs for developing (non-)cointegration tests in panel (see, e.g., de Jong and Wagner 2022; Wagner and Reichold 2022) or seemingly unrelated CPR settings (see, e.g., Wagner et al. 2020). Wagner and Hong (2016, Proposition 5) and it thus only remains to show the second item. By definition, it holds that:û  The symbols in the first column indicate the deterministic component: none (empty), intercept only (c) and intercept and linear trend (c.t), m indicates the number of integrated regressors, and p indicates the highest included power of the regressor entering with powers  (1), which is now a spurious regression with an integrated error process {u t }. The fact that we are in the spurious regression case implies that:

Proof of Proposition 2 The proof of item (i) is contained in
using again the notation y t = w t and 1 converges to: with τ and J * (r ) as defined in the main text. Consistency of the numerator, i.e., of w·v follows as in the proof of Phillips and Ouliaris (1990, Theorem 4.1), which relies upon the consistency ofξ t → ξ t and thus of˜ → under usual assumptions on kernel and bandwidth.  (m, m). This yields:

Proof of Proposition 3
Inserting this expression into (9) and multiplying the terms then leads to (13) and thus to the result for the CT test. The argument for theP u also rests upon the above transformation with two differences: First, the considered vector is J * (r ), i.e., also the first component corresponding to y t is considered. Second, the block structure of τ , with the first element equal to one, is used to simplify the limit ofP u from (10) to the expression (14) involving only (functions) of standard Brownian motions. The calculations are analogous, with the difference being the terms with powers (and also the deterministic components), to the calculations in the proofs of Phillips and Ouliaris 1990, Lemma 2.2 and Theorem 4.1). For further explanations see caption of Table 2 with the difference that the results in this table are based on the Bartlett kernel with bandwidth according to Andrews (1991)  For further explanations see caption of Table 2 with the difference that the results in this table are based on the Quadratic Spectral kernel with bandwidth according to Newey and West (1994)  For further explanations see caption of Table 2 with the difference that the results in this table are based on the Quadratic Spectral kernel with bandwidth according to Andrews (1991)  For further explanations see caption of Table 2 with the difference that the results in this table are based on the Bartlett kernel with bandwidth according to Andrews (1991)  For further explanations see caption of Table 2 with the difference that the results in this table are based on the Quadratic Spectral kernel with bandwidth according to Newey and West (1994)  For further explanations see caption of Table 2 with the difference that the results in this table are based on the Quadratic Spectral kernel with bandwidth according to Andrews (1991) Robustness: Sample range until 2000  For further explanations see caption of Table 2 Robustness: Old data vintage (sample range until 2000) For further explanations see caption of Table 2