Long run non-linearity in CO2 emissions: the I(2) cointegration model and the environmental Kuznets curve

Kivedal, Bjørnar Karlsen

doi:10.1007/s10663-023-09587-8

Long run non-linearity in CO₂ emissions: the I(2) cointegration model and the environmental Kuznets curve

Original Paper
Open access
Published: 30 September 2023

Volume 50, pages 899–931, (2023)
Cite this article

Download PDF

You have full access to this open access article

Empirica Aims and scope Submit manuscript

Long run non-linearity in CO₂ emissions: the I(2) cointegration model and the environmental Kuznets curve

Download PDF

Bjørnar Karlsen Kivedal ORCID: orcid.org/0000-0002-6368-5460¹

733 Accesses
Explore all metrics

Abstract

I utilize the I(2) cointegration model to assess the empirical relevance of the environmental Kuznets curve for CO₂ emissions in the US between 1960 and 2014. This takes the non-linearity of CO₂ emissions into account by directly incorporating data that are integrated of order two, I(2). As a result, it enables an extensive dynamic analysis of the relationship between emissions and economic growth, as postulated by the environmental Kuznets curve both in the short, medium, and long run. The results indicate that the primary drivers behind the non-linear shape of US CO₂ emissions in the long run are an increase in emissions caused by energy use and a decrease caused by more trade and the utilization of less polluting energy sources. GDP only exhibits short run effects. Hence, I do not find evidence in favor of a long-run relationship between economic development and the concave shape of emissions, as suggested by the environmental Kuznets curve.

Investigating Environmental Kuznets Curve for CO $$_{2}$$ Emissions: A Cointegration Analysis for Morocco

Does CO2 emissions–economic growth relationship reveal EKC in developing countries? Evidence from Kazakhstan

Article Open access 17 August 2019

The trilemma among CO2 emissions, energy use, and economic growth in Russia

Article Open access 23 June 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

One of the most widely used theoretical models on the relationship between economic activity and environmental effects such as greenhouse gas emissions or pollution, is the environmental Kuznets curve (EKC). The theory postulates a concave relationship between the two variables, where environmental degradation is increasing until a certain level of income is reached and then declines as the income level becomes higher. The theoretical framework of the EKC was first presented in the World Development Report 1992 (see World Bank 1992), based on research by Shafik (1994) and motivated by the relationship between inequality and the level of income in Kuznets (1955). Many studies have investigated the environmental Kuznets curve both theoretically and empirically since then; see e.g. Stern (2017) for a recent literature review.

The EKC suggests that $\hbox {CO}_2$ emissions should increase until a certain level of GDP per capita is reached, and then decline. This non-linearity and concave shape of $\hbox {CO}_2$ emissions over time is taken into account in empirical work on the EKC using different approaches. A common approach is to add a quadratic term for (real) GDP per capita as an explanatory variable in the model, which yields a polynomial relationship, but other approaches are also used. As pointed out by Stern (2017), a lot of the approaches in empirical work on the EKC are not statistically robust, and there is no consensus on the driver of the changes in emissions. However, there are recent works that use statistically robust methods in order to take non-linearity into account, see the next section for more information about the relevant work.

If I(2) trends are present in the standard cointegrated vector autoregressive (CVAR) model, there are some concerns that should be addressed (see Juselius (2006), p. 293). It is not possible to say anything about the number of I(2) trends in the short-run matrix of the vector error correction model (VECM), and the determination of the rank of the long-run matrix of the VECM may have poor small sample properties. The latter can be important in the investigation of the EKC because one often utilizes annual data and thereby has a relatively small sample size. Additionally, the cointegration relations cannot be interpreted in the same way as if only I(1) variables are used in the model, since these are not I(0) in the presence of I(2) trends but rather I(1). Hence, it is possible to estimate long-run relations in the presence of I(2) trends in the VECM, but the interpretation of the results is different, see Johansen (1995). By using the I(2) model, we are able to take this into account, and the estimated I(2) model will provide a rich dynamic analysis of the data, enabling us to analyze the relationship between economic activity and $\hbox {CO}_2$ emissions thoroughly. The non-linearity of $\hbox {CO}_2$ emissions may be investigated through an I(2) model if there are double unit roots, or “I(2)-ness”, in the data. Using this framework also enables estimating a system of non-linear variables, thus treating all variables as endogenous from the outset.

I am going to follow the empirical EKC literature on output, $\hbox {CO}_2$ emissions, energy consumption, and trade, by including and controlling for (the log of) all of them in the model and investigate the joint effects. Hence, I use the same per capita variables as e.g. in Halicioglu (2009), reducing the problem of omitted variable bias. This combines testing the EKC hypothesis with the nexus of investigating the relationship between economic growth and energy consumption, following Ang (2007, 2008); Soytas et al. (2007) and Soytas and Sari (2009). Including trade as an explanatory variable is also in line with the Hecksher-Ohlin trade theory (see Arrow et al. (1995) and Stern et al. (1996)), which suggest that developed countries will specialize in producing goods that are intensive in human capital as well as focusing on capital-intensive activities. This will result in pollution being reduced in developed countries and increased in developing countries, which specialize in producing labor and natural resource-intensive goods, due to trade. Hence, I test both the EKC hypothesis, the link between energy consumption and economic growth, and aspects related to the Hecksher-Ohlin trade theory such as the pollution haven hypothesis (see Cole (2004)) or carbon leakage (see Babiker (2005)). My contribution is thereby to investigate the EKC for $\hbox {CO}_2$ emissions using the I(2) model, allowing for a rich dynamic analysis of the link between economic activity and emissions. We also use other data sets and models as robustness and sensitivity analyses.

For the US, I find that $\hbox {CO}_2$ emissions show signs of being I(2). We may therefore use the I(2) model in order to take this into account, enabling the identification of common I(2) trends as the non-linear effects between the variables. I find that there is a non-linear relationship between the variables, but that the factors causing the non-linearity/concavity of $\hbox {CO}_2$ emissions in the long run seem to be related to trade and energy use and not GDP, suggesting that the pollution haven hypothesis or carbon leakage, along with changing energy sources, may have contributed to the concave shape of $\hbox {CO}_2$ emissions and the decline in emissions over the past years in the US.

The next section provides a brief motivation and literature review related to empirical research on the EKC. Section 3 presents the EKC model and the I(2) model as well as using simulated data to estimate the EKC model, Sect. 4 presents the data, estimates the unrestricted I(2) model and performs preliminary tests, while the next section shows the empirical results and analysis of the estimated I(2) model. The final section concludes. A description of the data sources, as well as robustness and sensitivity analyses, are in the Appendices.

2 Empirical assessment of the environmental Kuznets curve

By using cointegration analysis, both country specific analyzes (see e.g. Perman and Stern (2003), Halicioglu (2009) or Saboori et al. (2012)) and panel analyzes of multiple countries (see e.g. Apergis and Ozturk 2015 or Perman and Stern (2003)) have been carried out. Studies have utilized, among other methods, autoregressive distributed lag (ARDL) models (see e.g. Jalil and Mahmud 2009) and Granger non-causality tests such as those in Soytas et al. (2007) and Halicioglu (2009). Spatial effects have also been investigated (see e.g. Maddison 2006). However, as pointed out by Stern (2017), the econometric methods used for analyzing the EKC relationship are often not appropriate regarding the consideration of the properties of the data. For instance, Stern (2004) emphasizes that few econometric investigations of the EKC consider serial dependence or stochastic trends in time series, and that the empirical investigations of the EKC have been weak. For a more detailed overview, see Stern (2017) who provides a detailed recent literature review of work that assesses the empirical relevance of the EKC. See also Copeland and Taylor (2004) for a theoretical framework of the EKC.

Wagner (2008) accounts for non-linearity by replacing GDP by de-factored GDP, and Wagner (2015) extends the Fully modified OLS procedure to deal with this and estimates the EKC. See also Knorre et al. (2021) and Wagner et al. (2020) for newly developed procedures for monitoring polynomial regressions. Esteve and Tamarit (2012) and Sephton and Mann (2013) employ threshold cointegration to take the non-linear effects into account. Nonlinear cointegration is also taken into account in Sephton and Mann (2016) through the multivariate adaptive regression spline model, while Sephton (2020) and Sephton (2022) investigate mean reversion through non-linearity. Pata and Aydin (2022) utilize a wavelet unit root test in order to take non-linearity into account.

The concave behavior of $\hbox {CO}_2$ emissions can be taken into consideration by using a structural break, implying that there is one regime when $\hbox {CO}_2$ emissions are increasing and one when decreasing (or several breaks to account for multiple increases and decreases throughout the sample). See e.g. Campos et al. (1996) for using structural breaks in conjunction with cointegration, and Santos et al. (2008) and Johansen and Nielsen (2009) for impulse indicator saturation techniques to automate testing for the temporal location of these breaks. It is also possible to include multiple breaks (Castle et al. 2012). A recent modeling of UK $\hbox {CO}_2$ emissions using structural breaks has been carried out by Hendry (2020). A broken linear trend may then be a proxy for an omitted variable explaining changes in $\hbox {CO}_2$ emissions such as the composition of different types of manufacturing firms in the economy or activity in the agricultural sector in the country. Using structural breaks is in line with Narayan and Smyth (2008) who investigate how GDP depends on energy consumption by using the Westerlund (2006) cointegration test that allows the possibility of multiple structural breaks in panel regressions.

The choice between using an I(1) model with a structural break (or several structural breaks) or using the I(2) model to take non-linearity into account, is affected by whether we should have a deterministic or stochastic specification of the non-linearity. If there is no reason to expect a shift in the growth rate, the turning point of the EKC could be considered as the growth rate of emissions changing from positive to negative instead of being modeled by a deterministic shift. Furthermore, the EKC is a non-linear model, cf. the definition of non-linearity in Johansen (1997). The empirical model should thus also be non-linear in order to take this into account. A linear model with one or more shifts in the trend is not necessarily theoretically in line with the EKC model, even if it takes the non-linearity of the data into account empirically. $\hbox {CO}_2$ emissions, if found to be integrated of order two empirically and evolving according to an inverted u-shape, could show signs of a declining growth rate (first difference of its logarithm) rather than having a shift in the growth rate. This implies that the shift may be modeled stochastically, since it does not necessarily show a change in the behavior but only that the peak of $\hbox {CO}_2$ emissions has been reached and that the total emissions (per capita) is declining rather than increasing. Using the I(2) model also allows determining the number of common stochastic trends, such that we can analyze long run effects between variables in the model.

Variables that are expected to follow a linear stochastic trend and variables that are expected to have a concave shape may be I(1) and I(2), respectively. Furthermore, the results from a cointegration analysis between an I(1) and an I(2) variable without taking I(2)-ness into account may be misleading (see Juselius (2006) and the results in Appendix B2). By utilizing the I(2) model which will provide a rich dynamic analysis of the data, we can allow for I(2) variables directly in the model without using a structural break and thus have an empirical framework that is closer to non-linearity postulated by the EKC theory. This will provide information on the factors affecting emissions over the sample both in the long, medium and short run while taking non-linearity into account, in line with the EKC, while using a fully specified econometric framework.

Double unit roots are often not found in empirical literature. This is likely to be a result of using univariate unit root tests rather than multivariate tests, since univariate tests have been shown to have low power in detecting double unit root in many cases (Juselius 2014). Hence, multivariate tests such as the trace test for determining the rank performed in Sect. 4.3, should be the preferred method when testing for double unit roots (the presence of I(2)). This approach follows Juselius (1995), Bacchiocchi and Fanelli (2005), Johansen et al. (2010), Juselius and Assenmacher (2017), Hetland and Hetland (2017), Salazar (2017), Juselius and Stillwagon (2018) and Juselius and Dimelis (2019), who use the I(2) model and rely on multivariate tests. I will follow this literature and concentrate on the multivariate test. However, it is important to note that this multivariate test concerns the presence of double unit roots in the given system of variables, not just the data series for $\hbox {CO}_2$ emissions. Hence, the outcome of the test will be conditional on which variables that are included in the system. To address this, I also perform univariate tests in Appendix A2 and employ a bivariate system in Appendix B3, where only $\hbox {CO}_2$ emissions and GDP are considered. These alternatives also support the finding of double unit roots.

3 Theoretical framework and econometric approach

3.1 The EKC relationship

If we consider the EKC as an inverted U-shaped relationship between $\hbox {CO}_2$ emissions and GDP, the common representation is a polynomial equation (or one of higher order) such as

$$\begin{aligned} CO2_t = \beta _1 GDP_t + \beta _2 GDP_t^2 \text{, } \end{aligned}$$

(1)

where $CO2_t$ is emissions of $\hbox {CO}_2$ per capita, $GDP_t$ is GDP per capita and $GDP_t^2$ is squared GDP per capita in period t. We should expect $\beta _1>0$ since we are dealing with positive values for $CO2_t$ and $GDP_t$, and $\beta _2<0$ since the U shape should be inverted, allowing for a maximum value on the curve ($\beta _2$ should also be sufficiently small in relation to $\beta _1$, see Perman et al. (2011)). Control variables are also often added to (1) such as human capital, energy consumption, trade, fossil fuel consumption, investments, real capital, etc. in order to isolate the effect of GDP on environmental degradation. Micro foundations in order to explain the functional form of the EKC can be found e.g. in Andreoni and Levinson (2001).

A common empirical approach when investigating whether a long-run relationship exists between environmental pollution, such as $\hbox {CO}_2$ emissions, and economic activity, using cointegration, involves using $\hbox {CO}_2$ emissions per capita as the dependent variable and GDP per capita and squared GDP per capita as explanatory variables. Additional control variables, such as energy use per capita and international trade (relative to GDP), as motivated in the introduction and following Halicioglu (2009), are also often included. This approach allows testing the EKC hypothesis by assessing whether there is an inverted U shape between $\hbox {CO}_2$ emissions and economic growth while controlling for other effects. To test the presence of the EKC, one can then examine whether $\beta _1$>0 and $\beta _2<0$ in (1).

3.2 The I(2) model

If a variable is integrated of order two, I(2), it needs to be differenced twice in order to have a stationary representation, and the I(2) model may be used. For recent applications of the I(2) model, see e.g. Juselius and Stillwagon (2018) who investigate the relationship between interest rates, prices and the exchange rate for the UK and the US, Juselius and Assenmacher (2017) for Switzerland and the US, and Juselius (2017) for Germany and the US. Hetland and Hetland (2017) investigate the Danish housing market through the I(2) model, and the Greek crisis is analyzed in Juselius and Dimelis (2019). The presentation of the I(2) model below follows these papers closely. See also (Juselius (2006), ch. 17) and (Johansen 1995, ch. 9) for further details.

A VAR model with k lags

$$\begin{aligned} X_t = \Pi _1 X_{t-1} + \cdots \Pi _k X_{t-k} + \Phi D_t + \varepsilon _t \text{. } \end{aligned}$$

(2)

may be reformulated to a vector equilibrium correction model, such as

$$\begin{aligned} \Delta X_t = \Gamma _1 \Delta X_{t-1} +\cdots +\Gamma _{k-1}\Delta X_{t-k+1} + \alpha \tilde{\beta }^\prime \tilde{X}_{t-1}+\Phi D_t+\mu _0+\mu _1 t +\varepsilon _t \text{, } \end{aligned}$$

(3)

where for our purpose related to investigation the EKC, $X_t = [lco2_t, lgdp_t, lenergy_t, ltrade_t]^{\prime }$ where the variables in the vector $X_t$ is the natural logarithm of $\hbox {CO}_2$ emissions per capita, GDP per capita, energy use per capita and trade as share of GDP, respectively. Furthermore, $\tilde{\beta }^\prime =[\beta , \beta _0, \beta _1]$, $\tilde{X}_{t-1}=[X_{t-1}, 1, t]^\prime$, $\varepsilon _t \sim N_p(0,\Omega )$ (p is the number of variables in the information set) for $t=1,\ldots ,T$, and $X_{-1}, X_0$ is given. $D_t$ is a vector of dummy variables (if included), and $\mu _0$ and $\mu _1$ are constants. The trend is restricted to be in the cointegrating space in order to prevent quadratic trends (i.e. $\beta _1\ne 0$ and $\mu _1=0$). Below, we simplify the presentation by restricting the trend to be in $\beta ^\prime X_{t-1}$ and exempt from using the notation $\tilde{\beta }^\prime$.

We may also write (3) in acceleration rates, changes and levels, which for a lag of 2 (i.e. setting $k=2$ in (2) and (3)) yields

$$\begin{aligned} \Delta ^2X_t = \Pi X_{t-1} + \Gamma \Delta X_{t-1} +\Phi D_t + \mu _0 + \mu _1 t + \varepsilon _t \text{. } \end{aligned}$$

(4)

This provides the I(2) cointegrated vector autoregressive (CVAR) model formulated in acceleration rates, changes and levels.

By using the maximum likelihood parameterization suggested by Johansen (1997) on (4) we get

$$\begin{aligned} \Delta ^2X_t = \alpha (\beta ^\prime X_{t-1}+d^\prime \Delta X_{t-1})+\zeta \tau ^\prime \Delta X_{t-1}+\Phi D_t + \mu _0 + \mu _1 t + \varepsilon _t \text{. } \end{aligned}$$

(5)

Increasing the lag length to $k=3$, if empirically justified, results in adding the term $\Gamma _1 \Delta ^2 X_{t-1}$ with a negative sign to (4) and (5), where $\Gamma =-(I-\Gamma _1)$, providing a term for the short-run effects (see Johansen (1995)).

The hypothesis that there are unit roots in the data, i.e. that $X_t$ is I(1), is formulated as a reduced rank hypothesis on $\Pi =\alpha \beta ^\prime$, where $\alpha$ and $\beta$ are of dimension $p\times r$ where r is the rank of $\Pi$. As in the CVAR with variables that are at most I(1), the $\beta$ vector describes the long-run stationary relationships between the variables and the $\alpha$ vectors contain the adjustment parameters which describe how the system adjusts or error-corrects to a disequilibrium from $\beta ^\prime X_t$.

If $\Delta X_t\sim I(1)$, such that $X_t \sim I(2)$, this can be formulated by a linear transformation through a reduced rank hypothesis on $\Gamma$. This results in $\alpha _{\bot }^\prime \Gamma \beta _{\bot }=\xi \eta ^\prime$, where $\xi$ and $\eta$ are $(p-r)\times s_1$, and $\alpha _{\bot }$ and $\beta _{\bot }$ are the orthogonal complements of $\alpha$ and $\beta$, respectively (see Johansen 1992, 1995). As in the I(1) CVAR model, there are $(p-r)$ stochastic trends for the reduced rank r, and these are divided into $s_1$ trends of order I(1) and $s_2$ of order I(2). Hence, the I(1) reduced rank condition is associated with the levels of the variables, while the I(2) reduced rank condition is associated with the differenced variables, since the first difference is I(1) given the presence of I(2) in the levels. The trend, t, is restricted to be in the cointegrating relationship $\beta ^\prime X_{t-1}$, and the constant to be in $d^\prime \Delta X_{t-1}$ (often referred to as the multi-cointegrating relationship or dynamic equilibrium). As shown in Rahbek et al. (1999), we need a restricted linear trend in order to allow for linear trends in all linear combinations of $X_t$. The relation $\zeta {\tau }^\prime \Delta \tilde{X}_{t-1}$, where $\tau =[\beta ,\beta _{\bot 1}]$, describes the medium-run relations between the differences variables, and the cointegrating relations ${\tau }_{\bot ,1}X_t$ transforms the process from I(2) to I(1) by using the polynomial trends. It consists of r relations ${\beta }^\prime X_t$ and $s_1$ relations ${\beta }_{\bot ,1}X_t$.

As pointed out by Juselius and Assenmacher (2017), $\beta ^\prime X_t$ is generally I(1), and can be interpreted as an equilibrium error with pronounced persistence. The coefficients in the vectors $\alpha$ may then be interpreted as how the acceleration rates $\Delta ^2 X_t$ adjusts to the dynamic equilibrium relations $\beta ^\prime X_{t}+d^\prime \Delta X_{t}$, and d describes how the growth rates $\Delta X_t$ adjusts to the long-run equilibrium errors $\beta ^\prime X_t$. If $\alpha \ne 0$, d may be interpreted as a medium-run adjustment. For the variable $X_{i,t}$, (5) may be written in terms of the adjustment rates as

$$\begin{aligned} \Delta ^2 X_{i,t} = \cdots \sum ^r_{j=1}\alpha _{i,j} \sum ^p_{m=1}(\beta _{mj}X_{m,t-1}+d_{mj}\Delta X_{m,t-1})+ \sum ^r_{j=1}\zeta _{i,j} \sum ^p_{m=1}(\beta _{mj}\Delta X_{m,t-1})+\varepsilon _{i,t} \text{, } \end{aligned}$$

(6)

for $i=1,\ldots ,p$, where p is the number of variables in the vector $X_t$ and i is the ith variable. From this, we see that the signs of $\alpha$, $\beta$, and d determine whether the variable $X_{i,t}$ is error increasing or error correcting in the long run and in the medium run. If $\alpha _{ij} d_{mj}<0$ or/and $\alpha _{ij}\beta _{mj}<0$, the acceleration rate is error correcting to the changes $(\beta _j^\prime X_t+d^\prime _j \Delta X_t)$. Furthermore, if $d_{mj}\beta _{mj}>0$ (given $\alpha _{ij}\ne 0$), the change $\Delta X_{i,t}$ is error correcting to the levels $\beta ^\prime _j X_t$. Finally, if $\zeta _{i,j}\beta _{mj}<0$, then the acceleration rate $\Delta ^2 X_{i,t}$ is error correcting to $\beta ^\prime _j \Delta X_{t-1}$. In all other cases, the system is error increasing. Furthermore, error increasing behavior is offset by error correction elsewhere in the system if all the characteristic roots are inside or on the unit circle such that the system is stable. Hence, even though a variable may be error increasing in e.g. the medium run, this is offset by error correcting behavior in the long run or in another variable (see Juselius and Assenmacher (2017)). Even though cointegration measures co-movements and not causality, long-run adjustment may thus be assessed through the $\alpha$ coefficients in combination with $\beta ^\prime X_t$ and $\beta ^\prime X_t + d^\prime \Delta X_t$.

The moving average representation of the VAR model can, in a simplified manner, be given as

$$\begin{aligned} X_t = \tilde{\beta }_{\bot ,2}\alpha _{\bot ,2}^\prime \sum _{j=1}^{t}\sum _{i=1}^{j}(\varepsilon _i+\mu _0)+C_1\sum _{j=1}^{t}(\varepsilon _j+\mu _0)+C^*(L)(\varepsilon _t+\mu _0)+A+Bt \text{, } \end{aligned}$$

(7)

where $\alpha _{\bot ,2}^\prime$ gives information on the sources of exogenous shocks and $\tilde{\beta }_{\bot ,2}$ on how the I(2) trends $\alpha _{\bot ,2}^\prime \sum _{j=1}^{t}\sum _{i=1}^{j}\varepsilon _i$ loads into the variables in the system, $X_t$. This simplified representation is sufficient for the purpose of our analysis, as argued in Juselius and Stillwagon (2018). See (Juselius 2006, p. 313) for this simplification or Johansen (1992) for a more detailed description.

3.3 The I(2) model and the EKC relationship

Firstly, $\hbox {CO}_2$ may be I(2) and have a concave shape, following the non-linear relationship postulated by the EKC. We may then consider $\beta 'X_t$ as the long run relationship with highly persistent deviations from long-run static equilibrium errors. This highly persistent deviation may then be what is causing the concave shape of $\hbox {CO}_2$ emissions as suggested by the EKC, or it may also be in line with long-run deviations constituting an N-shape of $\hbox {CO}_2$ emissions (as found e.g. in Grossman and Krueger (1991)).

If we find that the rank is $r=1$, we have, from $\beta ^\prime X_t$, that the long-run relation between the log of CO2 emissions per capita, the log of GDP per capita, the log of energy use per capita and the log of net trade can be written as

$$\begin{aligned} lco2_t = \beta _1 lgdp_t + \beta _2 lenergy_t + \beta _4 ltrade_t + z_t \text{, } \end{aligned}$$

(8)

where $z_t$ is a residual integrated of order one. This is similar to the theoretical EKC relationship in (1) with control variables, except for not including the square of GDP in (8).^{Footnote 1} I also exempt from writing the constant term as it is not restricted to be in $\beta$, and I include a deterministic trend in the unrestricted $\beta ^\prime X_t$ to test for the presence of this. The I(2) model adds $d^\prime \Delta X_t$ in order to get a stationary relationship, which measures medium run changes in the variables. This can be thought of as a proxy for squared GDP in the EKC, such that the non-linearity may be accounted for through the I(2) model by $d^\prime X_t$ rather than by adding a quadratic term to the regression model. See, e.g., Juselius and Assenmacher (2017).

According to the EKC, the relationship between emissions and GDP depends on the level of GDP, where a quadratic term is added to explain the concave relationship. This is theoretically motivated as being a positive effect from GDP to emissions when a country’s GDP is low and it uses more emission-intensive resources in e.g. manufacturing, while the effect of GDP on emissions is smaller when GDP is high due to less use of emission-intensive factor inputs. Hence, technological advancements may cause this non-linear effect of GDP on emissions. GDP may therefore both be considered to error increase and error correct the system in the I(2) model, since deviations from a linear relationship such as in (8) is both increased and decreased by GDP. Error increasing factors will imply reinforcing effects, while error correcting factors adjust back to equilibrium. The non-linearity of the EKC can thus be considered the disequilibrium in (8), and should, according to the EKC, be a result of technological development which is explained through GDP. Hence, if technological improvements yield the shape of the EKC, then shocks to GDP should be relevant for $\hbox {CO}_2$ emissions. We should therefore observe that twice cumulated shocks to GDP will generate an I(2) trend that is concave and that these shocks feed into $\hbox {CO}_2$ emissions. This may be investigated through the moving average representation of the I(2) model as shown in (7).

The dynamic relation $\beta ^\prime X_t+d^\prime \Delta X_t$ (if $r=1$) will be given by

$$\begin{aligned}& lco2_t - \beta _1 lgdp_t - \beta _2 lenergy_t - \beta _4 ltrade_t \nonumber \\&\quad = d_0 \Delta lco2_t - d_1 \Delta lgdp_t - d_2 \Delta lenergy_t - d_4 \Delta ltrade_t + v_t \text{, } \end{aligned}$$

(9)

(also including a trend and a constant if applicable) where $v_t$ is stationary. Given a significant $\alpha$, this will show error correcting and increasing behavior in the medium run. This also shows how the non-stationary deviation from the equilibrium in the EKC without a quadratic term (i.e. the representation in (8)) can be explained through the estimated parameters and medium run error correction and increasing behavior. Hence, while the non-linearity and concave relationship in the EKC model is modeled through a polynomial relationship, the non-linearity in the I(2) model will be due to deviations from the estimated long-run equilibrium.

4 Data and empirical results

4.1 Data

I use log of $\hbox {CO}_2$ emissions per capita, energy use per capita, GDP per capita and trade intensity (the sum of imports and exports as a share of GDP) in the empirical model. References to the data on GDP, $\hbox {CO}_2$ emissions, energy use and trade in the next sections thereby refers to the natural logarithm of these series. The data set covers the US in the period 1960-2014. Per capita $\hbox {CO}_2$ emissions are measured as emissions stemming from the burning of fossil fuels and the manufacture of cement, while per capita energy use is the use of primary energy before transformation to other end-use fuels, and includes energy from combustible renewable sources and waste.^{Footnote 2} For GDP, I use real GDP per capita, while trade intensity is measured as the sum of imports and exports as a percentage of GDP. The data are plotted in Fig. 1, and a description of the data sources can be found in Appendix A1. These over 50 years of data includes many changes in the US economy such as variation in its composition of industrial sectors. The sample starts with the consumer boom in the 1960 s, followed by the recessions and inflationary periods in the 70 s and 80 s (French and French 1997). Increased labor productivity in the second half of the 90 s, in particular due to information technology (Oliner and Sichel 2000) is also related, and the sample ends after the financial crisis.

As argued in e.g. Itkonen (2012) and Jaforullah and King (2017), using energy consumption as an explanatory variable for $\hbox {CO}_2$ emissions may lead to underestimation of the other explanatory variables and systematic volatility in the estimated coefficients. It may also cause misleading cointegration test results, and should thereby not be used when estimating the effects on $\hbox {CO}_2$ emissions. However, in this paper, our main concern is the unobserved shocks driving the stochastic trends of the system and not the estimated parameters, such that this should be of less importance here. Even though the data series for $\hbox {CO}_2$ emissions are constructed partially based on energy consumption, the difference between the shocks to the two variables may provide important information regarding whether changes in energy originates from changes in the use of renewable sources and thereby do not lead to an increase in $\hbox {CO}_2$ emissions. The estimated system for the US when excluding energy use also suggests that GDP may be excluded from the long run relation, such that the effect of GDP on $\hbox {CO}_2$ emissions does not seem to be underestimated when including energy use in the system. Excluding energy use also suggests no long run relationship between GDP and $\hbox {CO}_2$ emissions (see Appendix B3), indicating that including energy use in the model does not underestimate the effect between GDP and $\hbox {CO}_2$ emissions or the I(2) cointegration test results. Additionally, when using an error correction form such as in (3), (4) and (5), which contain our estimates, the multicollinearity effect is significantly reduced (see Juselius 2006, p. 60), such that using both energy use and $\hbox {CO}_2$ emissions in our model is less of a concern.

4.2 Unrestricted I(2) model

The results from the estimations I conduct here were obtained using CATS 3 for OxMetrics, see Doornik and Juselius (2017).^{Footnote 3} I follow Rahbek et al. (1999) and restrict the constant term to be in $d'x_{t-1}$ and the deterministic trend to be in $\beta 'x_{t-1}$ in order to avoid quadratic trends as pointed out in Sect. 3.2. This enables us to separate between a quadratic trend and the presence of I(2) due to double unit roots.

I set the lag length of the unrestricted VAR to $k=3$, which is the most parsimonious well specified model without the need to include any step or indicator dummies. See Table 1 for the residual analysis which shows that the VAR(3) model is well specified. The tests of the residuals are described in Doornik and Juselius (2017).

Table 1 Misspecification tests for the unrestricted VAR(3) model

Long run non-linearity in CO2 emissions: the I(2) cointegration model and the environmental Kuznets curve

Abstract

Similar content being viewed by others

Investigating Environmental Kuznets Curve for CO $$_{2}$$ Emissions: A Cointegration Analysis for Morocco

Does CO2 emissions–economic growth relationship reveal EKC in developing countries? Evidence from Kazakhstan

The trilemma among CO2 emissions, energy use, and economic growth in Russia

1 Introduction

2 Empirical assessment of the environmental Kuznets curve

3 Theoretical framework and econometric approach

3.1 The EKC relationship

3.2 The I(2) model

3.3 The I(2) model and the EKC relationship

4 Data and empirical results

4.1 Data

4.2 Unrestricted I(2) model

4.3 Reduced rank and preliminary tests

5 Results

5.1 Long and medium run relations

5.2 Concavity and estimated I(2) trends

5.3 Short run relations

5.4 Robustness and sensitivity analysis

6 Conclusion

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest statement

Additional information

Publisher's Note

Appendices

Appendix A: Data and univariate unit root tests

1.1 A1. Data

1.2 A2. Univariate unit root tests

Appendix B: Robustness and sensitivity analysis

1.1 B1. Simulated data

1.2 B2. Estimating the I(1) model (and looking for indications of I(2))

1.3 B3. I(2) model for the US: \(\hbox {CO}_2\) emissions and GDP

1.4 B4. The UK and China

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation

Long run non-linearity in CO₂ emissions: the I(2) cointegration model and the environmental Kuznets curve