A simple theory of the link energy, emissions, and economic activity
In light of the preliminary evidence above, our goal is to establish an empirical relationship between CO2 emissions and the joint evolution of economic activity and energy use. We build upon the tradition of DICE (Dynamic Integrated Climate-Economy) models, and more precisely on the technological assumptions in Brock and Taylor (2005, 2010) and our previous work in Alvarez et al. (2005), and recently Díaz et al. (2019) or Díaz et al. (2020), to establish that relationship. The starting point is to consider a neoclassical production function augmented with an aggregate of energy use, \(E_t\). We assume that production (per unit of labor, \(L_t\)), requires capital and energy (whatever the source) in the following way:
$$\begin{aligned} y_{t} = \left\{ \begin{array}{ll} {\widetilde{A}}_t\,k_{t}^{\alpha }\,e_{t}^{\theta }, &{} \hbox {if }e_{t}= v_{t}\,k_t; \\ 0, &{} \hbox {otherwise,} \end{array} \right. \end{aligned}$$
where \(v_t\) is a technological (energy saving) index of the unit of capital [cf. Díaz and Puch (2019) for such an environment at the plant level], and \({\widetilde{A}}_t\) is an unadjusted measure of total factor productivity. Notice that we can write the production function (per worker) as:
$$\begin{aligned} y_t = A_t \left( \frac{e_t}{y_t}\right) ^{\frac{\alpha +\theta }{1-\alpha -\theta }}, \quad \text{ where } A_t = \left( {\widetilde{A}}_t\,v_t^{-\alpha } \right) ^{\frac{1}{1-\alpha -\theta }}. \end{aligned}$$
(3.1)
To make explicit the different sources of energy and, therefore, the energy mix, we specify carbon emissions in line with Stokey (1998).Footnote 4 We assume that we can express the flow of CO2 emissions:
$$\begin{aligned} P_t = {\widetilde{E}}_t^{\phi }\,Y_t^{\varphi }, \end{aligned}$$
where now, \({\widetilde{E}}_t\) is counting energy in units of CO2 emissions, whereas \(E_t\) in the production technology is expressed in units of energy. We do not need to be explicit on how the different energy technologies enter in the energy aggregate, \(E_t,\) or the emissions’ generating process, \({\widetilde{E}}_t\) [see, for instance, Díaz et al. (2019), based on a preliminary version of Hassler et al. (2020)]. We do not need either to specify how the climatic damage is built from the flow of CO2 emissions, \(P_t,\) in every period t, [see, for instance, Golosov et al. (2014)]. We adopt the simplifying assumptions that there is some form of imperfect substitution between the different energy technologies both in production and in carbon emissions, on the one hand, and that the feedback from climate damage to the economy operates, on the other hand, by diminishing total factor productivity in the long run. This later assumption implies the feedback occurs well beyond the short-run scope of our empirical implementation.
Using (3.1), we can rewrite the flow of CO2 emissions as:
$$\begin{aligned} P_t = \frac{{\widetilde{E}}_t}{E_t}^{\phi }\,{E_t}^{\phi }\, \left[ L_t\,A_t\,\left( \frac{{E}_t}{Y_t} \right) ^{\frac{\alpha +\theta }{1-\alpha -\theta }} \right] ^{\varphi }. \end{aligned}$$
Finally, taking into account the energy requirement in the production technology, \(E_t = v_t\,K_t,\) we can fully recover a parameterized version of this specification in the form:
$$\begin{aligned} P_t = \left( \frac{{\widetilde{E}}_t}{E_t}\right) ^{\phi }\, {\widetilde{A}}_t^{\gamma _1}\,v_t^{\gamma _2}\,Y_t^{\gamma _3} \,L_t{\gamma _4}\,\left( \frac{{E}_t}{Y_t} \right) ^{\gamma _5}. \end{aligned}$$
(3.2)
In this expression, the energy mix, \({\widetilde{E}}_t/E_t\), the energy intensity, \(E_t/Y_t,\) and the aggregate economic activity, \(Y_t,\) are made explicit, whereas the inertia of the model is embedded in both forms of technical progress we consider, that is, neutral technical progress, \({\widetilde{A}}_t,\) and the energy saving technical change index, \(v_t.\) We assume, therefore, that technical change in the state of the energy technology in the short run can be summarized in part into \(P_{t-1}\) through the process of carbon dynamics. Moreover, the reduced form specification of the state of the aggregate technology above can be made consistent with crossed effects of economic activity with energy intensity and the energy mix.
The variables selected with this theoretical background are based on well-established models in existing literature, following Brock and Taylor (2010) or Marrero (2010), and up to Díaz et al. (2020) as indicated above. It could be argued, though, that there are omitted variables. However, it is important to notice that the cross-sectional dimension we are considering is short, and that the selected set of countries share in common a lot of the institutional and regulatory framework. Therefore, we believe that the dynamic panel data framework with fixed effects we propose next, based on the production and emissions technologies we have specified in this section, is adequate to provide measurement of the short-run within-country effects we are looking for.
The empirical model
We use annual data, and we consider either growth rates or annual changes of the relevant variables. Thus, our approach is more business cycle oriented than long-run growth based. This is motivated because we want to characterize the existing heterogeneity of the short-run within-country CO2–GDP elasticity. An adequate understanding of this elasticity is needed to assess the interaction between CO2 emissions and economic activity for policy purposes in Western Europe.
From the previous assumptions and with some further parameterization suited for these data [cf. Marrero (2010), Díaz et al. (2019, 2020)], we specify a version of Eq. (3.2) linearized:
$$\begin{aligned} \Delta \ln P_{i,t}= & {} \beta _0 + C_i + T_t + \beta _1 \, \ln P_{i,t-1} + \beta _2 \, \Delta \ln Y_{i,t} + \beta _3 \, \Delta EI_{i,t} \nonumber \\&+ \beta _4 \, \Delta R_{i,t} + \varepsilon _{i,t}, \end{aligned}$$
(3.3)
where \(\Delta \ln P_{i,t}\) denotes per capita CO2 emissions annual growth; \(C_i\) is a country-fixed effect that captures the long-run (unobservable) differences across countries; \(T_t\) represents a time-fixed effect that captures the global business cycle effects and other global shocks that may be jointly driving emissions and economic activity in our sample; \(P_{i,t-1}\) accounts for a one-period lag in per capita CO2 emissions (inertia or convergence term); \(\Delta \ln Y_{i,t}\) is per capita GDP annual growth; \(\Delta EI_{i,t}\) denotes the annual change in energy intensity; and \(\Delta R_{i,t}\) represents the change in the share of renewables, which captures in a very parsimonious way the main source of variation in carbon intensity of energy use. Finally, \(\varepsilon _{i,t}\) is a mean zero and constant variance \(\sigma ^2\) innovation to this data generation process.
Notice that reverse causality (that is, whether \(\Delta \ln P_{i,t}\) causes \(\Delta \ln Y_{i,t}\)) is not relevant in our application as it associates with a long-run feature of the data that goes from climatic damage to neutral progress as in Golosov et al. (2014), and the cross-sectional dimension we consider is short. In any case, we explicitly explore this issue below. Also, institutional and regulatory variables exhibit limited time variability in our sample for the set of countries we consider, and therefore, those potential treatment variables should be captured within the country-specific fixed effect. Moreover, we found that adding those variables brings loss of efficiency in the estimator due to potential correlation with the fixed effect.Footnote 5 Finally, global elements of technical change, which are expected to be common across the set of countries in our sample, are expected to be captured by the time-specific fixed effect.
Under these circumstances, the key parameter is the elasticity \(\beta _2\), which should be interpreted as an average within-country CO2–GDP annual elasticity. Thus, it can be compared with parameters \(\delta _{0}\) and \(\delta _{1}\) in Eqs. (2.1) and (2.2), and thus, the estimates in Table 1. The parameter \(\beta _{1}\), which is expected to be negative, is associated with the conditional convergence speed of CO2 emissions in our sample.Footnote 6 As we show next, the negative sign of this elasticity is confirmed in the panel regressions, whereas it was not always present in the country by country regressions. The estimated \(\beta _3\) and \(\beta _4\) denote the direct impact of the energy elements on the CO2 emissions. Since the shares of renewables and non-renewables add up to one, the \(\beta _4\) coefficient measures the effect of a change in the renewables share with respect to the change in fossil fuels. For a better quantitative assessment of these relationships, the variables are scaled in such a way the estimated \(\beta _{3}\) and \(\beta _{4}\) represent the effect of a one standard deviation change over the annual emissions growth rate.
In the baseline specification, we assume \(\beta _{2}\) to be constant across countries. However, as discussed above, this is an unrealistic assumption. We extend equation (3.3) including two interaction terms between \(\Delta \ln Y_{i,t}\) and the lagged levels of the energy variables, say: \(\beta _{21}\,EI_{i,t-1}\,\Delta \ln Y_{i,t}\), and \(\beta _{22}\,R_{i,t-1}\,\Delta \ln Y_{i,t}\). Thus, we are allowing the short-run within-country elasticity between per capita CO2 emissions and GDP to be country-and-yearly-specific, as it depends on the lagged levels of energy intensity and the lagged share of renewables, that is: \(\beta _{2}+\beta _{21}\,EI_{i,t-1}+\beta _{22}\,R_{i,t-1}\). As we will show below, in all the specifications considered \(\beta _{22}\) is not statistically different from zero, and therefore, we will focus on the CO2–GDP elasticity as a function of the position in energy intensity: \(\beta _{2}+\beta _{21}EI_{i,t-1}\).
Econometric issues
In addition to exploiting the entire panel information of these data, our specification is convenient for at least two reasons. First, because it controls for time-varying and cross-country fixed and unobserved heterogeneity. Not considering these sources of heterogeneity may result in seriously biased estimates when those sources of heterogeneity exist (Hsiao (1986)). This feature of the data will be illustrated in Table 2 when comparing regressions (1), (2), and (3), showing that the estimated coefficients of \(\beta _{1}\) and \(\beta _{2}\) change with the inclusion of the fixed effects \(C_{i}\) and \(T_{t}\) in the model. Second, the estimated parameters represent what we actually want to measure: the (average) within-country and short-run partial correlations.
This specification does not guarantee unbiased estimations of \(\beta _{1},\) though. Actually, \({\widehat{\beta }} _{1}\) is expected to be downward biased, as far as the estimate from the pool-OLS would be upward bias [again, Hsiao (1986)]. As we will further discuss in the next section (again, Table 2), the inclusion of energy variables in the model seems to reduce this potential bias. However, estimated results of \(\beta _{2}\) could still be valid if \(E[\Delta \ln Y_{i,t}\varepsilon _{i,t}]=0 \) in equation (3.3), and in its extended version including crossed terms. Roughly speaking, this condition is likely to be satisfied if, first, \(\Delta \ln Y_{i,t}\) is weakly correlated with \(\ln P_{i,t-1}\), and second, there is no reverse causality in our sample (i.e., \(\Delta \ln P_{i,t}\) does not cause \(\Delta \ln Y_{i,t}\)). With respect to the first condition, we already obtained the evidence of a moderate explanatory power of lagged emissions in Table 1, as discussed above. Moreover, the linear correlation coefficient between \(\Delta \ln Y_{i,t}\) and \(\ln P_{i,t-1}\) is just 0.050 (non-significant) for the entire pool, and nearly the same, 0.048 (non-significant), for those variables controlled by the fixed effects (i.e., the within-country and within-year correlation). Regarding the second aspect, it has been argued in Sect. 3.1 that the effect on GDP from climate damage through productivity as in Golosov et al. (2014) is fundamentally forward looking. The environmental damage would end up affecting total factor productivity through its effect on, for instance, health and then on human capital, but this mechanism will not operate in the short run.
Nevertheless, we take the endogeneity concern more seriously and perform endogeneity tests to every right-hand side variable included in equation (3.3), and when extended with the crossed terms. We follow the three-step procedure proposed by Wooldridge (2002). First, an OLS regression is estimated drawing on the lagged levels of the dependent variable (i.e., per capita CO2 emissions), controlling by country- and time-fixed effects. Second, the residuals of this regression are included in our main models as an exogenous variable. Finally, we conduct a post-estimation Wald test on the estimates corresponding to the residual term under the null hypothesis that such parameter is equal to zero. Rejecting the null hypothesis should raise concerns about endogeneity in the models. In our case, the test fails to reject the null hypothesis (p value \(= 0.30\)) which is an undoubtable symptom that endogeneity is not an important issue in the sample. The homogeneity of our data (specifically, a strongly balanced panel of Western European countries, starting from 1980) is clearly helping to reduce endogeneity problems.
The usual, mechanical, way to proceed when estimating the dynamic panel model is to use an instrumental variable approach. In the absence of external instruments, the alternative is to use internal instruments (i.e., lagged value of the endogenous variable and of the regressors).Footnote 7 We have used one or two lagged levels of the variables as instruments, and we have obtained similar estimation results, but with the inconvenient that the Hansen test of overidentifying restriction fails in several specifications. The common alternative of the system-GMM (Arellano and Bover 1995), which uses a larger set of instruments, is specially designed for a large cross section in comparison with the time dimension, which is the opposite to our sample. In our case, we always have overfitting problems, even when using any method to reduce the number of instruments (Roodman 2009), hence system-GMM estimations are strongly inefficient in our case.Footnote 8 For all that, an instrumental variable approach, in the absence of a good exogenous instrument (the most common situation in these macroeconomic models) and in the presence of exogenous (statistically speaking) regressors, would generate estimation problems, and using a pool-OLS with fixed effects (country and year) would be a more convenient and conservative strategy (see Bun and Sarafidis 2015). Our estimation results in the following section are based on this latter approach.
Notwithstanding, there is the important issue of time-variant unobserved heterogeneity that cannot be addressed with fixed effects. For instance, one may think of differences in regulation between the north and the south along the sample. We believe that the structural part of those differences must be channeled through the energy technologies, precisely, in the form of differences in energy intensity and in the share of renewables in the primary energy supply. On top of that, those differences (in regulation) related to the energy dimension are hard to observe at the panel frequency. Consequently, as we do not include them into the model, we assume that they are, if temporary and therefore unchanneled through the energy variables, incorporated to the residuals. Whenever this part incorporated in the residuals is small (notice the high \(R^2\) of the regressions in the following section), and it is uncorrelated with energy aspects, its omission should not be affecting the estimation of our key coefficients.