1 Introduction

The Production Gap Report 2020 and Emission Gap Report 2020 released by the United Nations Environment Programme (UNEP) point out that, despite a brief dip in carbon dioxide emissions caused by the COVID-19 pandemic, the world is still heading for a temperature rise in excess of 3 °C this century—far beyond the Paris Agreement goals of limiting global warming to well below 2 °C and pursuing 1.5 °C. The increasing carbon emissions have been a major concern for most countries around the world (Shobande and Asongu 2021). And as a result, every country is concerned about developing appropriate strategies to curtail it (Chen et al. 2020; Yang et al. 2021). As a major economy and the largest carbon emitter in the world, China has pledged to reduce the carbon intensity (carbon dioxide emissions divided by gross domestic product (GDP)) by 60–65% by 2030, compared with 2005 levels, and achieve carbon neutrality before 2060 by implementing a green pandemic recovery plan.

Therefore, investigating China’s carbon emissions has been attracting more attention from numerous studies. Energy consumption is the crucial impetus of the carbon emissions, and the leading factors of carbon emissions have been identified as follows: industrial structure, economic growth and investment in treatment of environmental pollution (Song et al. 2015; Xu et al. 2016; Ridzuan et al. 2020; Du and Li 2020; Xuan et al. 2020; Abbasi et al. 2021; Aluko et al. 2021; Aslam et al. 2021; Cheng et al. 2021; Hossain and Chen 2021; Shabani et al. 2021).

Based on these factors, a large number of literatures have discussed the contributors of carbon emissions focusing on some industries or some regions in China by using the classic methods. On the one hand, there are some studies focusing on the carbon emissions of some industries in China. Teng et al. (2017) measured the carbon productivity of Chinese service industry by SBM directional distance function and GML index. Tian and Ma (2020) established the Kaya decomposition model of China’s industrial carbon intensity and used LMDI method to analyse the contribution of different factors to the industrial carbon intensity. Sun et al. (2020) constructed a Stackelberg differential game model to analyse the factors that influence China’s manufacturing industrial carbon emissions. Using the input–output analysis and the three-stage data envelopment analysis (DEA) model, Wang et al. (2020a) proposed an improved method for estimation of China’s embodied carbon emissions efficiency in the service sector. On the other hand, some studies are focusing on carbon emissions of some regions in China. Chang et al. (2020) evaluated the impact of energy consumption structure on carbon emission performance in the Bohai Rim Economic Circle (BREC) and allocated the carbon emission quotas in 2030. Wang et al. (2020b) evaluated the spatial correlation and relevant factors of carbon emissions in Chengdu-Chongqing urban agglomeration based on SNA and QAP. Jiang et al. (2020) constructed a three-stage DEA model to evaluate and compare the transportation carbon emission efficiency of the provinces in Yangtze River Economic Belt. Huang et al. (2019) applied the life cycle assessment approach to quantify the efforts of Shenzhen’s public building practices and evaluated its real ‘achievement’ by quantifying the carbon emissions reduction in the past decade. Taking energy intensity as the threshold variable, Wang et al. (2019) established the Threshold-STIRPAT model and determined the contributors of carbon emissions in 6 megacities.

But the above literatures have rarely taken spatial heterogeneity into consideration when analysing the relevant factors of carbon emissions. Anselin (1988) proposed spatial econometrics, which provided a basic model to contain spatial heterogeneity in classic econometric models. Pan and Zhao (2018) built a Spatial Autoregressive Model (SAR) to simulate the spatial–temporal distribution of carbon emissions in China. Cheng et al. (2018) used dynamic spatial panel models to analyse the effects of industrial structure and technical progress on carbon intensity, and explored those factors that may lead to a reduction in carbon intensity in China. Lu et al. (2019) applied Spatial Durbin Model (SDM) to analyse the direct and spillover effects of low-carbon technological innovation on carbon emissions in China. Liu and Zhang (2021) Investigated the relationship between heterogeneous industrial agglomeration, technological innovation and carbon productivity using SDM in china. Guo et al. (2021) investigated the spatial aggregation and determinants of Guangdong’s energy intensity using SDM.

However, the above spatial econometric models may have some limitations: firstly, they failed to consider the cross-products of relevant factors, which may play an important role on the analysis of carbon intensity (Shi and Lee 2017). Secondly, they did not consider heterogeneity of the direct and spillover effects across space and over time (Li et al. 2019). Furthermore, they did not take into account that energy price may have a determined negative effect on energy consumption (Ren et al. 2009; Du 2019; Ashraf et al. 2020; Wang et al. 2020c; Neya et al. 2020).

In fact, Elhorst et al. (2019) proposed a dynamic general nesting spatial panel model (GNS) with common factors, which introduced cross-products and showed the direct and spillover effects of relevant factors across space and over time in their problem. In order to achieve short-term and long-term control of carbon intensity of different regions in China, this study tries to explore the spatial–temporal effects of relevant factors on carbon intensity in China. The contributions of this paper are presented as follows: (1) following the method of Elhorst et al. (2019), this paper constructs our GNS with common factors and analyses all industries of 30 provinces in China from 2003 to 2017. (2) Based on the leading factors in previous studies, this study examines the direct and spatial–temporal spillover effects of industrial structure (IS), GDP per capita (PGDP), investment in anti-pollution projects as percentage of GDP (EI) and energy price (PE) on carbon intensity. (3) This paper is a first attempt to explore the spatial–temporal effects of relevant factors on carbon intensity by introducing the cross-products in the basic models. (4) It is the inclusion of cross-products in our model that makes it possible to provide effective economic explanations and reasonable policy implications of IS, PGDP, EI and PE for the observed heterogeneity from spatial and time dimensions, respectively.

The remainder of this paper is organized as follows. Section 2 describes the research methodology, variables, data description and constructs our model. Section 3 reports and discusses the spatial aggregation of carbon intensity and the empirical results of the models. Section 4 gives the conclusions and proposes several policy implications.

2 Materials and methods

2.1 Carbon intensity estimates

According to the report of the United Nation’s Intergovernmental Panel on Climate Change (IPCC), the use of fossil energy is the main source of the increase in carbon emissions. For convenient analysis, this study calculates carbon intensity from fossil energy including coal, coke, crude oil, gasoline, kerosene, diesel, fuel oil and natural gas. Specifically, in order to obtain carbon intensity, we need firstly evaluate the carbon emissions, which can be calculated as follows:

$$ C_{it} = \mathop \sum \limits_{j = 1}^{k} (E_{ijt} \times a_{j} \times b_{j} ) \times \frac{44}{{12}} $$
(1)

where \( C_{it} \) represents the carbon emissions of province \(i\) in period \(t\);\( E_{ijt} \) stands for the total consumption of energy \(j\) by province \(i\) in period \(t\);\( a_{j} \) is the standard coal-equivalent coefficient of energy \(j\), which is from China Energy Statistical Yearbook; \(b_{j}\) represents the carbon-emission coefficient of energy \(j\), which is from IPCC Report. And \(a_{j}\) and \(b_{j} \) are shown in “Table 7 in Appendix”.

Based on this formular, we can denote the carbon intensity as:

$$ CI_{it} = \frac{{C_{it} }}{{GDP_{it} }} $$
(2)

where \(CI_{it} \) and \(GDP_{it}\) stand for the carbon intensity and the regional gross domestic product of province \(i\) in period \(t\), respectively.

2.2 Testing for cross-sectional dependence

Based on the Cross-Sectional Dependence (CD) test developed in Pesaran (2015) and the α-exponent estimator developed in Bailey et al. (2016a), Bailey et al. (2016b) presented a two-step procedure to distinguish between weak and strong cross-sectional dependence. Under the null hypothesis 0 < α < 1/2, the CD-test statistic is defined as the Eq. (3), and the average correlation coefficient has the order property of Eq. (4):

$$ CD = \sqrt {2T/N\left( {N - 1} \right)} \mathop \sum \limits_{i = 1}^{N - 1} \mathop \sum \limits_{j = i + 1}^{N} \hat{\rho }_{ij} $$
(3)
$$ \overline{\rho }_{N} = \frac{2}{{N\left( {N - 1} \right)}}\mathop \sum \limits_{i = 1}^{N} \mathop \sum \limits_{j = i + 1}^{N} \rho_{ij} = O\left( {N^{2\alpha - 2} } \right) $$
(4)

where \(N\) represents the number of provinces (\(N\) = 30) and \(T \) stands for the time periods (\(T\) = 15); \(\hat{\rho }_{ij} \) denotes the sample correlation coefficient between \(CI_{it} \) and \( CI_{jt }\) of two provinces \(i \) and \(j\) in period \(t\); \(\overline{\rho }_{N} \) is the average correlation coefficient; And \(CD\mathop \sim \limits^{a} N\left( {0,1} \right)\); \(\alpha\) is a parameter that can take values on the interval (0,1) (Bailey et al. 2016b), for 0 < \(\alpha\) < 1/2, \(\overline{\rho }_{N} \) convergenes to zero very fast, pointing to weak dependence. The range 1/2 ≤ \(\alpha\) < 3/4 is considered to represent moderate dependence and 3/4 ≤ \(\alpha\) < 1 quite-strong cross-sectional dependence.

2.3 Testing for spatially stratified heterogeneity

The spatially stratified heterogeneity (SSH) refers to ubiquitous phenomena (those within strata are more similar than those between strata), implies potential distinct mechanisms by stratum, and enforces the applicability of statistical inferences (Wang et al. 2016). Confounding arises if a global model was applied to a SSH population, leading to statistical insignificance. The problem can be simply avoided if SSH is identified by geographical detector (GeoDetector) \(q\)-statistic then modelling in the strata, separately. The GeoDetector \(q\)-statistic is generally applied to quantitatively evaluate the SSH of an explained variable (Wang et al. 2010, 2016) and assess the determinant power of explanatory variables and their interactions without linear assumptions (Yin et al. 2019). The fundamental formula of the \(q\)-statistic is given by:

$$ q = 1 - \frac{{\mathop \sum \nolimits_{h = 1}^{L} N_{h} \sigma_{h}^{2} }}{{N\sigma^{2} }} $$
(5)

where \(q\), with a value ranging from 0 to 1, is the SSH measure of an explained variable or the determinant power of a factor to the objective, and the larger the \(q\)-statistic, the more pronounced SSH of \(Y \) is; \(N\) is the number of explained variable observations, and \(\sigma^{2}\) indicates the variance of all the observations; The explained variable is stratified into \(L\) strata, denoted by \(h\) = 1, 2,..., \(L\), which are determined by prior knowledge, the determinant factor, or a classification algorithm; \(N_{h}\) is the number of observations, and \(\sigma_{h}^{2}\) is the corresponding variance within stratum \(h\).

2.4 Spatial econometric model

This paper adopts a dynamic general nesting spatial panel model (GNS) with common factors, which can be written as:

$$ \begin{aligned} lnCI_{it} & = \tau lnCI_{it - 1} + \delta \mathop \sum \limits_{j = 1}^{N} w_{ij} lnCI_{jt} + \eta \mathop \sum \limits_{j = 1}^{N} w_{ij} lnCI_{jt - 1} + \mathop \sum \limits_{l = 1}^{L} \beta_{l} lnx_{ilt} \\ & \quad + \mathop \sum \limits_{l = 1}^{L} \theta_{l} \mathop \sum \limits_{j = 1}^{N} w_{ij} lnx_{jlt} + \frac{1}{2}\mathop \sum \limits_{l = 1}^{L} \mathop \sum \limits_{m = 1}^{L} \beta_{lm} lnx_{ilt} lnx_{imt} + \rho lnz_{it} \\ & \quad + \mathop \sum \limits_{r = 1}^{R} \tau_{ir} f_{rt} + v_{it} \\ \end{aligned} $$
(6)

where

$$ v_{it} = \lambda \mathop \sum \limits_{j = 1}^{N} w_{ij} v_{jt} + \varepsilon_{it} $$
(7)

where \(lnCI_{it - 1}\) and \(\mathop \sum \limits_{j = 1}^{N} w_{ij} lnCI_{jt}\) represent, respectively, the temporal and spatial lag, and \(\mathop \sum \limits_{j = 1}^{N} w_{ij} lnCI_{jt - 1} \) represents spatial–temporal lag of \(lnCI_{it}\); \(\tau ,\delta \) and \(\eta \) are the corresponding response parameters of these variables, respectively, the serial, spatial and spatial–temporal autoregressive coefficients; \(w_{ij}\) is the element of an \(N \times N \) non-negative matrix \(W \) of known constants describing the spatial arrangement of the provinces in the sample; \(L\) = 3,\({ }x_{i1t}\), \(x_{i2t}\) and \(x_{i3t} \) represent IS, PGDP, EI of the province \(i\) in period \(t\), respectively; And so in terms of these three explanatory variables, the number of cross-products amounts to six; \(\beta_{l}\),\(\theta_{l}\) and \( \beta_{lm}\) are the coefficients of the exogenous explanatory variables, the exogenous spatial lag explanatory variables and the cross-products of the exogenous variables, respectively; \(z_{it}\) stands for PE, with coefficient \( \rho\), the first three single explanatory variables are dominated by variation in the cross-sectional domain, while PE is dominated by variation in the time domain; The common factors, which cover potential global cross-sectional dependence, can be subdivided into observable and non-observable factors. In our model, PE is an observable common factor. The hypothesis is that if PE in China increases (resp. decrease), the \(CI \) will diminish (resp. increase) in all of its provinces. In addition, the \(CI\) may increase or decrease due to \(R\) non-observable common factors; \(\tau_{ir }\) is the \( i\) th column of \(\tau_{r}\), which is a vector of length \(N\) representing the factor loadings of common factor \(r\). \(f_{t}\) is of order \(R \times T\) such that its transpose consists of \(R\) columns of length \(T\). The proposed model encompasses many models of empirical interest, among which the popular dynamic spatial panel model with additive spatial and time period fixed effects. Shi and Lee (2017) demonstrated that this model can be obtained by imposing the restrictions \(R \) = 2,\({ }\tau_{1}\) = \(\left( {\mu_{1} \ldots \mu_{N} } \right)\), \(\tau_{2}\) = \(\left( {1 \ldots 1} \right)\) and \(f_{t}\) = \(\left( {1\xi_{t} } \right)^{^{\prime}}\), where \(\mu_{i}\) and \({ }\xi_{t} \) represent spatial and time period fixed effects, respectively; Finally, the error term \(v_{it} \) is assumed to follow a local spatial autoregressive process, where \(\varepsilon_{it} \) reflects an i.i.d disturbance term with zero mean and finite variance \(\sigma^{2}\). The coefficients of the model specified can be estimated by the quasi-maximum likelihood (QML) estimator developed by Shi and Lee (2017).

2.5 Direct effect and spillover effect (indirect effect)

The matrix of the long-term direct and indirect effects of the expected value of the \( CI\) with respect to the \(lnx_{ilt} \) can be expressed as follows (Elhorst 2014; Elhorst et al. 2018):

$$ \left( {\begin{array}{*{20}c} {\frac{{\partial E\left( {lnCI_{1t} } \right)}}{{\partial lnx_{1lt} }}} & \cdots & {\frac{{\partial E\left( {lnCI_{1t} } \right)}}{{\partial lnx_{Nlt} }}} \\ \vdots & \ddots & \vdots \\ {\frac{{\partial E\left( {lnCI_{Nt} } \right)}}{{\partial lnx_{1lt} }}} & \cdots & {\frac{{\partial E\left( {lnCI_{Nt} } \right)}}{{\partial lnx_{Nlt} }}} \\ \end{array} } \right) = \left( {\left( {1 - \tau } \right)I_{N} - \left( {\delta + \eta } \right)W} \right)^{ - 1} $$
$$ \times \left( {\begin{array}{*{20}c} {\beta_{l} + \mathop \sum \limits_{m = 1}^{L} \beta_{lm} lnx_{1mt} } & {\theta_{l} w_{12} } & \cdots & {\theta_{l} w_{1N} } \\ {\theta_{l} w_{21} } & {\beta_{l} + \mathop \sum \limits_{m = 1}^{L} \beta_{lm} lnx_{2mt} } & \cdots & {\theta_{l} w_{2N} } \\ \vdots & \vdots & \ddots & \vdots \\ {\theta_{l} w_{N1} } & {\theta_{l} w_{N2} } & \cdots & {\beta_{l} + \mathop \sum \limits_{m = 1}^{L} \beta_{lm} lnx_{Nmt} } \\ \end{array} } \right) $$
(8)

where every diagonal element of this matrix represents the direct effect of one unit change in one of the factors on \(CI\) of the province \(i\). Due to the inclusion of \(lnx_{ilt}\), the diagonal elements of the second matrix on the right-hand side will vary across space and over time, so will these direct effects; Every column sum of off-diagonal elements represents the spillover effect of one unit change in one of the factors on \(CI\) in all provinces other than the province instigating this change. Their short-term counterparts can be obtained by setting \( \tau = \eta = \) 0.

2.6 Spatial weight matrix

\(W_{1} \) represents a binary contiguity matrix, when the province \(i\) and province \(j\) are adjacent, \(w_{ij} = 1\), otherwise \(w_{ij} = 0\). Based on the inverse of the average distance (K) between the capitals of adjacent provinces, the element of \( W_{2}\) takes \(0\) if K > 450 km or 1 otherwise. At the same time, \(W_{1}\) and \( W_{2}\) are standardized.

2.7 Data description

The energy consumption data, IS (the proportion of the output value of the secondary industry in GDP), PGDP and EI in each province and PE were obtained from the Statistical Yearbooks of each province, China Energy Statistical Yearbook and China Environmental Statistical Yearbook. Due to the complexity of PE, a general method is to find a good proxy index for PE. This study chose the purchasing prices for raw materials, fuels and power, which has been chosen for the good proxy index for PE (for example Du 2019). PGDP and PE were converted into standard prices using a price index (2003 was the base year). In order to compare the regional differences of the effects of relevant factors on \(CI\), this paper introduced the regional division of China (see Table 1). The distance data came from Google Maps. Following the general treatment in the studies of \(CI\) in China, this study did not collect data from Hong Kong, Macao, Taiwan and Tibet.

Table 1 Regional division of China

3 Results and discussion

In this section, this paper firstly gives a graphical analysis of spatial aggregation of \(CI\) by the software of ArcGIS. Furthermore, we show a detailed spatial econometrics analysis of \(CI\) based on our GNS model.

3.1 The spatial aggregation of \(CI\) and the three rates of change of relevant factors

ArcGIS software can depict the spatial distribution of research objects directly and vividly through a graphical display. In this paper, the degree of spatial aggregation among 30 provinces in China by dividing \(CI\) into four regions from low level to high level was measured by ArcGIS. For the succinctness and representative of analysis, we used the data of 2003, 2008, 2013 and 2017. The results are shown in Fig. 1. Totally, China’s \(CI\) showed obvious spatial agglomeration. Specially, North China, Northeast China and Northwest China belonged to “high-high” agglomeration regions, while some coastal provinces in Central China and East China belonged to “low-low” agglomeration regions. In addition, China’s \( CI\) has been declining from 2003 to 2017.

Fig. 1
figure 1

China’s agglomeration map of carbon intensity (t/10,000 yuan) in 2003, 2008, 2013 and 2017

Furthermore, this paper calculated the three rates of change of relevant factors during the period 2003–2008, 2008–2013 and 2013–2017, respectively (the rate of change = ((current value)/(previous value)-1)\(*100\)), which are shown in Figs. 2, 3 and 4. Firstly, the PGDP of all provinces continued to grow during the period 2003–2017, especially the provinces in North China and East China. Secondly, during different periods, IS and EI of different provinces in China had different trends. From 2003 to 2008, except for a few provinces and cities such as Beijing and Shanghai, the IS of most provinces and cities was increasing. From 2008 to 2013, the IS of almost half of the provinces, such as Henan Province and Zhejiang Province, was decreasing, while EI of most provinces and cities was increasing sharply during this period. From 2013 to 2017, in the “12th Five-Year Plan”(2011–2015) and early stage of the “13th Five-Year Plan”(2016–2020), the industrial structure has been adjusted, the IS of all provinces were declining, while most provinces had a slight decrease in EI.

Fig. 2
figure 2

The rate of change in IS (2003–2008, 2008–2013 and 2013–2017)

Fig. 3
figure 3

The rate of change in PGDP (2003–2008, 2008–2013 and 2013–2017)

Fig. 4
figure 4

The rate of change in EI (2003–2008, 2008–2013 and 2013–2017)

3.2 Empirical results of models

Cross-sectional dependence is prerequisite of using the spatial econometric analysis. In order to test the cross-sectional dependence of \(CI\) in China, we will apply CD-test statistic (Bailey et al. 2016b; Pesaran 2021; Elhorst et al. 2021) and global Moran’s I (Zhou et al. 2019) (see “Table 8 in Appendix”). The results indicated a strong spatial dependence of \(CI\) in China. This result is consistent with the previous graphical analysis. Therefore, the influence of spatial effects should be taken into account in the study of \(CI\) in China.

Furthermore, in order to test whether there is the SSH of the huge country and confounding of a global modelling, the SSH and the degree of influence of different factors on \( CI \) were investigated through the GeoDetector \(q\)-statistic (Wang et al. 2010, 2016) of \(CI\) for different regions of China over the period 2003–2017 (see “Tables 9, 10 in Appendix”). The results indicated that although the cross-products detectors may reduce the validity of applying the global models (i.e. SAR models, SDM models and GNS models), the q-statistics imply that from the view of control functions, the primary variables such as IS, PGDP and EI are not affected by the spatial stratified heterogeneity, which implies the global model is a reliable spatial econometric model applied to analyze the effects of these primary variables on \(CI\) in China. Therefore, the global models are reliable spatial econometric models applied to focus on the effects of primary variables on the dependent variable for all regions, and this paper constructs the global models for the analysis of effects of these factors on \(CI\) in China, respectively.Footnote 1

Specifying spatial weight matrix is the first step of constructing the spatial econometric model. In this paper, two spatial weight matrices, \(W_{1} \) and \(W_{2}\), which are introduced in Sect. 2, will be used (characteristics of two spatial weight matrices see “Table 11 in Appendix”).

In terms of cross-sectional dependence above, spatial econometric models should be applied to analyze the effects of various relevant factors on \(CI\) in China. In this paper, based on two spatial weight matrices above, the classic SAR models (fixed effects in time and space), SDM models (fixed effects in time and space) and a kind of new model, GNS models will be applied to our analysis. Table 2 reports the estimation results of six models.

Table 2 The estimation results of models

From Table 2, we can find that: firstly, the LR-test (− 2 * (460.7400–524.4776) = 127.4752) implies that the coefficients of the cross-products and spatial lags are jointly significant and that introducing the cross-products and spatial lags in the model is reasonable. Secondly, the elasticities of PE in three models SAR(\(W_{1}\)), SAR(\(W_{2}\)) and SDM(\(W_{1}\)) are all greater than 1, which are not consistent with the actual situation in China (Zhang 2008; Du 2019). Thirdly, the higher values of adjusted R-squared and log-likelihood show that the GNS is more fitting to Chinese data. Finally, the sum of absolute value of the coefficients, \(\tau ,\delta \) and \(\eta\) in GNS(\(W_{2}\)) is greater than 1, which fails to satisfy the stability condition, and in addition, the probability of the empirical regularity (Parent and Lesage 2011, 2012) in GNS(\(W_{2}\)) is lower than that in GNS(\(W_{1}\)). Therefore, GNS (\(W_{1}\)) is a reliable spatial econometric model applied to the analysis of \(CI\) in China, and so we will focus on GNS(\(W_{1}\)).

The estimation results of GNS (\(W_{1}\)) imply that: (1) \(\tau\) is greater than 0 and significant at 1% level, indicating that China’s \(CI\) showed temporal “inertia” from 2003 to 2017, that is, for province \(i\), the \(CI\) of the previous period was significantly and positively correlated with that of the next period. This “snowball effect” illustrates that adjusting some policies such as optimizing industrial structure, regulating energy price and increasing or decreasing investment in treatment of environmental pollution, which usually had a habit persistence in China (Lu et al. 2019). In addition, δ and η are close to 0 and insignificant, which means the existence of local spillovers of \(CI\) and there were not global spillovers of \(CI\). (2) The coefficient of IS is positive and significant, which implies that the higher IS, the larger the \(CI \) in China was. This might be attributed to the unreasonable IS of most provinces in China. This is because relative unreasonable IS will lead to a high level of industrial activities, which inevitably leads to a high level of \(CI \) (Cheng et al. 2018; Lu et al. 2019; Yang et al. 2021). In fact, about 70% of China’s primary energy came from industrial energy consumption, and around 69% of that came from high energy consuming industries, such as steel, building materials, chemical industry, coking and petroleum processing (Ren and Xia 2017). (3) The coefficient of PGDP is positive and significant at 1% level, however a high level of PGDP led to not only a high level of industrial activities but also frequent activities of daily living, such as burning gasoline when we drive, burning oil or gas for home heating and so on, which might increase \(CI\) in China. (4) The coefficient of PE is negative. On the one hand, the increasing PE would decrease energy consumption, which would lead to the lower \( CI\). On the other hand, the rising price encouraged enterprises not only to use energy-saving products but also to improve energy efficiency, which can also decrease \( CI\) in China (Du 2019; Wang et al. 2020c). (5) The coefficient of EI is negative and significant at 1% level. Central government and local governments continued to pay more attention to the reduction of carbon emissions and pollution. In fact, China’s investment in treatment of environmental pollution increased from162.77 billion yuan in 2003 to 953.895 billion yuan in 2017, accounting for around 1.5% of GDP (Du and Li 2020; Wu et al. 2021). (6) The coefficient of the cross-products between PGDP and EI is positive, suggesting that the mere pursue GDP by local governments would weaken the restrictive effect of investment in treatment of environmental pollution on \(CI\) (Xuan et al. 2020; Yang et al. 2021). (7) The coefficients of \(W*lnIS\) and \(W*lnPGDP \) are both negative, which indicates that industrial development and economic growth had a negative effect on \(CI\) of adjacent provinces (Lu et al. 2019; Liu and Zhang 2021).

3.3 The short-term and long-term effects of relevant factors on carbon intensity

Table 3 shows how the short-term (ST) and long-term (LT) effects of the relevant factors evolve over time, using cross-sectional averages of the variables over 30 provinces for each year (It should point out that ST effect and LT effect in Table 3 stand for total effect, which is sum of direct effect and indirect effect). Tables 4, 5 and 6 show how the short-term and long-term effects of three relevant factors vary across province, respectively, using time-average of the variables over the whole time span of the sample for each province.

Table 3 The short-term and long-term total effects of relevant factors on carbon intensity from 2003 to 2017
Table 4 The short-term and long-term effects of IS on carbon intensity in 30 provinces
Table 5 The short-term and long-term effects of PGDP on carbon intensity in 30 provinces
Table 6 The short-term and long-term effects of EI on carbon intensity in 30 provinces

3.3.1 Evolution of ST and LT of relevant factors

Table 3 shows that: (1) from 2003 to 2017, the absolute values of the coefficients of long-term effects are all larger than those of short-term effects, indicating that all relevant factors had a strong lasting impact on \(CI\). (2) Specifically, the long-term effect of IS first increased and then gradually decreased. This result is consistent with the findings of Zhang (2015) and China Petroleum and Chemical Industry Progress Report. The report finds that, during the “12th Five-Year Plan”, China’s energy consumption per unit of industrial-added value continued to decline, and achieved the energy-saving targets by optimizing industrial structure and reducing the proportion of energy consuming industries. (3) Similarly, the long-term effect of PGDP increased initially and then continued to decline. This evolution is compatible with the fact that from 2003 to the early period of the “12th Five-Year Plan”, China has been in the rapid economic development, which led to a high level of industrial activities and frequent activities of daily living and then resulted in large \(CI\), however from the late period of the “12th Five-Year Plan” to the early period of the “13th Five-Year Plan”, China speeded up green and low-carbon economic development, which led to slow down the trends of \(CI\) (Yang et al. 2021). (4) Moreover, the long-term effect of PE was negative and significant at 1% level, which might be attributed to the substitution effect of PE in China (Ren et al. 2009; Du 2019). For example, if the increase in the price of fossil fuels will encourage people to switch to renewable energy sources, such as solar and wind, which can offer the benefits of lower carbon emissions and other types of pollution. (5) Obviously, in the short and long term, the total effects of EI were both negative and significant, and the trends remained stable, which indicates that more and more financial resources have been committed to protection of the environment. Furthermore, according to China’s Government Work Reports, the government expenditure on environmental protection constantly increased in the past decade. From 2003 to 2017, the Chinese government’s investment in environmental protection is mainly used for nine key projects, including the capacity building of environmental supervision, the proper disposal of hazardous waste items (HHW), urban wastewater treatment, urban waste management, flue gas desulfurization (FGD), the management of important ecological function areas, the capacity building of national-level nature reserves, the nuclear safety and radiation protection, which has effectively contributed to the reduction of carbon intensity (Wu et al. 2021).

3.3.2 Spatial variation of ST and LT of relevant factors

Table 4 shows that: the short-term indirect effect of IS in each province on \(CI\) is negative and insignificant, while the long-term indirect effect is positive and significant, indicating that in the long term, the spillover effect of IS was evident and a rise in the proportion of heavy industry in a province would drive up the \(CI\) in adjacent provinces. This may be attributed to the fact that the carbon emission is a non-localized environmental challenge with significant spatial spillover characteristics, which could dissuade local governments from implementing carbon emission control (Du et al.2020; Yang et al. 2021). Table 5 shows that: the short-term indirect effect of PGDP is negative, while the long-term indirect effect is positive and significant, which might because that in the long term, one government’s pursuit of economic growth without environmental costs has aggravated the environmental pollution, due to the adjacent provinces’ pressure of competition responsibility (Lu and Yang 2019; Li and Zhang 2019). Table 6 shows that: in the short and long term, the direct and total effects of EI are negative, which is similar to the finding of 3.3.1 and shows that investment on environmental protection from local governments constantly decreased the \(CI\).

Furthermore, based on the regional division of China, we also find that: (1) in the short and long term, the direct and total effects of IS and PGDP on \(CI\) in North China and Northeast China are larger than those in other regions. In fact, North China contains abundant petroleum and coal resources, for example, Shanxi has been the core territory for coal production in China, the coal resources account for 40.4% of the province’s land area. According to Announcement on the Production Capacity of the Province’s Production Coal Mines released by the Shanxi Provincial Energy Bureau points out that, by the end of 2017, there are 613 producing coal mines in Shanxi with a production capacity of 909.8 million tons per year, and the value added of coal industry leads to more carbon emissions. Meanwhile, Northeast China is the largest old industrial base of China, the enterprises of high energy consumption and high pollution industries accounted for a large proportion in secondary industries, which were the largest source of carbon emissions (Li et al. 2016). (2) In contrast to North China and Northeast China, the short-term and long-term direct effects of IS in East China and some provinces and cities in Central China are relatively low, while the direct effects of EI are relatively large. Actually, East China and some provinces and cities in Central China took the lead in transformation of industry-oriented structure into service-oriented industrial structure, and the local governments placed a priority on investing in treatment of environmental pollution in the past decade (Li et al. 2016; Huang et al. 2019; Wang et al. 2019; Guo et al. 2021). (3) In Northwest China, the direct and total effects of IS and the total effects of PGDP are low in the short and long term, which might because that this region had slow economic development and its primary industry had a high share compared to other regions (Fan et al. 2019), such as agriculture and animal husbandry, which has long played a major role in regional gross domestic product, and its heavy industry grew slowly.

4 Conclusion and policy implications

Based on panel data from 30 provinces in China between 2003 and 2017, this paper constructs our GNS with common factors to examine the direct and spatial–temporal spillover effects of IS, PGDP, EI and PE on \(CI\). Furthermore, we provide effective economic explanations for the observed heterogeneity from spatial and time dimensions, respectively. Perhaps the cross-products reduce the validity of applying the global models, while from the results of the \(q\)-statistic of the factors and the view of control functions, these factors are not affected by the SSH. The main findings are as follows: (1) China’s carbon intensity has been declining from 2003 to 2017 and showed obvious spatial agglomeration through a graphical display. (2) The estimation results of our GNS model further verified the spatial agglomeration and temporal “inertia” of \(CI\) in China. (3) From the time dimension, all relevant factors had a strong lasting impact on \(CI\). The long-term total effects of IS and PGDP first increased and then gradually decreased. Due to the substitution effect of PE in China, the long-term effect of PE was negative and significant. Furthermore, the short-term and long-term total effects of EI were both negative and significant, and the trends remained stable. (4) From the spatial dimension, there were regional differences in the short-term and long-term effects of the relevant factors on \(CI\). In North China and Northeast China, the enterprises of high energy consumption accounted for a large proportion in secondary industries. In East China and some provinces and cities in Central China, the tertiary industry remained the leading sector and the local governments placed a priority on investing in treatment of environmental pollution in the past decade. Northwest China had slow economic development and its primary industry has long played a major role in regional gross domestic product.

Based on the above conclusions, this study puts forward the following policy implications: firstly, since the industrial activities and economic growth are the main factors that increase carbon intensity in China, the government should transform the mode of economic development by optimizing industrial structure, and guide and encourage individual low-carbon lifestyles, such as driving less, installing a low-flow showerhead, bringing reusable bags when shopping and so on.

Secondly, due to the spatial heterogeneity of carbon intensity, different low-carbon development strategies should be applied in different regions. North China and Northeast China should optimize secondary industry, limit industrial development with high energy consumption and high carbon emissions to some extent and encourage the utilization of clean and renewable energy. East China and Central China should continue to encourage the development of tertiary industry, further optimize the energy structure and promote the application of advanced energy technologies. Northwest China should speed up its economic development by accelerating the transformation of industrial structure and improving energy utilization efficiency.

Thirdly, China should continue to promote the market-oriented reform of energy price and gradually improve the carbon emission trading mechanism, which will encourage individuals and enterprises to use low-carbon technologies. If emitting carbon becomes more expensive, consumers and enterprises may seek technologies and products to reduce their costs. As an efficient means, the market mechanism will help promote a shift to a clean energy economy and innovation in low-carbon technologies.

Finally, since the environmental expenditure is an important factor on the path towards a low-carbon economy, all provinces and cities should increase the environmental expenditure, and ensure the efficient use of funds to protect the environment.