Introduction

Global warming caused by the greenhouse effect is one of the factors that seriously threatens human survival and development, and the increase in CO2 emissions is considered the main cause of the greenhouse effect. According to the Global Climate 2015–2019 report released by the World Meteorological Organization, the growth rate of CO2 in the atmosphere from 2015 to 2019 was 18% higher than that during the previous 5 years, and the average temperature was 0.2 °C higher than that during the previous 5 years. Hence, this period encompasses the hottest 5 years on record. British Petroleum (BP) World Energy Statistics (2020) indicated that in 2019, the global CO2 emissions reached 34,169 million tons. Among the various countries, the USA and India rank second and third worldwide, respectively, in terms of their CO2 emissions, and the total CO2 emissions of these two countries account for 21.8% of the world emissions. The Asia Pacific region accounts for 50.5% of the global CO2 emissions. Upon entering the twenty-first century, the growth of global CO2 emissions is soaring. Thus, from 2009 to 2019, the CO2 emissions in the USA basically remained at approximately 5000 MT, revealing a trend of continuous fluctuation. The CO2 emissions in India grew the fastest, by 55.4%. The growth rate of the CO2 emissions in the Asia Pacific region reached 30.4%, which is also one of the fastest growing economies in the world. From the perspective of global CO2 emissions, although the growth rate is not high, with an average annual growth rate of 1.1%, CO2 emissions remain on the rise. To achieve the target of the Paris Agreement, more than 20 countries put forward carbon neutrality goals at the 2020 Climate Ambition Summit, and more than 40 countries established new commitments to independently improve national contributions. Therefore, accurate prediction of future CO2 emission data of the USA, India, Asia Pacific region, and the world can help policy-makers formulate more scientific and reasonable environmental policies to truly achieve the goal of carbon emission reduction.

The existing models for CO2 emission forecast can be classified into three categories. The first category includes nonlinear intelligent models, such as the least squares support vector machine (Sun and Liu 2016), extreme learning machine (Sun and Sun 2017), generalized regression neural network (Heydari et al. 2019); and improved chicken swarm optimization (ICSO-SVM) model using the ICSO algorithm to optimize support vector machine parameters (Wen and Cao 2020). The second category involves the statistical analysis model, which studies the quantitative relationship between CO2 emissions and influencing factors and applies the relationship equation to predict CO2 emissions. These approaches include trend analysis (Köne and Büke 2010), logistic equations (Meng and Niu 2011), improved Gaussian process regression (Fang et al. 2018), panel quantile regression (Zhu et al. 2018), log-average decomposition index (Xu et al. 2019), and comprehensive methods combining multiple regression analysis, input–output techniques, and structural decomposition analysis (Xia et al. 2019). The last category is the grey prediction model, which was first proposed by Deng (1982). The grey prediction model was established founded on a small amount of incomplete information to describe the development trend of objects more accurately. Compared to machine learning–based and statistical prediction methods founded on large data samples, the grey prediction model can realize the simulation and prediction of small data samples. The classical grey model GM(1,1) and its extended grey prediction model are widely used in the fields of energy, environment, and social management (Tsai 2016; Liu et al. 2020; Liu et al. 2021a; Wang et al. 2020b; Liu et al. 2021b).

There are two main kinds of grey single-variable forecasting models: the first-order grey differential model GM(1,1) and the grey Bernoulli model GBM(1,1). Deng first proposed the GBM(1,1) model in 1985; namely, a power exponent was introduced into the differential Bernoulli equation. When the exponent is equal to 2, the model is also referred to as the grey Verhulst model. Chen et al. (2008) proposed the nonlinear grey Bernoulli model NGBM(1,1) for the first time. Compared to the general GM(1,1) model, the NGBM(1,1) model can better reflect the nonlinear growth trend of data series. Chen et al. (2010) proposed a new grey prediction model NGBM(1,1), by optimizing the background value and power index simultaneously. Subsequently, many scholars improved the NGBM(1,1) model from different perspectives; e.g., Pao et al. (2012) proposed an iterative method to optimize the parameters of the NGBM(1,1) model. Wang (2017) optimized the background coefficient and initial conditions and considered the weighted method in analysis. Guo et al. (2016) proposed a new model by combining the self-memory principle of a dynamic system with the NGBM(1,1) model. Ma et al. (2019) constructed the NGBM(1, n) model by combining the GMC(1, n) model and Bernoulli equation. Wu et al. (2019b) extended the first-order accumulation operation and established the FANGBM(1,1) model based on fractional accumulation. Liu and Xie (2019) proposed the establishment of the WBGM(1,1) model by combining the fitting performance of the NGBM(1,1) model with the Weibull cumulative distribution. Şahin (2020) proposed the OFANGBM(1,1) model based on the integral mean value theorem. Wu et al. (2020a) established the NGBM(1,1, k, c) model by combining the NGM(1,1, k, c) model with the NGBM(1,1) model. Jiang and Wu (2021) constructed a nonlinear grey Bernoulli model based on the fractional order reverse accumulation, namely, the FANGBM(1,1) model. At present, a simpler and more convenient conformable fractional cumulative grey model (CFGM) was proposed (Ma et al., 2020). On this basis, Xie et al. (2020) proposed the conformable fractional grey model in opposite direction (CFGOM). Zheng et al. (2021) proposed the conformable fractional nonhomogeneous Bernoulli model (CFNHGBM (1,1, K)).

The grey prediction model is widely implemented in CO2 emission prediction. Lin et al. (2011) applied the grey model to predict CO2 emissions in Taiwan. Pao et al. (2012) used the NGBM model to predict the CO2 emissions and real GDP growth of China. Lotfall et al. (2013) forecast CO2 emissions based on a grey model and an autoregressive integrated moving average (ARIMA) model. The prediction accuracy of these two methods was compared according to the root mean square error (RMSE), mean absolute error (MAE), and mean absolute percentage error (MAPE). Gao et al. (2015) established a new discrete fractional order cumulation model, i.e., FAGM(1,1, D), to predict CO2 emissions. Hamzacebi and Karakurt (2015) used the grey prediction model to predict the energy-related CO2 emissions in Turkey. Yuan et al. (2017) established a linear programming model reflecting the relationship between the economic development and CO2 emissions in China and employed the GM(1,1) model to predict the parameters of the planning model. Wang and Ye (2017) developed a nonlinear multivariable grey model to discuss the relationship between economic growth and CO2 emissions. Xu et al. (2019) combined an adaptive grey model with the buffer rolling method to predict the greenhouse gas emissions in China from 2017 to 2025. Wang and Li (2019) adopted the nonequidistant grey Verhulst model to analyze the relationship between CO2 emissions and economic growth. Wu et al. (2020b) implemented a conformable fractional nonhomogeneous grey model to predict the CO2 emissions of BRICS countries. Chiu et al. (2020) proposed a multivariate grey prediction model using neural networks based on feature selection and residual correction to predict China’s carbon emissions. Based on the grey Verhulst model, Duan and Luo (2020) introduced an extrapolation method to optimize the background value and predicted the CO2 emissions for three coal resources in China. Wang et al. (2020a) established the metabolic nonlinear grey model (MNGM)-ARIMA method, established a new MNGM-BPNN combination model based on the MNGM model and back-propagation (BP) neural network and analyzed the CO2 emissions in China, the USA, and India with these two methods. Zhou et al. (2021) proposed a method to process original sequence data by means of an average weakening buffer operator based on the grey rolling mechanism of the new information priority principle and predicted the trend of CO2 emissions in China. Xie et al. (2021) established a new continuous conformable fractional nonlinear grey Bernoulli model to forecast CO2 emissions from fuel combustion in China.

Scholars have greatly promoted the optimization and application of grey models. However, although the existing studies have optimized the grey model considering the structure or parameters, each optimization method only improves the model performance to a certain extent, and the accuracy remains insufficient. In addition, most models only apply one optimization algorithm to determine the optimal parameters, and research on the application of multiple optimization algorithms is rare. Therefore, to better predict carbon emissions, based on the optimization of existing models, namely, NGBM(1,1) and FAGM(1,1,\(t^{\alpha }\)) (Wu et al. 2019a), this paper proposes a new grey prediction model FGBM(1,1,\(t^{\alpha }\)) and considers a variety of optimization algorithms to determine the optimal structural parameters of the model. The main contributions of this paper are as follows:

  1. (1).

    Based on the advantages of the grey Bernoulli prediction model NGBM(1,1) and FAGM(1,1, \(t^{\alpha }\)), a new model, i.e., FGBM(1,1,\(t^{\alpha }\)), is proposed. The new model can be transformed into other grey prediction models by changing its parameters.

  2. (2).

    In the FAGM(1,1,\(t^{\alpha }\)) model, application of the trapezoidal integral method to obtain an approximate solution yields errors, but this paper provides an analytical solution of the time response function in the FGBM(1,1,\(t^{\alpha }\)) model, and the result is more accurate.

  3. (3).

    Grey wolf optimization (GWO), particle swarm optimization (PSO), and quantum genetic algorithm (QGA) are applied to solve the parameters of the FGBM(1,1,\(t^{\alpha }\)) model.

  4. (4).

    The FGBM(1,1,\(t^{\alpha }\)) model is used to predict the CO2 emissions of four countries and regions over the next 5 years.

This paper is organized as follows: the second section first introduces the basic theory of the NGBM(1,1) and FAGM(1,1,\(t^{\alpha }\)) models. Second, a new grey prediction model FGBM(1,1,\(t^{\alpha }\)) is constructed, which mainly includes the modeling mechanism, model characteristics, and solution method. Then, three common optimization algorithms (GWO, PSO, and QGA) are employed to solve the parameters of the FGBM(1,1,\(t^{\alpha }\)) model, and two widely adopted error measures are introduced. The third section considers the CO2 emission data of four economies to verify the feasibility and effectiveness of the FGBM(1,1,\(t^{\alpha }\)) model. The fourth section applies the FGBM(1,1,\(t^{\alpha }\)) model to predict the CO2 emissions of the USA, India, Asia Pacific region and the world over the next five years. The fifth section contains the conclusion of this paper.

Prerequisite knowledge

Grey Bernoulli model NGBM(1,1) and FAGM(1,1,\(t^{\alpha }\))

This section mainly introduces the grey Bernoulli model NGBM(1,1) and the FAGM(1,1,\(t^{\alpha }\)) model as follows:

Definition 1: Given a nonnegative sequence \(X^{(0)} = \{ x^{(0)} (1),x^{(0)} (2), \cdots ,x^{(0)} (n)\}\), \(X^{(1)} = \{ x^{(1)} (1),x^{(1)} (2), \ldots ,x^{(1)} (n)\}\) is referred to as the first-order generating sequence of \(X^{(0)}\), where \(x^{(1)} (k) = \mathop \sum \limits_{i = 1}^{k} x^{(0)} (i),k = 1,2, \cdots n\). Based on Chen et al. (2008), it can be found that the expression of NGBM(1,1) is:

$$\frac{{dx^{(1)} (t)}}{dt} + ax^{(1)} (t) = b(x^{(1)} (t))^{\gamma }$$
(1)

The above is a nonlinear equation, and the exponent exponent \(\gamma \ne 1\).

The parameters \(a,b\) of the NGBM(1,1) model are calculated as follows:

$$\left(a,b\right)^T=\left(B^TB\right)^{-1}B^TY$$
(2)
$$\begin{array}{cc}B=\begin{pmatrix}-z^{\left(1\right)}\left(2\right)&\left(z^{\left(1\right)}\left(2\right)\right)^\gamma\\-z^{\left(1\right)}\left(3\right)&\left(z^{\left(1\right)}\left(3\right)\right)^\gamma\\\vdots&\vdots\\-z^{\left(1\right)}\left(n\right)&\left(z^{\left(1\right)}\left(n\right)\right)^\gamma\end{pmatrix},&Y=\begin{pmatrix}x^{\left(0\right)}\left(2\right)\\x^{\left(0\right)}\left(3\right)\\\vdots\\x^{\left(0\right)}\left(n\right)\end{pmatrix}\end{array}$$

where \(Z^{(1)} = \left( {z^{(1)} (2),z^{(1)} (3), \ldots ,z^{(1)} (n)} \right)\), \(z^{(1)} (k) = 0.5x^{(1)} (k - 1) + 0.5x^{(1)} (k),k = 2,3, \ldots ,n\), and \(n\) is the sample number of the modeling sequence.

The time response function of the NGBM(1,1) model is as follows:

$$\hat{x}^{(1)} (k) = \{ [(x^{(0)} (1))^{1 - \gamma } - \frac{b}{a}] \cdot e^{ - a(1 - \gamma )(k - 1)} + \frac{b}{a}\}^{{\frac{1}{1 - \gamma }}} ,k = 2,3, \cdots n$$
(3)

The predicted values of the model are as follows:

$$\hat{x}^{(0)} (k) = \hat{x}^{(1)} (k) - \hat{x}^{(1)} (k - 1),k = 2,3, \cdots n$$
(4)

Definition 2: Given a nonnegative sequence \(X^{(0)} = \{ x^{(0)} (1),x^{(0)} (2), \cdots ,x^{(0)} (n)\} ,\quad r \in R^{ + }\), and its \(r\)-th order accumulation sequence is \(X^{(r)} = \{ x^{(r)} (1),x^{(r)} (2), \ldots ,x^{(r)} (n)\}\) (Wu et al, (2013). Denoted as matrix \(A^{r}\), the r-AGO matrix that satisfies \(X^{(r)} = {\text{A}}^{r} X^{(0)}\) is:

$${\text{A}}^{r} = \left( {\begin{array}{*{20}c} {\left[ {\begin{array}{*{20}c} r \\ 0 \\ \end{array} } \right]} & 0 & 0 & \cdots & 0 \\ {\left[ {\begin{array}{*{20}c} r \\ 1 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} r \\ 0 \\ \end{array} } \right]} & 0 & \cdots & 0 \\ {\left[ {\begin{array}{*{20}c} r \\ 2 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} r \\ 1 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} r \\ 0 \\ \end{array} } \right]} & \cdots & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ {\left[ {\begin{array}{*{20}c} r \\ {n - 1} \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} r \\ {n - 2} \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} r \\ {n - 3} \\ \end{array} } \right]} & \cdots & {\left[ {\begin{array}{*{20}c} r \\ 0 \\ \end{array} } \right]} \\ \end{array} } \right)_{n \times n}$$
(5)

with \(\left[ {\begin{array}{*{20}l} r \hfill \\ i \hfill \\ \end{array} } \right] = \frac{r(r + 1) \cdots (r + i - 1)}{{i!}} = \left( {\begin{array}{*{20}c} {r + i - 1} \\ i \\ \end{array} } \right) = \frac{(r + i - 1)!}{{i!(r - 1)!}},\left[ {\begin{array}{*{20}l} 0 \hfill \\ i \hfill \\ \end{array} } \right] = 0,\left[ {\begin{array}{*{20}l} 0 \hfill \\ 0 \hfill \\ \end{array} } \right] = \left( {\begin{array}{*{20}l} 0 \hfill \\ 0 \hfill \\ \end{array} } \right) = 1\).

Denoted as matrix \(D^{r}\), the r-IAGO matrix that satisfies \(X^{(0)} = D^{r} X^{(r)}\) is:

$${\text{D}}^{r} = \left( {\begin{array}{*{20}c} {\left[ {\begin{array}{*{20}c} { - r} \\ 0 \\ \end{array} } \right]} & 0 & 0 & \cdots & 0 \\ {\left[ {\begin{array}{*{20}c} { - r} \\ 1 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} { - r} \\ 0 \\ \end{array} } \right]} & 0 & \cdots & 0 \\ {\left[ {\begin{array}{*{20}c} { - r} \\ 2 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} { - r} \\ 1 \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} { - r} \\ 0 \\ \end{array} } \right]} & \cdots & 0 \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ {\left[ {\begin{array}{*{20}c} { - r} \\ {n - 1} \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} { - r} \\ {n - 2} \\ \end{array} } \right]} & {\left[ {\begin{array}{*{20}c} { - r} \\ {n - 3} \\ \end{array} } \right]} & \cdots & {\left[ {\begin{array}{*{20}c} { - r} \\ 0 \\ \end{array} } \right]} \\ \end{array} } \right)_{n \times n}$$
(6)

with: \(\left[ {\begin{array}{*{20}c} { - r} \\ i \\ \end{array} } \right] = \frac{ - r( - r + 1) \cdots ( - r + i - 1)}{{i!}} = ( - 1)^{i} \frac{r(r - 1) \cdots (r - i + 1)}{{i!}},\left[ {\begin{array}{*{20}c} { - r} \\ i \\ \end{array} } \right] = 0,i > r\).

Matrices \(A^{r}\) and \(D^{r}\) satisfy \(A^{r} D^{r} = I_{n}\).

The whitening differential equation of the FAGM(1,1,\(t^{\alpha }\)) model is:

$$\frac{{dx^{(r)} (t)}}{dt} + ax^{(r)} (t) = bt^{\alpha } + c,r > 0,\alpha \ge 0$$
(7)

Applying integration to Eq. (7) in the time interval of [k − 1, k], we obtain:

$$\mathop \smallint \nolimits_{k - 1}^{k} \frac{{dx^{(r)} (t)}}{dt}dt + a\mathop \smallint \nolimits_{k - 1}^{k} x^{(r)} (t)dt = b\mathop \smallint \nolimits_{k - 1}^{k} t^{\alpha } dt + c\mathop \smallint \nolimits_{k - 1}^{k} dt$$
(8)

Based on the trapezoid equation and \(z^{(r)} (k) = 0.5x^{(r)} (k - 1) + 0.5x^{(r)} (k),k = 2,3, \ldots ,n\),

Equation (8) is written as:

$$x^{(r)} (k) - x^{(r)} (k - 1) + az^{(r)} (k) = b\frac{{k^{1 + \alpha } - (k - 1)^{1 + \alpha } }}{1 + \alpha } + c$$
(9)

Based on Eq. (9), unknown parameters \(a,b,c\) of FAGM(1,1,\(t^{\alpha }\)) satisfy:

$$(\hat{a},\hat{b},\hat{c})^{T} = \left( {B^{T} B} \right)^{ - 1} B^{T} Y$$
(10)
$$B = \left( {\begin{array}{*{20}c} { - z^{(r)} (2)} & {\frac{{2^{1 + \alpha } - 1}}{1 + \alpha }} & 1 \\ { - z^{(r)} (3)} & {\frac{{3^{1 + \alpha } - 2^{1 + \alpha } }}{1 + \alpha }} & 1 \\ \vdots & \vdots & \vdots \\ { - z^{(r)} (n)} & {\frac{{n^{1 + \alpha } - (n - 1)^{1 + \alpha } }}{1 + \alpha }} & 1 \\ \end{array} } \right),\quad Y = \left( {\begin{array}{*{20}c} {x^{(r)} (2) - x^{(r)} (1)} \\ {x^{(r)} (3) - x^{(r)} (2)} \\ \vdots \\ {x^{(r)} (n) - x^{(r)} (n - 1)} \\ \end{array} } \right)$$
(11)

The time response function of the FAGM(1,1,\(t^{\alpha }\)) model is:

$$\begin{array}{*{20}l} {\hat{x}^{(r)} (t)} \hfill & { = e^{{ - \hat{a}t}} \left\{ {x^{(0)} (1)e^{{\hat{a}}} + \hat{b}\mathop \smallint \nolimits_{1}^{t} s^{\alpha } e^{{\hat{a}s}} ds + \frac{{\hat{c}}}{{\hat{a}}}\left( {e^{{\hat{a}t}} - e^{{\hat{a}}} } \right)} \right\}} \hfill \\ {} \hfill & { = \left( {x^{(0)} (1) - \frac{{\hat{c}}}{{\hat{a}}}} \right)e^{{ - \hat{a}(t - 1)}} + \hat{b}e^{{ - \hat{a}(t - 1)}} \mathop \smallint \nolimits_{1}^{t} s^{\alpha } e^{{\hat{a}(s - 1)}} ds + \frac{{\hat{c}}}{{\hat{a}}}} \hfill \\ {} \hfill & { = \left( {x^{(0)} (1) - \frac{{\hat{c}}}{{\hat{a}}}} \right)e^{{ - \hat{a}(t - 1)}} + \frac{{\hat{b}}}{2}e^{{ - \hat{a}(t - 1)}} \mathop \sum \limits_{i = 1}^{t - 1} \left( {i^{\alpha } e^{{\hat{a}(i - 1)}} + (i + 1)^{\alpha } e^{{\hat{a}i}} } \right) + \frac{{\hat{c}}}{{\hat{a}}}} \hfill \\ \end{array}$$
(12)

Moreover, the restored value of \(\hat{x}^{(0)} (k),k = 2,3, \ldots ,n\) is given as: \(\hat{X}^{(0)} = D^{r} \hat{X}^{(r)}\).

Description of the FGBM(1,1,\(t^{\alpha }\))

The expression of the FGBM(1,1,\(t^{\alpha }\)) model is as follows:

$$\frac{{dx^{(r)} (t)}}{dt} + ax^{(r)} (t) = (bt^{\alpha } + c)(x^{(r)} (t))^{\xi }$$
(13)

where \(0 \le r \le 1,0 \le \alpha \le 4,0 \le \xi \le 3,\xi \ne 1\).

Theorem 1. The time response function of FGBM (1,1,\(t^{\alpha }\)) is derived as:

$$x^{(r)} (k) = \left\{ {(\left[ {x^{(0)} (1)} \right]} \right.^{1 - \xi } - \frac{{c^{\prime}}}{{a^{\prime}}})e^{{ - a^{\prime}(k - 1)}} + \frac{{c^{\prime}}}{{a^{\prime}}} + b^{\prime}e^{{ - a^{\prime}(k - 1)}} \int_{1}^{k} {\tau^{\alpha } } e^{{a^{\prime}(\tau - 1)}} \left. {d\tau } \right\}^{{\frac{1}{1 - \xi }}} ,k = 1,2,3 \cdots ,n$$
(14)

The reduction value of \(\hat{x}^{(r)} (k)\) is \(\hat{x}^{(0)} (k)\),

$$\hat{x}^{(0)} (k) = D^{r} \hat{x}^{(r)} (k),k = 1,2,3, \cdots ,n$$
(15)

Proof, Multiplying both sides of Eq. (13) by \(x^{(r)} (t)^{ - \xi }\). Let \(y^{(r)} = [x^{(r)} (t)]^{1 - \xi }\), one can obtain:

$$\frac{{dy^{(r)} (t)}}{dt} + a(1 - \xi )y^{(r)} (t) = (bt^{\alpha } + c)(1 - \xi )$$
(16)

Let the left side of Eq. (13) be 0, \(a(1 - \xi ) = a^{\prime},b(1 - \xi ) = b^{\prime},c(1 - \xi ) = c^{\prime}\),we can get:

$$\frac{{dy^{(r)} (t)}}{dt} + a(1 - \xi )y^{(r)} (t) = \frac{{dy^{(r)} (t)}}{dt} + a^{\prime}y^{(r)} (t) = 0$$
(17)

The general solution formula of the equation is as follows:

$$y^{(r)} (t) = Ce^{ - a^{\prime}t}$$
(18)
$$\frac{dC(t)}{{dt}} = e^{a^{\prime}t} (b^{\prime}t^{\alpha } + c^{\prime})$$
(19)

To perform the definite integral on interval \([1,t]\), we know that:

$$C(t) = \int {(b^{\prime}t^{\alpha } + c)e^{a^{\prime}t} dt} = C(1) + \int_{1}^{t} {(b^{\prime}\tau^{\alpha } + c)e^{a^{\prime}\tau } } d\tau = C(1) + b^{\prime}\int_{1}^{t} {\tau^{\alpha } e^{a^{\prime}\tau } } d\tau + \frac{c^{\prime}}{{a^{\prime}}}(e^{a^{\prime}t} - e^{a^{\prime}} )$$
(20)
$$C(1) = y^{(r)} (1)e^{a^{\prime}} = y^{(0)} (1)e^{a^{\prime}}$$
(21)
$$\begin{gathered} y^{(r)} (t) = e^{ - a^{\prime}t} [y^{(0)} (1)e^{a} + b^{\prime}\int_{1}^{t} {\tau^{\alpha } e^{a^{\prime}\tau } } d\tau + \frac{c^{\prime}}{{a^{\prime}}}(e^{a^{\prime}t} - e^{a^{\prime}} )] \hfill \\ \;\;\;\;\;\;\;\;\; = (y^{(0)} (1) - \frac{c^{\prime}}{{a^{\prime}}})e^{ - a^{\prime}(t - 1)} + \frac{c^{\prime}}{{a^{\prime}}} + b^{\prime}e^{ - a^{\prime}(t - 1)} \mathop \smallint \nolimits_{1}^{t} \tau^{\alpha } e^{a^{\prime}(\tau - 1)} d\tau \hfill \\ \end{gathered}$$
(22)

And because \(y^{(r)} (t) = [x^{(r)} (t)]^{1 - \xi }\),\(y^{(r)} (1) = y^{(0)} (1) = [x^{(r)} (1)]^{1 - \xi } = [x^{(0)} (1)]^{1 - \xi }\), the time response function can be derived from Eq. (22) by

$$x^{(r)} (k) = [(x^{(0)} (1)^{1 - \xi } - \frac{c^{\prime}}{{a^{\prime}}})e^{ - a^{\prime}(k - 1)} + \frac{c^{\prime}}{{a^{\prime}}} + b^{\prime}e^{ - a^{\prime}(k - 1)} \int_{1}^{k} {\tau^{\alpha } e^{a^{\prime}(\tau - 1)} d\tau } ]^{{\frac{1}{1 - \xi }}}$$
(23)

Although \(\alpha\) is not an integer, \(\int_{1}^{k} {\tau^{\alpha } e^{a^{\prime}(\tau - 1)} d\tau }\) can be integral by numerical integration method to obtain the real number.

The predicted values \(\hat{x}^{(0)} (k)\) can be obtained by:

$$\hat{x}^{(0)} (k) = D^{r} \hat{x}^{(r)} (k),k = 1,2,3, \cdots ,n$$
(24)

Parameters estimation of the FGBM(1,1,\(t^{\alpha }\))

By integrating on \([k - 1,k]\) both side of Eq. (16) at the same time, the following conclusion can be obtained:

$$y^{(r)} (k) - y^{(r)} (k - 1) + a^{\prime}\int\limits_{k - 1}^{k} {y^{(r)} (t)dt = } \frac{{k^{\alpha + 1} - (k - 1)^{\alpha + 1} }}{\alpha + 1}b^{\prime} + c^{\prime}$$
(25)

According to the integral median theorem, we can get:

$$\int\limits_{k - 1}^{k} {y^{(r)} (t)dt = } \lambda y^{(r)} (k) + (1 - \lambda )y^{(r)} (k - 1)$$
(26)

where \(0 \le \lambda \le 1\).

The results are as follows:

$$y^{(r)} (k) - y^{(r)} (k - 1) + a^{\prime}[\lambda y^{(r)} (k) + (1 - \lambda )y^{(r)} (k - 1)] = \frac{{k^{\alpha + 1} - (k - 1)^{\alpha + 1} }}{\alpha + 1}b^{\prime} + c^{\prime}$$
(27)

According to the commonly used method to solve the parameters of grey prediction model, the least square criterion of FGBM (1,1,\(t^{\alpha }\)) model can be described as the following unconstrained optimization problems:

$$\mathop {\min }\limits_{{a^{^{\prime}} ,b^{^{\prime}} ,c^{^{\prime}} }} \mathop \sum \limits_{k = 2}^{n} \left\{ {y^{(r)} (k) - y^{(r)} (k - 1) + a^{\prime}[\lambda y^{(r)} (k) + (1 - \lambda )y^{(r)} (k - 1)] - \frac{{k^{\alpha + 1} - (k - 1)^{\alpha + 1} }}{\alpha + 1}b^{\prime} - c^{\prime}} \right\}^{2}$$
(28)

The solution of this optimization problem is:

$$[\hat{a}^{\prime} \, \hat{b}^{\prime} \, \hat{c}^{\prime}]^{T} = \left( {B^{T} B} \right)^{ - 1} B^{T} Y$$
(29)

where

$$B = \left( {\begin{array}{*{20}c} { - [\lambda y^{(r)} (2) + (1 - \lambda )y^{(r)} (1)]} & {\frac{{2^{\alpha + 1} - 1^{\alpha + 1} }}{1 + \alpha }} & 1 \\ { - [\lambda y^{(r)} (3) + (1 - \lambda )y^{(r)} (2)]} & {\frac{{3^{\alpha + 1} - 2^{\alpha + 1} }}{1 + \alpha }} & 1 \\ \vdots & \vdots & \vdots \\ { - [\lambda y^{(r)} (n) + (1 - \lambda )y^{(r)} (n - 1)]} & {\frac{{n^{\alpha + 1} - (n - 1)^{\alpha + 1} }}{1 + \alpha }} & 1 \\ \end{array} } \right),Y = \left[ {\begin{array}{*{20}c} {y^{(r)} (2) - y^{(r)} (1)} \\ {y^{(r)} (3) - y^{(r)} (2)} \\ \vdots \\ {y^{(r)} (n) - y^{(r)} (n - 1)} \\ \end{array} } \right]$$
(30)

In order to facilitate readers to understand the solution process of the FGBM (1,1,\(t^{\alpha }\)) model, Fig. 1 shows the algorithm implementation process of FGBM (1,1,\(t^{\alpha }\)) model.

Fig. 1
figure 1

Algorithm implementation process of FGBM (1,1,)

Properties of the FGBM(1,1,\(t^{\alpha }\)) model

The expression of the FGBM(1,1,\(t^{\alpha }\)) model indicates that the FGBM(1,1,\(t^{\alpha }\)) model can be transformed into other grey prediction models when different values are assigned to parameters \(\xi\) and \(r,\alpha\), such as GM(1,1) (Deng 1982), FGM(1,1) (Wu et al. 2013), FAGM(1,1,\(t^{\alpha }\)) (Wu et al. 2019a), NGM(1,1,k,c) (Cui et al. 2009), GM(1,1,\(t^{2}\)) (Qian et al. 2012), GM(1,1,\(t^{\alpha }\)) (Qian et al. 2012), NGBM(1,1) (Chen et al. 2008), FANGBM(1,1) (Wu et al. 2019a), and NGBM(1,1,k,c) (Wu et al. 2020a). The relationships between FGBM (1,1,\(t^{\alpha }\)) with other grey models are shown in the Fig. 2.

Fig. 2
figure 2

Relationships between FGBM (1,1,) with other grey models

To minimize the model error, we must determine the optimal values of parameters \(r,\lambda ,\alpha ,\xi\) and adopt the MAPE as the main criterion. The optimization problems when solving the optimal parameters are as follows:

$$\mathop {\min }\limits_{r,\lambda ,\alpha ,\xi } f(r,\lambda ,\alpha ,\xi ) = \frac{1}{n - 1}\mathop \sum \limits_{t = 2}^{n} \left| {\frac{{x^{(0)} (t) - \hat{x}^{(0)} (t)}}{{x^{(0)} (t)}}} \right| \times 100{\text{\% }}$$
(31)
$$s.t.\left\{ \begin{gathered} 0 \le r \le 1,0 \le \lambda \le 1,0 \le \alpha \le 4,0 \le \xi \le 3,\xi \ne 1 \hfill \\ [\hat{a}^{\prime} \, \hat{b}^{\prime} \, \hat{c}^{\prime}]^{T} = \left( {B^{T} B} \right)^{ - 1} B^{T} Y \hfill \\ B = \left( {\begin{array}{*{20}c} { - [\lambda y^{(r)} (2) + (1 - \lambda )y^{(r)} (1)]} & {\frac{{2^{\alpha + 1} - 1^{\alpha + 1} }}{1 + \alpha }} & 1 \\ { - [\lambda y^{(r)} (3) + (1 - \lambda )y^{(r)} (2)]} & {\frac{{3^{\alpha + 1} - 2^{\alpha + 1} }}{1 + \alpha }} & 1 \\ \vdots & \vdots & \vdots \\ { - [\lambda y^{(r)} (n) + (1 - \lambda )y^{(r)} (n - 1)]} & {\frac{{n^{\alpha + 1} - (n - 1)^{\alpha + 1} }}{1 + \alpha }} & 1 \\ \end{array} } \right),Y = \left[ {\begin{array}{*{20}c} {y^{(r)} (2) - y^{(r)} (1)} \\ {y^{(r)} (3) - y^{(r)} (2)} \\ \vdots \\ {y^{(r)} (n) - y^{(r)} (n - 1)} \\ \end{array} } \right] \hfill \\ \hat{x}^{(r)} (t) = [(x^{(0)} (1)^{1 - \xi } - \frac{{\hat{c}^{\prime}}}{{\hat{a}^{\prime}}})e^{{ - \hat{a}^{\prime}(t - 1)}} + \frac{{\hat{c}^{\prime}}}{{\hat{a}^{\prime}}} + \hat{b}^{\prime}e^{{ - \hat{a}^{\prime}(t - 1)}} \mathop \smallint \nolimits_{1}^{t} \tau^{\alpha } e^{{\hat{a}^{\prime}(\tau - 1)}} d\tau ]^{{\frac{1}{1 - \xi }}} \hfill \\ \hat{x}^{(0)} (t) = \left( {\hat{x}^{(r)} (t)} \right)^{( - r)} ,t = 1,2,3, \cdots ,n \hfill \\ \end{gathered} \right.$$
(32)

The above optimization problems are essentially nonlinear programming problems with equality constraints, which can be solved with intelligent optimization algorithms or heuristic algorithms. In this paper, GWO, PSO, and QGA are adopted to solve the parameters. The reason for choosing these three algorithms is: GWO algorithm has the characteristics of simple structure, has few parameters to be adjusted, and is easy to implement. Among them, there are adaptive convergence factors and information feedback mechanism, which can achieve a balance between local optimization and global search. Therefore, GWO algorithm has good performance in solving the problem accuracy and convergence speed. The advantage of PSO algorithm is that it has memory ability. In the implementation of intelligent search, it can combine the individual and global optimal location to realize location. It has a very fast speed of approaching the optimal solution and can effectively optimize the parameters of the system. The population coding method used by QGA greatly enriches the diversity of the population, at the same time, the quantum revolving gate is used to update the population and evolve based on the information of the current optimal individual. It has the characteristics of strong adaptability, fast convergence and suitable for global search. The parameters of the algorithm are shown in Table 1.

Table 1 The parameters setting for the three algorithms

Error metric

This section considers two error metrics widely used for prediction models to test the effectiveness and applicability of the grey prediction model, as indicated in Table 2.

Table 2 Error metrics of prediction model

Validation of FGBM(1,1,\(t^{\alpha }\))

In this section, four cases are presented to check the accuracy of the FGBM(1,1,\(t^{\alpha }\)), and the forecasting results are compared to those obtained with other grey prediction models, such as NGM(1,1), SIGM, GMP, FGM, NGBM(1,1,k,c), and FAGM(1,1,\(t^{\alpha }\)). In Sections 3.1 to 3.4, the raw data in Table 1 and the above seven grey models are used to simulate and predict the CO2 emissions of the USA, India, Asia Pacific region, and the world. The original time series data from 2009 to 2016 are used to build FGBM(1,1,\(t^{\alpha }\)), NGM(1,1), SIGM, GMP, FGM, NGBM(1,1,k,c), and FAGM(1,1,\(t^{\alpha }\)), and the original time series data from 2017 to 2019 are used to check the accuracy of the above grey prediction models.

CO 2 emissions of the USA

According to Statistics Review of World Energy 2020, the USA produces the highest CO2 emissions worldwide. In 2019, the CO2 emissions of the USA reached 4964.69 million tons, accounting for 14.5% of the total global CO2 emissions, ranking second in the world. The task of emission reduction in the USA is far from realization. In this regard, the USA has put forward the Zero Carbon Action Plan (ZCAP), which deeply considers stabilization of domestic employment, economic innovation, and coordinated development of the environment in the postepidemic era. Therefore, this section adopts the USA as an example to test the accuracy of the FGBM(1,1,\(t^{\alpha }\)) model. The parameters and MAPE values of FGBM(1,1,\(t^{\alpha }\)) calculated with the three optimization algorithms are listed in Table 3. Figure 3 shows the number of iterations of the three optimization algorithms and the relationship between MAPE and the parameters. The MAPE and test_MAPE values for QGA are 1.1320% and 1.5805%, respectively, which are the smallest among those for the three optimization algorithms, and the performance is the best. Therefore, the QGA results are selected here. The fitting and prediction results for each model are shown in Fig. 4 and Table 4, and the error measurement results for each model are shown in Fig. 5 and Table 5. The prediction value of the FGBM(1,1,\(t^{\alpha }\)) model is the closest to the actual value. Based on the fitting values, the two error measurement indexes of the FGBM(1,1,\(t^{\alpha }\)) model are the best. This also indicates that the FGBM(1,1,\(t^{\alpha }\)) model is more accurate than the other grey models in predicting the CO2 emissions in the USA.

Table 3 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms(Case 1)
Fig. 3
figure 3

Iterations, MAPE, and parameters of the three optimization algorithms

Fig. 4
figure 4

Results of CO2 emissions of USA

Table 4 Fitting and prediction results of CO2 emissions of USA
Fig. 5
figure 5

Error metrics of CO2 emissions of USA

Table 5 Error metrics of CO2 emissions of USA

CO 2 emissions of India

According to Statistics Review of World Energy 2020, in 2019, the CO2 emissions in India reached 2480.35 million tons, accounting for 7.3% of the global CO2 emissions, ranking third in the world. From 2010 to 2019, the CO2 emissions in India increased, at an annual growth rate of 5.48%. With the rapid growth of the Indian economy and population, its CO2 emissions also increased notably, which is estimated to account for 11% of the global CO2 emissions by 2030. In the face of the call of the international community to promote carbon emission reduction, the Indian government has promised to reduce greenhouse gas emissions by 33–35% from 2015 to 2030. Therefore, this section chooses India as an example to test the accuracy of the FGBM(1,1,\(t^{\alpha }\)) model. The parameters and MAPE values of the FGBM(1,1,\(t^{\alpha }\)) model calculated with the three optimization algorithms are listed in Table 6. The MAPE and test_MAPE values for GWO are 0.5090% and 0.7130%, respectively, which are the smallest among the three optimization algorithms, and the performance is the best. Therefore, GWO is selected here. Figure 6 shows the number of iterations and the relationship between MAPE and the parameters. The fitting and prediction results for each model are shown in Fig. 7 and Table 7, and the error measurement results for each model are shown in Fig. 8 and Table 8. The prediction value of the FGBM(1,1,\(t^{\alpha }\)) model is the closest to the actual value. Based on the prediction and fitting values, the two error measurement indexes of the FGBM(1,1,\(t^{\alpha }\)) model are the best. This further demonstrates that the FGBM(1,1,\(t^{\alpha }\)) model is more accurate than the other grey models in predicting the CO2 emissions of India.

Table 6 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms(Case 2)
Fig. 6
figure 6

Iterations, MAPE, and parameters of the three optimization algorithms

Fig. 7
figure 7

Results of CO2 emissions of India

Table 7 Fitting results and prediction results of CO2 emissions of India
Fig. 8
figure 8

Error metrics of CO2 emissions of India

Table 8 Error metrics of CO2 emissions of India

CO 2 emissions of the Asia Pacific region

According to Statistics Review of World Energy 2020, in 2019, the CO2 emissions of the Asia Pacific region amounted to 17,269.46 million tons, accounting for 50.54% of the world’s CO2 emissions. From 2010 to 2019, the CO2 emissions in the Asia Pacific region indicated a rising trend, at an average annual growth rate of 2.60%. This section selects the Asia Pacific region as an example to test the accuracy of the FGBM(1,1,\(t^{\alpha }\)) model. The parameters and MAPE values of the FGBM(1,1,\(t^{\alpha }\)) model calculated with the three optimization algorithms are shown in Table 9. The MAPE and test-MAPE values obtained with PSO are 0.1076% and 0.6854%, respectively, which are the smallest and the best among the three optimization algorithms, so the PSO algorithm is selected here. Figure 9 shows the relationship between the number of iterations, MAPE, and parameters. The fitting and prediction results for each model are shown in Fig. 10 and Table 10, and the error measurement results for each model are shown in Fig. 11 and Table 11. The predicted value of the FGBM(1,1,\(t^{\alpha }\)) model is the closest to the actual value. According to the prediction and fitting values, the two error metrics of the FGBM(1,1,\(t^{\alpha }\)) model are the best. This also indicates that the FGBM(1,1,\(t^{\alpha }\)) model is more accurate than the other grey prediction models in predicting the CO2 emissions in the Asia Pacific region.

Table 9 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms(Case 3)
Fig. 9
figure 9

Iterations, MAPE, and parameters of the three optimization algorithms

Fig. 10
figure 10

Results of CO2 emissions of Asia Pacific

Table 10 Fitting results and prediction results of CO2 emissions of Asia Pacific
Fig. 11
figure 11

Error metrics of CO2 emissions of Asia Pacific

Table 11 Error metrics of CO2 emissions of Asia Pacific

Total CO 2 emissions of the world

According to Statistics Review of World Energy 2020, in 2019, the global CO2 emissions reached 34,169.00 million tons, which has increased greatly over the past few decades. From 2010 to 2019, the global CO2 emissions increased by 1.102% annually, exhibiting a slow rising trend. However, economic recovery after the COVID-19 pandemic will facilitate a global rebound in CO2 emissions. Therefore, this section adopts the world as an example to test the accuracy of the FGBM(1,1,\(t^{\alpha }\)) model. The parameters and MAPE values of the FGBM(1,1,\(t^{\alpha }\)) model calculated with the three optimization algorithms are summarized in Table 12. The MAPE and test_MAPE values obtained with QGA are 0.168% and 2.352%, respectively, which are the smallest and the best among the three optimization algorithms. Therefore, QGA is selected here. Figure 12 shows the relationship between the number of iterations, MAPE, and parameters. The fitting and prediction results for each model are shown in Fig. 13 and Table 13, and the error measurement results for each model are shown in Fig. 14 and Table 14. The predicted value of the FGBM(1,1,\(t^{\alpha }\)) model is the closest to the actual value. Based on the prediction and fitting values, the two error metrics of the FGBM(1,1,\(t^{\alpha }\)) model are the best. This further verifies that the FGBM(1,1,\(t^{\alpha }\)) model is more accurate than the other grey prediction models in predicting the global CO2 emissions.

Table 12 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms(Case 4)
Fig. 12
figure 12

Iterations, MAPE, and parameters of the three optimization algorithms

Fig. 13
figure 13

Results of CO2 emissions of total world

Table 13 Fitting results and prediction results of CO2 emissions of total world
Fig. 14
figure 14

Error metrics of CO2 emissions of total world

Table 14 Error metrics of CO2 emissions of total world

The above results reveal that although FGBM(1,1,\(t^{\alpha }\)) achieves the best effect in predicting the global CO2 emissions among the many models, due to the sudden increase in the original data of CO2 emissions in 2016, the fitting effect of the model is good, but the prediction error is large. The predicted global CO2 emissions over the next 5 years may be far from the real value. To resolve this problem, global carbon emission data can be divided into two parts, namely, Organisation for Economic Co-operation and Development (OECD) and non-OECD data. The trend of CO2 emissions based on OECD and non-OECD data is more stable than that based on global data. Therefore, this paper applies the FGBM(1,1,\(t^{\alpha }\)) model to fit and predict the CO2 emissions based on OECD and non-OECD data, as listed in Tables 15, 16, and 17. Tables 15 and 16 reveal that between the OECD and non-OECD data, the prediction error obtained with QGA is the smallest, so QGA is selected to calculate the model parameters. The global CO2 emissions are determined by adding the OECD and non-OECD-based CO2 emissions, as summarized in Table 17. The fitting and prediction errors of the global CO2 emissions are listed in Table 18 and are lower than those listed in Table 13. Table 19 provides the OECD and non-OECD based CO2 emissions over the next 5 years. The global CO2 emissions over the next 5 years can be obtained by adding the two types of emissions. The fitting and prediction results are more accurate than the direct prediction results of the global CO2 emissions.

Table 15 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms(OECD)
Table 16 Parameters and MAPEs of the FGBM (1,1,\(t^{\alpha }\)) model based on different optimization algorithms (non-OECD)
Table 17 Fitting and prediction results of CO2 emissions in OECD, non-OECD and total world
Table 18 Error metrics of CO2 emissions in total world(Add_values)
Table 19 Predictions for the CO2 emissions over the next 5 years in OECD and Non-OECD

Forecasting CO2 emissions over the next 5 years

In this section, we employ the FGBM(1,1,\(t^{\alpha }\)) model to predict the CO2 emissions of the USA, India, Asia Pacific region, and the world over the next 5 years (2020–2024). The prediction results are summarized in Table 20 and Fig. 15. The CO2 emissions in India, the Asia Pacific region, and the world will gradually increase over the 5 years. In addition, the CO2 emissions in the USA will slowly decline.

Table 20 Predictions for the CO2 emissions over the next 5 years
Fig. 15
figure 15

Predictions for the CO2 emissions over the next 5 years

Conclusions and policy implications

To better describe the future CO2 emissions of the USA, India, Asia Pacific region, and the world, a new grey prediction model, i.e., the FGBM(1,1,\(t^{\alpha }\)) model, is proposed based on the NGBM(1,1) and FAGM(1,1,\(t^{\alpha }\)) models, and a precise solution of the new model is obtained via the numerical integration method. Moreover, this paper applies three common optimization algorithms to calculate the model parameters. By changing the model parameters, the FGBM(1,1,\(t^{\alpha }\)) model can be transformed into other models, so the model achieves a strong adaptability. The CO2 emission fitting and forecasting results in the above four economies indicate that the FGBM(1,1,\(t^{\alpha }\)) model is more effective and accurate than the existing NGM(1,1), SIGM, GMP, FGM, NGBM(1,1,k,c), and FAGM(1,1,\(t^{\alpha }\)). Moreover, we employ the FGBM(1,1,\(t^{\alpha }\)) model to predict the CO2 emissions of the USA, India, Asia Pacific region, and the world over the next 5 years. The forecast results reveal that from 2020 to 2024, the CO2 emissions of India, the Asia Pacific region, and the world will gradually rise. The CO2 emissions of the USA will slowly decline over the next 5 years. Notably, the grey prediction FGBM(1,1,\(t^{\alpha }\)) model can be applied not only in the prediction of CO2 emissions but also in the prediction of other data, with a high adaptability.

In order to meet the requirements for carbon emission reduction in the sustainable development goals, several suggestions are put forward based on the above findings. First, promoting the green and low-carbon transformation of the energy system is the key. We should focus on energy conservation and improve the dual control system of total energy consumption and intensity. We should increase the utilization ratio of renewable energy such as wind power and photovoltaic power generation, and develop hydropower, geothermal energy, marine energy, hydrogen energy, biomass energy, and photothermal power generation according to local conditions. Second, the fiscal and tax support should be strengthened to encourage the research and development of green and low-carbon technologies. Financial funds should be used to support the development of green environmental protection industry and energy efficient utilization, and a number of scientific and technological projects should be arranged in the fields of energy conservation and environmental protection, cleaner production, and clean energy. Third, the recycling of renewable resources should be strengthened to realize a green and low-carbon lifestyle. We should promote the classification, reduction, and resource utilization of domestic waste according to local conditions, accelerate the construction of waste material recovery system, and strengthen the recycling of waste paper, waste plastics, waste tires and other resources.

This study still has some limitations. First, the model has many hyper-parameters, which may lead to over-fitting problems. Second, the model is a univariate model, which may ignore some important influencing factors (e.g., economic development, industrial policy, etc.), so the multivariate grey model can also be applied to predict carbon emissions, which is the optimization direction in the future.