Introduction

In the modern industrial era, petroleum has become a very important energy resource and a strategic economic resource. The scarcity of petroleum and the sustainable development of petroleum industry directly affect the development process of the national economy and defense security [1]. On the one hand, China is on an accelerated path to industrialization and urbanization [2], and this fact undoubtedly accelerates petroleum consumption in the secondary and tertiary industries. Specifically, Fig. 1 shows the proportion of petroleum consumption in different sectors of China; it is seen that the proportion of petroleum consumption of the secondary and tertiary industries basically dominants the total volume of petroleum consumption; therefore, the development trend of petroleum consumption will increase fast. On the other hand, Fig. 2 shows that China’s petroleum resources are located in poor and remote areas, which increases the difficulty of petroleum exploration. This fact embodies that China’s petroleum demand will rely on foreign import to a large degree [3]. On the foundation of the above-mentioned background, accurately estimating China’s petroleum consumption can assist in formulating rational plans and reducing capital waste [4].

Fig. 1
figure 1

The proportion of China’s petroleum consumption by sector in 2008

Fig. 2
figure 2

The statistical distribution of petroleum storage

In this context, the research regarding petroleum consumption has been received more attention. Having reviewed literature, these methods can be divided into three groups: the statistical models, machine learning methods, and grey forecasting models. For example, to overcome the challenge of modeling and forecasting the oil consumption with traditional methods, Turanoglu et al. [5] used the artificial neural network for unfolding the oil consumption forecasting using data for population, GDP, and import and export of Turkey in the period of 1965–2010. Dritsaki et al. [6] modeled and forecasted oil consumption in Greece using Box–Jenkins methodology during 960–2020, and their results showed a downturn in oil consumption for the following years. Yuan et al. [7]. used grey model with rolling mechanism to predict global petroleum consumption. Considering the nonlinear trend in petroleum consumption. More forecasting techniques for petroleum consumption can be seen in Table 1.

Table 1 Recent forecasting techniques for petroleum consumption

By reference with Ref. [15], there is little variation in the original data, and precise forecasts can be obtained. In addition, Ofosu-Adarkwa et al. [16] argued that it is better to focus on a time-series involving the most relevant shock when modeling a time-series sequence impacted by shocks. Taking into account data availability and volatility, this paper selects the grey modeling technique as the optimal forecasting model for accurately forecasting China’s petroleum consumption. In the past decades, grey prediction model has been widely used in various fields such as energy [17,18,19,20,21], environment [22], COVID-19 [23], tourism [24], automobile industry [25], and economics [26]. Due to its wide application range, a variety of derivative forms have been developed from the following aspects.

As many studies show that the traditional grey model (GM(1,1)) is suitable for processing time-series with homogeneous exponential law. To this end, Cui et al. [27] proposed a new model for predicting sequences with approximate non-homogeneous exponential law. Considering the influence of time-delay effect, Ma et al. [28] suggested time-delay polynomial grey model and applied this model in China’s natural gas consumption. Qian et al. [29] designed a new grey model with time power term. Wei et al. [30] developed a grey prediction model with polynomial term and optimized this model from the perspective of background value. Liu et al. [31] developed a combined grey model based on Refs. [29, 30]. Considering the nonlinear trend in the modeling sequence, Chen et al. [32] introduced the Bernoulli equation into GM(1,1); as a consequence, the nonlinear grey Bernoulli model (NGBM(1,1)) was proposed.

In the grey modeling procedure, the cumulative operation is applied to reduce the randomness of time-series sequence, which does achieve the purpose of accurate prediction. However, some scholars found that the prediction performance was poor when we take into account the integer-order accumulation. To address this issue, Wu et al. [33] introduced the fractional accumulated generation operator into the GM(1,1) model for enhancing the prediction performance. After that, Xie et al. [34] proposed an opposite-direction fractional grey model for forecasting China’s electricity consumption. Ma et al. [35] put forward a conformable fractional grey model and explored the relationship to the traditional fractional grey model.

Additionally, to estimate the model’s parameters, the trapezoid formula are usually used to approximate the integral in grey forecasting models. Zheng et al. [36] figured that there is an inherent error using trapezoid formula. To this end, Ma et al. [37] replaced trapezoid formula by Simpson formula to improve the accuracy of background value, and he and his colleagues demonstrated the feasibility and effectiveness of the newly designed model through a range of real cases. Şahin [38] used the integral mean value theorem to optimize the background value to reduce prediction errors. Subsequently, Liu et al. [39] further used a more complicated integral mean value theorem to enhance the prediction performance of the model.

Having reviewed the above literature, we know that the applicable scope, accumulated generation operator, and background value of the grey forecasting model still have defects. To address this issue, this paper develops a novel nonlinear grey Bernoulli model with combined fractional accumulation (abbreviated as ONGBM(1,1,k,c)). First, we introduce a combined fractional accumulated generation operator by incorporating the traditional fractional accumulation [33] and conformable fractional accumulation [35] into the optimized nonlinear grey Bernoulli model [38]. Second, the background-value coefficient is set as a variable for increasing the flexibility of the newly designed model. Third, the marine predation algorithm (MPA) [39] is applied to determine the optimal coefficients of the proposed model. Finally, to accurately predict China’s petroleum consumption in the following years, a case of coal consumption and two sets of China’s petroleum consumption are used for validating the effectiveness of the proposed model; after that, China’s petroleum consumption and terminal consumption in the next 5 years are forecasted by the newly designed model, and some suggestions are provided for policy-makers in the relevant sectors.

The rest of this paper can be organized as follows. “Optimized nonlinear grey Bernoulli model” describes the modeling procedure of the proposed model. “Numerical validation” conducts a numerical experiment for validating the effectiveness of the proposed model. “”Application in China's petroleum consumption applies the novel model to predict China’s petroleum consumption and “”Conclusion and research direction concludes.

Optimized nonlinear grey Bernoulli model

Combined fractional accumulated generation operator

As many studies show, the fractional accumulation generation (FAGO) has been widely used in various grey prediction models because of its flexibility and effectiveness. To further improve the performance of FAGO, this paper proposes a combined fractional accumulated generation operator (CFAGO) by incorporating the traditional fractional accumulation [33] and conformable fractional accumulation [35]. The relevant calculation formula can be seen in Theorem 1, which is helpful in establishing the proposed model in the next subsection.

Theorem 1.

Assume that \(X^{(0)} = \left( {x^{(0)} (1),x^{(0)} (2), \ldots ,x^{(0)} (n)} \right),n \ge 4\) is nonnegative time-series sequence, and the combined fractional accumulated generation operator (CFAGO) of \(X^{(0)}\) can be given by

$$ x^{(\alpha )} (k) = \sum\limits_{i = 1}^{k} {\sum\limits_{j = 1}^{i} {\frac{{\Gamma \left( {r + j - i} \right)x^{(0)} (k)}}{{\Gamma \left( {j - i + 1} \right)\Gamma \left( r \right)j^{[\tau ] - \tau } }}} } , $$
(1)

where \(k = 1,2, \ldots ,n;\tau ,r \in (0,1]\), and its inverse form (namely, the inverse conformable fractional accumulated generation operator (ICFAGO) is obtained as

$$ \begin{aligned} x^{(0)} (k) &= \sum\limits_{i = 0}^{k - 1} {\left( { - 1} \right)^{i} \frac{{\Gamma \left( {r + 1} \right)\left( {k - i} \right)^{1 - \tau } }}{{\Gamma \left( {i + 1} \right)\Gamma \left( {r - i + 1} \right)}}} \\ & \quad \left( {x^{(\alpha )} (k - i) - x^{(\alpha )} (k - i - 1)} \right). \end{aligned} $$
(2)

Proof.

Assume that \(X^{(0)} = \left( {x^{(0)} (1),x^{(0)} (2), \ldots ,x^{(0)} (n)} \right)\) is nonnegative sequence; according to [33], we can obtain the fractional accumulated generation operator of \(X^{(0)}\) as \(X^{(r)} = \left( {x^{(r)} (1),x^{(r)} (2), \ldots ,x^{(r)} (n)} \right)\), where

$$ x^{(r)} (k) = \sum\limits_{i = 1}^{k} {\frac{{\Gamma \left( {r + k - i} \right)}}{{\Gamma \left( {k - i + 1} \right)\Gamma \left( r \right)}}} x^{(0)} (i). $$
(3)

Similarly, according to [35], we get the conformable fractional accumulated generation operator (CFAGO) as

$$ x^{(\alpha )} (k) = \sum\limits_{i = 1}^{k} {\frac{{x^{(r)} (k)}}{{i^{[\tau ] - \tau } }}} , $$
(4)

where \(\tau \in (0,1]\), that is, the CFAGO of \(X^{(0)}\) is

$$ x^{(\alpha )} (k) = \sum\limits_{i = 1}^{k} {\sum\limits_{j = 1}^{i} {\frac{{\Gamma \left( {r + j - i} \right)x^{(0)} (k)}}{{\Gamma \left( {j - i + 1} \right)\Gamma \left( r \right)j^{[\tau ] - \tau } }}} } . $$
(5)

After an inverse calculation, we get

$$ x^{(r)} (k) = k^{1 - \tau } \left( {x^{(\alpha )} (k) - x^{(\alpha )} (k - 1)} \right). $$
(6)

Based on this, we easily get the inverse fractional accumulated generation operator of \(X^{(r)}\) as

$$ x^{( - r)} (k) = \sum\limits_{i = 1}^{k} {\left( {\begin{array}{*{20}c} {k - i - r - 1} \\ {k - i} \\ \end{array} } \right)} x^{(0)} (i). $$
(7)

Then, Eq. (7) becomes

$$ x^{( - r)} (k) = \sum\limits_{i = 1}^{k} {\left( { - 1} \right)^{i} \frac{{\Gamma \left( {r + 1} \right)x^{(0)} (k - i)}}{{\Gamma \left( {i + 1} \right)\Gamma \left( {r - i + 1} \right)}}} . $$
(8)

Furthermore, we can obtain the predicted value of \(X^{(0)}\) expressed as

$$ \begin{aligned} x^{(0)} (k) &= \sum\limits_{i = 0}^{k - 1} {\left( { - 1} \right)^{i} \frac{{\Gamma \left( {r + 1} \right)\left( {k - i} \right)^{1 - \tau } }}{{\Gamma \left( {i + 1} \right)\Gamma \left( {r - i + 1} \right)}}}\\ & \quad \left( {x^{(\alpha )} (k - i) - x^{(\alpha )} (k - i - 1)} \right). \end{aligned} $$
(9)

This completes the proof.

By reference with [33, 35], it is easily found that both the FAGO and CAGO are special cases of the CFAGO.

Model establishment

Inspired by Wu et al. [38], this paper presents an optimized nonlinear grey Bernoulli model by introducing the CFAGO, and the different equation of the newly-designed ONGBM(1,1,k,c) model can be defined as

$$ \frac{{d\left[ {x^{(\alpha )} (t)} \right]}}{dt} + ax^{(\alpha )} (t) = \left( {bt^{2} + ct + d} \right)\left( {x^{(\alpha )} (t)} \right)^{\gamma } , $$
(10)

where \(a\) is the development coefficient,\(b\),\(c\) and \(d\) are the grey qualities, and \(\gamma\) is the power index and \(\gamma \ne 1\). Obviously, when \(\alpha = 1\), this model is transformed into NGBM(1,1,k,c); when \(\alpha = 1,b = 0\), this model is simplified as NGBM(1,1); when \(\alpha = 1,b = 0,\gamma = 0\), this model is presented as NGM(1,1,k,c); and when \(\alpha = 1,\gamma = 0\), this model is converted to GMP(1,1,2). Therefore, the proposed ONGBM(1,1,k,c) model has a more flexible structure and compatibility with other grey models.

Exact solution to ONGBM(1,1,k,c)

This section gives the exact solution to the proposed model and the predicted value of the original data sequence.

First, we multiply both sides of Eq. (2) by \(\left[ {x^{(\alpha )} (t)} \right]^{ - \gamma }\), and one gets

$$ \frac{{d\left[ {x^{(\alpha )} (t)} \right]}}{dt}\left[ {x^{(\alpha )} (t)} \right]^{ - \gamma } + a\left[ {x^{(\alpha )} (t)} \right]^{1 - \gamma } = bt^{2} + ct + d. $$
(11)

Let \(\psi^{(\alpha )} (t) = \left[ {x^{(\alpha )} (t)} \right]^{1 - \gamma }\), Eq. (11) can be written as

$$ \frac{{d\left[ {\psi^{(\alpha )} (t)} \right]}}{dt}\frac{1}{1 - \gamma } + a\psi^{(\alpha )} (t) = bt^{2} + ct + d. $$
(12)

Furthermore, let \(A = a\left( {1 - \gamma } \right)\), \(B = b\left( {1 - \gamma } \right)\), \(C = c\left( {1 - \gamma } \right)\), and \(D = d\left( {1 - \gamma } \right)\), Eq. (12) can be then expressed as

$$ \frac{{d\left[ {\psi^{(\alpha )} (t)} \right]}}{dt} + A\psi^{(\alpha )} (t) = Bt^{2} + Ct + D. $$
(13)

By solving Eq. (13), we obtain the time response function of the ONGBM(1,1,k,c) model expressed as

$$ x^{(\alpha )} (k) = \left\{ \begin{gathered} e^{ - A(k - 1)} \left[ {\frac{B}{A}k^{2} + \frac{AC - 2B}{{A^{2} }}k + \frac{{2B - AC - + A^{2} D}}{{A^{3} }}} \right] \hfill \\ + \frac{B}{A}k^{2} + \frac{AC - 2B}{{A^{2} }}k + \frac{{2B - AC - + A^{2} D}}{{A^{3} }} \hfill \\ \end{gathered} \right\}^{{\frac{1}{1 - \gamma }}} . $$
(14)

With the help of the ICFAGO, the predicted value of the original series is given as

$$ \begin{aligned} x^{(0)} (k) &= \sum\limits_{i = 0}^{k - 1} {\left( { - 1} \right)^{i} \frac{{\Gamma \left( {r + 1} \right)\left( {k - i} \right)^{1 - \tau } }}{{\Gamma \left( {i + 1} \right)\Gamma \left( {r - i + 1} \right)}}}\\ & \quad \left( {x^{(\alpha )} (k - i) - x^{(\alpha )} (k - i - 1)} \right). \end{aligned} $$
(15)

Parameter estimation

This section deduces the model’s parameters (\(a\),\(b\),\(c\) and \(d\)); integrating both sides of Eq. (13) over the interval \([k - 1,k]\), one can write

$$ \psi^{(\alpha )} (k) - \psi^{(\alpha )} (k - 1) + A\int\limits_{k - 1}^{k} {\psi^{(\alpha )} (t)dt} = \frac{{k^{3} - (k - 1)^{3} }}{3}B + \frac{{k^{2} - (k - 1)^{2} }}{2}C + D. $$
(16)

Using the trapezoid formula, Eq. (16) can be converted as

$$ \psi^{(\alpha )} (k) - \psi^{(\alpha )} (k - 1) + Az^{(\alpha )} (k) = \frac{{k^{3} - (k - 1)^{3} }}{3}B + \frac{{k^{2} - (k - 1)^{2} }}{2}C + D. $$
(17)

In Eq. (17),\(z^{(\alpha )} (k)\) is the background value at point \(k\), and

$$ z^{(\alpha )} (k) = \left[ {\kappa \psi^{(\alpha )} (k) + (1 - \kappa )\psi^{(\alpha )} (k)} \right],\kappa \in [0,1]. $$
(18)

Using the least square method, we get the parameter vector \(\hat{\rho } = \left( {a,b,c,d} \right)^{{\text{T}}}\) expressed as

$$ \hat{\rho } = \left( {U^{{\text{T}}} U} \right)^{ - 1} U^{{\text{T}}} I, $$
(19)

where

$$ U = \left[ {\begin{array}{*{20}c} { - z^{(\alpha )} (2)} & {\frac{{2^{3} - 1^{3} }}{3}} & {\frac{{2^{2} - 1^{2} }}{2}} & 1 \\ { - z^{(\alpha )} (3)} & {\frac{{3^{3} - 2^{3} }}{3}} & {\frac{{3^{2} - 2^{2} }}{2}} & 1 \\ \vdots & \vdots & \vdots & \vdots \\ { - z^{(\alpha )} (n)} & {\frac{{n^{3} - \left( {n - 1} \right)^{3} }}{3}} & {\frac{{n^{2} - \left( {n - 1} \right)^{2} }}{2}} & 1 \\ \end{array} } \right],I = \left[ {\begin{array}{*{20}c} {\psi^{(\alpha )} (2) - \psi^{(\alpha )} (1)} \\ {\psi^{(\alpha )} (3) - \psi^{(\alpha )} (2)} \\ \vdots \\ {\psi^{(\alpha )} (n) - \psi^{(\alpha )} (n - 1)} \\ \end{array} } \right]. $$

Optimization of hyperparameters

As above-mentioned modeling steps, it is found that the four parameters are assumed to be known. By reference with [18, 40], we establish a simple optimization problem to obtain the optimal hyperparameters in the ONGBM(1,1,k,c) model. The objective function with constraints can be defined as follows:

$$ \begin{gathered} \mathop {\min }\limits_{\alpha ,\gamma ,\tau ,\kappa } MAPE = \frac{1}{n}\sum\limits_{k = 1}^{n} {\left| {\frac{{\hat{x}^{(\alpha )} (k) - x^{(\alpha )} (k)}}{{x^{(\alpha )} (k)}}} \right|} \times 100\% \hfill \\ s.t.\left\{ \begin{gathered} \alpha ,\tau \in (0,1],\kappa \in [0,1],\gamma \ne 1 \hfill \\ X^{(0)} = \left( {x^{(0)} (1),x^{(0)} (2), \ldots ,x^{(0)} (n)} \right) \hfill \\ x^{(\alpha )} (k) = \sum\limits_{i = 1}^{k} {\sum\limits_{j = 1}^{i} {\frac{{\Gamma \left( {r + j - i} \right)x^{(0)} (k)}}{{\Gamma \left( {j - i + 1} \right)\Gamma \left( r \right)j^{[\tau ] - \tau } }}} } \hfill \\ \hat{\rho } = \left( {U^{{\text{T}}} U} \right)^{ - 1} U^{{\text{T}}} I \hfill \\ x^{(\alpha )} (k) = \left\{ \begin{gathered} e^{ - A(k - 1)} \left[ {\frac{B}{A}k^{2} + \frac{AC - 2B}{{A^{2} }}k + \frac{{2B - AC - + A^{2} D}}{{A^{3} }}} \right] \hfill \\ + \frac{B}{A}k^{2} + \frac{AC - 2B}{{A^{2} }}k + \frac{{2B - AC - + A^{2} D}}{{A^{3} }} \hfill \\ \end{gathered} \right\}^{{\frac{1}{1 - \gamma }}} \hfill \\ x^{(0)} (k) = \sum\limits_{i = 0}^{k - 1} {\left( { - 1} \right)^{i} \frac{{\Gamma \left( {r + 1} \right)\left( {k - i} \right)^{1 - \tau } }}{{\Gamma \left( {i + 1} \right)\Gamma \left( {r - i + 1} \right)}}} \left( {x^{(\alpha )} (k - i) - x^{(\alpha )} (k - i - 1)} \right) \hfill \\ \end{gathered} \right.. \hfill \\ \end{gathered} $$
(20)

It is difficult to solve Eq. (20) because of its nonlinear characteristics. To this end, the MPA algorithm is employed to seek the optimal values of the four hyperparameters by minimizing the mean absolute percentage error (MAPE) between the fitted and original values.

MPA divides the whole system into three optimization stages based on different speed ratios, including high speed ratio, unit speed ratio, and low speed ratio. The specified algorithm is described as follows.

Initialization phase. Similar to most metaheuristic algorithms, MPA randomly initializes the prey positions within the search space to initiate the optimization process. The mathematical description is as follows:

$$ X_{0} = X_{\min } + rand\left( {X_{\max } - X_{\min } } \right), $$
(21)

where \(X_{\max }\) and \(X_{\min }\) indicate the search range, and \(rand()\) is generated within the random numbers.

Optimization phase. At the beginning of the iteration, when the predator speed is faster than the prey speed, the mathematical description of the MPA optimization process based on the exploration strategy is as follows:

$$ \left\{ \begin{gathered} {\text{stepsize}}_{i} = R_{B} \otimes \left( {{\text{Elite}}_{i} - R_{B} \otimes {\text{Prey}}_{i} } \right) \hfill \\ {\text{Prey}}_{i} = {\text{Prey}}_{i} + P \cdot R \otimes {\text{stepsize}}_{i} \hfill \\ \end{gathered} \right., $$
(22)

where \(i = 1,2, \ldots ,n\);\({\text{Iter}} < \frac{1}{3}{\text{Max\_Iter}}\). In Eq. (22), \({\text{stepsize}}\) is the movement step. \(R_{B}\) is the Brownian wandering random vector with normal distribution;\({\text{Elite}}\) is the elite matrix constructed from the top predators.\({\text{Prey}}\) is the prey matrix with the same dimension as the elite matrix.\(\otimes\) is the term-by-term multiplication operator.\(P\) is equal to 0.5.\(R\) is a uniform random vector within [0, 1];\(n\) is the population size.\({\text{Iter}}\) and \({\text{Max\_Iter}}\) are the current, and maximum, numbers of iterations, respectively.

In the middle of the iteration, when the predator and the prey have the same speed, the prey is responsible for exploitation based on the Lévy wandering strategy; the predator is responsible for exploration based on the Brownian wandering strategy, and gradually shifts from the exploration strategy to the exploitation strategy. The mathematical description of exploitation and exploration is as follows:

$$ \left\{ \begin{gathered} {\text{stepsize}}_{i} = R_{L} \otimes \left( {{\text{Elite}}_{i} - R_{L} \otimes {\text{Prey}}_{i} } \right) \hfill \\ {\text{Prey}}_{i} = {\text{Prey}}_{i} + P \cdot R \otimes {\text{stepsize}}_{i} \hfill \\ \end{gathered} \right., $$
(23)

where \(i = 1,2, \ldots ,\frac{n}{2}\),\(\frac{1}{3}{\text{Max\_Iter}} < {\text{Iter}} < \frac{2}{3}{\text{Max\_Iter}}\)

$$ \left\{ \begin{gathered} {\text{stepsize}}_{i} = R_{B} \otimes \left( {R_{B} \otimes {\text{Prey}}_{i} - {\text{Elite}}_{i} } \right) \hfill \\ {\text{Prey}}_{i} = {\text{Elite}}_{i} + P \cdot CF \otimes {\text{stepsize}}_{i} \hfill \\ \end{gathered} \right., $$
(24)

where \(i = \frac{n}{2}, \ldots ,n\),\(\frac{1}{3}{\text{Max\_Iter}} < {\text{Iter}} < \frac{2}{3}{\text{Max\_Iter}}\). In Eqs. (23) and (24), CF is the random vector with Lévy distribution and

$$ {\text{CF}} = \left( {1 - \frac{{{\text{Iter}}}}{{{\text{Max\_Iter}}}}} \right)^{{\frac{{2{\text{Iter}}}}{{{\text{Max\_Iter}}}}}} $$
(25)

is the adaptive parameter to control the movement step of the predator, and the other parameters have the same meaning as above. At the end of the iteration, when the predator speed is slower than the prey speed, the predator adopts an exploitation strategy based on Lévy wandering. The mathematical description is as follows:

$$ \left\{ \begin{gathered} {\text{stepsize}}_{i} = R_{L} \otimes \left( {{\text{Elite}}_{i} - R_{L} \otimes {\text{Prey}}_{i} } \right) \hfill \\ {\text{Prey}}_{i} = {\text{Elite}}_{i} + P \cdot CF \otimes {\text{stepsize}}_{i} \hfill \\ \end{gathered} \right., $$
(26)

where \(i = 1,2, \ldots ,n\), \({\text{Iter}} > \frac{2}{3}{\text{Max\_Iter}}\).

\(FADs\) effect. Fish aggregation devices (FADs) or eddy effects usually change the foraging behavior of marine predators, and this strategy enables MPA to overcome the early convergence problem and escape from local extremes in the process of finding the optimal value. Its mathematical description is as follows:

When \(r \le FADs\),

$$ {\text{Prey}}_{i} = {\text{Prey}}_{i} + CF\left[ {X_{\min } + R_{L} \otimes \left( {X_{\max } - X_{\min } } \right)} \right] \otimes U. $$
(27)

When \(r > FADs\),

$$ {\text{Prey}}_{i} = \left[ {FADs(1 - r) + r} \right]\left( {{\text{Prey}}_{r1} - {\text{Prey}}_{r2} } \right), $$
(28)

where \(FADs\) are the influence probabilities, taken as 0.2. \(U\) is the binary vector.\(r\) is the random number within [0, 1].\(r1\) and \(r2\) are the random indices of the prey matrix, respectively.

The detailed computational steps for the MPA can be seen in Fig. 3.

Fig. 3
figure 3

The computational steps for the MPA algorithm-based ONGBM(1,1,k,c) model

Evaluation criterion

In this paper, four statistical indicators [41] are used to evaluate the effectiveness of the model, including the mean absolute percentage error (MAPE), normalized mean absolute percentage error (NMAPE), root-mean-square error (RMSE) and normalized root-mean-square error (NRMSE). And the relevant calculation formulas can be seen as follows:

$$ MAPE = \frac{1}{n}\sum\limits_{k = 1}^{n} {\left| {\frac{{\hat{x}^{(0)} (k) - x^{(0)} (k)}}{{x^{(0)} (k)}}} \right|} \times 100\% $$
(29)
$$ NMAPE = \frac{1}{n}\sum\limits_{k = 1}^{n} {\left| {\frac{{\hat{x}^{(0)} (k) - x^{(0)} (k)}}{{\sum\nolimits_{k = 1}^{n} {x^{(0)} (k)} }}} \right|} \times 100\% $$
(30)
$$ RMSE = \sqrt {\frac{1}{n}\sum\limits_{k = 1}^{n} {\left( {\hat{x}^{(0)} (k) - x^{(0)} (k)} \right)^{2} } } $$
(31)
$$ NRMSE = \frac{{\sqrt {\sum\nolimits_{k = 1}^{n} {\left( {\hat{x}^{(0)} (k) - x^{(0)} (k)} \right)^{2} } } }}{{\sqrt {\sum\nolimits_{k = 1}^{n} {\left( {x^{(0)} (k)} \right)^{2} } } }}, $$
(32)

where \(x^{(0)} (k)\) and \(\hat{x}^{(0)} (k)\) are the actual value and predicted value at point \(k\), respectively.

Numerical validation

This section provides an example to validate the flexibility and effectiveness of the proposed ONGBM(1,1,k,c) model. The basic grey model (GM(1,1)) [42] and optimized nonlinear grey Bernoulli model (ONGBM(1,1)) [43] are selected as the benchmark models. These computational results have computed on the software MATLAB in authors’ labs.

In this section, we consider a time-series sequence from the National Bureau of Statistics of China (http://www.stats.gov.cn/tjsj/ndsj/) that describes the changing trend of China’s coal consumption. The data from 2000 to 2016 are applied to calibrate the model, while the data from 2017 to 2019 are used for examining the model’s accuracy. From Table 2, it is concluded that the proposed model is superior over other competitive models in this case.

Table 2 Fitted and predicted values and errors (%) by the different grey models in China’s coal consumption

Application in China’s petroleum consumption

Data collection and experimental design

The modeling data of petroleum consumption and petroleum terminal consumption from 2001 to 2018 are abstracted from the National Bureau of Statistics of China (http://www.stats.gov.cn/tjsj/ndsj/), as shown in Table 3.

Table 3 China’s petroleum consumption and terminal consumption from 2001 to 2018 [unit:104 tons]

Since China’s accession to the WTO in 2001, the Chinese economy has grown rapidly, thus leading to a dramatic increase in energy demand, and petroleum is seen as the blood of industry. Therefore, accurately forecasting China’s petroleum consumption is the motivation of this paper. To this end, this paper develops a new method for estimating the trend of China’s petroleum consumption and petroleum terminal consumption. In addition to the benchmarks mentioned in “Numerical validation”, the polynomial regression (PR) [45], artificial neural network (ANN) [46], and support vector regression (SVR) [47] are also chosen for demonstrating the effectiveness of the newly designed model.

Experimental results

The two datasets (annual petroleum consumption and petroleum terminal consumption) are conducted on the foundation of the mentioned competitive models, and the modeling results are listed in Tables 4, 5, 6 and 7. For visualization and intuition purposes, the forecasted values and the absolute percentage errors (APEs) generated by the six different models are graphed in Figs. 4, 5, 6 and 7. The competitive analysis will be implemented from the following angles: (1) the simulation and prediction deviations among the grey- and non-grey-based models, and (2) the effectiveness and flexibility of the proposed ONGBM(1,1,k,c) model using four calculated error-value metrics.

Table 4 Simulated and forecasted values of China’s petroleum consumption using different models
Table 5 Comparison of the prediction accuracies of the six models (2001–2018)
Table 6 Simulated and forecasted values of China’s petroleum terminal consumption using different models
Table 7 Comparison of the prediction accuracies of the six models in China’s petroleum terminal consumption
Fig. 4
figure 4

Comparison of forecasted and actual values from 2001 to 2018

Fig. 5
figure 5

APE(%) of the six different models in China’s petroleum consumption

Fig. 6
figure 6

Comparison of forecasted and actual values in China’s petroleum terminal consumption

Fig. 7
figure 7

APE(%) of the six different models in China’s petroleum terminal consumption

In Case 1, observed from Table 4 and Figs. 4 and 5, it is concluded that in the in-sample period, the intelligent methods (i.e., SVR and ANN) outperform other candidate models, because the modeling results generated by these two models are closer to the actual samples, followed by the grey models and polynomial regression. In addition, the range of APE of the intelligent methods is smaller than that of the grey-based models. While the performance of the mentioned competitive models will be changed when it comes to the out-of-sample period. The calculated results by the proposed model are closer to the actual values and its range of APE values is narrowest among these models.

Next, we compare four different error indicators (MAPE, NMAPE, RMSE, and NRMSE). The calculated errors are tabulated in Table 5. It is obviously seen from Table 5 that the MAPE and NMAPE of the SVR model are smallest, and RMSE and NRMSE of the ONGBM(1,1,k,c) model are lowest. On the contrary, the four error indicators of the newly proposed ONGBM(1,1,k,c) model are all lowest, and they are 0.91%, 0.89%, 611.14, and 0.01, respectively, in the test set. The newly designed model shows a higher accuracy with respect to forecasting China’s petroleum consumption.

In case 2, we conduct the two-step analysis similar to Case 1. It is seen from Table 6 and Fig. 6 that our calculated results are closer to the actual observations of China’s petroleum terminal consumption for both the training and test sets, and the newly designed model has a narrower changing range for APE values (Fig. 7), indicating that the performance stability of the proposed model is superior over other benchmarks.

Considering the four different error indices in Case 2, it is known from Table 7 that the newly designed ONGBM(1,1,k,c) model is more accurate, because its MAPE, NMAPE, RMSE, and NRMSE values are lowest for both the in-sample and out-of-sample periods. In particular, the competitors mentioned in this paper perform well in this case by reference with the Lewis standard [47]. Through the in-depth analysis, the GM(1,1) model and PR has comparatively poor forecasting abilities. And when GM(1,1) is further modified by Wang et al. [43], as a result, the forecasting result generated by ONGBM(1,1) can be improved. The intelligent methods can be treated as the alternative tools for forecasting Cases 1–2. Our results investigate that the newly designed ONGBM(1,1,k,c) model is the optimal model for forecasting China’s petroleum consumption in this case, and the reason for this situation is that the proposed model has a more flexible structure owing to its flexible fractional accumulated generation operator and power index.

Out-of-sample predictions and suggestion

Our tests suggest that the newly designed ONGBM(1,1,k,c) model is more suitable for forecasting China’s petroleum consumption and terminal consumption than the competitive models. Therefore, the proposed model is selected as the optimal model for making out-of-sample predictions of China’s petroleum consumption and terminal consumption in the next 5 years, and the calculated results are listed in Table 8.

Table 8 Forecast results of China’s petroleum consumption and terminal consumption from 2019 to 2023 (104 tons)

According to the historical data and forecast data of petroleum consumption mentioned above, China’s petroleum consumption presents a trend of continuous growth. However, it can be seen from Fig. 2 that it is very difficult to carry out the exploration work of the petroleum in China, sustaining China’s petroleum exports at a very low level. Therefore, the supply–demand relationship of China’s petroleum will become more and more unbalanced. To avoid the contradiction between supply and demand affecting social development and national defense security, some suggestions and countermeasures for the development of China’s petroleum industry are gained as follows.

  1. 1.

    At present, the growth rate of China’s petroleum production is much lower than the growth rate of petroleum demand, and in recent years, petroleum production has shown a downward trend. This situation is unbelievable, because China has very large petroleum reserves. The problem that causes the unbalance of petroleum supply and demand mainly lies in the intensity of petroleum exploration. According to Fig. 2, the main reason for the difficulty in petroleum exploration is the uneven geographical distribution of China’s petroleum resources. Abundant petroleum resources are mostly stored in inaccessible places such as the Great Northwest and Northeast, where exploitation costs and difficulties are too great. Therefore, the petroleum industry should increase investment in petroleum exploitation and increase exploration and development efforts. At the same time, it is necessary to increase the input of logistics facilities and equipment, carry out crude petroleum pipeline transportation plans, and reduce the cost of crude petroleum transportation.

  2. 2.

    Figure 1 shows that the petroleum consumption of transportation, storage, and postal services has surpassed that of industry and has become the first industry in terms of petroleum consumption. Therefore, in the field of transportation, which belongs to the tertiary industry, the purchase of small-displacement cars can be encouraged; in the aspect of car design, energy-saving technology can be further studied to achieve the goal of energy-saving and emission reduction. In addition, related supporting facilities of new energy vehicles should be further improved, such as reasonable layout of charging piles, distribution stations, maintenance points, and other infrastructure, so as to solve the problem that cars cannot meet long-distance driving due to insufficient electric capacity. In addition, petroleum consumption in industry is second only to that of transportation, warehousing, and postal services. Therefore, China should adjust and upgrade its industrial structure, devote itself to developing high-tech and high value-added industries, and abandon the development of industries with high energy consumption.

  3. 3.

    By reference with the forecast results in the previous section, we can see that China’s overall petroleum consumption shows an increasing trend, while petroleum exploration will not play its due role immediately. Until China can quickly explore for petroleum, its main source of petroleum is still import. Therefore, China’s energy sector can reasonably import petroleum according to the forecast results of this paper, so as to avoid unnecessary waste caused by the excessive import of petroleum. In addition, the Chinese government can also sign agreements with major petroleum-exporting countries to continue imports, so that exporting countries can give certain discounts.

Conclusion and research direction

To accurately predict China’s petroleum consumption, this paper proposes a newly optimized nonlinear grey Bernoulli model with combined fractional accumulated generation operator, and these varying parameters are optimized by marine predators algorithm for precisely excavating the development patterns of various time-series sequence. Specifically, we introduce a new combined conformable fractional accumulated generation operator into the existing modified nonlinear grey Bernoulli model. When we take the different values of the emerging coefficients (the fractional accumulation orders, background-value coefficient, and power index), the proposed model can be suitable for forecasting various time-series sequence issues.

As the numerical results in a series of cases show that the newly designed ONGBM(1,1,k,c) model does reach a higher accuracy in forecasting China’s coal consumption and petroleum consumption, compared with other competitive models that embrace the traditional grey model, optimized nonlinear grey Bernoulli model, polynomial regression, artificial neural network, and support vector regression. Our tests suggest that the newly proposed model should be considered the optimal technique for forecasting China’s petroleum consumption and terminal consumption in the next period. Based on the forecasts of this paper, some suggestions are recommended for the relevant sectors in formulating reasonable plans and strategies.

Up to this point, the newly optimized nonlinear grey Bernoulli model with combined conformable fractional accumulation has prominent advantages over the other benchmarks, whereas there remains room for improvement. For example, the proposed model is essentially a single variable-based model, neglecting relevant influential factor in practice. Additionally, other latest algorithms should be employed for improving the effectiveness of the proposed model in our next work.