Introduction

On January 27, 2022, Cable News Network (CNN) reported that the US Pentagon said Russian buildup had increased “in the last 24 hours” near Ukraine, and tensions on the Russia–Ukraine border simmered (CNN 2022). When we examine the data, we can see that the Russia–Ukraine war had a significant impact on natural gas futures prices due to concerns about the imbalance in natural gas supply and demand.

In recent literature on natural gas, there have been studies on (i) natural gas hydrates as a new and clean energy source (e.g., Koh et al. 2016; Ding et al. 2017; Liu et al. 2018b; Xu et al. 2018a, 2018b; Gambelli and Rossi 2019; Tupsakhare and Castaldi 2019; Zhu et al. 2020), (ii) substitutes for natural gas production through CO methanation (e.g., Bian et al. 2016; Tao et al. 2016, 2017; Liu et al. 2018a; Zhang et al. 2020; Cisneros et al. 2021), (iii) sustainable hydrogen production through enhanced natural gas production (e.g., Sanusi and Mokheimer 2019; Guban et al. 2020; Ji and Wang 2021; Olabi et al. 2021; Qureshi et al. 2022; Younas et al. 2022), (iv) mercury control technologies in natural gas (e.g., Pontes et al. 2014; Koulocheris et al. 2018; Chalkidis et al. 2019; Wang et al. 2020; Zhou et al. 2020; Koulocheris et al. 2023; Shen et al. 2023), (v) the impact of natural gas and hydrogen properties on internal combustion engine performance (e.g., Wei and Geng 2016; Thiruvengadam et al. 2018; Tutak et al. 2020; Kim et al. 2021; Wang et al. 2021; Chen et al. 2023; Li et al. 2023), and (vi) market prices for natural gas (e.g., Wang et al. 2022; Akcora and Kocaaslan 2023; Dastan 2023; Liu et al. 2023; Su et al. 2023; Mensi et al. 2024).

Reviewing the recent studies on market prices for natural gas, Wang et al. (2022) investigated the volatilities of oil rents and natural gas rents. Akcora and Kocaaslan (2023) examined the price bubbles in European natural gas markets. Dastan (2023) investigated the Turkish natural gas market during the supply disruptions that occurred in early 2022. Further, Liu et al. (2023) analyzed the impacts of oil and natural gas prices on China’s carbon efficiency. Su et al. (2023) examined whether there were multiple bubbles in the European natural gas market. Mensi et al. (2024) investigated the connectedness between bond, oil, and natural gas prices. Although not focused on natural gas, many studies have argued for the importance of analyzing market prices for energy (e.g., Ju et al. 2014; Tsuji 2018, 2020; van Eyden et al. 2019; Dutta et al. 2021; Puig-Gamero et al. 2021; Jingjian et al. 2023).

Given this and considering the importance of actual natural gas market prices, our current analysis focuses on the impact of the the Russia–Ukraine war on natural gas futures prices. We suggest that in doing such new analysis, it is vital to incorporate fat-tailed distributions and structural breaks. However, in existing studies, although either fat-tailed distributions (e.g., Geweke 1993; Fan et al. 2008) or structural breaks (e.g., Wang and Moore 2009; Malik 2022) are considered, both components are not incorporated into a single model simultaneously. Therefore, our approach of newly considering and incorporating both in this study fills a research gap left by existing literature.

Hence, we examine natural gas futures volatilities of the period around the Russian invasion of Ukraine, with a particular focus on the impact of simmering tensions on the Russia–Ukraine border on natural gas futures volatilities, especially because at that time, a very large volatility spike was observed. We consider that this spike was due to heightened concerns about the imbalance in natural gas supply and demand, particularly at that time.

Our research questions are as follows. “How do we well capture the natural gas futures volatility spike when the tensions simmered on the Russia–Ukraine border by considering both fat-tailed errors and structural breaks as hybrid models?” and “What is the beneficial characteristic of the volatility estimated by our superior hybrid models?” To clarify these matters is the goal of this study. For this purpose, we develop new hybrid generalized autoregressive conditional heteroscedasticity (GARCH) models, which incorporate not only fat-tailed distribution errors but also structural breaks. To the best of our knowledge, no existing study has analyzed the points raised in our research questions mentioned above.

As a result of our careful examinations applying four new hybrid models, i.e., the autoregressive (AR) mean Student’s t (T) distribution error and structural break (SB) incorporated GARCH model (hereinafter, the AR–T–SB–GARCH model), the AR–generalized error distribution (GED)–SB–GARCH model (hereinafter, the AR–GED–SB–GARCH model), the AR–T–SB–exponential GARCH (EGARCH) model (hereinafter, the AR–T–SB–EGARCH model), and the AR–GED–SB–EGARCH model, we have uncovered the following findings as new contributions to this topic area.

First, we reveal that it was not when Russia invaded Ukraine in February 2022, but rather when tensions on the Russia–Ukraine border simmered in January 2022, that the Russia–Ukraine war had the most impact on natural gas futures prices. Second, we clarify that our hybrid models, which incorporate both fat-tailed distribution errors—either Student’s t distribution or GED errors—and structural breaks, are effective to capture the natural gas futures volatility jump in a timely manner when tensions on the Russia–Ukraine border simmered.

Third, we also reveal that our hybrid modeling approach—incorporating both fat-tailed errors and structural breaks—is effective not only in GARCH but also in EGARCH modeling. This proves the robustness of our hybrid approach for modeling natural gas futures volatilities. Fourth, we further uncover that the volatility estimates from our hybrid models of the AR–T–SB–GARCH and the AR–GED–SB–GARCH models have predictive power for the volatilities of nonhybrid versions of the AR–T–GARCH and AR–GED–GARCH models. We consider that because of this characteristic, our two hybrid GARCH models well capture the natural gas futures volatility spike when the tensions on the Russia–Ukraine border simmered.

Fifth, we further reveal that the volatility estimates from our exponential-form hybrid AR–T–SB–EGARCH and AR–GED–SB–EGARCH models also have forecast power for the volatilities of nonhybrid AR–T–EGARCH and AR–GED–EGARCH models. This indicates the robustness of both (i) the volatility predictive power and (ii) our perspective that because of this timely characteristic of the volatility estimates from our hybrid models, our four hybrid GARCH and EGARCH models well capture the jump in natural gas futures volatility when tensions on the Russia–Ukraine border simmered.

Sixth, we suggest that our new evidence of the predictive power of hybrid models for the volatilities of nonhybrid models means that the volatilities from the nonhybrid models lag behind those of the hybrid models. We consider that because of this characteristic, the nonhybrid models are inferior to hybrid models in capturing the natural gas futures volatility spike in a timely manner when tensions on the Russia–Ukraine border simmered. Seventh, in addition to the points mentioned above, we also derive many valuable interpretations, implications, and innovative perspectives for future energy informatics research and risk management in the energy industries through the use of artificial intelligence (AI). This emphasizes the significance of our work.

The remainder of the paper is organized as follows. Section “Data and the characteristics” details the data and the characteristics, Section “Hybrid GARCH models” presents our hybrid GARCH models, and Section “Why are hybrid GARCHs effective?” tests the predictive power of their volatility estimates. Afterwards, Section “Hybrid EGARCH models” provides our hybrid EGARCH models, Section “Why are hybrid EGARCHs effective?” examines the predictive power of their volatility estimates, and Section “Implications and perspectives” provides the implications and our innovative perspectives. Finally, Section “Contributions and conclusions” concludes the paper.

Data and the characteristics

This study uses the natural gas futures price data from the New York Mercantile Exchange. Using the price series, denoted \(p^{\textit{NG}}\), we compute the daily log-difference percentage return series as \(\textit{dlr}_{t}^{\textit{NG}}=\ln (p^{\textit{NG}}_t/p^{\textit{NG}}_{t-1})\times 100\). To examine the effectiveness of our hybrid models described later for capturing the natural gas futures price fluctuations during the Russian invasion of Ukraine, we analyze the returns, \(\textit{dlr}_{t}^{\textit{NG}}\), from January 3, 2020, to January 5, 2024. We note that this sample period also means the post-COVID-19 outbreak period.

Fig. 1
figure 1

Daily price evolution of natural gas futures, January 2020 to January 2024. Natural gas futures prices in US dollars

Fig. 2
figure 2

Daily return evolution of natural gas futures, January 2020 to January 2024. Natural gas futures returns in percent

Figures 1 and 2 plot the price and return evolution of the natural gas futures, respectively. From these figures, we can see that before the Russian invasion of Ukraine in February 2022, natural gas futures prices had jumped (Fig. 1), and the returns largely and sharply fluctuated (Fig. 2). Referring to the return data, the largest return jump took place on January 27, 2022, which was when tensions on the Russia–Ukraine border simmered as discussed.

Table 1 provides summary statistics of the daily natural gas futures returns. The high kurtosis values and large Jarque–Bera statistics in this period indicate that it is meaningful to employ non-normal distribution errors in quantitative models like our present analysis. Moreover, the augmented Dickey–Fuller test statistic indicates that natural gas futures returns are stationary, suggesting that applying GARCH models like ours is applicable.

Table 1 Summary statistics for natural gas futures returns, January 2020 to January 2024

Furthermore, as Fig. 2 also indicates, there clearly exist structural breaks in natural gas futures returns. Hence, we investigated and identified the structural break points using the iterated cumulative sums of squares (ICSS) algorithm (Inclán and Tiao 1994). We stress that this ICSS algorithm is a very effective algorithm for identifying structural breaks in time series data (e.g., Wang and Moore 2009; Malik 2022). More precisely, as the Bayesian information criterion (BIC) suggests the appropriate autoregressive lag order for the natural gas futures returns is one, we estimate the regression, \(\textit{dlr}_{t}^{\textit{NG}} = \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{t}\), and identify the structural break points in the return residuals, \(\tau _{t}\).

Table 2 Structural break points for natural gas futures return residuals, January 2020 to January 2024

As shown in Table 2, we find that the number of break points for our analysis period is seven. We also show these break points with the bands of \(\pm 3\) standard deviations of the return residuals in Fig. 3. We consider that this figure clearly indicates the importance of taking structural breaks into consideration to analyze the impact of the Russia–Ukraine war on natural gas futures prices.

Fig. 3
figure 3

Structural breaks in natural gas futures return residuals. Bands of ±3 standard deviations and change points identified by the ICSS algorithm

Hybrid GARCH models

Model constructions

This section constructs our hybrid volatility estimation models. Note that we use ‘hybrid’ to refer to the incorporation of both non-normal fat-tailed errors and structural breaks throughout this study. Due to the non-normality of natural gas futures returns shown in Table 1, we employ either Student’s t distribution or GED errors in our models. This is because existing literature suggests the effectiveness of modeling fat-tailed asset returns using Student’s t or the GED distribution (e.g., Geweke 1993; Fan et al. 2008). As stated, the BIC suggests the adequate autoregressive lag order for natural gas futures returns is one, and we thus include only the first autoregressive variable in the mean equations of all our models.

Accordingly, the base models used to construct our hybrid GARCH models are the following AR–T–GARCH and AR–GED–GARCH models:

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{t,t},\nonumber \\ h_{t,t}= & {} \: \psi + \chi \tau ^2_{t,t-1} +\xi h_{t,t-1}, \end{aligned}$$
(1)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{g,t},\nonumber \\ h_{g,t}= & {} \: \psi + \chi \tau ^2_{g,t-1} +\xi h_{g,t-1}, \end{aligned}$$
(2)

where \(\textit{dlr}_{t}^{\textit{NG}}\) denotes the return of natural gas futures at time t; \(\tau _{t,t}\) (\(\tau _{g,t}\)) denotes the Student’s t distribution (GED) errors at time t with \(\nu\) (k) being the distribution shape parameter; and \(h_{t,t}\) (\(h_{g,t}\)) is the variance to be estimated by model (1) (model (2)) at time t. In addition, \(\mu\) denotes the mean equation intercept; \(\kappa\) denotes the autoregressive term coefficient; \(\psi\) is the variance equation intercept; \(\chi\) is the ARCH-term coefficient; and \(\xi\) is the GARCH-term coefficient.

As the next step, we construct two hybrid GARCH models by including structural break effects into models (1) and (2). Namely, our hybrid GARCH models are the following AR–T–SB–GARCH and AR–GED–SB–GARCH models:

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{tsb,t},\nonumber \\ h_{tsb,t}= & {} \: \psi + \chi \tau ^2_{tsb,t-1} +\xi h_{tsb,t-1}+\sum _{i=1}^{n}\phi _{i}D^{\textit{SB}}_{i,t}, \end{aligned}$$
(3)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{gsb,t},\nonumber \\ h_{gsb,t}= & {} \: \psi + \chi \tau ^2_{gsb,t-1} +\xi h_{gsb,t-1}+\sum _{i=1}^{n}\phi _{i}D^{\textit{SB}}_{i,t}, \end{aligned}$$
(4)

where \(\tau _{tsb,t}\) (\(\tau _{gsb,t}\)) denotes the Student’s t distribution (GED) errors in model (3) (model (4)) at time t; and \(h_{tsb,t}\) (\(h_{gsb,t}\)) is the variance to be estimated by model (3) (model (4)) at time t.

In addition, \(D^{\textit{SB}}_{i,t}\) denotes the i-th structural break dummy variable, n is the number of structural breaks, and \(\phi _{i}\) is the i-th structural break dummy variable’s coefficient. The i-th structural break dummy variable takes a value of zero until the break point identified for the return residuals by the ICSS algorithm, and one thereafter. This study employs the ICSS algorithm because existing studies suggest its effectiveness in capturing structural breaks in asset returns (e.g., Wang and Moore 2009; Malik 2022). For details of the ICSS algorithm, see Inclán and Tiao (1994). Note that including the parameters of the non-normal fat-tailed distributions, \(\nu\) and k, the other notations of models (3) and (4) described above are the same as those of models (1)–(2).

Verifications

Table 3 provides the estimation results of models (1)–(4). We note that all model parameters are estimated using the maximum likelihood method throughout the paper. Panels A–D present that all the model parameters including the shape parameters of the fat-tailed distribution errors are generally well estimated. In addition, as Panels B and D show, most of the coefficients of our structural break dummy variables—\(\phi _{1}\) to \(\phi _{7}\)—are statistically significant.

Table 3 Estimation results for hybrid GARCH models

It is noteworthy that when incorporating structural breaks, the ARCH effect disappears (The significant coefficients, \(\chi\), in Panels A and C of Table 3 become insignificant in Panels B and D.) and the GARCH effect is weakened (The significant values of the coefficients, \(\xi\), in Panels A and C of Table 3 become smaller in Panels B and D.). This also indicates the effectiveness of structural breaks in explaining the evolution of natural gas futures volatilities.

We also conduct likelihood ratio (LR) tests to compare the performance of models (1)–(4) and present the results in Table 4. As shown in Table 4, the null hypotheses—the AR–GARCH model, which has normal distribution errors, is superior to the AR–T–GARCH model (Panel A), the AR–T–GARCH model is superior to the AR–T–SB–GARCH model (Panel B), the AR–GARCH model is superior to the AR–GED–GARCH model (Panel C), and the AR–GED–GARCH model is superior to the AR–GED–SB–GARCH model (Panel D)—are all strongly rejected.

Table 4 LR tests for GARCH model discrimination

Moreover, the values of the Akaike’s information criterion (AIC) for models (1)–(4) are 5,784.032, 5,794.934, 5,716.146, and 5,716.906, respectively. That is, the LR test results and the AIC values clearly indicate the superiority of our hybrid models, the AR–T–SB–GARCH model and the AR–GED–SB–GARCH model, and demonstrate the effectiveness for taking both non-normal fat-tail errors and structural breaks into account in our analysis for natural gas futures volatilities.

Why are hybrid GARCHs effective?

This section considers why our hybrid GARCH models are effective. For this purpose, we plot the volatility estimates in Figs. 4 and 5 to compare our model estimations. Specifically, Fig. 4 illustrates the volatility estimates from the AR–T–GARCH model with those from the AR–T–SB–GARCH model, and Fig. 5 presents the volatility estimates from the AR–GED–GARCH model with those from the AR–GED–SB–GARCH model.

Fig. 4
figure 4

Estimated daily volatilities from AR–T–GARCH and AR–T–SB–GARCH models

Fig. 5
figure 5

Estimated daily volatilities from AR–GED–GARCH and AR–GED–SB–GARCH models

From Figs. 4 and 5, we can see that when volatilities spike, the volatility estimates from the AR–T–SB–GARCH and the AR–GED–SB–GARCH models lead the volatility estimates from the AR–T–GARCH and the AR–GED–GARCH models, respectively. Considering this and clarifying why our hybrid GARCH models are effective, we test below whether our hybrid model volatility estimates have predictive power for the volatilities of the nonhybrid models.

Testing models

This section examines the predictive power of the hybrid model volatility estimates for the nonhybrid model volatilities (precisely, variances). Specifically, to test the forecast power of the volatility estimates from the AR–T–SB–GARCH model (3), we first use the following AR–T–GARCH–X model with \(\sqrt{{\hat{h}}_{tsb,t-1}}\) and the AR–GED–GARCH–X model with \(\sqrt{{\hat{h}}_{tsb,t-1}}\):

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{t,t},\nonumber \\ h_{t,t}= & {} \: \psi + \chi \tau ^2_{t,t-1} +\xi h_{t,t-1}+\lambda \sqrt{{\hat{h}}_{tsb,t-1}}, \end{aligned}$$
(5)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{g,t},\nonumber \\ h_{g,t}= & {} \: \psi + \chi \tau ^2_{g,t-1} +\xi h_{g,t-1}+\lambda \sqrt{{\hat{h}}_{tsb,t-1}}, \end{aligned}$$
(6)

where \(\sqrt{{\hat{h}}_{tsb,t-1}}\) denotes the one-day lagged volatility estimates from our hybrid model (3), and \(\lambda\) is the coefficient. Hence, the statistically significant positive \(\lambda\) values indicate the predictive power of the volatility estimates from our hybrid model, the AR–T–SB–GARCH model (3). Note that the other notations of models (5)–(6) are the same as those of models (1)–(2).

Moreover, to test the forecast power of the volatility estimates from the AR–GED–SB–GARCH model (4) for the nonhybrid model volatilities (variances), we use the following AR–T–GARCH–X model with \(\sqrt{{\hat{h}}_{gsb,t-1}}\) and the AR–GED–GARCH–X model with \(\sqrt{{\hat{h}}_{gsb,t-1}}\):

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{t,t},\nonumber \\ h_{t,t}= & {} \: \psi + \chi \tau ^2_{t,t-1} +\xi h_{t,t-1}+\lambda \sqrt{{\hat{h}}_{gsb,t-1}}, \end{aligned}$$
(7)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu +\kappa \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{g,t},\nonumber \\ h_{g,t}= & {} \: \psi + \chi \tau ^2_{g,t-1} +\xi h_{g,t-1}+\lambda \sqrt{{\hat{h}}_{gsb,t-1}}, \end{aligned}$$
(8)

where \(\sqrt{{\hat{h}}_{gsb,t-1}}\) denotes the one-day lagged volatility estimates from our hybrid model (4), and \(\lambda\) is the coefficient. Therefore, the statistically significant positive \(\lambda\) values again mean the predictive power of the volatility estimates from our hybrid model, the AR–GED–SB–GARCH model (4). Note that the remaining notations of models (7)–(8) are the same as those of models (1)–(2).

Results

Table 5 shows the results for the predictive power of our hybrid model estimates. Panel A shows the results for the volatility estimates from the AR–T–SB–GARCH model, and Panel B presents those for the volatility estimates from the AR–GED–SB–GARCH model.

Table 5 Results for predictive power tests of hybrid GARCH model volatility estimates

In these two panels, the estimated coefficients, \(\lambda\)s, are always statistically significant with positive signs. This indicates that the volatility estimates from our hybrid models, the AR–T–SB–GARCH and the AR–GED–SB–GARCH models, have predictive power for the volatilities (variances) of the nonhybrid AR–T–GARCH and AR–GED–GARCH models.

We consider that these new findings mean that volatility estimates from our hybrid GARCH models (3)–(4) lead the volatilities of nonhybrid GARCH models (1)–(2). We thus believe that because of this favorable characteristic, the two hybrid GARCH models well captured the volatility jump of the natural gas futures when the tensions simmered on the Russia–Ukraine border.

Hybrid EGARCH models

Model constructions

This section formulates alternative hybrid volatility estimation models using exponential-form GARCH (Nelson 1991). As before, we employ the Student’s t distribution or GED errors, and we include the first autoregressive variable in the mean equations of our models.

The base models for building our alternative hybrid models are the following AR–T–EGARCH and AR–GED–EGARCH models:

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{t,t}^{e},\nonumber \\ \ln (h_{t,t}^{e})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau _{t,t-1}^{e}\big |}{\sqrt{h^{e}_{t,t-1}}} +\xi _{e}\ln (h_{t,t-1}^{e}), \end{aligned}$$
(9)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau _{g,t}^{e},\nonumber \\ \ln (h_{g,t}^{e})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau _{g,t-1}^{e}\big |}{\sqrt{h^{e}_{g,t-1}}} +\xi _{e}\ln (h_{g,t-1}^{e}), \end{aligned}$$
(10)

where \(\tau _{t,t}^{e}\) (\(\tau _{g,t}^{e}\)) denotes the Student’s t distribution (GED) errors at time t with the distribution parameter \(\nu _e\) (\(k_e\)); and \(h_{t,t}^{e}\) (\(h_{g,t}^{e}\)) is the variance to be estimated by model (9) (model (10)) at time t. Further, \(\mu _e\) denotes the mean equation intercept; \(\kappa _e\) denotes the autoregressive term coefficient; \(\psi _e\) is the variance equation intercept; \(\chi _e\) is the ARCH-term coefficient; and \(\xi _e\) is the GARCH-term coefficient.

As before, we construct the hybrid EGARCH models by including structural break effects into models (9) and (10). Our exponential-form hybrid models are the following AR–T–SB–EGARCH and AR–GED–SB–EGARCH models:

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{tsb,t},\nonumber \\ \ln (h^{e}_{tsb,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{tsb,t-1}\big |}{\sqrt{h^{e}_{tsb,t-1}}} +\xi _{e}\ln (h^{e}_{tsb,t-1})+\sum _{i=1}^{n} \phi _{e,i}D^{\textit{SB}}_{i,t}, \end{aligned}$$
(11)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{gsb,t},\nonumber \\ \ln (h^{e}_{gsb,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{gsb,t-1}\big |}{\sqrt{h^{e}_{gsb,t-1}}} +\xi _{e}\ln (h^{e}_{gsb,t-1})+\sum _{i=1}^{n}\phi _{e,i}D^{\textit{SB}}_{i,t}, \end{aligned}$$
(12)

where \(\tau ^{e}_{tsb,t}\) (\(\tau ^{e}_{gsb,t}\)) denotes the Student’s t distribution (GED) errors in model (11) (model (12)) at time t; \(h^{e}_{tsb,t}\) (\(h^{e}_{gsb,t}\)) means the variance to be estimated by model (11) (model (12)) at time t.

Furthermore, \(D^{\textit{SB}}_{i,t}\) denotes the i-th structural break dummy variable and n is the number of the structural breaks as before, and \(\phi _{e,i}\) is the coefficient of the i-th break dummy variable. The dummy variable is the same as in models (3) and (4). We also note that including the parameters of fat-tailed distributions, \(\nu _e\) and \(k_e\), the other notations of models (11) and (12) described above are the same as those of models (9)–(10).

Verifications

Table 6 presents the estimation results of models (9)–(12). Panels A–D present that all the model parameters including the shape parameters of the fat-tailed distribution errors are well estimated. Further, as Panels B and D show, the coefficients of our structural break dummy variables—\(\phi _{e,1}\) to \(\phi _{e,7}\)—are all statistically significant.

We note that when incorporating structural breaks, the ARCH effect again disappears (The significant coefficients, \(\chi _e\), in Panels A and C of Table 6 become insignificant in Panels B and D.) and the GARCH effect is again weakened (The significant values of the coefficients, \(\xi _e\), in Panels A and C of Table 6 become smaller in Panels B and D.). Therefore, we understand that this again shows the effectiveness of structural breaks in explaining the evolution of natural gas futures volatilities.

Table 6 Estimation results for hybrid EGARCH models

Moreover, we performed the LR tests to compare the performance of models (9)–(12) as before, and showed the results in Table 7. As Table 7 indicates, the four null hypotheses—the AR–EGARCH model, which has normal distribution errors, is superior to the AR–T–EGARCH model (Panel A), the AR–T–EGARCH model is superior to the AR–T–SB–EGARCH model (Panel B), the AR–EGARCH model is superior to the AR–GED–EGARCH model (Panel C), and the AR–GED–EGARCH model is superior to the AR–GED–SB–EGARCH model (Panel D)—are all strongly rejected.

Table 7 LR tests for EGARCH model discrimination

Furthermore, the AIC values for models (9)–(12) are 5775.172, 5789.563, 5718.041, and 5718.860, respectively. That is, the LR test results and the AIC values clearly show the superiority of our alternative hybrid models, the AR–T–SB–EGARCH model and the AR–GED–SB–EGARCH model, and prove the effectiveness for taking both non-normal fat-tail errors and structural breaks into consideration in our natural gas futures volatility analyses.

Why are hybrid EGARCHs effective?

This section considers why our hybrid EGARCH models are effective. For this purpose, we plot the volatility estimates from our alternative hybrid models (11) and (12) in Figs. 6 and 7 to compare the model estimates. Specifically, Fig. 6 exhibits the volatility estimates from the AR–T–EGARCH model with those from the AR–T–SB–EGARCH model, and Fig. 7 displays the volatility estimates from the AR–GED–EGARCH model with those from the AR–GED–SB–EGARCH model.

Fig. 6
figure 6

Estimated daily volatilities from AR–T–EGARCH and AR–T–SB–EGARCH models

Fig. 7
figure 7

Estimated daily volatilities from AR–GED–EGARCH and AR–GED–SB–EGARCH models

From Figs. 6 and 7, we can see that when volatilities jump, the volatility estimates from the AR–T–SB–EGARCH and the AR–GED–SB–EGARCH models lead the volatility estimates from the AR–T–EGARCH and the AR–GED–EGARCH models, respectively. Considering this and to clarify why our hybrid EGARCH models are effective, we below test whether the volatility estimates from our two hybrid EGARCH models have predictive power for the volatilities of the nonhybrid EGARCH models.

Testing models

This section examines the forecast power of our hybrid EGARCH model estimates for the nonhybrid EGARCH model volatilities (precisely, variances). Specifically, to test the predictive power of the volatility estimates from the AR–T–SB–EGARCH model (11), we first use the following AR–T–EGARCH–X model with \(\sqrt{\hat{h^{e}}_{tsb,t-1}}\) and the AR–GED–EGARCH–X model with \(\sqrt{\hat{h^{e}}_{tsb,t-1}}\):

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _e \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{t,t},\nonumber \\ \ln (h^{e}_{t,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{t,t-1}\big |}{\sqrt{h^{e}_{t,t-1}}} +\xi _{e}\ln (h^{e}_{t,t-1})+\lambda _{e}\sqrt{\hat{h^{e}}_{tsb,t-1}}, \end{aligned}$$
(13)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{g,t},\nonumber \\ \ln (h^{e}_{g,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{g,t-1}\big |}{\sqrt{h^{e}_{g,t-1}}} +\xi _{e}\ln (h^{e}_{g,t-1})+\lambda _{e}\sqrt{{\hat{h}}^{e}_{tsb,t-1}}, \end{aligned}$$
(14)

where \(\sqrt{\hat{h^{e}}_{tsb,t-1}}\) denotes the one-day lagged volatility estimates from our hybrid model (11), and \(\lambda _{e}\) is the coefficient. Hence, the statistically significant positive \(\lambda _{e}\) values mean the predictive power of the volatility estimates from our hybrid EGARCH model, the AR–T–SB–EGARCH model. Note that the other notations of models (13)–(14) are the same as those of models (9)–(10).

Moreover, also to test the forecast power of the volatility estimates from the AR–GED–SB–EGARCH model (12), we use the following AR–T–EGARCH–X model with \(\sqrt{{\hat{h}}^{e}_{gsb,t-1}}\) and the AR–GED–EGARCH–X model with \(\sqrt{{\hat{h}}^{e}_{gsb,t-1}}\):

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _{e} \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{t,t},\nonumber \\ \ln (h^{e}_{t,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{t,t-1}\big |}{\sqrt{h^{e}_{t,t-1}}} +\xi _{e}\ln (h^{e}_{t,t-1})+\lambda _{e}\sqrt{{\hat{h}}^{e}_{gsb,t-1}}, \end{aligned}$$
(15)

and

$$\begin{aligned} \textit{dlr}_{t}^{\textit{NG}}= & {} \: \mu _{e}+\kappa _e \textit{dlr}_{t-1}^{\textit{NG}} +\tau ^{e}_{g,t},\nonumber \\ \ln (h^{e}_{g,t})= & {} \: \psi _{e} + \chi _{e} \frac{\big |\tau ^{e}_{g,t-1}\big |}{\sqrt{h^{e}_{g,t-1}}} +\xi _{e}\ln (h^{e}_{g,t-1})+\lambda _{e}\sqrt{{\hat{h}}^{e}_{gsb,t-1}}, \end{aligned}$$
(16)

where \(\sqrt{{\hat{h}}^{e}_{gsb,t-1}}\) denotes the one-day lagged volatility estimates from our hybrid EGARCH model (12), and \(\lambda _{e}\) is the coefficient. Thus, the statistically significant positive \(\lambda _{e}\) again indicates the predictive power of the volatility estimates from our alternative hybrid model, the AR–GED–SB–EGARCH model. Note that the other notations of models (15)–(16) are the same as those of models (9)–(10).

Results

Table 8 presents the results for the predictive power of our hybrid EGARCH model estimates. Panel A exhibits the results for the volatility estimates from the AR–T–SB–EGARCH model, and Panel B shows those for the volatility estimates from the AR–GED–SB–EGARCH model.

Table 8 Results for predictive power tests of hybrid EGARCH model volatility estimates

In these two panels, the estimated coefficients, \(\lambda _e\)s, are always statistically significant with positive signs; and thus this means that the volatility estimates from our hybrid EGARCH models, the AR–T–SB–EGARCH and the AR–GED–SB–EGARCH models, have predictive power for the volatilities (variances) of the nonhybrid models, the AR–T–EGARCH and the AR–GED–EGARCH models.

We consider that these new findings mean that volatility estimates from our hybrid EGARCH models (11)–(12) lead the volatilities of nonhybrid EGARCH models (9)–(10). Hence as before, we consider that because of this favorable characteristic, the two hybrid EGARCH models timely captured the volatility spile of the natural gas futures when the tension summered in the Ukraine-Russia border.

Implications and perspectives

This section discusses how we can interpret our results and derive significant implications and perspectives for academic energy research and practical industries in the post-COVID-19 era. As in Ahmed et al. (2022), Chaklader et al. (2023), and Dirican (2015), AI will play a more significant role in various industries in the future. The implications of our study along with the innovative perspectives derived from our research are as follows.

Firstly, as we have demonstrated, to accurately capture the impact of significant events, natural gas futures volatilities should be estimated by incorporating fat-tailed errors and structural breaks. We believe that capturing structural breaks in a timely manner is particularly crucial for volatility modeling and future risk management in the energy markets and industries. This is because, as we found in Figs. 4, 5, 6, and 7, the volatilities derived from nonhybrid models lag behind the volatilities from our four hybrid models, which incorporate structural breaks. The lag is especially evident during the time when tensions simmered on the Russia–Ukraine border. Therefore, it is crucial for us to accurately forecast energy price volatilities without lagging behind significant events by utilizing innovative AI techniques such as machine learning. This will lead to more effective risk management in the energy markets and industries in the post-COVID-19 era.

Secondly, given the increasing importance of structural breaks in energy markets that we have observed, it is now more crucial than ever for us to accurately predict these breaks. We should note that these breaks are caused by information related to the energy markets. We should strive to accurately predict these breaks by using cutting-edge AI technologies. This will improve risk management in the energy markets and industries, particularly in the unpredictable post-COVID-19 world. We also believe that accurately predicting structural breaks allows us to forecast volatilities without falling behind sudden changes in energy prices caused by significant events.

Thirdly, as a policy implication, we suggest that policy makers and energy-related industry practitioners need to pay closer attention not only to the demand and supply balance for energy, but also to various news and its impact on energy market prices, even during normal circumstances. These efforts and close observation should help prevent potential imbalances in energy markets in the future. Furthermore, our results suggest that policy making and risk management in the energy industry should be conducted more swiftly than in the past. This is due to the fact that energy prices now respond more quickly to various events and news than ever before. One deeper interpretation of our results is that market efficiency in energy markets may have shifted. We believe that we should take this viewpoint into consideration in policy making and risk management in the energy industry.

Fourth, an interesting implication for future research would be to utilize machine learning approaches, such as reinforcement learning or deep learning for predicting energy price fluctuations. We consider that comparing traditional econometric models with machine learning approaches in our context would be a valuable direction for future research. We believe that our methods can be applied to other real-world assets, such as fuel and other energy commodities.

While this study does not provide methods for forecasting structural breaks, the comprehensive inspections conducted in this paper provide many insightful interpretations, implications, and innovative perspectives. Collectively, these demonstrate the importance of our current work for future energy research and practice. Therefore, we believe that our empirical findings, along with the significant interpretations, implications, and innovative viewpoints discussed above, all signify our substantial contribution to current and future research on energy informatics.

Contributions and conclusions

Paying keen attention to the Russia–Ukraine war, and particularly its impact on natural gas futures prices when tensions simmered on the Russia–Ukraine border, this paper empirically examined natural gas futures volatilities. Using the four hybrid models, the AR–T–SB–GARCH, the AR–GED–SB–GARCH, the AR–T–SB–EGARCH, and the AR–GED–SB–EGARCH models, which incorporated both fat-tailed errors and structural breaks, we derived the following findings as new contributions.

  • First, we found that very interestingly, it was not when Russia invaded Ukraine in February 2022, but rather when tensions on the border simmered in January 2022, that the matter of the Russia–Ukraine war had the most impact on natural gas futures prices.

  • Second, we revealed that our hybrid models, not only with fat-tailed errors—Student’s t distribution or GED errors—but also with structural breaks, effectively capture the natural gas futures volatility spike when tensions on the Russia–Ukraine border simmered in January 2022.

  • Third, we also uncovered that our hybrid modeling approach—incorporating both fat-tailed errors and structural breaks—is highly effective, not only in GARCH models but also in EGARCH models. This demonstrates the robustness of the effectiveness of our hybrid modeling approach in capturing natural gas futures volatilities.

  • Fourth, we also newly found that the volatility estimates from our hybrid models of the AR–T–SB–GARCH and the AR–GED–SB–GARCH models have predictive power for the volatilities of nonhybrid AR–T–GARCH and AR–GED–GARCH models. We consider that because of this beneficial characteristic, our hybrid GARCH models well captured the natural gas futures volatility jump when the tensions on the Russia–Ukraine border simmered.

  • Fifth, we also uncovered that the volatility estimates from our exponential-form hybrid models of the AR–T–SB–EGARCH and the AR–GED–SB–EGARCH models also have forecast power for the volatilities of nonhybrid AR–T–EGARCH and AR–GED–EGARCH models. This indicates the robustness of the predictive power, and again, we suggest that because of this characteristic, our hybrid EGARCH models also timely captured the natural gas futures volatility spike when the tensions in Russia–Ukraine border simmered.

  • Sixth, we further elucidated that the evidence of the predictive power of our hybrid models for the volatilities of nonhybrid models means that the volatility estimates from the nonhybrid models lag behind the volatilities of the hybrid models. We consider that because of this characteristic, the nonhybrid models are inferior to the hybrid models in capturing sudden changes in natural gas futures volatilities.

  • Seventh, in addition to the above, we have also developed and shared numerous valuable interpretations, implications, and innovative perspectives for future risk management, policy-making practices, and related research in energy informatics. This represents another significant contribution of our work. These interpretations, implications, and innovative perspectives, wihch include the effective use of AI, should be valuable not only for academic researchers but also for industry practitioners.

This study examined the impact of war as a manmade disaster on natural gas futures prices, which is of great importance for research in energy informatics. Additionally, the findings from our study indicate the growing significance of structural breaks in energy markets. This highlights the need to closely monitor the immediate news impact on contemporary energy markets.

As a broader implication for future research, studies that explore similar models in different energy-related markets or under various geopolitical scenarios would be interesting. Further research into the adaptability and scalability of the proposed hybrid models in our current study would provide a pathway for ongoing contributions in the field of energy informatics. Given this, we believe that the many new findings derived from our present study make a significant contribution not only to the body of academic energy informatics research but also to actual practice in energy-related industries.