Abstract
This work aims to extend previous research on how a trifactorial stochastic model, which we call \(CIR^3\), can be turned into a forecasting tool for energy time series. In particular, in this work, we intend to predict changes in the industrial production of electric and gas utilities. The model accounts for several stylized facts such as the mean reversion of both the process and its volatility to a short-run mean, non-normality, autocorrelation, cluster volatility and fat tails. In addition to that, we provide two theoretical results which are of particular importance in modelling and simulations. The first is the proof of existence and uniqueness of the solution to the SDEs system that describes the model. The second theoretical result is to convert, by the means of Lamperti transformations, the correlated system into an uncorrelated one. The forecasting performance is tested against an ARIMA-GARCH and a nonlinear regression model (NRM).
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Further to previous research [13], in this article, we illustrate a stochastic model to forecast changes in the industrial production (and then market demand) of electric and gas utilities. The examined time series is highly irregular and difficult to predict, with maximum volatility reaching up to 800%, as shown in Fig. 1 and Table 1. Therefore, we have considered a three-factor model [16, 17] and extended it to fit the current case, incorporating a number of changes described in this paper.
In his work, Chen suggested a bond pricing formula ”under a non-trivial, three-factor model of interest rates” [16] where the future short rate ”depends on (1) the current short rate, (2) the short-term mean of the short rate, and (3) the current volatility of the short rate”. In addition, Chen assumed that ”both the short-term mean and the volatility are stochastic”. The rationale of his model lies in the observation that ”short rates should be better modelled as reverting to a short-run mean, rather than to a long-run constant mean”. Similarly, Chen observed that short-rate volatility is not constant and mean reverting as well.
Stochastic models, such as the one proposed by Chen, are designed to replicate short-term interest rates \(r_t\), along with their mean \(\theta _t\) and volatility \(v_t\), for pricing purposes. However, in the case under consideration, our objective is to forecast changes in industrial production. Therefore, we replace interest rates with the production of electric and gas utilities, denoted by \(S_t\), and we use \(\theta _t\) and \(v_t\) to represent its short-term mean and volatility, respectively. Furthermore, we link the processes \(S_t\), \(\theta _t\), and \(v_t\) together through correlations, which is a crucial novelty in our approach. Finally, our approach differs from the literature cited because we do not model the short-term mean \(\theta _t\) as a Cox–Ingersoll–Ross process [19] due to the stochastic volatility coefficient that depends on \(v_t\).
Our first achievement is an existence and uniqueness result for the solution of the system of stochastic differential equations (SDEs) describing our three-factor model (see Theorem 4.1 and Corollary 4.2 below). To our knowledge, this issue has never been discussed before. Our system of SDEs does not satisfy the classical local Lipschitz condition hence we can not apply well known results of the (global) existence and uniqueness of the solution. For this reason, we first discuss existence in a weak sense (see, for instance, Chapter 4 in [36] or Chapter 5 in [25]), where the probability space and Brownian motions are not fixed a priori but they are part of the solution itself. For the unidimensional CIR model where mean and volatility coefficients are positive constants, the well-known Feller’s condition implies that a CIR process starting from a positive initial point stays strictly positive (see, Section 6.3.1 in [37]). This implies that in our model the volatility \(v_t\) is a non-explosive strictly positive process under Feller’s condition. While, even if starting from positive initial points \(S_t\) and \(\theta _t\), due to their unbounded stochastic volatility \(v_t\), hit zero almost surely. Nevertheless, we prove that our model is well defined in a local sense, that is, in terms of a unique weak solution until \(\tau = \tau ^S \wedge \tau ^\theta \), where \(\tau ^S\) and \(\tau ^\theta \) denote the first hitting times of zero for \(S_t\) and \(\theta _t\), respectively.
Finally, by Theorem 1.1, Chapter 4 in [36] or Corollary 3.2.3, Chapter 5 in [58], (local) weak existence and pathwise uniqueness of a (strict positive) solution implies also (local) strong existence.
As well as in [13], by using the Lamperti transformation, which applies thanks to the strict positivity of \((S_t, \theta _t,v_t)\), we show how the correlated process \(S_t\) can be turned into an uncorrelated auxiliary process \(X_t\), which is important for simulations and forecasting. We give a rigorous proof of equivalence between the two systems of SDEs related to the dynamics of the triples \((S_t, \theta _t, v_t)\) and \((X_t, \theta _t, v_t)\).
Last but not least, we show that the proposed model accounts for several stylized facts such as the mean reversion of both the process and its volatility to a short-run mean, non-normality, cluster volatility and fat tails. This is because, by design, both the process and its mean are reverting, the volatility is time-dependent and the processes are correlated. A discussion on the choice of the model is provided and specific analysis is carried out on the time series of the industrial production of electric and gas utilities. Subsequently, the implementation shows that the proposed model provides the best fit for the data.
This paper is organized as follows. Section 2 provides the rationale for our quest on model selection and a brief account of the relevant literature. Section 3 shows the time series we are considering and its main statistical characteristics. In Section 4 the three-factor model is presented and the main results are provided (i.e. existence and uniqueness result and the presentation of the Lamperti transformations that lead to the new dynamics of the auxiliary process). Section 5 illustrates a numerical implementation in the following order: calibration, in-sample simulation, out-of-sample forecasts. The last Section contains the concluding remark.
2 Literature and model selection
The reason why we thought of a three-factor model is that, as for the interest rates, the time series considered seems to adapt to some characteristics that are well explained by this model. Namely, the level of industrial production of electric and gas utilities, \(S_t\), reverts to its mean \(\theta _t\) which, in turn, is time-varying and reverting to a constant long-term mean. The distribution of \(S_t\) is highly non-normal, displays fat tails (see Fig. 2), its volatility depends on time and seems to be mean reverting as well, as the interest rate’s volatility [44]. In other models volatility is represented either as an Ornstein–Uhlenbeck (OU) process [11, 54, 61] or as a log-normal process [11]. However, in the first case, volatility can take undesirable negative values (except for non-Gaussian Ornstein–Uhlenbeck (OU) processes [5]) and in the second, volatility has no mean reversion [16]. The Chen model, instead, simulates volatility with a square root process, with the advantage that it excludes negative values and allows mean reversion. Furthermore, the model can be designed in such a way as to correlate processes with each other and, as a by-product, this allows for autocorrelation. For reference, on autocorrelation for an OU process see [9] and on autocorrelation for geometric Brownian motion see [57].
As detailed in Sect. 3, both \(S_t\) and \(v_t\) are mean reverting. Consequently, another class of models that we have considered is the so-called autoregressive integrated moving average (ARIMA) adopted for example by Chavez et al. [15] to simulate and predict future energy production and consumption in Asturias. Other examples of the use of ARIMA models for forecasting can be found in Shi et al. [59] for short-term wind power generation, in Jiang et al. [38] for China’s coal future consumption, in Mahia et al. [46] for industrial electricity consumption in Guangdong, etc. However, since we obtained unsatisfactory results in our simulations, we supplemented the classical ARIMA with a generalized autoregressive conditional heteroskedasticity (GARCH) to model clustering in volatility. Among those who tested the class of ARIMA-GARCH model, we mention Soares et al. [60] who modelled the hourly electricity load in the area covered by an electric utility located in southeastern Brazil. Gupta et al. [29] that implemented the ARIMA-GARCH for wind power prediction. Hussin et al. [35] who used the model for forecasting wind speed. Yotto et al. [64] that employed the ARIMA-GARCH for estimating and forecasting electricity load. Mohammadi et al. [50] who examined ”the usefulness of several ARIMA-GARCH models for modelling and forecasting the conditional mean and volatility of weekly crude oil spot prices” and Diallo et al. [22] who estimated the spread between Hungarian (HUPX) and German (EEX) day-ahead power prices. The latter, in their analysis, found out that NGARCH, TGARCH, EGARCH, GJR GARCH perform similarly in terms of RMSE and MAE. In addition, they claim that all models perform better than ”the simple ARIMA model” [22]. Finally, Bufalo and Orlando [13] have recently used the ARIMA-GARCH as a benchmark against the \(CIR^3\) to predict the production of energy material. Thus, for the above mentioned reasons, we find the ARIMA-GARCH a popular and suitable reference model.
In our model’s selection, we also considered the following nonlinear regression model (NRM)
Eq. (1) is consistent with the expectations of the Ornstein–Uhlenbeck process and, in general, with the expectations coming from multifactor Hull–White model (e.g. G2++ by Brigo and Mercurio [11]), which are widely used in finance [20, 27, 52]. Nonlinear mean reversion in financial time series has been reported by many (e.g. see [4, 18, 28]). Among those that used nonlinear models for energy, we mention Bilgili et al. [7], Kumru et al. [40] and Noskov et al. [51]. To run a robust estimation we adopted the iteratively re-weighted least squares algorithm by Holland [33]. The algorithm recalculates the weights based on the residual from the previous iteration and progressively downweights outliers so that iterations continue until the weights converge.
In summary, drawing inspiration from the literature, ARIMA-GARCH and NRM are the two models used as benchmarks to test the performance of the proposed approach.
3 Data
Figure 1 displays the monthly percent change (i.e., the month-to-month variation in the industrial production) of electric and gas utilities, as classified by the North American Industry Classification System (NAICS) and represented by the IPUTIL index. The data was retrieved from the Federal Reserve Economic Data (FRED) [8]. As a side note, we would like to emphasize that while we model the IPUTIL time series level (i.e. \(S_t\)), the results displayed are in terms of percent change. This approach is chosen because percent changes are more challenging to model and may hold financial significance, as what matters most are the variations rather than the absolute levels.
Board of Governors of the Federal Reserve System (US), Industrial Production: Electric and Gas Utilities (NAICS = 2211,2) [IPUTIL] [8]. Percent change. Monthly data from 1939-02-01 to 2020-11-01. Shaded grey areas correspond to recessions and the yellow strip to the right highlights the COVID-19 pandemic
3.1 Statistical characteristics
Regarding the model’s selection mentioned in Sect. 2, as shown in Fig. 2 and Table 1, not only the time series we are considering is very volatile, but its statistical characteristics are quite different from those of the Gaussian distribution.
3.2 Mean reversion and stationarity
Mean reversion contrasts with random walk behavior, which is used to support the efficient market hypothesis in theoretical studies in finance [10, 12, 26]. However, studies have found mixed or mean-reverting processes in both developed and emerging markets [2, 34, 55]. Mean reversion is used for modeling electricity and natural gas prices [1] because ”most energy and commodity markets exhibit mean-reversion” [56]. As mentioned by Hoque et al. [34], since the pioneering work of Lo and MacKinlay [45], variance ratio (VR) tests have been widely used econometric tools for testing the random walk hypothesis (RWH). The VR test on both the levels and the volatility of the IPUTIL rejects the random walk hypothesis with p-values of \(2.33\times 10^{-9}\) and \(3.20\times 10^{-19}\), respectively.
Mean reversion is also linked to the absence of a unit root and to stationarity. For assessing that, a number of tests have been developed in the literature such as Augmented Dickey–Fuller, KPSS, Pierre–Perron, and DFGLS tests. Table 2 does not confirm the presence of stationarity in data. Notice that, unlike the other tests, for the KPSS test the null hypothesis is that the time series is trend stationary therefore h = 0 means that there is no statistical indication of a unit root.
3.3 Autocorrelation
Figure 3 plots the sample autocorrelation function (ACF) and sample partial autocorrelation function (PACF) of the IPUTIL index (changes). By a visual inspection, there is autocorrelation at lag 1 and 2. The Ljung–Box Q-test (LBQ) confirms the presence of autocorrelation with a p-value of 5.8953 \(\times 10^{-13}\).
As a comparison Fig. 4 displays the ACF and PACF of a CIR process. Notice the autocorrelation at lag 1 but the absence of autocorrelation at lag 2. For that reason, we need a more advanced model such as the one proposed in Sect. 4.
As one can see from Fig. 5, the \(CIR^3\) model exhibits a significant autocorrelation both at lag 1 and 2 (with a LBQ p-value around 5.1507\(\times 10^{-13}\)), and a partial autocorrelation at lag 11 according to the original time series.
3.4 Cluster volatility
Lastly, we check whether the difference between the mean and the realizations display heteroscedasticity (cluster volatility). According to Engle’s ARCH [24] the null hypothesis of no heteroscedasticity should be rejected with a p-value of 5.9 \(\times 10^{-13}\).
4 A three-factor stochastic model
Let us denote by \(\{S_t\}_{t \ge 0}\) the stochastic process modelling the level of industrial production of electric and gas utilities. In addition, the correlated processes referring to the volatility and the short-run mean of \(\{S_t\}_{t \ge 0}\) are, respectively, \(\{v_t\}_{t \ge 0}\) and \(\{\theta _t\}_{t \ge 0}\).
Let \(k_v, \eta , \gamma , k_\theta , \zeta , \beta , k\) and \(\alpha \) be positive constants. We consider the following system of SDEs
with the initial condition \((S_0,\theta _0,v_0)\in (0, + \infty )^3\). Here \(\{W^{(i)}_t\}_{t\ge 0}\), \(i =1,2,3\), are three standard correlated Brownian motions such that
\(\rho _{\theta }, \rho _v \in (-1,1)\). Moreover, the correlation coefficients satisfy the following relation
System (2) represents a three-factor type model that we call \(CIR^3\). In our framework, each dynamic process follows a square-root process, and, differently from the model in [16], the variance of the processes \(S_t\) is proportional to the variance of the process itself, i.e., \(v_t\), as often suggested by financial literature (see, [3, 30, 32, 43, 62, 63]). The same holds true for the mean process \(\theta _t\).
By introducing the stochastic process
which is a standard Brownian motion, independent both from \(W^{(2)}_t\) and \(W^{(3)}_t\), the system (2) reads as
with \((S_0,\theta _0,v_0)\in (0, + \infty )^3\) and \(\{ W^*_t, W^{(2)}_t, W^{(3)}_t\}_{t \ge 0}\) a three-dimensional standard Brownian motion.
Assumption 4.1
We assume Feller’s condition
We will work with the weak solution to system (5) according to the following definition. We say that \((\Omega ,\mathcal{F}, {\mathbb P};\textbf{F} = (\mathcal{F}_t)_{t\ge 0}, \textbf{W}, S, \theta , v)\) is a solution of system (5) if:
-
\((\Omega ,\mathcal{F}, {\mathbb P};\textbf{F})\) is a filtered probability, \(\textbf{W} = \{W^*_t, W^{(2)}_t, W^{(3)}_t\}_{t \ge 0}\) is a 3-dimensional standard \(\textbf{F}\)-Brownian motion, and \(\{S_t, \theta _t, v_t \}_{t \ge 0}\) is an \({\mathbb R}^3\)-valued process with continuous sample paths.
-
\(\{S_t, \theta _t, v_t \}_{t \ge 0}\) satisfies the initial condition \((S_0,\theta _0,v_0)\in (0, + \infty )^3\).
-
System (5) holds a.s.
We first prove pathwise uniqueness and weak existence of a local solution to system (5) with state-space \((0, + \infty )^3\).
Theorem 4.1
Under Assumption 4.1, system (5) admits a weak solution \(\{ S_t, \theta _t, v_t \}_{t \ge 0}\) with state-space \([0, + \infty )^2 \times (0, + \infty )\). Pathwise uniqueness of the solution to system (5) holds over the stochastic interval \([0, \tau = \tau ^S \wedge \tau ^\theta ]\), where
are the hitting time of zero for \(S_t\) and \(\theta _t\), respectively. The random time \(\tau \) is such that \({\mathbb P}(\tau >0) = 1\) and for all \(t <\tau \) the process \((S_t, \theta _t, v_t)\) takes values in \((0, + \infty )^3\).
Proof
Step (i) Weak existence of a global solution
The coefficient matrix of the diffusion term \(\Sigma (S,\theta , v)\) and the drift \(b(S, \theta , v)\) associated to system (5) are given by
and
respectively. Note that \(\Sigma (S,\theta , v)\) and \(b(S, \theta , v)\) are continuous functions and satisfy growth conditions. In fact,
and by the inequality \(ab \le \frac{1}{2} (a^2 + b^2)\), \(a, b \in {\mathbb R}\), follows the sublinear growth condition
for some constant \(K>0\). Moreover, let \(x = (S,\theta , v) \in {\mathbb R}^3\), we have
and again by \(ab \le \frac{1}{2} (a^2 + b^2)\), \(a, b \in {\mathbb R}\), we get the inequality
for some constant \(K>0\). Let us observe that b also satisfies the sublinear growth condition
for some constant \(K>0\).
Thanks to Eqs. (7) and (8) we can apply Theorem 3.10, in [25] (or Theorems 2.3 and 2.4, Chapter 6, in [36]), since \(\Sigma \) and b are continuous and satisfy the sublinear growth conditions (7) and (9)). Thus there exists a weak solution \(\{ S_t, \theta _t, v_t \}_{t \ge 0}\) to system (5) for any initial condition \((S_0,\theta _0,v_0)\in {\mathbb R}^3\), which does not explode in finite time. Let us note that continuity of \(\Sigma (S,\theta , v)\) and \(b(S, \theta , v)\) ensures existence of a weak solution (see Theorems 2.3, Chapter 6, in [36]) but this solution could explode (i.e. it could tend to infinity in finite time) and so we need an additional condition, as sublinear growth, which implies that the solution does not explode (see Theorems 2.4, Chapter 6, in [36]).
Summarizing, we have proved that there exist a filtered probability space \((\Omega ,\mathcal {F}, {\mathbb P}, \textbf{F} = (\mathcal {F}_t)_{t\ge 0})\), a three-dimensional \(\textbf{F}\)-Brownian motion \(\{ W^*_t, W^{(2)}_t, W^{(3)}_t\}_{t \ge 0}\), and an \(\textbf{F}\)-adapted process with sample paths in \(C_{{\mathbb R}^3}[0, + \infty )\), \(\{ S_t, \theta _t, v_t \}_{t \ge 0}\), such that (5) holds \({\mathbb P}\)-a.s. Moreover, we have that for all \(t\ge 0\)
Step (ii) Nonnegativity of the global solution.
At this point, we are going to show that if the process starts from a strictly positive initial condition \((S_0,\theta _0,v_0)\in (0, + \infty )^3\) then \(\{ S_t, \theta _t, v_t \}_{t \ge 0}\) takes values in \([0, + \infty )^2 \times (0, + \infty )\). Under the Assumption 4.1, for any initial condition \(v_0 >0\), it is known that there exists a unique strong solution to the third equation in System (5), the so-called CIR-process, which is strictly positive (see, for instance, Section 6.3.1 in [37]). Hence we get that the process \(\{v_t \}_{t \ge 0}\) is strictly positive. We can now prove by comparison result that for any initial conditions \(S_0 >0\) and \(\theta _0>0\), both the processes \(\{S_t\}_{t \ge 0}\) and \(\{\theta _t\}_{t \ge 0}\) take values in \([0, + \infty )\).
We first prove that for all \(t\ge 0\), \(\theta _{t} \ge 0\), \( {\mathbb P}-a.s.\). Let us consider, on the probability space \((\Omega ,\mathcal {F}, {\mathbb P}, \textbf{F} = (\mathcal {F}_t)_{t\ge 0})\) where the processes \(\{\theta _t\}_{t\ge 0}\) and \(\{v_t\}_{t\ge 0}\) are defined, the following SDE
and note that for any \(t\ge 0\), \(\theta ^1_t=0\) solves Eq. (11). We can proceed as in the proof of Theorem 1.1, Chapter 6 in [36] with
Note that, for all \(x\in \mathbb {R}\) \(b_1(x) < b_2(x)\), \(b_1\) is Lipschitz continuous, i.e.
and \(\sigma \) satisfies
where \(\rho (x) = \alpha \beta \sqrt{x}\) is a strictly increasing function defined on \([0, + \infty )\) such that \(\rho (0)=0\) and satisfying Eq. (1.1) in Chapter 6 of [36].
We can not apply directly Theorem 1.1, Chapter 6 in [36] because the diffusion coefficient depends on the process \(\{v_t\}_{t\ge 0}\), which is an unbounded process. Therefore, we use a localization argument and define for all \(N\in \mathbb {N}\)
The sequence of stopping times \(\{\eta _N\}_{N \in \mathbb {N}}\) is non-decreasing and such that \(\eta _N \rightarrow +\infty \) as \(N \rightarrow + \infty \) (since the process \(\{v_t\}_{t\ge 0}\) does not explode in a finite time).
We can now consider the non-decreasing sequence of continuous functions \(\{\varphi _n(x)\}_{n \in \mathbb {N}}\) as defined in the proof of Theorem 1.1, Chapter 6 of [36], which satisfy \(\varphi _n \in C^2(\mathbb {R})\), \(\varphi _n(x)=0\) for \(x\le 0\), \(0 \le \varphi '_n(x) \le 1\) and \(\varphi _n(x) \rightarrow (x)_+ =\) \(\max \{x,0\}\) as \(n \rightarrow +\infty \).
We apply Itô’s rule and by similar computations as in that proof, we get that
where
It is clear that \({\mathbb E}[I_1(n,{t \wedge \eta _N} )] =0\) and since
we can proceed as in the proof of Theorem 1.1, Chapter 6 of [36] obtaining
Observing that for all \(t \ge 0\)
and again as in the proof of Theorem 1.1, Chapter 6 of [36] we get that
Hence
and by letting \(n \rightarrow +\infty \), we obtain
By Gronwall’s Lemma we deduce that for all \(t\ge 0\), \(N \in \mathbb {N}\), \({\mathbb E}[(\theta ^1_{t \wedge \eta _N} - \theta _{t \wedge \eta _N})_+] =0\), which in turn implies that
Finally, letting \(N \rightarrow + \infty \) and recalling that for all \(t \ge 0\), \(\theta ^1_{t}=0\), we obtain
Similarly, we can prove that, for any \(t\ge 0\), \(S_{t} \ge 0\), \( {\mathbb P}-a.s.\) Let us consider, on the probability space \((\Omega ,\mathcal {F}, {\mathbb P}, \textbf{F} = (\mathcal {F}_t)_{t\ge 0})\) where the processes \(\{\theta _t\}_{t\ge 0}\), \(\{v_t\}_{t\ge 0}\) and \(\{S_t\}_{t\ge 0}\) are defined, the following SDE
and note that \(S^1_t=0, \forall t \ge 0\) solves Eq. (12). We now take
and \(\sigma (t, \omega , x)\) as before. Note that for all \(x \in \mathbb {R}\) and \((t, \omega ) \in [0, + \infty )\times \Omega \),
(because, for all \(t\ge 0\), \(\theta _{t} \ge 0, {\mathbb P}-a.s.\) and \(k>0\)) and \(b_1\) is Lipschitz continuous with Lipschitz constant equals to k. Finally, observing that for all \((t, \omega ) \in [0, + \infty )\times \Omega \)
we can perform the same computations as before.
Step (iii) Pathwise uniqueness of a strict positive local solution
We want to show that starting from \((S_0,\theta _0,v_0)\in (0, + \infty )^3\) the process \(\{ S_t, \theta _t, v_t \}_{t \ge 0}\) is the unique solution (in pathwise sense) until one of the processes \(\{ S_t\}_{t \ge 0}\) or \(\{ \theta \}_{t \ge 0}\) reaches zero. Note that \(\Sigma (S,\theta , v)\) is not Lipschitz-continuous in \([0, +\infty )^2 \times (0, + \infty )\) but, it is in the open set \(U_N = ({1 \over N}, N)^3\), for any \(N >0\). In fact,
By Lagrange’s Theorem, for all \(x,x' \in (a,b)\), \(0<a<b\), there exists \(\bar{x} \in (a,b)\) such that
hence
It is clear that for all \(x,x', y, y' \in ({1 \over N}, N)\),
By applying the above inequalities we get that for all \( (S, \theta , v), (S', \theta ', v') \in U_N\)
for some constant \(K_N>0\). Then, we can apply Theorem 3.7 in [25], and we get that pathwise uniqueness holds over the stochastic interval \([0, \tau _N]\) where
Note that \(U_N \subset U_{N+1}\), so \(\tau _N\) is an increasing sequence of stopping times with \(\tau _N \rightarrow \tau \) as \(N \rightarrow +\infty \) due to the fact that \(\{S_t\}_{t\ge 0}\), \(\{\theta _t\}_{t\ge 0}\), and \(\{v_t\}_{t\ge 0}\) do not explode and \(v_t > 0\) for all \(t \ge 0\). Therefore, by taking the limit as \(N \rightarrow +\infty \), we obtain pathwise uniqueness on \([0, \tau ]\).
Finally, from continuity of trajectories of \(\{ S_t, \theta _t \}_{t \ge 0}\) since \(S_0>0\) and \(\theta _0>0\) we get that \({\mathbb P}(\tau > 0)=1\) and this concludes the proof.
Ultimately, (local) weak existence and pathwise uniqueness of the solution to system (5) imply (local) strong existence.
Corollary 4.2
Let consider any initial condition \((s_0, \theta _0, v_0) \in (0, + \infty )^3\). The system (5) admits a unique strong solution \((S_t, \theta _t, v_t)\) taking values in \((0, + \infty )^3\), for all \(t<\tau \), where
Proof
The assertion directly comes from Theorem 4.1 and [39, Corollary 3.23, Chapter 5] (or [36, Theorem 1.1, Chapter 4]).
Thanks to the strict positivity of \((S_t,\theta _t,v_t)\) for all \(t < \tau \), we can apply a suitable Lamperti transformation that converts the correlated system (5) into an uncorrelated one, as described in (19) below. This transformation will be useful in Sect. 5 for numerical purposes. We first provide a preliminary result.
Lemma 4.1
For any \(t\le \tau \), define
Then \(X_t\) solves for \(t < \tau \)
where
and
Proof
By virtue of Itô’s formula we have that for any \(t < \tau = \tau ^S \wedge \tau ^\theta \)
By substituting the expressions of \(dS_t\), \(d\theta _t \) and \(dv_t\) in Eq. (18) and observing that
we obtain Eq. (15).
Based on Lemma 4.1, we introduce the following system of SDEs
where \(c_t\), and \(c_{i,t}\), \(i=0, 1,2\) are defined in Eqs. (16) and (17).
Note that the drift in the dynamics of \(X_t\) in system (19) explodes if \(\theta _t\) or \(X_t + c(\theta _t, v_t)\) hits zero. As a consequence of Theorem 4.1 and Corollary 4.2 we will prove that system (19) admits a unique strong solution over the random interval \([0, \tau ^X \wedge \tau ^\theta )\), where
Theorem 4.3
Let us consider any deterministic initial condition, \((x_0, \theta _0, v_0) \in \mathbb {R} \times (0, + \infty )^2\), such that \(x_0 + c(\theta _0, v_0)>0\). Then there exists a unique strong solution to system (19) over the random time interval \([0, \tau ^X \wedge \tau ^\theta )\).
Proof
Step (i) Existence of a strong local solution.
From Theorem 4.1 and Lemma 4.1 we get existence of a solution to system (19) over the random time interval \([0, \tau = \tau ^S \wedge \tau ^\theta \)). By construction, for all \(t < \tau = \tau ^S \wedge \tau ^\theta \), \(X_t + c(\theta _t, v_t) =2 \sqrt{S_t} >0\) and \(X_{\tau ^S} \ + c(\theta _{\tau ^S}, v_{\tau ^S}) = 2 \sqrt{S_{\tau ^S}} =0.\) This implies that \(\tau ^X =\tau ^S\), and hence the solution is defined on the random time interval \([0, \tau = \tau ^X \wedge \tau ^\theta \)).
Step (ii) Pathwise uniqueness of the local solution
Let \((\widetilde{X}_t, \widetilde{\theta }_t, \widetilde{v}_t )\) be a solution to (19), for any \( t < \tau ^{\widetilde{X}} \wedge \tau ^{\widetilde{\theta }}\), starting from any deterministic initial condition \((x_0, \theta _0, v_0) \in \mathbb {R} \times (0, + \infty )^2\) satisfying \(x_0 + c(\theta _0, v_0)>0\). Let us introduce, on the same probability space where \((\widetilde{X}_t, \widetilde{\theta }_t, \widetilde{v}_t )\) is defined, and, for any \(t < \tau ^{\widetilde{X}} \wedge \tau ^{\widetilde{\theta }}\) the process
From Itô’s formula we get that for any \(t < \tau ^{\widetilde{X}} \wedge \tau ^{\widetilde{\theta }}\) the triple \((\widetilde{S}_t, \widetilde{\theta }_t, \widetilde{v}_t )\) solves system (5), with initial condition
By construction, for any \(t < \tau ^{\widetilde{X}} \wedge \tau ^{\widetilde{\theta }}\), \(\widetilde{S}_t >0\) and \(\widetilde{S}_{\tau ^{\widetilde{X}} } =0\). Thus, we have that \(\tau ^{\widetilde{S}} =\tau ^{\widetilde{X}}\). By strong uniqueness of a strict positive solution (see Corollary 4.2), we get that \((\widetilde{S}_t, \widetilde{\theta }_t, \widetilde{v}_t )\) and \((S_t, \theta _t, v_t)\) coincides for any \( t < \tau ^{\widetilde{S}} \wedge \tau ^{\widetilde{\theta }} = \tau ^{ S} \wedge \tau ^{\theta }\).
System (19) is equivalent to (5). In particular, from the solution to (19) we can derive by a simple transformation, see (21) below, the solution of our original system (5).
Corollary 4.4
Let us consider any deterministic initial condition, \((x_0, \theta _0, v_0) \in \mathbb {R} \times (0, + \infty )^2\), such that \(x_0 + c(\theta _0, v_0)>0\). Let \((X_t, \theta _t, v_t)\) be the unique strong solution to system (19) over the random time interval \([0, \tau ^X \wedge \tau ^\theta )\). Then \((S_t, \theta _t, v_t)\), where
and \(c(\theta _t,v_t)\) given in (16), is the unique strong solution to system (5) over the random time interval \([0, \tau ^S \wedge \tau ^\theta )\).
Proof
The proof follows directly from Step (ii) in the proof of Theorem 4.3.
As just said the main advantage given by the process \(X_t\) is to provide a fast and independent simulation of the process \(S_t\). Indeed, we can first simulate the pair \((\theta _t,v_t)\), and next the process \(X_t\), whose stochastic component \(W^*_t\) is uncorrelated with those of \(\theta _t\) and \(v_t\). Finally, the dynamics of \(S_t\) can be obtained by the transformation (21).
Remark 4.1
Note that in our model \(S_t\) or \(\theta _t\) have unbounded stochastic volatility due to the presence of \(v_t\). It is known, see for instance Section 6.3.1 in [37], that a CIR process hits zero almost surely if Feller’s condition is not fulfilled, i.e. the volatility is not sufficiently small. This implies that \(S_t\) or \(\theta _t\) reaches zero almost surely, that is, \({\mathbb P}(\tau ^S< + \infty ) = {\mathbb P}(\tau ^\theta < + \infty )=1\). However, even if existence and uniqueness of solution to systems (5) and (19) are only in local sense, and we do not have a lower bound for the random time \(\tau \), we observe that \({\mathbb E}[\tau ^S]={\mathbb E}[\tau ^{\theta }]=+\infty \). This property is observed only through numerical inspection, and its formal treatment will be the subject of future research. Specifically, to test this numerically, we simulated the process together with 10,000 randomly bootstrapped realizations. In no iteration did we find \(S_t\) or \(\theta _t\) to be zero.
We conclude the Section with a remark which will be useful in the next sections.
Remark 4.2
It is easy to verify that
Indeed the processes \(\int _0^t e^{k_\theta s}\sqrt{v_s \theta _s}\, dW^{(2)}_s\), and \(\int _0^t e^{k_v s}\sqrt{v_s}\, dW^{(3)}_s\), \(t \ge 0\), thanks to Eq. (10) turn to be \(\textbf{F}\)-martingales.
5 Results
In this section, we apply our model to the change in the industrial production of electric and gas utilities already mentioned. Other models used as a benchmark are the ARIMA-GARCH and the non-linear regression model (NRM) specified in Eq. (1). Note that, with reference to the model (2), let \((s_1,\ldots ,s_n)\) be the observations of \(S_t\), and \((\Theta _1,\ldots ,\Theta _n)\) those of the mean process \(\theta _t\), taken as the exponential weighted moving average (EWMA) of \((s_1,\ldots ,s_n)\). Moreover, the observations \((\nu _1,\ldots ,\nu _n)\) of the volatility process \(v_t\) are given by the so-called pointwise volatility
Once again, the results displayed are in terms of percent change because, economically, the focus is on the variations rather than absolute levels.
5.1 Parameters calibration
In order to estimate \(S_t,\theta _t\), and \(v_t\) the involved parameters \(k, k_{\theta }, k_{v},\eta ,\zeta ,\alpha ,\beta ,\gamma \) and the correlations \(\rho _{\theta },\rho _v\) in Eq. (2) need to be calibrated to the market prices. To estimate the correlation \(\rho _{\theta }\) we use the Spearman correlation between the realizations of \(S_t\) and \(\theta _t\); analogously for \(\rho _v\).
Among many approaches existing in the literature to estimate the parameters of the square-root models (see, for instance, [41] and references therein), we consider the estimating function approach for ergodic diffusion models introduced in Bibby et al. [6]. This method proved to be very useful in obtaining optimal estimators for the parameters of discretely sampled diffusion-type models whose likelihood function is usually not explicitly known. In [6, Example 5.4] the authors constructed an approximately optimal estimating function for the square-root model, from which they derived the following explicit estimators of the three parameters based on a sample of n observed market prices. For example, with regard to the process \(v_t\), the parameters \(k_v,\eta ,\gamma \) on a sample \((\nu _1,\dots ,\nu _n)\) are given by
These estimators exist provided that the argument of the logarithm in the first equation is strictly positive (the authors observed that this happens with a probability tending to one as \(n\rightarrow \infty \), see Example 5.4 in [6].
Analogously, given the observations \((\Theta _1,\ldots ,\Theta _n)\), we may compute
where \(\hat{v}\) is computed through the average (in [1, n]) of the discretization scheme described in Sect. 5.3, once the related parameters are estimated by Eq. (24). Similarly to \(\hat{v}\), we obtain \(\hat{\theta }\). Finally, given the observations \((s_1,\ldots ,s_n)\), we take
and from \(\hat{\alpha }\) we derive \(\hat{\beta }=\frac{\widehat{(\alpha \beta )}}{\hat{\alpha }}\).
5.2 Accuracy statistics
5.2.1 Normalized root mean square error (NRMSE)
The root mean squared error (RMSE) is a measure of the closeness between the observed data and the simulated values from a given model. So, it represents the accuracy of the model in terms of goodness of fit. It is defined by
where \(e_u\) denotes the residuals between the observed data and their simulations, over n times. Hence, a value near 0 indicates a perfect fit to the data, and values lower than 1 represent a good result. Note that the RMSE depends on the scale of observed data, thus it is sensitive to the outliers; consequently, larger errors have a disproportionately large effect. To solve this issue, we adopt the so-called normalized root mean squared error (NRMSE)
where \(s_{\max }\) denotes the maximum value and \(s_{\min }\) is the minimum value of the observed sample data.
5.2.2 Mean absolute percentage error (MAPE)
The mean absolute percentage error (MAPE) is a measure of prediction accuracy of a forecasting method. It is defined as
where \(e_u\) denotes the residuals between the observed data \(s_u\) and their previsions. Table 3 suggests the accuracy levels of the MAPE criterion.
5.3 In-sample simulation
As mentioned, for simulations, the pointwise volatility of \(S_t\) is used as a proxy for the latent variable \(v_t\), while the trend of \(S_t\), represented by the EWMA, is captured by the latent variable \(\theta _t\).
To simulate the processes \(v_t, \theta _t\) we apply the strong convergent Milstein discretization ([49]) to the second and third SDE of Eq. (2). Brigo and Mercurio in [11, Section 22.7] showed that the Milstein scheme converges in a much better way than other numerical algorithms for the square-root process. Here the Lévy area terms are expressed by means of the square of the increments of the Brownian motion, as for instance discussed in [21].
Hence, for any \(1\le u\le (n-1)\),Footnote 1 we compute
and
respectively, where \(\Delta \) is the time step and \((\varepsilon ^{(i)}_u)_{u\ge 1}\) \((i=1,2,3)\) are i.i.d. (standard) normal random variables.
Once calibrated the model parameters, we simulate the auxiliary process \(X_t\) (see Eq. (15))
where
where
and
Next, we obtain by Eq. (21) in Corollary 4.4
Figure 6 and Table 4 display the results of our simulations on the entire dataset of the proposed model versus the considered benchmarks mentioned in the literature review. The fitted values are obtained by averaging 100,000 simulations. In particular, we simulate the changes of \(S_t\), jointly with their pointwise volatility and their trend (see Fig. 1 and Sect. 5).
As shown, our approach can provide an accurate fit for the considered time series. Observe that the numerical investigation confirms that \(\hat{X}_{u+1} + \hat{c}(\hat{\theta }_{u+1}, \hat{v}_{u+1}) >0\) (for any \(1\le u\le (n-1)\)) as a consequence of the equivalence between systems (5) and (19).
Real data (changes) versus simulated data via the \(CIR^3\) model Eq. (2). The top left graph shows the volatility, while the top right graph shows the trend (i.e., the EWMA). The bottom graph in the center displays the changes of real data. In-sample results
5.3.1 Partitioning and regime changes in data
One may wonder if, because of the extended time period under consideration, data can be more simply explained by a classical ARIMA which models both the moving average and autoregressive components (see Sect. 3.3), coupled with a GARCH process to take into account volatility clustering (see Sect. 3.4). To check this, we divided the dataset using the Lavielle method [42], which identifies the optimal segmentation of a time series by minimizing a contrast function that quantifies the difference between the original and segmented series. Figure 7 visually depicts the dataset partitioned into three segments, while Table 5 presents the model performance results. The proposed model outperforms an ARIMA-GARCH model in all three segments.
Real data (changes) versus simulated data via the \(CIR^3\) model Eq. (2). The top left graph shows the volatility, while the top right graph shows the trend (i.e., the EWMA). The bottom graph in the center displays the changes of real data. In-sample results. The vertical green bars highlight the different intervals identified by Lavielle’s algorithm [42]
Real trend (changes). Actual versus forecasted trend (1 month) obtained through the \(CIR^3\) model Eq. (2), ARIMA-GARCH model and NRM model. Out of sample results
Real data (changes) versus \(CIR^3\) Eq. (2) forecasts (1 month). Out of sample results
5.4 Forecasting
To predict changes in the industrial production of electric and gas utilities through our model in system (2), we use the expectations (22), (23) for \(\theta _t\) and \(v_t\), respectively. In addition, as the distribution of \(X_t\) is unknown, we take the Monte Carlo approximation, i.e.
where, for each iteration r, \(\hat{X}_{u+z,r}\) is computed as in Eq. (31), and N =100,000.
Figures 8 and 9 show how close the \(CIR^3\) model is to both real data and the selected benchmarks. Table 6 summarizes the results in terms of MAPE and NMRSE thus confirming, once again, that \(CIR^3\) forecasts are quite accurate. Here we are interested in forecasting the next data point in terms of process and trend while, for longer horizons, we check how far the estimate goes. Note that it makes sense to add the MAPE to the error analysis as, in this instance, we are dealing with forecasts.
Comparison of competing predictions Given an actual series and two competing predictions, the Diebold and Mariano test [23] calculates a measure of the predictive accuracy of those models. The null hypothesis is that the two methods have the same forecast accuracy. Similarly, the Harvey, Leybourne and Newbold test [31], checks the hypothesis of equal accuracy in forecast performance of two sources of predictions. Table 7 demonstrates that the forecasts of the three models are statistically different.
Receiver operating characteristic (ROC) analysis Having confirmed that the models provide statistically different forecasts. Next is to complement the accuracy of forecasts presented in Table 6 with the receiver operating characteristic (ROC) analysis. The ROC curve plots the true positive rate (TPR) against the false positive rate (FPR) at various threshold settings. The TPR, also said sensitivity, indicates probability of detection. The FPR, also called sensitivity, indicates the probability of a false alarm. In general, the closer the plot is to the top and left-hand borders, the more accurate the test is. Red circles indicate coordinates in terms of 1-specificity (x-axis) and sensitivity (y-axis) of the optimal threshold. This means that the closer the red dot is to the origin, the better. For more details and applications see [14, 47, 53, 65]. Figure 10 and Table 8 confirm that the best model is the \(CIR^3\). Note that, for sake of space, we show only the results over 1-month horizon but similar results have been obtained over the 3 and 6-month horizon.
6 Conclusions
In this article, we have shown how a three-factor stochastic model, which we call \(CIR^3\), can be used to predict changes in the industrial production of electric and gas utilities. To this end, we introduce a model described by a system of SDEs (which accounts for several stylized facts including mean reversion to a stochastic level, stochastic volatility, and correlations/autocorrelations) and discuss the existence and uniqueness of the solution. Next, since the process \(S_t\) is correlated with its mean and volatility, by means of Lamperti transformations we obtained an uncorrelated auxiliary process \(X_t\) useful for simulation. Numerical simulations show that the proposed model has an edge over the benchmarks considered.
Data Availability
The data that support the findings of this study are available on request from the corresponding author, G.O.
Notes
That means we assume that only the first n observations are available for \(S_t,\theta _t,v_t\).
References
Abadie, L.M.: Energy market prices in times of COVID-19: the case of electricity and natural gas in Spain. Energies 14(6), 1632 (2021)
Ahmed, R.R., Vveinhardt, J., Streimikiene, D., Channar, Z.A.: Mean reversion in international markets: evidence from G.A.R.C.H. and half-life volatility models. Econ. Res.-Ekon. Istraživanja 31(1), 1198–1217 (2018)
Antonov, A., Konikov, M., Spector, M.: Sabr spreads its wings. Risk 26(8), 58 (2013)
Bali, T.G., Demirtas, K.O., Levy, H.: Nonlinear mean reversion in stock prices. J. Bank. Financ. 32(5), 767–782 (2008)
Barndorff-Nielsen, O.E., Shephard, N.: Non-Gaussian Ornstein–Uhlenbeck-based models and some of their uses in financial economics. J. R. Stat. Soc.: Ser. B (Stat. Methodo.) 63(2), 167–241 (2001)
Bibby, B.M., Jacobsen, M., Sørensen, M.: Estimating functions for discretely sampled diffusion-type models. In: Handbook of Financial Econometrics: Tools and Techniques. pp. 203–268. Elsevier (2010)
Bilgili, M., Sahin, B., Yasar, A., Simsek, E.: Electric energy demands of Turkey in residential and industrial sectors. Renew. Sustain. Energy Rev. 16(1), 404–414 (2012)
Board of Governors of the Federal Reserve System (US) (2020). Industrial Production: Utilities: Electric and Gas Utilities (NAICS = 2211,2) [IPUTIL]. https://fred.stlouisfed.org/series/IPUTIL#0. Accessed 22 Dec 2020
Bourret, R.C., Frisch, U., Pouquet, A.: Brownian motion of harmonic oscillator with stochastic frequency. Physica 65(2), 303–320 (1973)
Boya, C.M.: From efficient markets to adaptive markets: evidence from the French stock exchange. Res. Int. Bus. Financ. 49, 156–165 (2019)
Brigo, D., Mercurio, F.: Interest Rate Models-Theory and Practice: With Smile, Inflation and Credit. Springer Science & Business Media, Berlin (2007)
Brown, S.J.: The efficient market hypothesis, the financial analysts journal, and the professional status of investment management. Financ. Anal. J. 76(2), 5–14 (2020)
Bufalo, M., Orlando, G.: A three-factor stochastic model for forecasting production of energy materials. Financ. Res. Lett. 51, 103356 (2022)
Carter, J.V., Pan, J., Rai, S.N., Galandiuk, S.: ROC-ing along: evaluation and interpretation of receiver operating characteristic curves. Surgery 159(6), 1638–1645 (2016)
Chavez, S.G., Bernat, J.X., Coalla, H.L.: Forecasting of energy production and consumption in Asturias (northern Spain). Energy 24(3), 183–198 (1999)
Chen, L.: A bond pricing formula under a non-trivial, three-factor model of interest rates. Econ. Lett. 51(1), 95–99 (1996)
Chen, L.: three-factor model of the term structure of interest rates. In:Interest Rate Dynamics, Derivatives Pricing, and Risk Management, pp. 1–36. Springer (1996b)
Chortareas, G.E., Kapetanios, G., Shin, Y.: Nonlinear mean reversion in real exchange rates. Econom. Lett. 77(3), 411–417 (2002)
Cox, J.C., Ingersoll Jr, J.E., Ross, S.A.: A theory of the term structure of interest rates. In: Theory of Valuation, pp. 129–164. World Scientific (2005)
De Jong, P., Heller, G.Z.: Generalized linear models for insurance data. Cambridge University Press (2008)
Deya, A., Neuenkirch, A., Tindel, S.: A Milstein-type scheme without Lévy area terms for SDEs driven by fractional Brownian motion. Ann. del’I.H.P. Probabilités et Stat. 48(2), 518–550 (2012)
Diallo, A., Kácsor, E., Vancsa, M.: Forecasting the spread between HUPX and EEX dam prices the case of Hungarian and German wholesale electricity prices. In: 2018 15th International Conference on the European Energy Market (EEM), pp. 1–5. IEEE (2018)
Diebold, F.X., Mariano, R.S.: Comparing predictive accuracy. J. Bus. Econ. Stat. 13(3), 253–263 (1995)
Engle, R.F.: Autoregressive conditional heteroscedasticity with estimates of the variance of United Kingdom inflation on JSTOR. Econometrica 50(4), 987–1007 (1982)
Ethier, S.N., Kurtz, T.G.: Markov Processes: Characterization and Convergence. Wiley (1986)
Fama, E.F.: Random walks in stock market prices. Financ. Anal. J. 21(5), 55–59 (1965)
Goldburd, M., Khare, A., Tevet, D.: Generalized linear models for insurance rating. Number 5 in CAS Monographs Series. Casualty Actuarial Society (2016)
Gulcu, A., Yildirim, D.: Smooth breaks and nonlinear mean reversion in real interest parity: evidence from East Asian countries. J. Int. Trade Econ. Dev. 28(6), 668–685 (2019)
Gupta, A., Sharma, K.C., Vijayvargia, A., Bhakar, R.: Very short term wind power prediction using hybrid univariate ARIMA-GARCH model. In: 2019 8th International Conference on Power Systems (ICPS), pp. 1–6. IEEE (2019)
Hagan, P.S., Kumar, D., Lesniewski, A.S., Woodward, D.E.: Managing smile risk. Best Wilmott 1, 249–296 (2002)
Harvey, D., Leybourne, S., Newbold, P.: Testing the equality of prediction mean squared errors. Int. J. Forecast. 13(2), 281–291 (1997)
Heston, S.L.: A closed-form solution for options with stochastic volatility with applications to bond and currency options. Rev. Financ. Stud. 6(2), 327–343 (1993)
Holland, P.W., Welsch, R.E.: Robust regression using iteratively reweighted least-squares. Commun. Stat.-Theory Methods 6(9), 813–827 (1977)
Hoque, H.A.A.B., Kim, J.H., Pyun, C.S.: A comparison of variance ratio tests of random walk: a case of Asian emerging stock markets. Int. Rev. Econ. Financ 16(4), 488–502 (2007)
Hussin, N.H., Yusof, F., Norrulashikin, S.M., et al.: Forecasting wind speed in Peninsular Malaysia: an application of ARIMA and ARIMA-GARCH models. Pertan. J. Sci. Technol. 29(1), 31–58 (2021)
Ikeda, N., Watanabe, S.: Stochastic Differential Equations and Diffusion Processes. North Holland Publ. Co., Amsterdam— Oxford—New York 1981. John Wiley & Sons, Ltd (1986)
Jeanblanc, M., Yor, M., Chesney, M.: Mathematical methods for financial markets. Springer Science & Business Media (2009)
Jiang, S., Yang, C., Guo, J., Ding, Z.: ARIMA forecasting of China’s coal consumption, price and investment by 2030. Energy Sourc. Part B 13(3), 190–195 (2018)
Karatzas, I., Shreve, S.: Brownian Motion and Stochastic Calculus. Springer Science & Business Media (2012)
Kumru, M, Kumru, P.: Calendar-based short-term forecasting of daily average electricity demand. In: 2015 International Conference on Industrial Engineering and Operations Management (IEOM), pp. 1–5. IEEE (2015)
Laurini, M.P., Hotta, L.K.: Generalized moment estimation of stochastic differential equations. Comput. Stat. 31(3), 1169–1202 (2016)
Lavielle, M.: Using penalized contrasts for the change-point problem. Signal Process. 85(8), 1501–1510 (2005)
Lipton, A., Gal, A., Lasis, A.: Pricing of vanilla and first-generation exotic options in the local stochastic volatility framework: survey and new results. Quant. Financ. 14(11), 1899–1922 (2014)
Litterman, R.B., Scheinkman, J., Weiss, L.: Volatility and the yield curve. J. Fixed Income 1(1), 49–53 (1991)
Lo, A.W., MacKinlay, A.C.: The size and power of the variance ratio test in finite samples: a Monte Carlo investigation. J. Econom. 40(2), 203–238 (1989)
Mahia, F., Dey, A.R., Masud, M.A., Mahmud, M.S.: Forecasting electricity consumption using ARIMA model. In: 2019 International Conference on Sustainable Technologies for Industry 4.0 (STI), pp. 1–6. IEEE (2019)
Mandrekar, J.N.: Receiver operating characteristic curve in diagnostic test assessment. J. Thorac. Oncol. 5(9), 1315–1316 (2010)
Martínez-Cagigal, V.: ROC Curve: File Exchange—MATLAB CentralFile Exchange—MATLAB Central (2018). Accessed 24 Sep 2022
Mil’shtein, G.: A method of second-order accuracy integration of stochastic differential equations. Theory Probab. Appl. 23(2), 396–401 (1979)
Mohammadi, H., Su, L.: International evidence on crude oil price dynamics: applications of ARIMA-GARCH models. Energy Econ. 32(5), 1001–1008 (2010)
Noskov, S., Vergasov, A.: Regression model of electricity generation at nuclear power plants in Russia with respect to the nonlinear predictors. Adv. Appl. Stat. 70(2), 229–233 (2021)
Ohlsson, E., Johansson, B.: Non-life Insurance Pricing with Generalized Linear Models, vol. 174. Springer, Berlin (2010)
Orlando, G., Bufalo, M.: Modelling bursts and chaos regularization in credit risk with a deterministic nonlinear model. Financ. Res. Lett. 47, 102599 (2022)
Petroni, N.C., Sabino, P.: Gamma related Ornstein–Uhlenbeck processes and their simulation (2020). arXiv preprint arXiv:2003.08810
Rui, D., Teixeira, N., Machova, V., Pardal, P., Horak, J., and Vochozka, M.: Random walks and market efficiency tests: evidence on US, Chinese and European capital markets within the context of the global Covid-19 pandemic (2020). Accessed 3 Jun 2023
Sabino, P., Cufaro Petroni, N.: Fast pricing of energy derivatives with mean-reverting jump-diffusion processes. Appl. Math. Financ. 28(1), 1–22 (2021)
Sagadavan, R., Djauhari, M.A.: Autocorrelated multivariate process control: a geometric Brownian motion approach. AIP Conf. Proc. 1557(1), 571–575 (2013)
Scheidt, J.V.: Karatzas, I.; Shreve, S. E., Brownian Motion and Stochastic Calculus. New York, Springer-Verlag 1988. XXIII, 470 pp., 10 figs., DM 138, ISBN 3–540-96535-1 . Z. angew. Math. Mech., 69(11):423 (1989)
Shi, J., Qu, X., Zeng, S.: Short-term wind power generation forecasting: direct versus indirect ARIMA-based approaches. Int. J. Green Energy 8(1), 100–112 (2011)
Soares, L.J., Medeiros, M.C.: Modeling and forecasting short-term electricity load: a comparison of methods with an application to Brazilian data. Int. J. Forecast. 24(4), 630–644 (2008)
Uhlenbeck, G.E., Ornstein, L.S.: On the theory of the Brownian motion. Phys. Rev. 36(5), 823 (1930)
Van der Stoep, A.W., Grzelak, L.A., Oosterlee, C.W.: The Heston stochastic-local volatility model: efficient Monte Carlo simulation. Int. J. Theor. Appl. Financ. 17(07), 1450045 (2014)
Wong, H.Y., Lo, Y.W.: Option pricing with mean reversion and stochastic volatility. Eur. J. Oper. Res. 197(1), 179–187 (2009)
Yotto, H. C.S., Chetangny, P.K., Houndedako, S., Aredjodoun, J., Chamagne, D., Barbier, G., Vianou, A.: Estimation and forecasting electricity load in Benin: using econometric model ARIMA/GARCH. In: 2021 International Conference on Electrical, Communication, and Computer Engineering (ICECCE), pp. 1–6. IEEE (2021)
Zou, K.H., O’Malley, A.J., Mauri, L.: Receiver-operating characteristic analysis for evaluating diagnostic tests and predictive models. Circulation 115(5), 654–657 (2007)
Acknowledgements
The authors are members of INdAM-GNAMPA (Istituto Nazionale di Alta Matematica, Italy). The first author has been partially supported through Project U-UFMBAZ-2020-000791. M.B. acknowledges support from GNAMPA (Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni) for researching on anomalous diffusion and its applications to fractal domains: Physics and Mathematical Finance. G.O. acknowledges support from the Spanish Ministry of Science and Innovation (PID2022-138144NB-I00), the Center for Market Studies and Spatial Economics at the HSE University, and GNAMPA (Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni) for researching on anomalous diffusion and its applications to fractal domains: Physics and Mathematical Finance. The authors thank the anonymous reviewers for their valuable feedback.
Funding
Open access funding provided by Universitá degli Studi di Bari Aldo Moro within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Ceci, C., Bufalo, M. & Orlando, G. Modelling the industrial production of electric and gas utilities through the \(CIR^3\) model. Math Finan Econ (2024). https://doi.org/10.1007/s11579-023-00350-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11579-023-00350-y