1 Introduction

Standard stochastic volatility (SV) models such as the well-known Heston or Bates models have been introduced long ago to address the shortcomings of the celebrated model of Black & Scholes. However, such models are still limited and cannot reproduce some important empirical facts of the historical volatility and of the observed implied volatility surface.

In their seminal paper, Gatheral et al. [26] propose a model called “Rough Fractional Stochastic Volatility” (RFSV) where the log-volatility process is modeled in terms of a fractional Brownian motion. More precisely, a fractional Ornstein-Uhlenbeck process is used with \(H<1/2\), in contrast with the “Fractional Stochastic Volatility” model (FSV) previously introduced by Comte and Renault [15], where the Hurst index is assumed to satisfy \(H > 1/2\). Gatheral et al. [26] find a highly consistent model with empirical estimates of the volatility time series. Moreover, the RFSV volatility process is stationary, which ensures reasonableness of the model in the long run. However, due to the need of Monte-Carlo simulations, this RFSV model does not provide rapidly option prices, nor an effective calibration procedure. More tractable pricing models such as the rBergomi from Bayer et al. [4] or the rough Heston from El Euch et al. [19] were then derived from the RFSV model to improve the efficiency of model calibration and option pricing. These two models appear to be extremely good at fitting the implied volatility surface while still reproducing the empirical properties of the historical volatility due to their rough sample paths. This paper shows how to expand these two rough-type models at larger time scales in order to price long-term life insurance contracts and explains why they provide a more accurate value of such contracts.

First, we will see that the stationary property of the volatility in the RFSV model will be lost when considering the rBergomi and rough Heston models. Despite this non-stationary property, we will manage to build for the rough Heston model a stable regime for the volatility in the long-run so as to price consistently long-term life insurance claims. On the contrary, for the rBergomi model, we will show that the generated volatility at large time scales will not be realistic and will tend to exhibit extremely large swings. Moreover, since these long-term properties of the rough Heston and rBergomi models depend heavily on the forward variance curve, we will show how to extrapolate the values of this curve beyond the observed maturities. In particular, we will derive a methodology based on a SSVI parametrization of the volatility surface which allows the forward variance curve to be consistent with the absence of static arbitrage and to be stable for large maturities. We will then verify practically these long-term properties by valuating an equity-linked endowment embedding a cliquet option (using four different models: Heston, Bates, rBergomi and rough Heston). This will allow to highlight the added-value of using rough-volatility models (and especially the rough Heston) for pricing such long-term life insurance contracts.

The main contribution of this paper is therefore to propose a calibration and a pricing methodology for long-term life insurance contracts using rough volatility models. We also contribute with this work to highlight and study the divergent long-term properties between rough volatility models and standard SV models. This way, we will be able to explain why these rough volatility models provide a more consistent and accurate fair value of such long-term claims. We will indeed find a significantly different fair value of our equity-linked endowment with models incorporating rough volatility, especially at long time scales with the rough Heston model. In fact, we will see that this rough Heston model exhibits the best long-term properties while being highly tractable. This paper also aims to compare these models in terms of fit of the observed European option prices and implied volatility surface, which will confirm the high robustness of rough volatility models (and especially of the rough Heston model).

2 Life insurance contract

Our aim is thus to price the following equity-linked endowment using rough volatility models and compare them with standard SV models, such as the Heston and Bates models. We thus consider an insurance contract that provides a lump sum payment at maturity T in case of survival, or a payment at the time of death if it occurs before T. We first denote by \(\tau _0\) the random residual lifetime of the policyholder at time \(t=0\). We suppose an equity-linked case, where payment amounts depend on the market value of a reference fund F(t) and where an amount F(0) is invested in it at \(t=0\). We then introduce a yearly minimal guarantee \(\kappa _g\) for the policyholder and a maximal yearly return \(\kappa _{m}\) that this policyholder can earn on the fund. The survival benefit is then given by \(F_T \, \mathbbm {1}_{\tau _0 \ge T}\) and the death benefit by \(F_{\tau _0} \, \mathbbm {1}_{\tau _0 <T}\), where \(F_t\) is given by a series of guarantees (known as a cliquet option) for \(t \in [0, T]\):

$$\begin{aligned} F_t&= F_0 \prod _{u=1}^{\left\lfloor {t}\right\rfloor } \min \left\{ e^{\kappa _m} \, ; \, \max \left\{ 1 + \pi \left( \frac{S_u}{S_{u-1}} - 1 \right) \, ; \, e^{\kappa _g}\right\} \right\} \nonumber \\ \, \,\quad & \times \min \left\{ e^{\kappa _m(t-\left\lfloor {t}\right\rfloor )} \, ; \, \max \left\{ 1 + \pi \left( \frac{S_t}{S_{\left\lfloor {t}\right\rfloor }} - 1 \right) \, ; \, e^{\kappa _g(t-\left\lfloor {t}\right\rfloor )}\right\} \right\} . \end{aligned}$$
(1)

In the equation above, \(S_t\) is the price process of each fund unit defined on a filtration \(({\mathcal {F}}_t)_{t\ge 0}\). The cliquet above implicitly assumes yearly resettlements. Moreover, the rate \(\pi \) identifies the portion of yearly return recognized to the policyholder with \(\pi \in (0,1]\).

In this life insurance claim, we will model mortality using the following assumptions. We first introduce the non-homogeneous Poisson process \(N_t:= \mathbbm {1}_{\tau _0 <t}\) defined on a filtration \(({\mathcal {G}}_t)_{t\ge 0}\), which is equal to zero as long as the individual is alive and jumps to one at death. We then assume independence between the mortality process \(N_t\) and the price process \(S_t\). The intensity of \(N_t\) is given by the deterministic Makeham force of mortality \(\mu _x(t) = c+ab^{\,x+t}\) for a policyholder aged x at time \(t=0\). Hence, her survival probability at time T, being alive at \(t=0\), is equal to

$$\begin{aligned} _{T}p_{x}= P(N_T = 0 \, | \,{\mathcal {G}}_0) = \exp \left( -\int _0^T \mu _x(u) \, du \right) . \end{aligned}$$

We now introduce the main characteristics of the RFSV model from Gatheral et al. [26] and then derive two tractable models from it, the rBergomi and the rough Heston models, that will help us to give a more consistent and accurate market value to this insurance contract.

3 Key features of the RFSV model

Gatheral et al. in [26] build the Rough Fractional Stochastic Volatility model (RFSV) with the following SDE under the real-world measure \({\mathbb {P}}\):

$$\begin{aligned} dS_t&= \mu \, S_t \, dt + \sigma _t \, S_t \, dW_t \, , \end{aligned}$$
(2)
$$\begin{aligned} d \log (\sigma _t)&= \lambda (\theta - \log (\sigma _t))dt + \nu \, dB^H_t \, , \end{aligned}$$
(3)

where \(\mu \) is a drift term, \(W_t\) a standard Brownian motion and \(B^H_t\) is a fractional Brownian motion (fBm) with Hurst exponent H. \(W_t\) and \(B^H_t\) are in general correlated through the constant correlation \(\rho \) between \(W_t\) and the Brownian motion driving the Mandelbrot & Van Ness representation [32] of \(B^H_t\). Recall that the sample paths of \(B^H_t\) are Hölder-continuous of order \(\beta \) for any \(\beta < H\). Similarly, sample paths of \(B^H_t\) are almost surely nowhere Hölder-continuous of order \(\beta \) for any \(\beta > H\). Therefore, the larger the Hurst exponent H, the smoother the sample paths and the lower the H, the rougher they are. More precisely, when \(H<1/2\), the sample paths of the fBm are called rough. Since \(\log (\sigma _t)\) is driven in Eq. (3) by such fBm, the sample paths of \(\log (\sigma _t)\) are also Hölder-continuous of order \(\beta \) for any \(\beta < H\) and exhibit the same regularity behavior as \(B^H_t\).

Given that Eq. (3) models the log-volatility as a fractional Ornstein-Uhlenbeck (fOU) process, we refer to Cheridito et al. [14] for the derivation of its stationary solution. One important property of the RFSV is that the volatility process \(\sigma _t\) itself generated by this model is again stationary, which ensures reasonableness of the model at large time scales when pricing long-term claims on the underlying. The cornerstone of [26] is to impose the Hurst exponent H of the fBm \(B^H_t\) to be in \(\left( 0, \frac{1}{2} \right) \) and a very long reversion time scale with \(\lambda \ll 1/T\). Such conditions aim to generate rough sample paths and to introduce short-range dependence for the volatility process as explained by Gatheral et al. in [26]. Their findings are extremely consistent at any time scales with empirical statistical properties of the observed volatility time series as confirmed in Bennedsen et al. [7]. Firstly, Gatheral et al. show, for \(\lambda \ll 1/T\) and \(0< H < 1\), that the log-volatility process \(\log \sigma _t\) behaves locally as a fBm. Indeed, as \(\lambda \rightarrow 0\), they prove that the log-volatility is close to the fBm \(B^H_t\) under the RFSV model (3). Therefore, this model approximately reproduces the well-known scaling property of the fBm (cfr Karatzas and Shreve [30]). This scaling property is clearly verified empirically for the discrete historical log-volatility process and allows us to estimate the Hurst index H of the observed volatility process. Many studies (as in [7]) consistently find \(H \approx 0.1\), which confirms the rough property of the historical volatility, i. e. a Hurst exponent \(H<1/2\). Secondly, neither the autocovariance function \(\text {cov}(\sigma _{t+\Delta }, \sigma _t)\) of the volatility process in the RFSV model nor the empirical counterpart of \(\text {cov}(\sigma _{t+\Delta }, \sigma _t)\) decay as a power law (cfr Figure 12 on p. 941 in [26] with \(\Delta >0\)). Therefore, these authors claim that historical volatility data are in accordance with the RFSV model, on the contrary to long memory models such as the FSV model of Comte and Renault [15].

Finally, the RFSV model is not only consistent with empirical statistical properties of the volatility time series but also with the observed implied volatility surface. Indeed, standard stochastic volatility models such as the Hull and White and Heston models do not provide a good fit of the volatility surface (especially for short expirations). Particularly, these models generate a term structure of at-the-money (ATM) volatility skewFootnote 1 under a risk-neutral measure \({\mathbb {P}}^*\) that does not match the observed one for small time to maturity \(\tau =T-t\). Instead, empirical studies show that this observed term structure of ATM skew is well approximated by power-law functions of the form \(\psi (\tau ) \sim \tau ^{H-1/2}\) with \(0< H < 1/2\), which can be generated by stochastic volatility models where the log-volatility is driven by fBm with values of the Hurst exponent in (0, 1/2). The explosion of the volatility skew as \(\tau \rightarrow 0\) can therefore be modeled using fBm without introducing jumps in our model, as we will confirm in Sect. 5.

Although empirically and theoretically grounded, the RFSV does not allow to price rapidly option prices and therefore to calibrate effectively a pricing model. Indeed, we need Monte-Carlo simulations which are fairly slow, especially for calibration. Therefore, we now show how to adapt the RFSV to obtain more tractable models that can be used to price long-term claims on the underlying. We first analyze in more detail a simple case of the RFSV model when \(\lambda =0\), called the rBergomi model, built upon a forward variance curve. Secondly, we study an extension of the classical Heston model incorporating a rough fractional volatility process. This rough Heston model has the nice property of generating a realistic long-term behavior for the volatility process and of having a characteristic function of the log-stock price in quasi-closed form.

3.1 rBergomi model

First, let denote \(v_u = \sigma ^2_u\) the instantaneous variance at time u and the forward variance curve (also called variance swap curve) \(\xi _t(u) = {\mathbb {E}}[v_u | {\mathcal {F}}_t], \, \, u \ge t\). We can easily recover \(v_t\) from \(\xi _t(u)\) as \(v_t = \lim _{u\rightarrow t} \xi _t(u)\). Following Bergomi and Guyon [9], forward variance models are models that can be written as a function of this curve \(\xi _t(u).\)

From Bayer et al. [4], the rBergomi model is obtained from the RFSV model by setting \(\lambda = 0\) in the log-volatility dynamic (3). Under this assumption and from the Mandelbrot & Van Ness representation [32] of the fBm, these authors show that the variance process \(v_u\) in the rBergomi model can be rewritten under the physical measure \({\mathbb {P}}\) as

$$\begin{aligned} v_u&= {\mathbb {E}}^{{\mathbb {P}}}(v_u | {\mathcal {F}}_t) \exp \left( \eta \, {\tilde{W}}_t(u) - \frac{1}{2} {\mathbb {E}}^{{\mathbb {P}}} \Big [ |\eta {\tilde{W}}_t(u)|^2 \Big ] \right) = {\mathbb {E}}^{{\mathbb {P}}}(v_u | {\mathcal {F}}_t) \, {\mathcal {E}} \left( \eta {\tilde{W}}_t(u) \right) , \end{aligned}$$

where \({\tilde{W}}_t (u):= \sqrt{2\,H} \int _t^u (u-s)^{H-1/2 \,}d{\hat{W}}_s\) is a square-integrable continuous martingale with \({\hat{W}}_t\) a standard Brownian motion under \({\mathbb {P}}\). We also define the stochastic exponential \({\mathcal {E}}:= \exp \left( \Phi - 1/2 \, {\mathbb {E}}[|\Phi |^2] \right) \) with \(\Phi \) a zero-mean Gaussian process. Finally, we define \(\eta := 2 \, \nu \, C_H / \sqrt{2\,H}\) where \(C_H: = \sqrt{\frac{2\,H \, \Gamma (3/2-H)}{\Gamma (H+1/2)\, \Gamma (2-2\,H)}}\).

Under an equivalent risk-neutral measure \({\mathbb {P}}^*\) (with a deterministic change of measure), the authors in [4] finally find for variance process in the rBergomi model

$$\begin{aligned} v_u = {\mathbb {E}}^{{\mathbb {P}}^*}[v_u | {\mathcal {F}}_t] \, \exp \Big \{ \eta \sqrt{2H} \int _t^u \frac{1}{(u-s)^{1/2-H}}dW^*_s - \frac{\eta ^2}{2}(u-t)^{2H} \Big \} \, , \end{aligned}$$
(4)

where \(W^*_t\) is a standard Brownian motion under \({\mathbb {P}}^*\) and where \({\mathbb {E}}^{{\mathbb {P}}^*}[v_u | {\mathcal {F}}_t]\) is the forward variance curve \(\xi _t(u)\) observed on the market. In Eq. (4), it is important to note the presence at \(t=0\) of a Riemann-Liouville fBm \(\int _0^u C_H \, (u-s)^{H-1/2} \, dW^*_s\) as defined in [13], with a Hurst index H that induces the rough behavior of the variance process \(v_u\). In particular, its paths are \((H-\varepsilon )\)-Hölder continuous, as classical fBm. Moreover, since \({\mathbb {E}}^{{\mathbb {P}}^*}[v_u | {\mathcal {F}}_t] \ne {\mathbb {E}}^{{\mathbb {P}}^*}[v_u | v_t] \), this model is non-Markovian. Nevertheless, given the state vector \(\xi _t(u)\), which can in principle be computed from observed option prices (cfr section 4.3), the dynamics of the model are well-determined. Finally, the pricing model to be simulated under \({\mathbb {P}}^*\) is

$$\begin{aligned} S_T&= S_0 \, e^{rT} \, {\mathcal {E}} \left( \int _0^T \sqrt{v_u} \, dW^{*, S}_u \right) , \nonumber \\ v_t&= \xi _0(t) \, {\mathcal {E}} \left( \eta \, {\tilde{W}}^{*}_t \right) = {\mathbb {E}}^{{\mathbb {P}}^*}[v_t | {\mathcal {F}}_0] \, {\mathcal {E}} \left( \eta \, {\tilde{W}}^{*}_t \right) , \end{aligned}$$
(5)

where \(W^{*,S}_t = \rho \, W^*_t + \sqrt{1- \rho ^2} \, W^{'^{\,*}}_t\) and \(W^{'^{\,*}}_t\) a Brownian motion under \({\mathbb {P}}^*\), independent from \(W^*_t\). We can hence model the leverage effect in the rBergomi model with \(\rho < 0\). It is important to note that the assumption \(\lambda = 0\) prevents us from having a stationary model for the volatility and the variance processes. Long-term behavior of the rBergomi model will be discussed more thoroughly in Sect. 3.3.

3.2 Rough Heston

A reminder of the standard Heston model is given in [24] for interested readers. Bergomi and Guyon in [9] show that this Heston model can be written in terms of the forward variance curve \(\xi _t(u)\) (cfr Appendix) by

$$\begin{aligned} d\xi _t(u) = \nu e^{-\lambda (u-t)} \sqrt{v_t} \, d{\hat{W}}^*_t \,. \end{aligned}$$
(6)

Furthermore, from the classical Mandelbrot-van Ness [32] representation of the fBm, we clearly see that the kernel \((u-s)^{H-1/2}\) plays a central role in the rough dynamic of the fBm with \(H<1/2\). Indeed, as said above, one can show that the Riemann-Liouville fBm \(\int _0^u C_H \, (u-s)^{H-1/2} \, d{\hat{W}}^*_s \,\) has Hölder regularity \(H - \varepsilon \) for any \(\epsilon > 0\). Therefore, in order to allow for a rough behavior of the variance process in a Heston-type model, El Euch and Rosenbaum [21] naturally introduce the kernel \((u-s)^{H-1/2}\) in the risk-neutral stochastic differential equation of the Heston model with \(C_H:= 1 / \Gamma (H+1/2)\) as follows

$$\begin{aligned} v_u = v_t + \frac{\lambda }{\Gamma (H + 1/2)}\int _t^u \frac{\theta ^t(s)-v_s}{(u-s)^{1/2-H}} \, ds + \frac{\nu }{\Gamma (H+1/2)}\int _t^u \frac{ \sqrt{v_s}}{(u-s)^{1/2-H} } \, d{\hat{W}}^*_s \,, \end{aligned}$$
(7)

where \(\theta ^t(.)\) is assumed to be continuous, \({\mathcal {F}}_t\)-measurable and represents a time-dependent mean reversion level. When \(H=1/2\), one can verify that we indeed recover the classical Heston model. It can also be shown that the trajectories of the volatility itself are almost surely Hölder-continuous of order \(H - \varepsilon \), for any \(\varepsilon > 0\).

Moreover, El Euch et al. show in [20] and [19] that \(\lambda \theta ^t(.)\) can be directly inferred from the forward variance curve observed at time t on the market \(\xi _t(u)\). By doing so, they can rewrite the model (7) in the asymptotic setting \(\lambda \rightarrow 0\) by

$$\begin{aligned} v_u = \xi _t(u) + \frac{1}{\Gamma (H+1/2)} \int _t^u (u-s)^{H-1/2} \nu \sqrt{v_s} \, d{\hat{W}}^*_s \,. \end{aligned}$$
(8)

Since the forward variance curve \(\xi _0(u)\) at time \(t=0\) is observed (or at least can be retrieved) from the market, we only have three parameters left to estimate: H, \(\nu \) and \(\rho \). Reduction of parameters for the rough Heston is of utmost importance for improving the efficiency of the calibration methodology. Moreover, the fact that we consider \(\lambda = 0\) implies that the average long-term behavior of the variance process in the rough Heston will be ruled by the forward variance curve \(\xi _0(t)\) and not by the mean-reversion parameters \(\lambda \) and \(\theta ^0(\cdot )\) anymore (see Sect. 3.3 for more details). We also note that the limit \(u \rightarrow t\) of the rough Heston makes no sense. This reflects the fact that this model (as the rBergomi) is not Markovian with respect to the current variance state \(v_t \,\). However, it is directly visible from equation (8) that the rough Heston is Markovian in the forward variance curve \(\xi _t(u)\). Finally, the rough Heston can also be expressed in the forward variance form as

$$\begin{aligned} d\xi _t(u) = \frac{\nu }{\Gamma (H+1/2)}(u-t)^{H-1/2} \sqrt{v_t} \, d{\hat{W}}^*_t \,, \end{aligned}$$

which is highly similar to (6), with the presence of a Riemann-Liouville kernel in addition.

Furthermore, from Gatheral and Keller-Ressel [27], we have that the Heston and rough Heston forward variance models are said to be affine. In both cases, they show that the characteristic function of the log-asset price \(X_T = \log S_T\) at time t can be written as

$$\begin{aligned} \Phi _t(a,T) = {\mathbb {E}}\left[ e^{i a X_T}|{\mathcal {F}}_t \right] = \exp \left( i a \left( X_t + r(T-t) \right) + \int _t^T \xi _t(s) \, g(T-s,a) \, ds \right) , \end{aligned}$$
(9)

where \(g(\cdot ,a):{\mathbb {R}}_+ \rightarrow {\mathbb {R}}_-\) is the unique global continuous solution of a convolution Riccati equation (cfr [27]). In the classical Heston model, they find that

$$\begin{aligned} g(t, a) = \partial _t \, h(t, a) + \lambda \, h(t,a)\,, \end{aligned}$$

where \(h(\cdot , a):{\mathbb {R}}_+ \rightarrow {\mathbb {R}}_-\) is the unique \(C^1\)-function solving the Riccati ODE

$$\begin{aligned} \partial _t \, h(t, a) =-\frac{1}{2} a(a+i) - (\lambda - i\, a \, \rho \, \nu \,) h(t,a) + \frac{1}{2}\nu ^2 \, h(t,a)^2, \ \ \ \ \ h(0,a) = 0 \,. \end{aligned}$$
(10)

In the rough Heston model with \(\lambda =0\), we have this time that \(h(\cdot , a)\) is the unique continuous solution of the following fractional Riccati equation

$$\begin{aligned} g(t, a) = D^\alpha h(t, a) = -\frac{1}{2} a(a+i) + i \, a \, \rho \, \nu \, h(t,a) + \frac{1}{2} \nu ^2 \, h(t,a)^2, \ \ \ \ \ I^{1- \alpha } h(0, a) = 0 \,, \end{aligned}$$
(11)

with \(D^\alpha \) the Rieman-Liouville fractional derivative of order \(\alpha = H +1/2\) and \(I^{1 - \alpha }\) the Rieman-Liouville fractional integral of order \(1-\alpha \) (cfr [19]). This equation is exactly the same as in the classical Heston model (with zero-mean reversion) but with the time derivative replaced by a fractional derivative that makes the model rough.

We thus have a quasi-closed formula for the characteristic function in the rough Heston model since it is given by the simple expression (9). Contrary to the classical Heston case, the drawback is that there is no explicit solution to (11). However, this fractional Riccati equation can be solved numerically quasi instantaneously using the method described in Gatheral & Radoičić [28], built upon the combination of a short-time and an asymptotic expansion of the solution h(ta). As shown in [28], this method is particularly fast, simple and accurate to compute the approximate solution of the fractional Riccati equation (11). We refer to Section 5 of [28] for a more thorough discussion on the quality and convergence of their approximation for h(at). Then, European option prices may be obtained efficiently from the characteristic function (9) using standard Fourier techniques such as the Carr-Madan formula [12]. Finally, this fractional Riccati ODE also allows to study portfolio insurance strategies, as developed in [18].

3.3 Long-term behavior of rBergomi and rough Heston models

The Riemann-Liouville fBm in Eq. (5) for the rBergomi and in Eq. (8) for the rough Heston is a non-stationary process with non-stationary increments (cfr [13]), which thus also makes the variance process of both models non-stationary. Hence, we need to verify whether the variance process exhibits a reasonable long-term behavior in the rBergomi and rough Heston models since we do not have this stationary property anymore as we have in the RFSV model with \(\lambda \ne 0\) (which ensures reasonable and stable properties on the long-run for the RFSV variance process, cfr [26]). More precisely, we have to make sure that this variance process (and particularly its two first conditional moments) do not explode for large maturities.

First, as shown in [16] and in [2], the Riemann-Liouville fBm can be represented as an infinite superposition of Ornstein-Uhlenbeck processes with different mean reversion speeds. Hence, even if \(\lambda = 0\), authors in [1] show that there is an inherent mean reversion feature around the forward variance curve in rough volatility models coming from the Riemann-Liouville fBm. This mean-reversion property combined with an appropriate behavior of the long-end of the forward variance curve may thus allow for a sufficiently well behaved variance process at large maturity in rough volatility models, as we will see. For both the rough Heston and the rBergomi models, the fact that we consider \(\lambda = 0\) implies that the average long-term behavior of the variance process will be ruled by the forward variance curve \(\xi _0(t)\) and not anymore by the mean-reversion parameters \(\lambda \) and \(\theta ^t(\cdot )\) of Eq. (7) for the rough Heston, nor by the parameters \(\lambda \) and \(\theta \) of Eq. (3) for the rBergomi. Indeed, this simply comes from the fact that the conditional mean of the variance process is now given by \({\mathbb {E}}^{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t] = \xi _t(u)\). Hence, we will discuss in Sect. 4.3 how to build such forward variance curves with non-exploding long-term behavior. However, long-term behavior of the variance process in both models is not solely driven by the form of this forward variance curve (since this curve only controls the average value of the variance process). The non-stationary Riemann-Liouville fBm has also a strong impact on the conditional variance of the variance process given by \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\), as explained in Appendix. Indeed, this quantity \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) increases exponentially with u in the rBergomi model and hence tends to exhibit extreme values for large maturities (see Fig. 15). For the rough Heston model, the rate of increase of this quantity is much slower and hence, we do not observe this exponential growth of \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) for the considered maturities, as again depicted on Fig. 15. The generated variance sample paths at large time scales are thus more reasonable and less volatile in the rough Heston while they exhibit extremely large swings for the rBergomi. This behavior will have an impact on the price of long-term life insurance claims as we will see in the last section of this paper.

4 Calibration

In this section, we will show how to build a consistent calibration methodology. We will then compare four models (Heston, Bates, rBergomi and rough Heston) in terms of fit of the observed European option prices and implied volatility surface for the CAC 40 index. We will mainly focus on the long-term behavior of such models and we will then use these calibrated models to price our equity-linked endowment embedding the cliquet option (1).

As in [26], we first provide a statistical estimation of the RFSV parameters based on historical data for the CAC 40 index from January 2004 to July 2020 (obtained from the Oxford-Man Institute). Using the scaling property of [26] to estimate the smoothness of the volatility process for the CAC 40, we find an estimated value \({\hat{H}} = 0.13\). The historical estimate of \(\nu \) is also derived from this scaling property and is equal to \({\hat{\nu }} = 0.31\).

The aim of the calibration method is to find, for each model, the model parameters that match derivative model prices as best as possible with the observed market prices of vanilla derivatives in the market. It then boils down to an optimization problem where we have to find the minimum distance between model prices/model-based implied volatility and market prices/market implied volatility. In this paper, we choose to minimize the distance between model-based implied volatility \({\widehat{\sigma }}^{\,imp}_j\) and market implied volatility \(\sigma ^{imp}_j\) since it leads to more stable results. Finally, we introduce the weighted Root Mean Square Error as the loss function, which is defined as

$$\begin{aligned} \text {RMSE} = \sqrt{ \sum _{j=1}^N \frac{w_j}{N} \left( \sigma ^{imp}_j - {\widehat{\sigma }}^{\,imp}_j\right) ^2 } \,, \end{aligned}$$
(12)

where \(w_j = \frac{1}{\text {Ask}_j - \text {Bid}_j}\), i. e. the weight is the inverse of the bid-ask spread, expressed in terms of implied volatility. This way, we give more importance to observations with small bid-ask spread. Indeed, if an option is liquid, then its bid-ask spread is supposed to be small and its price/implied volatility is more accurate. Mathematically, it boils down to find the optimal set \(\Theta ^* \in {\mathbb {R}}^p\) of the p model parameters such that

$$\begin{aligned} \Theta ^* = \text {arg} \min _{\Theta } {\mathcal {L}}\left( \sigma ^{imp}, {\widehat{\sigma }}^{\, imp}(\Theta )\right) , \end{aligned}$$

where \({\mathcal {L}}\) is the RMSE loss function. Finally, we highlight the fact that the results of the optimization methodology can be highly dependent on the initial parameters. Even for the Heston model, there is no consensus among researchers on whether the objective function for the calibration is convex or not, as explained in [17]. In any case, to overcome this problem of guessing the initial parameters, we use a global optimizerFootnote 2 combined with a local optimizer (the standard nlminb function in R). Indeed, global optimizers can find a solution on their own even without any initial guess while local optimizer need a suitable starting point. Therefore, we use the approach described in [35] which takes the final parameters provided by the global optimizer as initial values for the local optimization. This combination of global and local optimizers allows to refine the final parameters of the calibration and minimize the loss function, as explained and shown in [35]. However, even with this technique, we will see that the rBergomi model still appears to be highly unstable due to the need of Monte-Carlo simulations for this model.

4.1 Market data

We first choose as starting date \(t=0\), the \(6^{th}\) of July 2020. The maturity of vanilla options as well as the risk-free term structure will be defined with respect to this date.

Firstly, the risk-free rates used for pricing purposes are derived from the observed swap term structure. The technique for constructing this swap term structure divides the curve into two term buckets. The short end of the swap term structure is built using interbank deposit rates. We will here consider EURIBOR rates from one day to 12 months. The long end of the risk-free interest rate curve (maturity above 1 year) will be based on the European swap term structure available from the EIOPA for the month of JulyFootnote 3. Table 12 in Appendix provides a summary of the risk-free rates used between 0 and 20 years. The dividend yield is chosen to be 1% but this assumption does not have a significant impact on the results and conclusions of this paper.

Secondly, we have that the price \(S_0\) at time 0 of the CAC40 index is equal to 5028.56 €. As mentioned above, we also need market prices of European options on the CAC40 in order to calibrate later our model since these are the most traded financial instrument in the equity world. Indeed, there are no liquid long-term products traded on the market on which we can calibrate our models for large maturities. The best we can hope so as to be the most-market consistent is therefore to reproduce as best as possible the observed implied volatility surface (and hence prices of European options) and then extrapolate the obtained trend for large maturities. In particular, we will see two methods below for obtaining the values of the forward variance curve beyond the available maturities. From Bloomberg, we thus gather a database that includes 366 European options with the option bid price and the option ask price for different strikes and maturities. The true market price is here considered as the mid value between the bid price and the ask price, which is a common assumption. Using the one-to-one relationship between market prices and implied volatility from the Black and Scholes formula, we can build a database with the bid, ask and mid implied volatility for each strike and maturity. We then apply the following common filters as in Moyaert and Petitjean [34] and select the following options:

  • Out-of-the-money options: Since out-of-the-money options are more actively traded than in-the-money options (as a protection), the quotes on out-of-the-money options are usually more reliable.

  • The bid-ask spread (on the option prices) is less than 5%. Otherwise, we consider that there is no sufficient liquidity to take the option into account.

  • Maturity: We reject options with maturity equal to 0.9 year due to data quality issue for this particular maturity.

We have a final database of 145 OTM options for which we can observe the strike, the maturity, the bid, ask and mid implied volatility. We now turn to the specific calibration method for each model. Indeed, even if the general methodology is the same, the way we price our European options will vary depending on the model.

4.2 Heston and Bates models

We have at disposal for these two models a closed-form expression for their characteristic function, as reviewed in [24]. Then, for a given maturity, the Carr-Madan formula [12] generates prices of European options for a whole range of strikes in one run very fast using Fourier inversion. By calling many times this pricing algorithm on the available maturities in the data, we can thus find the optimal parameters of each model.

  • Heston

A reminder of the standard Heston model is given in [24]. The set of initial parameters obtained from the Simulated Annealing algorithm is given by (0.2898, 2.9176, 0.0966, 1.4703, \(-\)0.7010). Using the local optimizer nlminb in R, we then find a minimal RMSE of 0.11694 with the following final parameter values \(\Theta ^*\):

\(\sigma ^*_0\)

\(\lambda ^*\)

\(\theta ^*\)

\(\nu ^*\)

\(\rho ^*\)

0.28992

2.91760

0.09664

1.47027

-0.7010

We see that we have a high speed of mean reversion \(\lambda ^*\) and a high level of the vol-of-var parameter \(\nu ^*\) which implies large swings in the variance process. We also obtain a large negative correlation between the stock process and the variance process, which is consistent with the leverage effect. Finally, the mean reversion level \(\theta ^*\) towards which the variance will converge is 9.66% (which represents a volatility of 31,09%). It is no wonder to obtain this high level of volatility as well as vol-of-var parameter given the state of the economy due to the COVID-19 outbreak in July 2020. Finally, the Heston model with the parameters obtained above leads to Fig. 1 when compared to market prices of European options on the CAC 40 index.

Fig. 1
figure 1

Comparison of Heston option prices in red crosses vs. market option prices in black dots (CAC 40)

The calibration of the forward-variance specification of the Heston model, as described in [9] and [10], is provided in Appendix. This appendix will allow to compare more thoroughly the Heston model with rough volatility models which are also based on this forward variance curve.

  • Bates

For a reminder of the Bates (or SVJ) model, we refer to [24]. The set of initial parameters obtained from the SA global optimizer is given by (0.27, 0.62, 0.12, 1.24, \(-\)0.66, \(-\)0.10, 0.11, 0.82). Using our local optimizer, we then find a minimal RMSE of 0.08465 with the following final parameter values \(\Theta ^*\):

\(\sigma ^*_0\)

\(\lambda ^*\)

\(\theta ^*\)

\(\nu ^*\)

\(\rho ^*\)

\(\mu _j^*\)

\(\sigma _j^*\)

\(\lambda _j^*\)

0.2589

0.1555

0.2709

1.1593

-0.6640

-0.1004

0.1072

0.9080

Firstly, the RMSE is substantially lower than the one obtained with the Heston model, which means that the Bates model provides a better fit of the observed European option prices. The mean-reversion speed \(\lambda ^*\) and the volatility of variance parameters \(\nu ^*\) are also lower, which implies that the swings in the variance process are less pronounced than in the Heston model. Moreover, we now observe jumps in the price process that are on average negative since \(e^{\mu _j + \sigma ^2_j/2} -1 < 0\). The annual frequency of these jumps is equal to 90.8%. Finally, we observe that the long-run variance \(\theta ^*\) is higher than in the Heston model. This Bates models leads to Fig. 2 when compared to the observed market prices. The fit is clearly better than in the Heston model as said above. More precisely, we see that this improvement comes from the better fit of OTM call options with large maturities when using the Bates model. However, we also observe on Fig. 2 that option prices with the third largest maturity (\(T=0.45\)) are still significantly out-of-market.

Fig. 2
figure 2

Comparison of Bates option prices in red crosses vs. market option prices in black dots (CAC 40)

Note that the presence of jumps makes the calibration of the forward-variance specification of the Bates model really difficult and time-consuming. Indeed, the characteristic function of the Bates model cannot be written as an integral of the forward variance curve as in the Heston and rough Heston case with equation (9) since the Bates model is not an affine forward variance model due to its jump term (cfr [27]). We will therefore not consider this specification in this paper.

4.3 Rough Heston calibration

As explained above, we have a quasi-closed form for the characteristic function of the log-price \(X_t\) under the rough Heston model (8). We can then apply exactly the same calibration technique as for the Heston and Bates models described in the previous section. However, recall that the rough Heston is not Markovian in the current variance state \(v_t\) but Markovian in the (forward) variance swap curve \(\xi _t(u)\). Hence, the rough Heston’s characteristic function depends on this variance swap curve. In practice however, the variance swap curve \(\xi _0(u)\) at time \(t=0\) is not obtained directly from the financial markets. Indeed, it is hard to obtain high-quality variance swap data since variance swaps are OTC contracts as explained in [4]. We thus choose to proxy the value of a variance swap of maturity T by the value of a \(\log \)-contract (also of maturity T), as described for example in Chapter 11 of Gatheral [24]. Therefore, even if variance swaps are OTC contracts, we can obtain variance swap prices directly from the observed European options which are heavily traded on the market. This way, we can consider in this context that variance swap quotes derived from our European options are also reliable. The first method to obtain the initial variance swap curve is therefore to calibrate it based on our dataset of observed option prices for the CAC 40. We hence obtain in Fig. 3 the following market variance swap curve, denoted \(\xi ^{mkt}_0(t)\), between 0 and 1.2 years when interpolated between observed maturities.

Fig. 3
figure 3

Initial market forward variance curve \(\xi ^{mkt}_0(t)\) between 0 and 1.2 years

However, we only have European options with observed maturities until 1.2 years. In order to price long-term claims on the underlying, we need to extrapolate the values of \(\xi ^{mkt}_0(t)\) beyond 1.2 years and hence, we need to consider a term-structure parametrization of \(\xi ^{mkt}_0(t)\). Since the forward variance curve is estimated from the data and is not an output of the model, the average long-time behavior of the variance process is embedded in this forward variance curve parametrization (cfr Sect. 3.3). Indeed, recall that the forward variance curve controls the average value of the variance process \(v_t\) since \(\xi _0(t):= {\mathbb {E}}^{{\mathbb {P}}^*}[v_t | {\mathcal {F}}_0]\). The long-end of this curve will thus have a strong impact on the behavior of the model for large maturities and we hence need to avoid parametrizations generating explosive or unrealistic forward variance curves. Otherwise, the variance process (and hence also the price process) will tend to explode when the maturity increases. The easiest way to achieve this is to impose a constant long-term value for the initial forward variance curve \(\xi ^{mkt}_0(t)\). Therefore, it is reasonable to choose a parametrization with asymptotic line such that

$$\begin{aligned} \lim _{t\rightarrow \infty } \xi ^{mkt}_0(t) = cst \,. \end{aligned}$$

This assumption allows the variance process of the rough Heston to be in a stable regime for large maturities. A classical choice for such parametrization is the Gompertz function

$$\begin{aligned} \xi ^{mkt}_0(t) = z_1 e^{-z_2 \, e^{-z_3 \, t}} \,, \end{aligned}$$
(13)

where \(z_1\) is the asymptote, \(z_2\) sets the displacement along the x-axis (here, the time to maturity) and \(z_3\) sets the growth rate. Fitting the Gompertz function to the observed forward variance curve, we find:

$$\begin{aligned} z_1 = 0.03954 \qquad \, z_2 = -0.7992 \qquad \, z_3 = 0.5672 \end{aligned}$$

Therefore, the asymptotic level of the volatility is given by \(\sqrt{0.03954} = 19.88\%\). Finally, combining the curve obtained in Fig. 3 with the Gompertz fit for \(\tau > 1.2\), we obtain the initial market forward variance curve \(\xi ^{mkt}_0(t)\) in Fig. 4. We can observe that we enter a stable regime as of the 6\(^{th}\) year.

Fig. 4
figure 4

Initial market forward variance curve \(\xi ^{mkt}_0(t)\) until \(t=20\) years

Yet, we are aware that calibrating such Gompertz function based on only 6 different maturity points can seem dubious. Indeed, estimating an asymptotic value for the forward variance from the information carried out by short-maturity option data does not provide a lot of confidence in our results. Moreover, this way of building and extrapolating the forward variance curve does not ensure to avoid static arbitrage. Therefore, we now show how to tackle both of these issues. A better alternative to estimate the forward variance curve is to fit the SSVI parametrization [25] of the observed implied volatility surface and compute the forward variance curve associated with this parametrization. A brief reminder of such arbitrage-free SSVI parametrization is given in Appendix. The advantage of this procedure is to ensure that the estimated forward variance curve is consistent with the absence of calendar spread and butterfly arbitrages. Furthermore, the extrapolation of the forward variance curve does not require to specify a particular function for this curve anymore since its extrapolation beyond 1.2 years in the SSVI parametrization only relies on an arbitrage-free argument, as explained in Appendix. We therefore do not need anymore to determine a long-term value of \(\xi _0(t)\) based solely on short-term maturity data. We obtain this way the following SSVI forward variance curve \(\xi ^{SSVI}_0(t)\) depicted on Fig. 5.

Fig. 5
figure 5

Initial SSVI forward variance curve \(\xi ^{SSVI}_0(t)\) until \(t=80\) years. Black circles are the variance swap values for the available maturities

We see that this forward variance curve \(\xi ^{SSVI}_0(t)\) tends to approximately exhibit the same shape as the market forward variance curve between 0 and 80 years. Moreover, the SSVI forward variance curve appears to be stable and does not explode for large maturities, which also ensures a stable regime and a reasonable average behavior of the rough Heston model on the long-term.

The last alternative we will consider is to use a constant forward variance curve denoted \(\xi ^{cst}_0(t)\). We decide arbitrarily to fix \(\xi ^{cst}_0(t) = \xi ^{mkt}_0(0) = 0.0925 \,,\, \forall t\). Note that this constant level is very close to the long-term mean \(\theta ^*\) obtained in the Heston calibration (\(= 0.0966\)). This parametrization for the forward variance curve will hence allow to compare more thoroughly rough volatility models with the latter. Results for calibration and pricing with \(\xi ^{cst}_0(t)\) are given in Appendix.

The SSVI forward variance curve being arbitrage-free, more reliable and more consistent, we now perform the calibration of the rough Heston model based on \(\xi _0^{SSVI}(t)\). Results for the market and the constant forward variance curves are given in Appendix. Using the Carr-Madan formula [12] and the numerical approximation of Gatheral & Radoičić [28] for the fractional Riccati equation (11), we obtain in a first step the following set of initial parameters from the SA global optimizer: (0.1702, 0.6241, \(-\)0.6724). Using our local optimizer, we then find a minimal RMSE of 0.08457 with the following (same) final parameter values \(\Theta ^*\):

\(H^*\)

\(\nu ^*\)

\(\rho ^*\)

0.1703

0.6241

-0.6725

Based on the RMSE, we then have a slightly better fit than with the Bates model. It is impressive knowing that the rough Heston model in the asymptotic setting \(\lambda \rightarrow 0\) has only 3 effective parameters to estimate while the Bates model has 8 parameters ! We will also analyze in the next section which of the models best fits the behavior of the implied volatility surface (and particularly the term structure of ATM skews). Moreover, it is important to note that the Hurst index \(H^*\) obtained via the risk-neutral calibration method is consistent with the Hurst index obtained via statistical estimation (under the real-world measure) equal to 0.13. Yet, we have a higher \(\nu ^*\) than the one estimated based on the historical time series (\(=0.31)\) due to the market price of risk included in the risk-neutral valuation. Finally, we obtain a leverage effect of the same order as in the Heston and Bates models. Figure 6 depicts the fit of the rough Heston options prices compared with the observed market prices of European options.

Fig. 6
figure 6

Comparison of rough Heston option prices vs. market option prices (CAC 40)

4.4 rBergomi Calibration

The calibration of this model departs significantly from the three models described above. Indeed, no characteristic function is available in (quasi-)closed form. We then rely on Monte-Carlo simulations with the Hybrid scheme of [8], as applied in [33], to calibrate the parameters \(\rho ,\, \eta \) and H of the rBergomi model under \({\mathbb {P}}^*\). More precisely, since the Riemann-Liouville fBm is a special case of Brownian Semistationary (BSS) processes, we follow the papers of Bennedsen et al. [8] and of McCrickerd et al. [33] in order to improve the efficiency of simulations in the rBergomi model. Note that the initial forward variance curve is exactly the same as the one derived in the calibration of the rough Heston model. Hence, we will again use \(\xi ^{SSSI}_0(t)\) in this section.

We find a RMSE of 0.08631 with the initial set of parameters equal to (0.15, 1.67, \(-\)0.92). This leads to the following final values \(\Theta ^*\):

\(H^*\)

\(\eta ^*\)

\(\rho ^*\)

0.1515

1.6803

-0.9275

First, the RMSE is slightly higher compared with the rough Heston and Bates models but lower than the Heston model. We see that the calibrated Hurst exponent \(H^*\) is lower than the one found in the rough Heston model and closer to the historical estimate (\(=0.13\)). Based on a \(\eta ^* = 1.6803\), we find the corresponding \(\nu ^*\) equal to \(\eta ^* \sqrt{2\,H^*} / (2 \,C_H)=0.9969\). Therefore, the volatility of variance parameter \(\nu ^*\) clearly needs to be much higher than the historical \({{\hat{\nu }}}\) to better fit the European option data (again due to the volatility of variance risk premium). The anti-correlation between the stock process and the variance process is stronger than the \(\rho ^*\) derived from the rough Heston. Moreover, it is important to add that these results are quite unstable from simulation to simulation. The hybrid scheme is indeed not yet fast enough to provide a reliable calibration method, at least in our implementation. We used \(m=1,000,000\) and \(\Delta t = t_k - t_{k-1} = T/n\) with \(n=400\) and the optimization was done in more than 5 h with these parameters. We need to increase m and decrease \(\Delta t\) to obtain more precise and stable results but it then requires far more computing time and memory, which is not suitable and tractable in practice. We then obtain Fig. 7 where we can see that the OTM puts are substantially off-market with this model.

Fig. 7
figure 7

Comparison of rBergomi option prices vs. market option prices (CAC 40)

Finally, recall that the SSVI parametrization of the forward variance curve ensures that in the long term we have on average a stable and non-exploding variance process in the rBergomi and rough Heston models (cfr Fig. 5). Nevertheless, we said that the presence of the non-stationary Riemann-Liouville fBm in both models also impacts the long-term behavior of the variance process. More precisely, if we now compare the conditional variance of the variance process \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\), we obtain an increasing quantity in u with divergent properties between the two models as shown and explained with Fig. 15 in Appendix and with the corresponding equations 17 and 18. Clearly, the exponential growth of \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) in the rBergomi model does not allow for a reasonable behavior of the variance process in the long term with the calibrated parameters above. The same extreme behavior is obtained using the parameters of the rBergomi model found in the initial paper [4] and in [6]. On the contrary, the conditional variance \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) in the rough Heston exhibits an almost constant behavior for large times to maturity and is hence more appropriate for long-term pricing. This stable regime of the rough Heston model is also obtained using standard parameters such as in [19]. The impact of the quantity \({\mathbb {V}}_{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) on the price of life insurance claims for large maturities u will be analyzed more thoroughly in the last section.

5 Volatility surface fit

We now compare the Heston, Bates, rough Heston and rBergomi models in terms of implied volatility surface fit. More precisely, we will look at the fit of the term structure of ATM volatility skew \(\psi (\tau )\) and confirm that rough volatility models provide a better fit.

From our market option prices database, we can build the term structure of ATM volatility skew \(\psi (\tau )\). First, recall that \(\psi (\tau ):= \left| \frac{\partial }{\partial k}{\hat{\sigma }}^{imp}_{t}(\tau ,k)\right| _{k=0}\) with k being the log-strike and \(\tau = T-t\) the time to maturity. We can approximate it by

$$\begin{aligned} {\widehat{\psi }}(\tau ) = \left| \frac{{\hat{\sigma }}^{imp}_{0}(\tau , k +\delta ) - {\hat{\sigma }}^{imp}_{0}(\tau , k -\delta )}{2\delta }\right| _{k=0} \,, \end{aligned}$$

for small enough \(\delta \) and for each available time to maturity \(\tau \). We then obtain in Fig. 8 the term structure of ATM volatility skew for our data at time \(t=0\).

Fig. 8
figure 8

Red dots are the ATM volatility skews \({\widehat{\psi }}(\tau )\) estimated for the available maturities. The power-law fit is superimposed in blue

The red dots are the estimated ATM skews for each of the available maturities. We clearly see that we obtain a term structure of ATM volatility skew which is consistent with the power-law function. Indeed, the blue line is the power-law fit to the data, obtained by the following linear regression:

$$\begin{aligned} \log \psi (\tau ) = -\alpha \log \tau \,. \end{aligned}$$

We find \({{\hat{\alpha }}} = 0.4546\). From Sect. 3, we know that \(\psi (\tau ) \sim \tau ^{H-1/2}\) holds for the RFSV model. Hence, we have that \({\hat{H}}_{ATM} = 1/2- {{\hat{\alpha }}} = 0.0452\), which is lower than the Hurst exponent estimated with historical data (but again rough !). However, we do not have a lot of different maturities in our option data set, which prevents us from having a consistent estimator \({\hat{H}}_{ATM}\) using this power-law fit. Therefore, we will mainly focus our analysis on the shape of the ATM skew term structure and on whether our different models are able to approximate this blue line rather than on the exact fit.

5.1 Comparison of the fit

We now compare the empirical term structure of ATM volatility skew with the term structures derived from each model. We first analyze the fit of the standard Heston and Bates model thanks to Fig. 16 in Appendix. The blue line is again the power-law fit of the observed ATM skews (with \({{\hat{\alpha }}} = 0.4546)\). The green dots are the ATM skews \({{\widehat{\psi }}}(\tau )\) derived from the different models for the available maturities. For the Heston model, we clearly observe that it does not allow to capture the high values of ATM skews for short maturities and hence, it cannot reproduce the power-law shape of \({\widehat{\psi }}(\tau )\). This was indeed one of the issues of this model that we rose in Sect. 3. For \(T> 0.20\), we can however note that the Heston fit approximates quite well the empirical blue line. Concerning the Bates model, the presence of jumps only provides a slightly better fit of the ATM skews \({{\widehat{\psi }}}(\tau )\) compared with the Heston model. Indeed, the Bates model is still not able to capture the explosion of the ATM skews for very short time to maturity. This is due to the fact that the parameter \(\lambda _j\) controlling the frequency of jumps is only of 0.91 and that the average size of the jumps is also rather low. A higher value of \(\lambda _j\) or of \(|\mu _j |\) would allow to better capture this explosion phenomenon for \(\tau \rightarrow 0\).

We now analyze the fit of the rough Heston and the rBergomi models since we said that rough volatility models should be able to better capture the high curvature of \({{\widehat{\psi }}}(\tau )\) for small time to maturity thanks to the rough behavior of their volatility process. We obtain Fig. 17 in Appendix. We clearly see that the green dots of both figures (i. e. the rough Heston and rBergomi fit for available maturities) exhibit the same shape as the blue line and can therefore be fitted by a power-law function of the form \(\tau ^{H-1/2}\). If we had at disposal even shorter maturities, the rough Heston and rBergomi models would likely better capture the explosion of \({{\widehat{\psi }}}(\tau )\) for \(\tau \rightarrow 0\). We can finally conclude that rough volatility models are more consistent with the observed term structure of ATM volatility skew than standard SV models.

6 Simulation and discretization methodology

In order to price the above equity-linked life insurance contract, we first need to explain how we can discretize and simulate sample paths of the price process and the variance process under our different models.

- Heston

Once the Heston model has been calibrated and the optimal parameters \(\sigma _0, \lambda , \theta , \nu \) and \(\rho \) have been found, we can use a Monte Carlo approach to simulate the sample paths of the Heston model. We first simulate the stock price process and the variance process by generating correlated N(0, 1) random numbers \( \varepsilon _S\) and \( \varepsilon _v\) (with correlation \(\rho \)). The relationship between \( \varepsilon _S\) and \( \varepsilon _v\) can be written as

$$\begin{aligned} \varepsilon _{v,t_i} = \rho \, \, \varepsilon _{S,t_i} + \sqrt{1- \rho ^2} \, \, \varepsilon _{t_i}' \,, \end{aligned}$$

where \( \varepsilon _{S,t_i}\) and \( \varepsilon _{t_i}'\) are independent N(0, 1) random variables. Using an Euler discretization scheme for the price process and the variance process, we have

$$\begin{aligned} S_{t_{i+1}}&= S_{t_i} \left( 1+r\Delta t + \sqrt{v_{t_i}} \sqrt{ \Delta t} \, \varepsilon _{S,t_i} \right) , \end{aligned}$$
(14)
$$\begin{aligned} v_{t_{i+1}}&= v_{t_i} + \lambda (\eta -v_{t_i}) \Delta t + \nu \sqrt{v_{t_i}} \sqrt{\Delta t} \, \varepsilon _{v,t_i} \, , \end{aligned}$$
(15)

for \(i=0,...,n-1\), with \(t_0 = 0\) and \(t_n = T\), where n is the number of time steps and \(\Delta t = T/n\), for a given maturity T. We then simulate m sample paths of this stock price process and of this variance process. Now that we have the full path of \((S_{t_i})_{i=0,...,n}\) for each m, we can price path-dependent options, such as the cliquet option (1). In practice, the variance can become negative under a simulation because of the discretization. We then use the full truncation scheme of Lord et. al [31] to tackle this problem since it overcomes other Euler fixes (absorption, relfection, etc.) and quasi-second order schemes (Milstein scheme, Ninomiya and Victoir scheme [36], etc.) in terms of positive bias. The full truncation scheme therefore better reproduces the true properties of the CIR process. This simulation methodology gives us the following Fig. 9, depicting one sample path of the variance process under the standard Heston model over 10 years. We can clearly see the effect of the high volatility of variance (vol-of-var) parameter \(\nu \) with large swings in the variance process.

Fig. 9
figure 9

One sample path of the variance process under the standard Heston model

- Bates

We again use the Euler scheme with a full truncation fix for the variance process as in the Heston model. However, the discretization scheme of the price process is replaced by

$$\begin{aligned} S_{t_{i+1}} = S_{t_i} \left( 1+r\Delta t + \sqrt{v_{t_i}} \sqrt{ \Delta t} \, \varepsilon _{S,t_i} + (Y_{i}-1) \, \Delta N_{t_i}\right) , \end{aligned}$$
(16)

where \(Y_{i} \sim \text {Log}N(\mu _j, \sigma ^2_j)\) and \(\Delta N_{t_i} \sim Poi(\lambda \Delta t)\).


Rough Heston

The discretized price process is exactly the same as in the Heston model with (14). However, due to the non-Markovian nature of the variance process under the rough Heston, an Euler discretization scheme for simulating \(v_t\) is very slow and does not allow to use the full truncation fix in order to deal with negative variance. A more time-efficient simulation scheme for the rough Heston can be found instead in [1] with a lifted version of the Heston model. This lifted Heston model appears to be a multi-factor approximation of the rough Heston model built as an infinite superposition of square-root (CIR) processes with the same dynamic but mean reverting at different speeds (cfr Sect. 3.3). Such infinite-dimensional Markovian representation of the limiting rough variance process is an appealing trade-off between flexibility and tractability, as explained in [1]. In practice, only few factors are sufficient which drastically speeds up the simulation procedure compared to an Euler discretization scheme. Moreover, this representation in terms of square-root processes allows to use the full truncation fix of [31] described above in order to avoid negative variance and to better reproduce the true properties of the variance process. For simulating our rough Heston model, we take the suggested values of the lifted Heston model defined in [1]: \({\widetilde{n}}=20\) factors and \(r_{20} = 2.5\). The function \(g^{{\tilde{n}}}_0(t)\) of this lifted Heston model is chosen so as to match the forward variance curve \(\xi _0(t)\) and the speed of mean-reversion \(\lambda \) is again set to 0 since mean reversions at different speeds are inherent in the lifted Heston model. This parametrization allows to recover the rough Heston model (8) with its 3 effective parameters.

One sample path of the variance process over 10 years under our calibrated rough Heston model with SSVI forward variance curve is plotted in Fig. 10. We clearly see the rough behavior of the variance process compared to the sample paths of the Bates and Heston models. We also have that the vol-of-var parameter \(\nu \) is quite high, leading to large movements in the variance process.

Fig. 10
figure 10

One sample path of the variance process under the rough Heston model


rBergomi

The price process is again given by (14). However, the variance process is now simply given by

$$\begin{aligned} v_{t_i} = \xi _0(t_i) \, {\mathcal {E}}\left( \eta \, \tilde{W}^{*}_{t_i} \right) , \end{aligned}$$

where we use the SSVI forward variance curve for \(\xi _0(t_i)\,(={\mathbb {E}}^{{\mathbb {P}}^*}[v_{t_i}])\) and the hybrid scheme of [33] and [8] for the simulation of \(\tilde{W}^{*}_{t_i} \,\), with \(i=0,...,n\). The way the variance process is defined in (5) under the rBergomi model prevents us by definition from having negative variance. You can also find in Fig. 11, one simulated sample path of the rBergomi variance process using this Hybrid scheme. Due to the huge vol-of-var parameter \(\eta \), the variance paths tend to explode and exceed 100%. Generated sample paths with the rBergomi are therefore not realistic. Furthermore, if we compute the sample variance \({\mathbb {V}}_{{\mathbb {P}}^*}[v_t]\), we would obtain the exact same shape as in Fig. 15, which confirms the extreme variability of the rBergomi variance process for large maturities. The roughness of the variance process is again clearly visible in Fig. 11.

Fig. 11
figure 11

One sample path of the variance process under the rBergomi model

7 Conclusion of the fit

From the analyses of the implied volatility surface, we can conclude that rough volatility models with the rough Heston and the rBergomi clearly outperform standard SV models in terms of goodness of fit of the observed term structure of ATM volatility skew. Therefore, with the lowest RMSE (0.0846) and a good fit of the implied volatility surface, we conclude that the rough Heston model is the best of the considered models at reproducing the observed risk-neutral data. Secondly, since the rough Heston exhibits a rough variance process (with a H close to the historical estimate), this model can also better reproduce at any time scales numerous statistical properties of the historical volatility process. Therefore, the fact that this model exhibits rough volatility and hence better reproduces the empirical properties of the historical volatility will lead to a better price of insurance contracts at large time scales. More precisely, the parameter H controlling the roughness/memory of the variance process has a direct impact on long-term life insurance contracts since the generated sample paths of the fund’s underlying price process \(S_t\) will strongly vary in function of this index H (as shown below in Sect. 8 and Appendix). In fact, the larger the maturity, the more important will be the effect of the roughness of the variance process on the fair value of such contracts. Moreover, as shown in Sect. 4.3, we can build at large time scales a stable regime for the variance process in the rough Heston, which ensures a reasonable long-term behavior of the model. Combined with the fact that the SSVI forward variance curve \(\xi ^{SSVI}_0(t)\) is consistent with the absence of arbitrage at any time scales, we will obtain in the next section a very reliable and consistent fair value of long-term life insurance contracts. Finally, the rough Heston is also particularly tractable (only three parameters to calibrate and a characteristic function available in quasi-closed form) which is very important in practice for time-efficient calibration and option pricing.

Concerning the rBergomi model, even with calibrated parameters, it provides a slightly higher RMSE than the rough Heston model and hence a less good fit of the observed option prices (at least in our implementation with the SSVI forward variance curve). Secondly, the rBergomi does not provide realistic variance sample paths in the long run due to the huge level of the vol-of-var parameter and an exponentially increasing quantity \({\mathbb {V}}^{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\,\) (leading to extreme values for large u). Furthermore, this model is fairly slow in terms of running time, which appears problematic for calibration and option pricing. To quote El Euch et al. [19]: “Even with the introduction of the efficient hybrid scheme [of Bennedsen et al. [8]], practical implementation [of the rBergomi model\(\,]\) has proved to be difficult.” One solution to improve the stability and efficiency of this pricing model is to use asymptotics such as in Fukasawa [23], in Forde and Zhang [22] or in Bayer et al. [5]. Finally, Bayer et al. [6] propose a neural network approach to calibrate more accurately the rBergomi and to approximate the implied volatility surface. We will however not deepen these methods in this paper.

8 Life insurance contract valuation

Although all models are well calibrated and hence vanilla options have approximately the same prices under all models, exotic option prices can differ dramatically. It is important to point out that vanilla options determine the marginal distribution (at maturity T of the option), not the process. Indeed, the underlying fine-grain properties of the process have an important impact on path-dependent option prices. We now highlight the impact of exotic price ranges between our four calibrated models by valuating our endowment life insurance contract described in Sect. 2. This way, we also emphasize the impact of rough volatility models on long-term insurance contracts and compare more thoroughly the rBergomi model (5) and the rough Heston model (8) in terms of long-term properties. For further reference on equity-linked endowment valuation, we refer to Bacinello et al. [3].

8.1 Valuation

By the first fundamental theorem of asset pricing, the fair value at time \(t=0\) of the cash-flow \(F_t\) in (1) is given by its discounted expected value under a risk-neutral measure \({\mathbb {P}}^*\). However, this cash flow also depends on the path of the Poisson process \(N_t\) which is not known at time \(t=0\). Therefore, we take the following expectation conditionally to the filtration \({\mathcal {G}}_0\) to obtain the fair value of our equity-linked endowment contract

$$\begin{aligned} FV_0 = {\mathbb {E}} \left. \left[ (1-N_T) \, e^{-rT} \, {\mathbb {E}}^{{\mathbb {P}}^*}[F_T \, | \, {\mathcal {F}}_0] + \int _0^T e^{-rt} \, {\mathbb {E}}^{{\mathbb {P}}^*}[F_t\, | \, {\mathcal {F}}_0] \, dN_t \, \right| \, {\mathcal {G}}_0 \right] . \end{aligned}$$

The processes \(N_t\) and \(S_t\) being independent, we obtain that

$$\begin{aligned} FV_0 =\, _{T}p_{x} \, e^{-rT} \, {\mathbb {E}}^{{\mathbb {P}}^*}[F_T] + \int _0^T e^{-rt} \, {\mathbb {E}}^{{\mathbb {P}}^*}[F_t] \, _{t}p_{x} \, \mu _x(t) \, dt \,. \end{aligned}$$

We now want to price this insurance contract with the four models we calibrated above (Heston, Bates, rBergomi and rough Heston). We can then use a Monte-Carlo approach to simulate their price process \(S_t\) (and variance process \(v_t\)) and take the average of the paid benefit under each simulation \(j=1,...m\) in order to derive the expectation under \({\mathbb {P}}^*\), i. e.:

$$\begin{aligned} FV_0 = \frac{1}{m} \sum _{j=1}^m \left( e^{-r t_n} F_{t_n, j} \, \mathbbm {1}\left\{ \sum _{i=0}^n \Delta N_{t_i,j}=0 \right\} + \sum _{i=0}^n e^{-rt_i} \Delta N_{t_i,j} \, F_{t_i, j} \, \mathbbm {1}\left\{ \sum _{k=0}^{i-1} \Delta N_{t_k,j}=0 \right\} \right) , \end{aligned}$$

where the underlying price process in \(F_t\) is defined by the risk-neutral pricing equation of each model. The discretization of their price process and variance process is given in the previous Sect. 6. Moreover, the use of a Poisson process above makes it easy to add the mortality effect in the simulations to compute \(FV_0\). Indeed, for each simulation \(j \in m\), we generate \(n = T/ \Delta t = 5/0.001 = 5000\) Poisson random variables \(\Delta N_{t_i,j}\) with parameter \(\lambda = \mu _x(t_i) \, \Delta t\) for \(i=0,...,n\). For the first i where \(\Delta N_{t_i,j}=1\), we compute a death benefit \(F_{t_i,j}\) and if \(\Delta N_{t_i,j}=0\) for all i, we compute a survival benefit \(F_{t_n,j}\).

We consider the following contracts:

  • A female policyholder aged \(x=50\) or \(x=65\) at time \(t=0\).

  • We use the Belgian regulatory life table FR for the Makeham force of mortality \(\mu _x(t)\).

  • The initial amount \(F_0\) invested in the fund equals 10 000 €.

  • The policyholder pays a unique premium P at time \(t=0\). Note that we could have imposed in the following that \(FV_0 = P = F_0 = 10\ 000\) € so as to make the contract fair and then determine the corresponding \(\kappa _g\), \(\kappa _m\) or \(\pi \) in (1) that verifies this condition. However, since we aim to compare the contract’s value \(FV_0\) between models, we prefer to fix arbitrary values for \(\kappa _g\), \(\kappa _m\) and \(\pi \) and then derive the corresponding value of the contract.

  • We then assume the parameters in Table 1 when valuating the contract.

Table 1 Parameters of the equity-linked endowments

We obtain in Table 2 the fair values of the endowments with our different modelsFootnote 4 using Monte-Carlo simulations with \(m=200 \, 000\) and working with the SSVI forward variance curve \(\xi ^{SSVI}_0(t)\) for the rBergomi and rough Heston models:

Table 2 Fair value of the endowment contracts with \(\xi ^{SSVI}_0(t)\)

We can compare Table 2 with the following Table 3 computed using the constant forward variance curve \(\xi ^{cst}_0(t) = 0.0925\).

Table 3 Fair value of the endowment contracts with \(\xi ^{cst}_0(t)\)

We first see that the Heston model leads to a higher price than the Bates model in both tables. Indeed, even if the long-term variance is higher in the Bates model, the negative average jumps combined with a lower initial variance \(v_0\) and a lower speed of mean reversion \(\lambda \) lead to a lower price in the Bates model. Secondly, we have that rough-type models have a lower price than standard SV models. This can be partly explained in Table 2 by the long-term variance (equal on average in rough models to \(\xi ^{SSVI}_0(t)\)), which is lower than the long-term mean \(\theta ^*\) in the Heston and Bates model as depicted on Fig. 5. However, if we analyze Table 3, we still have lower prices for rough volatility models even if the average long-term variance is this time comparable. This remaining difference can thus be explained by the very long reversion time scale in rough volatility models (\(\lambda \rightarrow 0\)) and by the rough properties of the variance process (i.e. the parameter H). Moreover, we can see that the rBergomi model tends to have a higher fair value than the rough Heston due to its lower Hurst index H and the extreme values of \({\mathbb {V}}^{{\mathbb {P}}^*}[v_u|{\mathcal {F}}_t]\) when u is large. This is especially true at long time scales and when the participation rate \(\pi \) is high. Finally, comparing the different contracts, we clearly see that the larger the maturity, the higher is the difference between rough volatility models and standard SV models. This will be confirmed with Table 5 below. Moreover, when the participation rate \(\pi \) decreases, the fair value of the contract is logically lower since the policyholder earn a lower portion of the yearly returns. This lower \(\pi \) also lead to less pronounced differences between rough and standard SV models since their divergent sample paths properties play a lesser role. We observe the same effect when the maximal yearly return \(\kappa _m\) increases, with higher differences in fair value between rough and standard SV models due to their divergent sample paths.

Since we said that the rough Heston model is the most consistent with risk-neutral data while enjoying the best long-term properties, we retain the fourth column of Table 2 as being the contracts’ prices which are the most market-consistent. We confirm these effects with the two following tables, displaying the fair value \(FV_0\) in function of the minimum yearly guaranteed rate \(k_g\) and the maturity of the contract T. These two tables are computed using the SSVI forward variance curve but similar tables derived from \(\xi ^{cst}_0(t)\) and from \(\xi ^{mkt}_0(t)\) can be found in Appendix with Tables 6 and 7 and 8 and 9, respectively. These tables in Appendix are also consistent with the following analyses.

Table 4 Fair value \(FV_0\) for various \(k_g\) with maturity \(T=10\) and with \(\xi ^{SSVI}_0(t)\)

We observe the highest differences between rough and standard SV models when \(k_g\) tends to be small. Indeed, in this case, the minimum yearly guarantee is exercised less often and therefore the yearly returns, which are highly influenced by the sample paths, play a more important role. This thus leads to stronger differences in fair value between models with divergent sample path properties.

Table 5 Fair value \(FV_0\) for various maturities T with \(k_g = 1\%\) and with \(\xi ^{SSVI}_0(t)\)

The differences in fair value between models are also the most exacerbated when the maturity of the contract is large. The impact of the sample paths properties of each model is indeed stronger at longer time scales. In particular, the differences in long-term behavior of the variance process are exacerbated when the maturity is large, as explained in the previous sections and in Appendix. Combined with the fact that rough volatility models better reproduce the statistical properties of the historical volatility thanks to their memory properties, this confirms the importance and the added value of using rough volatility models at large time scales (and especially the rough Heston). Particularly, the importance of the Hurst index (controlling the roughness and memory of the variance process) is emphasized in Appendix when comparing the fair values obtained with the different forward variance curves and with the forward-variance specification of the Heston model. We indeed show that this Hurst index has a huge impact on the contracts’ fair values, especially for large maturities. We finally want to highlight that the numbers given in the tables above can slightly change for each run of simulations but are quite stable with our chosen m, equal to \(200\,000\).

9 Conclusion

Rough fractional stochastic volatility models are excellent candidates for reproducing important stylized facts of the past volatility time series and for providing a consistent implied volatility surface with the observed one. In this paper, we explore more closely two tractable implementations of the RFSV model, the rBergomi and the rough Heston models. We show how to expand these two rough volatility models at larger time scales, we analyze their implications for the pricing of long-term life insurance claims and we explain why they provide a more accurate fair value of such long-term contracts. In particular, we study the long-term properties of these two rough volatility models and compare them with two standard SV models (the Heston and Bates models).

Among these four models, we conclude that the rough Heston is the most consistent with historical and risk-neutral data while enjoying the best long-term properties. Indeed, this rough Heston model with \(H \approx 0.15\) tends to outperform standard SV models in terms of calibration with a better fit of the implied volatility surface and allows to better reproduce the statistical properties of the observed historical volatility, especially at long time scales. We also retain the rough Heston model as a highly tractable implementation of rough volatility models with only three parameters to estimate and a characteristic function available in quasi-closed form, which is not the case of the rBergomi model. Combined with the Carr-Madan formula, the calibration and pricing of European options with the rough Heston model is hence extremely fast and accurate. Moreover, we manage to build at large time scales a stable regime for the variance process in the rough Heston based on a SSVI parametrization of the forward variance curve. This way, we obtain a model with a reasonable long-term behavior for its variance process while being consistent with the absence of static arbitrage. This rough Heston is thus able to overcome the extreme and exponentially increasing variability of the rBergomi variance process. Using the pricing methodology introduced in this paper, a more accurate fair value of long-term equity-linked life insurance contracts is then obtained in comparison with the rBergomi and standard SV models. This is especially true at large maturity where the good properties of the rough Heston model have a stronger impact. More precisely, we obtain with the rough Heston model a fair value significantly lower than in standard SV models due to the long-reversion time scale and the rough property of volatility. This highlights the added-value of using this rough Heston model when pricing long-term life insurance contracts.

As future work, we should re-calibrate all the models considered in this paper on other market data of European options. Choosing an other index/stock with a larger set of available maturities and at another point in time should help us to confirm and validate the results derived with our methodology. Moreover, more advanced mortality models could be considered such as in [29] or in [11].