1 Introduction

Consider the problem of hedging contingent claims written on cryptocurrencies (CC). The dynamics of this new expanding market is characterized by high volatility, as is evident from the Cryptocurrency volatility index \(\texttt {VCRIX}\) (see Kim et al., 2021) and large price jumps (Scaillet et al., 2018). We approach hedging options written on \(\texttt {Bitcoin}\) (BTC) with models from the class of affine jump diffusion models and infinite activity Lévy processes. Similarly to Branger et al. (2012), we assess the hedge performance of implausible, yet complete as well as plausible, but incomplete asset pricing models. Since April 2019, contingent claims written on BTC and \(\texttt {Ethereum}\) (ETH) have been actively traded on \(\texttt {Deribit}\) (\(\texttt {www.deribit.com}\)). The Chicago Merchantile Exchange (CME) introduced options on BTC futures in January 2020. In contrast to traditional asset classes such as equity or fixed income, the market for CC options has only recently emerged and is still gaining liquidity, see e.g. (Trimborn & Härdle, 2018) for an early description of the market. Cryptocurrency markets are known to exhibit high volatility and frequent jumps, see e.g. market crashes on 12 March 2020, 19 May 2021, 17 June 2022, posing challenges to valuation and risk management. This erratic price behaviour may be attributed to the lack of a fundamental value, see e.g. Biais et al. (2022), Athey et al. (2016) and Makarov and Schoar (2020).

As the option market is still immature and illiquid, in the sense that quotes for many specific strikes or maturities are not directly observable or may be stale, we derive options prices by interpolating prices from stochastic volatility inspired (SVI) parametrized implied volatility (IV) surfaces (Gatheral, 2004).

Aside from conducting a historical backtest, and in order to capture a variety of market dynamics, the BTC market is imitated with two different Monte Carlo simulation approaches. In a parametric price path generation approach, we assume that the data-generating process is described by a SVCJ model. The second scenario generation method is based on GARCH-filtered Kernel-density estimation (GARCH-KDE), which can be thought of as a smooth historical simulation taking into account the historical volatility dynamics, and which is therefore close to actual market dynamics.

Under each of the two different market simulation methods, options are hedged by a hedger employing models of different complexity. This deliberately includes models that are “misspecified” in the sense that relevant risk factors may be omitted (Branger et al., 2012). On the other hand, those models are possibly parsimonious enough to yield a complete market. It is known that, when comparing the hedge performance to a more realistic, albeit incomplete market model, the simpler model may outperform the complex model (Detering & Packham, 2015). In our context, a model is “misspecified” if it contains fewer or different parameters than the SVCJ model. Specifically, as models included in the class of SVCJ models, we consider the Black and Scholes (1973) (BS) model, the Merton (1976) jump-diffusion model (JD), the Heston (1993) stochastic volatility model (SV), the stochastic volatility with jumps model (SVJ) (Bates, 1996) and the SVCJ model itself. Infinite activity Lévy hedge models under consideration are the Variance-Gamma (VG) model (Madan et al., 1998) and the CGMY model (Carr et al., 2002). Options are hedged dynamically with the following hedge strategies: Delta (\(\Delta\)), Delta–Gamma (\(\Delta\)\(\Gamma\)), Delta–Vega (\(\Delta\)\(V\)) and minimum variance strategies (MV).

To gain further insights, we separate the full time period, ranging from April 2019 to June 2020, into 3 different market scenarios with a bullish market behavior, calm circumstances with low volatility and a stressed scenario during the SARS-COV-2 crisis. In addition to evaluating the hedge performance, we aim to identify BTC risk-drivers such as jumps. This contributes to the understanding of what actually drives fluctuations on this market. A historical backtest of the hedge performance, in the spirit of Detering and Packham (2016) and Ting and Ewald (2013), completes and confirms the findings of the SVCJ- and GARCH-KDE approaches.

A number of papers investigate the still young market of CC options. Trimborn and Härdle (2018) describe the CC market dynamics via the cryptocurrency index \(\texttt {CRIX}\). Madan et al. (2019) price BTC options and calibrate parameters for a number of option pricing models, including the Black-Scholes, stochastic volatility and infinite activity models. Hou et al. (2020) price \(\texttt {CRIX}\) options under the assumption that the dynamics of the underlying are driven by the (SVCJ) model introduced in Duffie et al. (2000) and Eraker et al. (2003). The literature on the aspects of risk management in CC markets is scarce but growing. Dyhrberg (2016), Bouri et al. (2017) and Selmi et al. (2018) investigate the role of BTC as a hedge instrument on traditional markets. Sebastião and Godinho (2020) and Alexander et al. (2021) investigate the hedge effectiveness of BTC futures, while Nekhili and Sultan (2021) hedge BTC risk with conventional assets. To the best of our knowledge, hedging of CC options has not yet been investigated in this depth and detail. The aspect of risk management and the understanding of the dynamics of CCs is therefore a central contribution of this study.

The remainder of the paper is structured as follows: Sect. 2 describes the methodology, decomposed into market scenario generation, option valuation and hedge routine. The hedge routine presents the hedge models and explains the model parameter calibration and hedge strategy choices. In Sect. 3, we present and evaluate the results of the hedge routine and in Sect. 4, we conclude. The code is available as quantlets, accessible through under the name hedging_cc (https://github.com/Quantlet/hedging_cc).

2 Methodology

In this section, we introduce the methodology, comprising market scenario generation, option valuation and hedging. We take an option seller’s perspective and sell 1- and 3-months (M) contingent claims. The choice is justified by the total trading volume of BTC options. Nearly \(80 \%\) of the trading volume consists of options expiring in at most 1 month. Almost all remaining options expire in 3 months or less (Alexander & Imeraj, 2022).

2.1 Synthetic market data generation

We describe how to generate synthetic market data, which serves as the input for the main analysis. The principal goal of synthetic scenario generation is to imitate the BTC market behavior, especially retaining its statistical properties, with the added flexibility of Monte Carlo simulation to create a large amount of plausible scenarios. In addition, we consider two simulation methods capturing different statistical properties. They represent a trade-off between a parametric model with valuable and traceable risk-factor information and a flexible semi-parametric closer-to-actual-market approach. The parametric model is simulated under the risk neutral measure \(\mathbb {Q}\) with a forward looking perspective. The semi-parametric simulation relates to the past market behavior performed under the physical measure \(\mathbb {P}\). The time frame under consideration is from 1st April 2019 to 30th June 2020. The BTC market behavior in this time period is time-varying. This makes it convenient to segregate the time frame into three disjoint market segments from April to September 2019 (bullish), October 2019 to February 2020 (calm) and March to June 2020 (covid), respectively. Bearing in mind that we are going to hedge 1-month and 3-month options, the minimal segment length is chosen to exceed three months. A graphical representation of the BTC closing price trajectory is illustrated in Fig. 1 with the corresponding summary statistics in Table 1. The first interval is labeled as the bullish segment, because, to a great extent, the market behaves upward-trending. The second period is labeled as the calm period. With an overall standard deviation of \(\widehat{\sigma}=756.55\), price movements are more stagnant compared to the bullish segment.

Fig. 1
figure 1

BTC closing price from 1st April 2019 to 30th June 2020, where the blue trajectory represents the bullish market behavior, the black path the calm period and red path the stressed scenario during the Corona crisis (Color figure online)

Table 1 Summary statistics of the bullish, calm and covid market log returns \(R_t\)

The last segment is the Corona crisis or stressed scenario, where financial markets, especially CC markets, experienced high volatility. A notable mention is the behavior of BTC on 12th March 2020, where its price dropped by nearly \(50 \%\).

We now turn to a formal mathematical framework. Let the BTC market be a continuous-time, frictionless financial market. Borrowing and short-selling are permitted. The constant risk-free interest rate \(r \ge 0\) and the time horizon \(T < \infty\) are fixed. On a filtered probability space \(\left( \Omega ,F , \left( F_ { t } \right) _ { t \in [ 0 , T ] } ,\mathbb P \right)\), the asset price process and the risk-free asset are defined by adapted semimartingales \((S_{t})_{t \ge 0}\) and \(\left( B_{t} \right) _{t \ge 0}\), where \(B_{0}=1\) and \(B_{t} = e^{rt}, \ t \ge 0\), respectively. The filtration is assumed to satisfy the usual conditions (e.g. Protter, 2005). To ensure the absence of arbitrage, we assume the existence of a risk-neutral measure \(\mathbb {Q}\). We consider an option writer’s perspective and short a European call option. The price of the option with strike K and time-to-maturity (TTM) \(\tau = T-t\) at time \(t < T\) is \(C(t, \tau , K)\). For multiple-instrument hedges, we further assume the existence of a liquidly traded call option \(C_2 (t,\tau ,K_2)\), \(K_2\not =K\), suitable for hedging. The dynamic, self-finance hedging strategy \(\xi =( \xi ^{0}, \xi ^{1} )=(\xi ^{0}_{t}, \xi ^{1}_{t} ))_{0 \le t \le T}\) is an \(F_{t}\)-predictable process, where \(\xi ^{0}_{t}\) and \(\xi ^{1}_{t}\) denote the amounts in the risk-free security and the asset, respectively. The resulting portfolio process \(\Pi =(\Pi _t)_{t\ge 0}\) is admissible and self-financing. The evolution of the value process \(\Pi\) is reviewed in detail in Appendices A.1, A.2 and A.3.

For the Monte Carlo simulation, the finite time horizon is partitioned into daily steps of size \(\delta t = \frac{1}{365}\). The number of trajectories of the asset price process is set to \(n=100,000\).

2.1.1 SVCJ model

The parametric scenario generation approach assumes that the dynamics of the asset price process \((S_t)_{t\ge 0}\) and the volatility process \((V_t)_{t\ge 0}\) are described by the SVCJ model introduced in Duffie et al. (2000). This particular choice is motivated by the methodology in Hou et al. (2020), where the model is applied to pricing options on the \(\texttt{CRIX}\) . A high degree of free parameters enables to model various market dynamics. More specifically, the risk-neutral model dynamics are (Broadie & Kaya, 2005)

$$\begin{aligned} \begin{aligned} d S_t&=(r-\lambda \bar{\mu }) S_t d t+\sqrt{V_t} S_t\left[ \rho d W_t^{(1)}+\sqrt{1-\rho ^2} d W_t^{(2)}\right] + Z^{s}_{t} d N_t \\ d V_t&=\kappa \left( \theta -V_t\right) d t+\sigma ^{v} \sqrt{V_t} d W_t^{(1)}+Z^{v }_{t} d N_t \end{aligned} \end{aligned}$$
(1)

where \(W_{t}^{(1)}, W_{t}^{(2)}\) are two independent standard Wiener processes. The scale of \(V_{t}\) is given by \(\sigma ^{v}\), the mean reversion speed is denoted by \(\kappa\) and \(\theta\) is the mean reversion level. The model allows for simultaneous arrivals of jumps in returns and jumps in volatility governed by the Poisson process \((N_t)_{t\ge 0}\) with constant intensity \(\lambda\). The jump sizes in volatility \(Z^{v}_{t}\) are exponentially distributed \(Z^{v}_{t} \sim \varepsilon \left( \mu ^{v}\right)\) and the jump sizes of the asset price are conditionally normally distributed

$$\begin{aligned} \Xi {\mathop {=}\limits ^{def}} Z^{s}_{t} \vert Z^{v}_{t} \sim N\left( \overline{\mu }^{s} + \rho ^{j} Z^{v}_{t}, \left( \sigma ^{s}\right) ^2 \right) \end{aligned}$$
(2)

where \(\overline{\mu^{s}}\) is the conditional mean jump size in the asset price given by

$$\begin{aligned} \overline{\mu^{s}}=\frac{\exp \left\{ \mu ^{s}+\frac{\left( \sigma ^{s}\right) ^{2}}{2}\right\} }{1-\rho ^{j} \mu ^{v}}-1 \end{aligned}$$

In detail, \(\sigma ^{s}\) denotes the jump size standard deviation. The unconditional mean is denoted by \(\mu ^{s}\), which is related to the jump compensator \(\lambda {\mu }^{*}\) by \(\mu ^s=\log \left[ (1+{\mu }^{*})\left( 1-\rho ^{j} \mu ^{v}\right) \right] -\frac{1}{2} \left( \sigma ^{s}\right) ^2\). The correlation parameter \(\rho ^j\) governs the correlations between jump sizes. From an empirical point of view, in most markets, jumps occur seldomly and are difficult to detect, which, as a consequence, makes the calibration of \(\rho ^{j}\) unreliable (Broadie et al., 2007). Chernov et al. (2003), Broadie et al. (2007), Eraker et al. (2003), Eraker (2004) and Branger et al. (2009) therefore recommend to set \(\rho ^{j}=0\). In fact, this finding extends to the BTC market, see Hou et al. (2020), who find that "the jump correlation \(\rho ^j\) is negative but statistically insignificant...". Our main results are therefore calculated assuming \(\rho ^j=0\). Nonetheless, we add some insights into calibrating \(\rho ^j\) and hedging with the calibrated parameter in Sect. 3.5.1. Note that despite a correlation of zero, the SVCJ model does not reduce to an SVJ model, as it still features jumps in the volatility.

The resulting paths are simulated according to the Euler-Maruyama discretization of (1) suggested in Belaygorod (2005). The corresponding model parameters are re-calibrated on a daily basis according to the methodology described in Sect. 2.3.2.

2.1.2 GARCH-KDE approach

Compared to the empirical price process, the SVCJ may appear quite restrictive: aside from being an incomplete market model, the price dynamics are limited by the specification of the stochastic volatility component as well as the jump intensity and size. The semi-parametric method loosens the assumptions by generating scenarios using GARCH-filtered kernel density estimation (GARCH-KDE) as in e.g. McNeil and Frey (2000). Let \((R_{t})_{t \ge 0}\) denote BTC log-returns and \((\widehat{\sigma }_t)_{t \ge 0}\) the estimated GARCH(1,1) volatility, (Bollerslev, 1986). The kernel density estimation is performed on "de-garched" residuals

$$\begin{aligned} \widehat{z}_t = \displaystyle \frac{R_t}{\widehat{\sigma }_t}. \end{aligned}$$
(3)

The rationale is to capture the time-variation of volatility by the GARCH filter and perform kernel density estimation on standardised residuals. The estimated density function is

$$\begin{aligned} \widehat{f}^{h}(z)=\frac{1}{n h} \sum _{t=1}^{n} \mathbf{K}^{*} \left( \frac{\widehat{z}_{t}-z}{h}\right) , \end{aligned}$$
(4)

where \(\mathbf {K}^{*} \left( \cdot \right)\) denotes the Gaussian Kernel. The resulting generated paths are \((S(0,i),\ldots , S(T,i))\), \(i=1,\ldots , n\), withFootnote 1

$$\begin{aligned} S(t,i) = \displaystyle S(0) \exp \left[ \sum _{k=1}^t \widehat{\sigma }_k \widehat{z}_k\right] , \quad t=0,\ldots , T. \end{aligned}$$
(5)

Throughout the paper, the parametric and the semi-parametric method are referred to as the SVCJ and GARCH-KDE framework, respectively.

2.2 Valuation

This section describes how option prices are derived from the implied volatility surface. As the market for CC claims, during the time period of our dataset, is still relatively immature with only a limited number of actively traded options on \(\texttt {Deribit}\) and the Chicago Mercantile Exchange, arbitrage-free option prices are derived through the stochastic volatility inspired (SVI) parameterization of the volatility surface of Gatheral and Jacquier (2014). Let \(\sigma ^{\mathrm {BS}}(k, \tau )\) denote the BS IV with log-moneyness \(k=\ln \left( K /S_0 \right)\) and total implied variance \(w(k, \tau )=\{\sigma ^{\mathrm {BS}}(k, \tau )\}^{2} \tau\). For a fixed \(\tau\), the raw SVI parameterization of a total implied variance smile, as initially presented in Gatheral (2004), is

$$\begin{aligned} w(k, \chi _{R})=a+b\left\{ \rho ^{SVI}(k-m)+\sqrt{(k-m)^{2}+\left( \sigma ^{SVI}\right) ^{2}}\right\} . \end{aligned}$$
(6)

In the parameter set \(\chi _{R}=\{a, b, \rho ^{SVI}, m, \sigma ^{SVI}\}\), \(a \in \mathbb {R}\) governs the general level of variance, \(b \ge 0\) regulates the slopes of the wings, \(\rho ^{SVI} \in [-1, 1]\) controls the skew, \(m \in \mathbb {R}\) enables horizontal smile shifts and \(\sigma ^{SVI}>0\) is the ATM curvature of the smile (Gatheral & Jacquier, 2014). For each maturity, the smile is recalibrated daily. The implied volatility is obtained by a simple root-finding procedure, whereas the parameters \(\chi _{R}\) are calibrated according to the optimization technique explained in Sect. 2.3.2. In addition, the calibration is subject to non-linear constraints prescribed in Gatheral and Jacquier (2014). These constraints ensure convexity of the option price, which rules out butterfly arbitrage. Calendar spread arbitrage is avoided by penalizing fitted smiles, which induce a decrease in the level of the total implied variance for a given strike level. For interpolation, the at-the-money (ATM) total implied variance \(\theta _{T}^{SVI}=w(0,T)\) is interpolated for \(t_1< T < t_2\) as in Gatheral and Jacquier (2014), where \(t_1, t_2\) refer to time points at which implied volatilities are observed. The resulting option price is the convex combination

$$\begin{aligned} C(T, K)=\alpha _{T} C(t_1, K)+\left( 1-\alpha _{T}\right) C(t_2, K), \end{aligned}$$
(7)

where \(\alpha _{T}=\displaystyle \frac{\sqrt{\theta _{t_{2}}^{SVI}}-\sqrt{\theta _{T}^{SVI}}}{\sqrt{\theta _{t_{2}}^{SVI}}-\sqrt{\theta _{t_{1}}^{SVI}}} \in \left[ 0,1\right]\).

2.3 Hedge routine

This section describes the models selected to hedge BTC options as well as the model parameter calibration procedure. Given these model classes, hedge strategies are chosen for the hedge routine.

2.3.1 Hedge models

For hedging purposes, the choice of a hedge model faces the trade-off between sufficient complexity to describe the actual market dynamics and market completeness (Detering & Packham, 2015). In practice, a trader may for instance initiate hedging with an evidently wrong but simple model, such as the complete BS option pricing model. A lower number of parameters provides a parsimonious setup with potentially manageable explanatory power. In our setting, a European option is hedged employing models of increasing complexity. In the following, the model granularity is gradually extended by the addition of risk-factors such as local volatility, jumps, stochastic volatility and others. This covers the empirical finding of the previous literature on CC’s, e.g. (Kim et al., 2021; Scaillet et al., 2018). Accordingly, the hedge models selected encompass affine jump diffusion models and infinite activity Levy processes.

The class of affine jump diffusion models covers well-known models nested in (1). Due to its popularity in the financial world, the simple but complete BS option pricing is selected as one of the hedge models. The volatility is constant with \(V_t=\sigma\) and there are no discontinuities from jumps \(N_{t} = 0\). A slightly more complex model is the JD model. It assumes constant volatility with \(V_{t}=\theta\), \(\sigma ^{v}=0\) and extends the BS model by allowing for jumps in returns. The jump size is \({\text {log}} \xi \sim N\left( \mu ^{s}, \left( \delta ^{s}\right) ^{2} \right)\) distributed.

Evidence for stochastic volatility motivates the choice of the SV model. The jump component is excluded with \(\lambda = 0\) and \(N_{t} = 0\). We also examine the SVCJ model itself as a model used for hedging. It serves as the most general model and its hedge performance provides a meaningful insight for the comparison of the SVCJ and GARCH-KDE framework, while in the SVCJ framework, it provides “anticipated” hedge results (cf. Branger et al. (2012)). Due to the jump scarcity and latent nature of the variance process \(V_{t}\), we also consider the SVJ model for hedging. In difference to the SVCJ model, this model has jumps in returns but no jumps in volatility.

In contrast to affine jump processes, there exists a well-established class of processes that do not entail a continuous martingale component. Instead, the dynamics are captured by a right-continuous pure jump process, such as the Variance Gamma (VG) model (Madan et al., 1998). The underlying \(S_t\) evolves as

$$\begin{aligned} \begin{aligned} d S_{t}&=r S_{t-} d t+ S_{t-} d X^{\mathrm {VG}}_{t} \\ X^{\mathrm {VG}}_{t}&=\theta ^{VG} G_{t} +\sigma ^{VG} W_{G_{t}}, \end{aligned} \end{aligned}$$
(8)

with the characteristic function of the VG-process \(X^{\mathrm {VG}}_{t}\) given by

$$\begin{aligned} \varphi _{\mathrm {VG}}(u ; \sigma ^{VG}, \nu , \theta ^{VG})=\left( 1-\mathrm {i} u \theta ^{VG} \nu +\frac{1}{2} \left( \sigma ^{VG} \right) ^{2} \nu u^{2}\right) ^{-1 / \nu }, \end{aligned}$$
(9)

where r is the risk-free rate, \(W_{t}\) is a Wiener process and \(G_{t}\) is a Gamma process. The overall volatility level is represented by \(\sigma ^{VG}\); \(\theta ^{VG}\) governs the symmetry of the distribution and therefore controls the implied volatility skew; \(\nu\) controls for tails, kurtosis and thus regulates the shape of the volatility surface. An alternative representation of the VG process appealing for practical interpretation is given by the characteristic function

$$\begin{aligned} \varphi _{\mathrm {VG}}(u ; C, G, M)=\left( \frac{G M}{G M+(M-G) \mathrm {i} u+u^{2}}\right) ^{C}, \end{aligned}$$
(10)

where \(C, \ G, \ M >0\). The detailed link between (9) and (10) is described in Appendix A.4. An increase in G (M) increases the size of upward jumps (downward jumps). Accordingly, \(\theta ^{VG}\), M and G account for the skewness of the distribution. An increase in C widens the Lévy-measure. An extension of the VG model is the CGMY model by Carr et al. (2002). On a finite time interval, the additional parameter Y permits infinite variation as well as finite or infinite activity. Formally, in (8) the source of randomness is replaced by a CGMY process \(X^{CGMY}_{t}\) with the characteristic function

$$\begin{aligned} \varphi _{\mathrm {CGMY}}\left( u ; C, G, M,Y \right) =\exp \left[ C t \Gamma (-Y)\left\{ (M-\mathrm {i} u)^{Y}-M^{Y}+(G+\mathrm {i} u)^{Y}-G^{Y}\right\} \right] . \end{aligned}$$
(11)

The \(X^{VG}_{t}\)-process in the representation in Equation (9) is a special case of the CGMY process for \(Y=1\). On a finite time interval, the behavior of the path depends on Y. For \(Y<0\), there is a finite number of jumps, else infinite activity. In case of \(Y \in (1,2]\), there is also infinite variation.

2.3.2 Calibration routine

The model parameters are calibrated following the FFT option pricing technique of Carr and Madan (1999). The price of a European-style option C(TK) is given by

$$\begin{aligned} \begin{aligned} C(T,K)&=\frac{1}{\pi } {\text {e}}^{-\alpha \ln K} \int _{0}^{\infty } {\text {e}}^{-\mathrm {i} v \ln K} \psi _T(v) d v, \quad \text { with} \\ \psi _T(v)&=\frac{\exp ^{-rT} \phi _{T}(v-(\alpha +1)\mathrm {i})}{\alpha ^{2}+\alpha -v^{2}+\mathrm {i}(2 \alpha +1) v}, \end{aligned} \end{aligned}$$
(12)

where \(\phi _T\) is the characteristic function of the \(\alpha\)-damped option price \(c_{T}\left[ \ln (K)\right] =e^{\alpha \ln K} C(T,K), \ \alpha >0\). The ill-posed nature of calibration can lead to extreme values of the model parameters. This is avoided by employing a Tikhonov \(L_2\)-regularization (Tikhonov et al., 2011). At the cost of accepting some bias, this penalizes unrealistic values of the model parameters by giving preference to parameters with smaller norms. Calibration is performed by the optimizer

$$\begin{aligned} \begin{aligned} \theta ^*&= \underset{\theta \in \Theta }{{\text {argmin}}} R(\theta ) \\ R(\theta )&= \sqrt{\frac{1}{n} \sum _i \left\{ IV^{Model}(T^i, K^i, \theta ) - IV^{Market}((T^i, K^i))\right\} ^2} + \theta ^{\top } \Gamma \theta , \end{aligned} \end{aligned}$$
(13)

where \(IV^{Model} \left( \cdot \right) , IV^{Market}\left( \cdot \right)\) describe model and market implied volatilities for maturity and strike \(T^i, K^i\). \(\Gamma\) is a diagonal positive semi-definite matrix. It corresponds to the Tikhonov \(L_2\)-regularization, which gives preference to parameters with smaller norms. The entries in the matrix \(\Gamma\) are chosen individually for each parameter to ensure that they maintain the same reasonable order of magnitude.

The parameter space \(\Theta \subset \mathbb R^d\) of each model in scope is subject to linear inequality constraints. Given that the objective is not necessarily convex, it may have multiple local minima. In order to explore the entire parameter space, simplex-based algorithms are more appropriate than local gradient-based techniques. In our case, we employ the Sequential Least Squares Programming optimization (Kraft, 1988) routine. We adjust for time effects by calibrating parameters on the IV surface instead of option prices.

We impose liquidity and moneyness cut-offs. Claims must have a positive trading volume and an absolute BS Delta in [0.25, 0.75]. This filters options that are close to ATM as is custom in FX trading, see Clark (2011).

2.3.3 Hedging strategies

Any hedging strategy’s target is to protect against market movements and to minimize Profit-and-Loss (P &L) of the hedged position. Hedges either reduce risk by eliminating market-risk-related sensitivities \(\left( \Delta , \Gamma , V \right) = \displaystyle \left( \frac{\partial C }{\partial S} , \frac{\partial ^{2} C}{\partial ^{2} S}, \frac{\partial C }{\partial \sigma } \right)\) or by minimising a risk measures, such as a hedged position’s variance. Broadly, hedging strategies are split into single- and multiple instrument hedges. Single instrument hedges incorporate the \(\Delta\)- and MV-hedging. Föllmer and Sondermann (1986)’s MV hedge aims to find the strategy that minimizes the mean-squared error under \(\mathbb {Q}\)

$$\begin{aligned} \left( \Pi _{0}, \xi ^{MV}_{t}\right) =\underset{\Pi _{0} , \xi ^1_t}{\text {argmin}}\ {\textsf {E}}_\mathbb {Q}\left[ \left( C_{T} -\Pi _{0} -\int _{0}^{T} \xi ^{1}_{u} d S_u\right) ^{2}\right] . \end{aligned}$$
(14)

Under the assumption of symmetric losses and gains, the minimizing strategy is denoted by \(\xi ^{MV}_{t}\). The \(\Delta\)-hedge targets to protect the position against first-order changes in the underlying \((S_t)_{t\le T}\).

In addition to hedging \(\Delta\), multiple instrument hedges eliminate higher-order sensitivities or sensitivities of risk factors other than the underlying, e.g. \(\sigma\). To achieve \(\Delta\)-\(\Gamma\)- or \(\Delta\)-\(V\)-neutrality, an additional liquid option \(C_{2}(S(t),T,K_{1})\) with strike \(K_1 \ne K\) is priced from the SVI parameterized IV surface, as explained in Sect. 2.2. For performance comparison of linear and non-linear effects, the dynamic \(\Delta\)- and \(\Delta\)-\(\Gamma\)-hedging strategies are applied to all hedge models. The \(\Delta\)-\(V\)-hedge is only considered for affine jump diffusion models. The technical aspects of the dynamic hedging strategies are described in Appendices A.2 and A.3. The calibrated model parameters are used to compute hedging strategies \((\xi _t)_{0\le t\le T}\) for each model. Table 2 summarizes the hedging strategies applied to the respective hedge models.

Table 2 Hedge strategy summary applied to the hedge models described in Sect. 2.3.1

The methods for computing sensitivities depend on the model. Where possible, analytic formulas are used (e.g. BS-model). In cases where not analytic formulas are available, e.g. the VG-model, finite differences are applied to FFT-generated option prices.

2.3.4 Backtesting hedges on historical data

In addition to evaluating the hedges in Monte Carlo simulations, the hedging strategies are backtested on the historical BTC price path. The principal idea is to write an at-the-money option with fixed expiry (2 months in our setting) each day. Each option is hedged by a self-financing hedging strategy with daily rebalancing. At expiry, the P &L is recorded. This gives a sample of P &L’s on real data. Details of the self-financing strategy are given in Appendix A.1. The choice of 2-month expiry allows to construct P &L samples of size 60 for each market regime (bullish, calm, Covid).

This setup follows the empirical study in Detering and Packham (2016). A similar type of backtest, recording daily P &L instead of terminal P &L is conducted in Ting and Ewald (2013). Daily P &L, however, depends on the option price and is therefore model-dependent.

2.3.5 Hedge performance measures

Each model’s hedge performance is evaluated by indicators derived from the relative P &L

$$\begin{aligned} \pi ^{rel} = e^{-r T} \frac{\Pi _T }{C (T,K)}. \end{aligned}$$
(15)

In a perfect hedge in a complete market, we have \(\Pi _T=0\), and therefore \(\pi ^{rel}=0\). However, in practice, due to model incompleteness, discretization and model uncertainty, \(\pi ^{rel}\ne 0\). We evaluate the hedge performance with the relative hedge error \(\varepsilon ^{hedge}\) as applied in e.g. Poulsen et al. (2009), defined as

$$\begin{aligned} \varepsilon ^{hedge} =100 \sqrt{ {\text {Var}}\left( \pi ^{rel} \right) }. \end{aligned}$$
(16)

The rationale behind \(\varepsilon ^{hedge}\) is that standard deviation represents a measure of uncertainty. A sophisticated hedge strategy reduces or ideally eliminates uncertainty (Branger et al., 2012). The tail behavior is captured by the expected shortfall

$$\begin{aligned} \text {ES}_{\alpha } = \mathbb E\left[ \pi ^{rel} \mid \pi ^{rel}>F_{\pi ^{rel}}^{(-1)}(\beta )\right] , \end{aligned}$$
(17)

where \(F_{\pi ^{rel}}^{(-1)}(\beta )\) denotes an \(\beta\)-quantile. In the empirical part, these measures are estimated via the empirical distributions from Monte Carlo, resp. historical simulation.

3 Empirical results

3.1 Data

The models are calibrated on the market prices of European-style \(\texttt {Deribit}\) options written on BTC futures. The number of liquidly traded instruments varies significantly with maturity. Therefore, the data is filtered with liquidity cut-offs.

3.2 Option pricing

Option prices are obtained on every day of the hedging period. This is necessary for the calculation of the initial value of the hedging portfolio and to perform multi-asset dynamic hedging. Each option is priced according to the IV surface on the given day. If the option is not traded for the given strike or maturity, the SVI parametrized IV surface is interpolated in an arbitrage-free way. For illustration, we take a look at CC option prices at the beginning of each market period. Figure 2 displays the SVI parametrized interpolated IV surfaces for SVI parameters listed in Table 15. The resulting option prices used in the hedging routine are displayed in Table 3.

Table 3 Interpolated 1-month and 3-months ATM option prices

Recall that for a given IV surface the SVI parameters related by the formula (6) are calibrated for each TTM. The temporal dynamics of the SVI parameters provide the following insights: parameter a increases with TTM, which aligns with the increase of the ATM total variance as TTM rises. Parameter \(\sigma ^{SVI}\) decreases with TTM, indicating decrease of the ATM curvature. Increasing values of parameter b indicate higher slopes of the wings as TTM increases. Skewness, expressed in terms of the parameter \(\rho ^{SVI}\), varies across market segments. Usually negative values of \(\rho ^{SVI}\) indicate a preference for OTM puts over OTM calls. In the bullish period, skewness is close to zero across most maturities.

Fig. 2
figure 2

Market IVs in red and interpolated IV surface in blue on (a) 1st April 2019 (b) 1st October 2019 (c) 1st February 2020. Fitted smiles with very short maturities less than 1 week are excluded from plots, because they are not relevant for the hedging routine. Calibrated SVI parameters of shorter maturities are given in Table 15

3.3 Scenario generation results

Table 4 Summary statistics of estimated historical densities \(\widehat{z_{t}}\) defined in (3) for a respective scenario

For the GARCH-KDE approach, the estimated residual distributions \(\widehat{f^{h}}(z)\) from (4) are displayed in Fig. 3. The empirical moments and quantiles are listed in Table 4. Figure 12 illustrates the GARCH(1, 1) volatility estimates of BTC returns and the 7-day historical BTC volatility. As a consequence of de-garching, all three distributions are roughly symmetric and mean-zero. Deviations are direct results from market moves: the upward-moving market behavior in the bullish period leads to a left-skewed residual distribution. High drops in the stressed period result in a negatively skewed distribution.

Fig. 3
figure 3

Estimated residual density \(\widehat{f^{h}}(z)\) in (4) during bullish market behavior, calm period and the stressed scenario during the Corona crisis for \(h=0.2\)

To demonstrate that the GARCH-KDE method is an appropriate method of sampling "close-to-actual-market" paths, the boxplots in Fig. 4 illustrate the distributions of one simulated GARCH-KDE path and the corresponding historical distribution. The strength of the GARCH-KDE approach, of course, lies in the fact that through Monte Carlo simulation, the analysis is not restricted to one path.

Fig. 4
figure 4

Distributions of one-sampled GARCH-KDE path and historical returns

SVCJ paths are simulated with daily re-calibrated parameters, which are summarized in Appendix Table 6. Selected statistical properties of both scenario generation approaches are given in Table 14. We observe differences in tails, extreme values and standard deviation. Discrepancies in \(\widehat{\sigma }\) are natural consequences from different methodological assumptions. The SVCJ approach assumes volatility to be stochastic, whereas GARCH-KDE models \(\sigma _{t}\) with GARCH(1,1). Discrepancies in path extremes result from the SVCJ model assumptions on return jump size \(\Xi\) in (2). In the calibration routine, the \(L_{2}\)-regularization is applied to control extreme parameter values. Yet, estimated return jump sizes can be very large. Resulting Euler discretized paths contain trajectories with extreme moves of the underlying. These are e.g. extremely low and high prices during the calm and stressed scenario displayed in Table 14. The sometimes erratic BTC price evolution suggests that such price moves are entirely implausible (Tables 5, 6).

Table 5 Summary statistics of calibrated SVCJ jump size \(\Xi\) per market segment
Table 6 Average calibrated SVCJ parameters with market segments

3.4 Calibration results

In each period, calibration is performed daily using instruments satisfying the liquidity and moneyness requirements specified in Sect. 2.3.2. For an overview, average numbers of options per maturity range used for calibration are summarized in Table 7. As a consequence of the moneyness requirement, more longer-dated options are selected. The average parameter values per period are summarized in Table 6. Sections 3.4.1 and 3.4.2 provide a detailed perspective on the dynamics of the calibrated parameters. Calibration is carried out on the market’s mid IVs. Of course, ignoring bid-ask spreads and the possibility of stale prices may produce arbitrage opportunities as well as spikes in parameters and calibration errors. However, this is considered a minor issue and ignored. RMSEs for the models are illustrated in Appendix C.3. Naturally, the model fit improves with increasing model complexity. Hence, the BS model has the highest RMSE values on average while the SVCJ model has the lowest.

Table 7 Overview of average maturity counts of all options in a daily IV surface fullfiling the liquidity and moneyness requirements (Sect. 2.3.2)

3.4.1 Affine jump diffusion models

The calibrated parameter \(\sigma ^{BS}\) provides meaningful insights into market expectations. Levels vary in the range \(\sigma ^{BS} \in [50 \ \%,175 \ \%]\), with summary statistics for this parameter provided in Table 8. Due to the volatile nature of the CC markets, levels of \(\sigma ^{BS}\) are generally higher than in traditional markets (Madan et al., 2019). In comparison, the VIX index in the time period 1990–2021 ranges between 9.5% and 60%, with the 95%-quantile at 33.5%. Figure 5 shows the dynamics of \(\sigma ^{BS}\) over the entire time frame. In the bullish period, volatility levels rise up to \(120 \%\). In the calm period, as expected, the levels are lower than in the other two periods with \(\sigma ^{BS} \in [0.61,0.91]\).

Table 8 Summary statistics of daily \(\sigma ^{BS}\) calibration
Fig. 5
figure 5

Daily calibration \(\sigma ^{BS}\) segregated by market segment in chronological order. Volatility levels are very high compared to equities or indices such as S & P 500

Figure 6 plots the calibrated parameters \(\sigma ^{JD}\) and \(\lambda ^{JD}\) of the JD model over time. In general, levels of \(\sigma ^{JD}\) are lower than \(\sigma ^{BS}\), clearly visible during the calm and stressed scenario. As the JD model is an extension of the BS model, higher levels of \(\sigma ^{BS}\) are partially compensated by the jump component. On many days, \(\sigma ^{JD}\) is close to \(\sigma ^{BS}\). The reason for this are generally low values of the annual jump intensity \(\lambda ^{JD}\) and jump size \(\mu _{y}\). On average, the JD model expects less than one jump in returns per year.

Fig. 6
figure 6

Interplay between \(\sigma ^{JD}\) and \(\lambda ^{JD}\) segregated by market segment in chronological order. Mostly, for high levels of \(\sigma ^{JD}\) we observe low levels of \(\lambda ^{JD}\) and vice versa

The evolution of \(\lambda ^{JD}\) is compared to the jump intensities of extended models \(\lambda ^{SVJ}\) and \(\lambda ^{SVCJ}\) in Appendix Fig. 13. Throughout, annualised jump intensities are low with mostly \(\lambda ^{SV(C)J} \le 2.5\). Overall, the conclusion is that jumps are infrequent. We observe contrasting levels of \(\lambda ^{SVCJ}\) and \(\lambda ^{JD}\). They are not directly comparable, as the jump intensity \(\lambda ^{SVCJ}\) contributes to simultaneous jumps in returns and stochastic volatility, while \(\lambda ^{JD}\) and \(\lambda ^{SVJ}\) corresponds solely to jumps in returns. For example, levels of \(\lambda ^{SVCJ}\) in the calm period are high whereas \(\lambda ^{SVJ}\) is close to zero.

The plausibility of the stochastic volatility assumption is analyzed by the evolution and levels of \(\sigma ^{v}\). In most periods, levels of \(\sigma ^{v}\) are higher compared to traditional markets. In the broad picture, the evolution of \(\sigma ^{v}\) does not depend on model choice a shown in Appendix Fig. 14. Table 16 summarizes statistical properties of this parameter by model and market segment. In the bullish and calm period, the indication for stochastic volatility is strong with vol-of-vol levels at \(q^{50} \ge 80 \%\) and \(q^{50} \ge 75\%\), respectively. In the stressed period, levels of \(\sigma ^{v}\) in SV,SVJ,SVCJ remain high at \(q^{50} \ge 73 \%\).

Empirical evidence suggests that in traditional markets the correlation parameter \(\rho ^{SV(CJ)}\) is usually negative. Specifically, when prices fall, volatility increases. However, across all three market segments and models, \(\rho ^{SV(CJ)}\) is mainly positive and close to zero as illustrated in Appendix Fig. 15. Hou et al. (2020) name this phenomenon the inverse leverage effect in CC markets, that was previous reported on commodity markets by Schwartz and Trolle (2009).

This relationship in the CC markets is also supported by the correlation between the \(\texttt {CRIX}\) and the \(\texttt {VCRIX}\) under the physical measure \(\mathbb P\). Pearson’s correlation coefficient is \(\rho ^{pearson}=0.51\) in the bullish and \(\rho ^{pearson}=0.64\) in the calm period, respectively. In the stressed segment, correlation is negative with \(\rho ^{pearson}=-0.73\).

3.4.2 VG and CGMY

The prospect of infinite variation is evaluated by the calibration of the CGMY model with average calibrated parameters in Table 9. Precisely, we are interested in the evolution of the infinite activity parameter \(Y^{CGMY}\) portrayed in Fig. 7. As in each market segment we mostly have \(Y^{CGMY} > 0\), there is evidence for infinite activity. In the bullish period, there is also evidence of infinite variation, as we mostly have \(Y^{CGMY} \in (1,2 ]\) (Carr et al., 2002).

Table 9 Average calibrated parameters of the CGMY model segregated by market segment
Fig. 7
figure 7

Daily \(Y^{CGMY}\) calibration segregated by market segment. Often, we observe \(Y^{CGMY} >0\). This provides indication for infinite activity. As \(Y^{CGMY} \in (1,2 ]\) in the bullish segment, there is evidence for infinite variation

The bullish period catches high magnitudes of jump size direction increase parameters \(G_{CGMY}\) and \(M_{CGMY}\), reflecting the nature of this market segment. Similarly, the increase in decreased jump size parameter \(M_{CGMY}\) is mainly higher in the stressed scenario. A graphical illustration is given in Appendix Fig. 17. The VG is calibrated under representation (9). Overall, volatility levels of \(\sigma ^{VG}\) are comparable to \(\sigma ^{BS}\), as illustrated in Appendix Fig. 16.

3.5 Hedge results

At the beginning of each market period, we short 1- and 3-months ATM options with option premiums listed in Table 3. As outlined earlier, the price process is simulated in both SVCJ and the GARCH-KDE setting. The exposure in each option is dynamically hedged using the strategies summarized in Table 2. The hedge performance is evaluated in terms of the hedge error \(\varepsilon ^{rel}\) and tail measures \(\text {ES}^{5 \%}\) and \(\text {ES}^{95 \%}\). The hedge results are shown in Tables 10 and 11. For a concise graphical representation, the best performing hedge strategies across models are compared in boxplots displayed in Figs. 8 and 9. For each model, the best performing strategy is selected according to \({\text {ES}}^{5 \%}\), as this provides a trade-off between an extreme, yet plausible tail summary.

These are the main findings: First, with some exceptions, using multiple instruments for hedging, i.e., Delta–Gamma and Delta–Vega hedges, when compared to a simple Delta-hedge lead to a substantial reduction in tail risk. Hence, whenever liquidly traded options are available for hedging, they should be used.

Exceptions are the calm and COVID periods in the GARCH-KDE approach for the short-maturity option as well as the calm period and GARCH-KDE approach for the long-date option—here, no significant improvement is achieved by including a second hedge instrument. In any case, no deterioration takes place when using a second security for hedging. Contrary to the SVCJ approach, which models several risk factors (jumps, stochastic volatility) explicitly, the GARCH-KDE approach, with a smooth KDE density, exhibits less sensitivity to concrete risk factors (e.g. Vega) in the calm period, even despite the GARCH filter, see Fig. 12.

Second, for short-dated options, no substantial differences occur in the optimal hedging strategies across models. The sole exception is worse performance of the VG- and CGMY-models in the calm period when price paths are generated in the SVCJ model.

Third, turning to the long-dated option, although not always best performing, it can be said that stochastic volatility models perform consistently well. Amongst the stochastic volatility models, the SV model as the simplest model, does not underperform and sometimes even is the best-performing model. For the choice of a SV hedge model, the \(\Delta ^{SV}\)-\(V^{SV}\) hedge is a replicating strategy (Kurpiel & Roncalli, 1999) and performs often better than other models under the same or different strategies. As calibrated jump intensities \(\lambda ^{SVJ}\) and \(\lambda ^{SVCJ}\) are low, the SVJ or SVCJ are often similar to the SV leading to comparable hedge results (Table 12).

The simulated hedge results are confirmed in the historical hedge backtest. As before, with expections (calm period), hedges involving multiple hedge instruments consistently achieve desirable variance and tail risk reduction. For example, in the bullish period, the \(\Delta\)-\(V_{SV}\) strategy strikingly outperforms other best performing strategies.

Fig. 8
figure 8

1-month option hedge performance boxplots of \(\pi ^{rel}\) under (a), (c), (e) GARCH-KDE and (b), (d), (f) SVCJ market simulation for different market segments. For illustrative purposes \(\pi ^{rel}\) is truncated at \(q^{5}\) and \(q^{95}\). The vertical axis portrays \(\Delta ^{BS}\) hedge results compared each model’s best performing strategy. This best performing strategy is selected according to the minimal \(\text {ES}^{5 \%}\)

Table 10 1-month ATM hedge performances with the worst (bold values) and best (italic) performing strategy. The table corresponds to results in Fig. 8
Fig. 9
figure 9

Three month option hedge performance boxplots of \(\pi ^{rel}\) under (a), (c), (e) GARCH-KDE and (b), (d), (f) SVCJ market simulation for different market segments. For illustrative purposes \(\pi ^{rel}\) is truncated at \(q^{5}\) and \(q^{95}\). The vertical axis portrays \(\Delta ^{BS}\) hedge results compared each model’s best performing strategy. This best performing strategy is selected according to the minimal \(\text {ES}^{5 \%}\)

Table 11 3-month ATM hedge performance with the worst (bold values) and best (italic values) performing strategy. The table corresponds to results in Fig. 9
Fig. 10
figure 10

Historical backtest (a) bullish, (b) calm and (c) covid hedge performance; \(\pi ^{rel}\) for 2-months ATM options. For illustrative purposes \(\pi ^{rel}\) is truncated at \(q^{5}\) and \(q^{95}\). The vertical axis portrays \(\Delta ^{BS}\) hedge results compared each model’s best performing strategy. This best performing strategy is selected according to the minimal \(\text {ES}^{5 \%}\)

Table 12 Backtest of hedging 2-month ATM options with the worst (bold values) and best (italic values) performing strategies. The table corresponds to results in Fig. 10

3.5.1 Hedges with jump size correlation

The hedges above (Sect. 3.5) are performed under the assumption that the jump size correlation parameter \(\rho ^{j}\) is zero. This assumption is particularly well-founded on the BTC market, because jump size correlation \(\rho ^{j}\) is reportedly insignificant (Hou et al., 2020). Nevertheless, we investigate whether \(\rho ^{j} \ne 0\) impacts hedging and look for differences to the main hedge results from Sect. 3.5. Therefore, the hedge routines are repeated for a daily calibrated \(\rho ^{j}\) and for fixed parameter values \(\rho ^{j} \in \{-0.5, 0.5\}\). For comparison, we look at selected examples. Note that changes to \(\rho ^{j} \ne 0\) impact the SVCJ Monte Carlo simulation and the SVCJ hedge strategies (Table 2).

The calibration results from Fig. 11 and Table 13 show that most calibrated values lie close to \(\rho ^{j}=0\). As such, the SVCJ’s hedge performance results are similar. This is visible in the SVCJ’s hedge performance comparisons in the historical backtest in Table 17 to Table 19.

Fig. 11
figure 11

Distribution of daily calibrated \(\rho ^{j}\), where most values lie close to \(\rho ^{j}=0\)

Table 13 Summary statistics of daily \(\rho ^{j}\) calibration

Appendix B, Table 20 shows hedge results, when bumping the correlation to \(\rho ^j=0.5\), resp. \(\rho ^j=-0.5\). Return jump sizes (2) depend on \(\rho ^{j}\). Unsurprisingly, large correlation changes have a significant impact on the hedge performance.

4 Conclusion

From a risk management perspective, CC markets are a highly interesting new asset class: on the one hand CC prices are subject to extreme moves, jumps and high volatility, while on the other hand, derivatives are actively traded—and have been for several years—on several exchanges. This paper presents an in-depth comparison of different hedging methods, providing concise answers to the trade-off between hedging in a complete, albeit oversimplified model and hedging in a more appropriate, albeit incomplete market model.

As a central part of the methodology, we simulate price paths given the Bitcoin price history in two different ways: First, a semi-parametric approach (under the physical measures \(\mathbb P\)) combines GARCH volatilities with KDE estimates of the GARCH residuals. These paths are statistically close to the actual market behaviour. Second, paths are generated (under the risk-neutral measure \(\mathbb Q\)) in the parametric SVCJ model, where the SVCJ model parameters include valuable information on the contributing risk factors such as jumps. The time period under consideration features diverse market behaviour, and as such, lends itself to being partitioned into “bullish”, “calm” and “Covid-19” periods.

We hedge options with maturities of one and three months. If not directly quoted on the BTC market, option prices are interpolated from an arbitrage-free SVI-parametrization of the volatility surface. The options are then hedged assuming risk managers use market models from the classes of affine jump diffusion and infinite activity Lévy models, which feature risk factors such as jumps and stochastic volatility. The calibration of these models strongly support the following risk factors: stochastic volatility, infrequent jumps, some indication for infinite activity and inverse leverage effects on the market. Under GARCH-KDE and SVCJ, options are hedged with dynamic Delta, Delta–Gamma, Delta–Vega and minimum variance hedging strategies.

For longer-dated options, multiple-instrument hedges lead to considerable tail risk reduction. For the short-dated option, using multiple hedging instruments did not significantly outperform a single-instrument hedge. This is in-line with traditional markets, where even in highly volatile market periods, short-dated options are less sensitive to volatility or Gamma effects. For longer-dated options, multiple-instrument hedges consistently improve the hedge quality. Hence, if several liquidly traded options are available for hedging, they should be used. Among all models, persistently good hedge results are achieved by hedging with stochastic volatility models. This demonstrates that complete market models with stochastic volatility perform well, while models allowing for jump risk, although more realistic, do not produce better hedges due to the associated market incompleteness. These findings are confirmed for a historical backtest, where a 2-month option is written every day, generating a series of daily P &L’s from hedging at expiry.