Modelling and forecasting risk dependence and portfolio VaR for cryptocurrencies

Cheng, Jie

doi:10.1007/s00181-023-02360-7

Modelling and forecasting risk dependence and portfolio VaR for cryptocurrencies

Open access
Published: 16 January 2023

Volume 65, pages 899–924, (2023)
Cite this article

Download PDF

You have full access to this open access article

Empirical Economics Aims and scope Submit manuscript

Modelling and forecasting risk dependence and portfolio VaR for cryptocurrencies

Download PDF

Jie Cheng ORCID: orcid.org/0000-0002-0838-0090¹

2271 Accesses
4 Citations
2 Altmetric
Explore all metrics

Abstract

In this paper, we investigate the co-dependence and portfolio value-at-risk of cryptocurrencies, with the Bitcoin, Ethereum, Litecoin and Ripple price series from January 2016 to December 2021, covering the crypto crash and pandemic period, using the generalized autoregressive score (GAS) model. We find evidence of strong dependence among the virtual currencies with a dynamic structure. The empirical analysis shows that the GAS model smoothly handles volatility and correlation changes, especially during more volatile periods in the markets. We perform a comprehensive comparison of out-of-sample probabilistic forecasts for a range of financial assets and backtests and the GAS model outperforms the classic DCC (dynamic conditional correlation) GARCH model and provides new insights into multivariate risk measures.

Forecasting cryptocurrencies returns: Do macroeconomic and financial variables improve tail expectation predictions?

Article Open access 02 November 2023

Relationships among return and liquidity of cryptocurrencies

Article Open access 01 January 2024

The nexus between the volatility of Bitcoin, gold, and American stock markets during the COVID-19 pandemic: evidence from VAR-DCC-EGARCH and ANN models

Article Open access 15 January 2024

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

During the last years, cryptocurrencies gain more and more attention not only from ordinary investors but also from regulatory authorities and policy makers. Cryptocurrencies are decentralized currencies that are powered by their users with no central authority and therefore are independent of monetary politics and not controlled by the existing banking system^{Footnote 1}. Bitcoin, the largest cryptocurrencies was created in 2009 and since then numerous other cryptocurrencies have been created. After a stable period of development, most of the cryptocurrencies started to climb and dramatically increased in the period 2016 to 2020 with pricing bubbles in 2018 (Corbet et al. 2018). After that, all major cryptocurrencies’ prices have exhibited tremendous fluctuation with the sharpest drop during March 2020 selloff, as a result of the COVID-19 outbreak.

Existing literature on the cryptocurrencies market includes studies focusing on hedging and safe-haven properties of cryptocurrencies (e.g. Bouri et al. 2017; Conlon and McGee 2020), market efficiency (e.g. Nadarajah and Chu 2017; Tran and Leirvik 2020), volatility patterns and portfolio of cryptocurrency markets (Katsiampa 2017), most of which provide the within-sample fit for univariate cases. On the other hand, to account for the structure linkage and interdependencies among the cryptocurrencies and other financial assets, different multivariate approaches including the GARCH-DCC models (Guesmi et al. 2019; Ghabri et al. 2021), the GARCH-BEKK models (Katsiamp et al. 2019; Stavroyiannis and Babaros 2017) and GARCH-copula models (Bouri et al. 2018; Boako et al. 2019; Syuhada and Hakim 2020) have documented for volatility forecasting and risk management.

While these studies provide useful analyses, they also confirm that both the conditional volatilities and the correlations of the cryptocurrencies change over time, especially during the bubble period in 2018 and the pandemic era in 2020. Therefore, we pay attention to the observation-driven time-varying multivariate generalized autoregressive score (GAS) model to examine the price dependency relationships and portfolio value-at-risk (VaR) of cryptocurrencies; particularly, Bitcoin (BTC), Ethereum (ETH), Litecoin (LTC) and Ripple (XRP) are considered. The generalized autoregressive score-driving model (GAS) is proposed by Creal et al. (2013), and it nests many well-known models, including GARCH (Bollerslev 1986) and ACD (Engle and Russell 1998) models. Tafakori et al. (2018) consider an asymmetric exponential GAS model to predict Australian electricity returns. Chen and Xu (2019) use both univariate and bivariate GAS models to analyse and forecast volatilities and correlations between Brent, WTI and gold prices. To the best of our knowledge, no other study has ever used the multivariate GAS model to forecast the volatility and correlation of cryptocurrencies.

Due to the relatively young literature on cryptocurrency, there are few studies related to out-of-sample forecasting performance for both dependence structure and volatility. Amongst those, Syuhada and Hakim (2020) construct a dependence model through vine copula and provide the value-at-risk (VaR) forecasts. Chi and Hao (2021) show GARCH model’s volatility forecast is better than the option implied volatility using the BTC and ETH prices. In our paper, we conduct out-of-sample forecasting performance for both point forecasts (e.g. VaR) and density forecasts. In order to see how effectively the GAS model treats different dynamic features simultaneously in a unified way, we compare the forecasting results with those of the classic dynamic conditional correlation generalized autoregressive conditional heteroskedasticity (DCC-GARCH) model (Engle 2002).

Our main findings are as follows: First, beside the most applied volatility models, GARCH, asymmetric GARCH specifications including GJR-GARCH and APARCH models are also considered for the univariate ETH, LTC, BTC and XRP return series. Interestingly, the additional parameters in these models, which are supposed to show the asymmetric volatility response to past returns (so-called leverage effect), are not significant for all the cryptocurrencies in this paper. These results are consistent with those found in Chi and Hao (2021) and Syuhada and Hakim (2020). Several studies apply the asymmetric GARCH models to cryptocurrencies’ return series; however, they either use a GARCH-type model with Gaussian innovation (Cheikh et al. 2020) or show rather weak significant additional terms, which are supposed to reflect the asymmetry (Apergis 2021). One possible explanation is that the traders or investors from the cryptocurrency market are different to those from the stock market. Unlike the stock market which is usually dominated by well informed investors, the cryptocurrency market has more uninformed investors, and the volatility asymmetry, which can be traced to trading activity that has been guided by information asymmetry between well informed and uninformed traders in the market (Avramov et al. 2006), is not significant as it did in the stock market.

Second, we find empirical evidence to show that the forecasting ability of the GAS model is better than those of the DCC-GARCH model. More specifically, the GAS model accounts for large price changes in a very natural way when updating the correlations and volatilities over time, especially during extreme events. This is particularly important when we form a portfolio risk and estimate the corresponding VaR forecasts. Through a sequence of statistical tests, our results prefer the GAS model to the DCC-GARCH models in terms of point (volatilities and correlations) forecasts, quantile (value-at-risk) forecasts and density forecasts.

This paper is organized as follows: Section 2 describes the multivariate GAS model and the DCC-GARCH model. Section 3 provides the data source and preliminary analysis. In Sect. 4, we applied the two multivariate models to the daily cryptocurrencies and present the estimation results for the within-sample period. Moreover, we conduct out-of-sample forecasting performance for volatilities, correlations, VaRs and probability distributions for the two models. Section 5 concludes.

2 Empirical models

2.1 The multivariate GAS model

Let ${\varvec{r}}_t$ be an N-dimensional random vector at time t with conditional distribution

$$\begin{aligned} {\varvec{r}}_t\vert F_{t-1} \sim p({\varvec{r}}_t,{\varvec{\theta }}_t), \end{aligned}$$

(1)

where $F_{t-1}$ contains all the information up to time $t-1$, ${\varvec{\theta }}_t$ is a vector of time-varying parameters depending on $F_{t-1}$ and a set of static parameters ${\varvec{\phi }}$ for all time t. The GAS(p,q) model is an observation-driven model, and the time-varying parameters ${\varvec{\theta }}_t$ are governed by the score of the conditional density in (1) and an autoregressive updating equation

$$\begin{aligned} {\varvec{\theta }}_{t+1}={\varvec{\kappa }}+\sum _{i=1}^{p}A_i{\varvec{s}}_{t-i+1}+\sum _{j=1}^{q}B_j{\varvec{\theta _{t-j+1}}}, \end{aligned}$$

(2)

where ${\varvec{\kappa }}$, A and B are the coefficient matrices with proper dimensions and ${\varvec{s}}_t$ is the scaled score function

$$\begin{aligned} {\varvec{s}}_t={\varvec{S}}_t\nabla _t({\varvec{r}}_t,{\varvec{\theta }}_t), \end{aligned}$$

(3)

with

$$\begin{aligned} \nabla _t&=\frac{\partial }{\partial {\varvec{\theta }}_t}p({\varvec{r}}_t,{\varvec{\theta }}_t),\\ {\varvec{S}}_t&=I_t({\varvec{\theta }}_t)^{-\gamma },\\ I_t({\varvec{\theta }}_t)&=E_{t-1}\left[ \nabla _t\nabla _t^{T}\right] =-E_{t-1}\left[ \frac{\partial ^2\log p({\varvec{r}}_t,{\varvec{\theta }}_t)}{\partial {\varvec{\theta }}\partial {\varvec{\theta }}^{T}}\right] , \end{aligned}$$

where the expectation is taken with respect to the conditional distribution in (1). The additional parameter $\gamma $ is fixed. By choosing different values of $\gamma $, the GAS model encompasses some well-known models (e.g. GARCH, ACD and ACM models, see Creal et al. 2013, for a detailed discussion).

In the application, we consider a GAS(1,1) model with $\gamma =0$ and the conditional distribution in (1) follows a multivariate standardized Student-t distribution (Ardia et al. 2019). Therefore, the time-varying parameter vector ${\varvec{\theta }}$ (including location $\mu $, scale $\sigma $, correlation $\rho $ and shape $\nu $ parameters) is given by:

$$\begin{aligned} {\varvec{\theta }}_{t+1}={\varvec{\kappa }}+A{\varvec{s}}_t+B{\varvec{\theta }}_t, \end{aligned}$$

and a natural choice for $S_t$ is identity matrix.

2.2 The multivariate DCC-GARCH model

Following Engle (2002), the DCC-GARCH(1,1) model is as follows. Let ${\varvec{r}}_t$ be an N-dimensional random vector at time t, we consider

$$\begin{aligned} Var({\varvec{r}}_t\vert F_{t-1})=Q_t=D_tR_tD_t, \end{aligned}$$

(4)

where $F_{t-1}$ is the information available up to time $t-1$, $D_t$ is a diagonal matrix such that $D_t=\text {diag}(\sqrt{h_{11,t}},\cdots ,\sqrt{h_{nn,t}})$ and $h_{ii,t}$, $i=1,2,\cdots ,N$ is the conditional variance obtained from the univariate model, which is usually GARCH-type model and $R_t$ is the dynamic conditional correlation matrix. More specifically, let

$$\begin{aligned} {\varvec{r}}_t&={\varvec{\mu }}_{t-1}+{\varvec{\psi }}_t,\end{aligned}$$

(5)

$$\begin{aligned} {\varvec{\psi }}_t&=Q_t^{1/2}{\varvec{\varepsilon }}_t, \end{aligned}$$

(6)

then the time-varying correlation matrix $Q_t$ can be updated by

$$\begin{aligned} Q_t=(1-a-b)\bar{Q}+a{\varvec{Z}}_{t-1}{\varvec{Z}}_{t-1}^T+bQ_{t-1} \end{aligned}$$

where $\bar{Q}$ is a symmetric time-invariant unconditional covariance matrix and ${\varvec{Z}}_t=D_t^{-1}{\varvec{\varepsilon }}_t$. In our application, we assume ${\varvec{\varepsilon }}_t$ follows a multivariate standardized Student-t distribution, as we did in GAS(1,1) model.

3 Empirical application

Daily Cryptocurrencies data, Ethereum (ETH), Litecoin (LTC), Bitcoin (BTC) and Ripple (XRP), in US dollars, are obtained from https://www.cryptocompare.com^{Footnote 2} using a Python script. Our sample period is from 1 January 2016 till 31 December 2021. We split the sample into two parts, a within-sample period from 1 January 2016 to 31 December 2018, which includes a total of 1096 daily prices and out-of-sample period from 1 January 2019 to 31 December 2021. For each of the datasets, the returns $r_t$ of ETH, LTC, BTC and XRP are calculated as

$$ r_t=100\left[ (\log (P_t)-\log (P_{t-1})\right] , $$

where $P_t$ is the daily closing price at time t.

Cryptocurrency returns are extremely volatile, so we winsorized them at the 0.005% and 99.5% levels. Figure 1 displays the winsorized return series for ETH, LTC, BTC and XRP during the full sample period, i.e. from January 2016 to December 2021. We observe multiple volatile periods for different returns series, but they behave more similarly after 2018. During the March 2020 selloff, all of them experienced the most negative changes. It is worth mentioning that XRP suffered significant price fluctuations during first half of 2021 due to an SEC lawsuit Ripple faced at the end of 2020. Therefore, volatility changes of XRP were mostly caused by updates on the SEC lawsuits after 2021. Table 1 reports the descriptive statistics for the ETH, LTC, BTC and XRP return series. All of them have positive mean returns and leptokurtic empirical distributions for both sample periods. Moreover, the skewness for BTC (XRP) is negative (positive) across the full sample, while ETH and LTC present positive skewness before 2019 and negative one after 2019. For all returns series, the augmented Dickey and Fuller statistics reject the unit root null at 1% significance level, in favour of the stationary time series. The normality is significantly rejected by the enormous Jarque–Bera statistics, indicating the fat-tailed distribution. Engle’s ARCH test (Engle 1982) results reveal the significant ARCH effect, highlighting the application of GARCH-type models.

Table 1 Descriptive statistics for ETH, LTC, BTC and XRP return series

Full size table

Following Tang and Xiong (2012), we first study the full sample rolling unconditional correlations between the ETH, LTC, BTC and XRP return series using a bivariate approach. We rescale the return series by subtracting their means and dividing by their standard deviations and specify the regression of the rescaled return $r_{m,t}^{r}$ on the rescaled return $r_{l,t}^{r}$, with $l,m=1,2,3,4$ and $l\ne m$:

$$ r_{m,t}^{r}=\mu +\tilde{\rho }r_{l,t}^{r}+\eta _t $$

and $\hat{\tilde{\rho }}$ is the estimated unconditional correlation between the two cryptocurrencies returns $r_m$ and $r_l$. The time-varying estimated correlation is obtained by using a rolling window of fix length equal to 30 days. The rolling correlations of full-sample return series are plotted in Fig. 2.

Before 2017, the correlation between BTC and LTC stays high and positive while those between ETH, BTC and XRP are low and negative. This is not surprising as Litecoin was one of the first “altcoins" to draw from Bitcoin’s original open-source code to create a new cryptocurrency, therefore one of the most correlated altcoins with Bitcoin, while Ethereum is launched based on the platform which enables building and deploying smart contracts and decentralized applications, and compete against Bitcoin for market shares; XRP is created as a faster, cheaper, and more energy-efficient digital asset that can process transactions within seconds and consume less energy than some counterpart cryptocurrencies.

From the beginning of 2017 to the middle of 2018, distinct spikes in the correlation can be generally found between the cryptocurrencies. Such spikes may reflect the presence of significant uncertainty during the stage of the development of cryptocurrency market. All the correlations drastically go up at the middle of 2018 and remain positive and strong until the end of the sample. This finding is in line with the current literature (Katsiampa 2019; Katsiampa et al. 2019; Chowdhury et al. 2022; Pace and Rao 2023), and the connectedness between the cryptocurrencies is mainly caused by market uncertainty in response to the 2018 cryptocurrency crash (Aslanidis et al. 2019 and Antonakakis et al. 2019) and the launch on 10 December 2017 of the Bitcoin futures contracts at the Chicago Board Options Exchange (Blau et al. 2020). Moreover, a significant drop in rolling window correlations can be observed at the beginning of 2021 in the cryptocurrency pairs ETH-XRP, LTC-XRP, and BTC-XRP. Again, this is due to the SEC lawsuit Ripple faced. The above bivariate approach considers two return series at a time, as such, cannot exploit the dynamic interdependence simultaneously. To address this issue, we consider the multivariate GAS and DCC models in the next section.

3.1 In-sample results

For notational convenience, let ${\varvec{r}}_t=(r_{1},r_{2},r_{3},r_{4})$ be the returns of the four assets ETH, LTC, BTC and XRP at time t and $\rho _{12}$, $\rho _{13}$, $\rho _{14}$, $\rho _{23}$, $\rho _{24}$ and $\rho _{34}$ be the correlation of the return series ETH and LTC, ETH and BTC, ETH and XRP, LTC and BTC, LTC and XRP, and BTC and XRP, respectively. We use the multivariate GAS(1,1) model and the DCC-GARCH(1,1) model (hereafter GAS and DCC) we mentioned in the last section to fit the multivariate return series ${\varvec{r}}_t$, respectively. Based on the fat-tail leptokurtic empirical distributions we obtained in Table 1, the conditional distribution of ${\varvec{r}}_t$ in the GAS model is specified by the multivariate standardized Student-t distribution; the univariate and multivariate residuals in the DCC model are also specified by the t-distribution.

Asymmetric GARCH specifications including GJR and EGARCH models are also considered for both GAS and DCC models. Interestingly, the additional parameters in these models, which are supposed to show the asymmetric volatility response to past returns (so-called leverage effect), are not significant for all the cryptocurrencies in this paper. These results are consistent with those found in Chi and Hao (2021) and Syuhada and Hakim (2020). Several studies apply the asymmetric GARCH models to cryptocurrencies’ return; however, they either use the GARCH-type model with Gaussian innovation (Cheikh et al. 2020) or show rather weak significant additional terms which are supposed to reflect the asymmetry (Apergis 2021).

Table 2 The LR test results for the multivariate GAS model

Full size table

For the GAS model, the conditional distribution parameters are as follows:

$$ {\varvec{\theta }}=(\mu _1, \mu _2, \mu _3, \mu _4, \sigma _1, \sigma _2, \sigma _3, \sigma _4, \rho _{12}, \rho _{13}, \rho _{14},\rho _{23}, \rho _{24}, \rho _{34}, \nu ) $$

where $(\mu _1, \mu _2, \mu _3, \mu _4)$, $(\sigma _1, \sigma _2, \sigma _3, \sigma _4)$, $(\rho _{12}, \rho _{13}, \rho _{14},\rho _{23}, \rho _{24}, \rho _{34})$, $\nu $ are location, scale/volatility, correlation and shape parameters of the conditional t-distribution, respectively. Following (Chen and Xu 2019), we conduct a series of likelihood ratio test (LRT) to see whether these parameters are time varying or not. We are interested in the null hypothesis $H_0: M=M_i$ versus to the alternative hypothesis $H_1: M=M_{i+1}$ for $i=1,2,3,4$, where Model 1 to Models 5 are a series of nested time-varying parameters models, i.e. Model 1 assumes all the parameters are time-invariant, Model 2 is the time-varying volatility-only model and Model 5 is the time-varying volatility, correlation, location and shape model. Clearly, $M_1\subset M_2 \subset M_3 \subset M_4 \subset M_5$, and under the regular conditions, the test statistic LRT shall follow a Chi-square distribution $\chi ^2_k$ with degree of freedom k if $H_0$ is true. The LRT test results are listed in Table 2. It is clear that model 5 seems to be a reasonable choice, i.e. the GAS model with time-varying volatility and correlation, location and shape model is used for the return series ${\varvec{r}}_t$ during 2016 to 2019.

Table 3 Parameters estimation of the GAS model

Full size table

The estimation results are presented in Table 3. All the parameters, especially the time-varying parameters of the model (left panel), are significant at the 5% level. We also present the unconditional parameters (right panel) by considering the long-term values of the parameters, i.e. $(I-\hat{B})^{-1}\hat{{\varvec{\kappa }}}$. With regard to the DCC model, similar estimation results are reported in Table 4. The parameters can be divided into two parts, the results of the GARCH model for each individual return series (upper panel) and the dynamic correlation using multivariate t distribution (lower panel).

In Figs. 3, 4, 5 and 6, we plot the estimated volatilities for ETH, LTC, BTC and XRP using both GAS and DCC models during the in-sample period, respectively. For all four return series, the DCC model seems to provide more fluctuant volatilities than the GAS model, especially during the 2018 crash period. Clearly, the extreme returns appear to have a strong effect on estimated volatilities for the GARCH models, whereas those for the GAS model appear to be robust.

The correlation estimates from the two models, which are presented in Figs. 7 and 8, show a substantial difference though both models identify a significant persistence of correlations in high positive values between the cryptocurrencies since 2018. The GAS model suggests, in general, positive correlations, varying from -0.15 to 1 between three series, while the DCC model gives correlations fluctuating substantially over time, falling to extreme values around -0.6 during June 2016, which is mainly caused by the instability of the Ethereum prices due to the DAO hack. It is worth noting that the dynamic correlations we derive from DCC multivariate modelling approach appear to be similar to the rolling correlations we estimate in the previously described bivariate setting while those by GAS approach seem to produce more smoothed correlation estimates due to its desirable robust future.

3.2 Out-of-sample results

We now turn to the out-of-sample (OOS) forecast performance of the two models. We compare the one-step-ahead forecasting performance of the GAS model and DCC model using a rolling window scheme. The length of the rolling estimation window is set to be 1096 observations, such that 1096 observations (from January 1 2019, until December 31 2021) are left for out-of-sample forecast evaluation.

Table 4 Parameters estimation of the DCC model

Full size table

3.2.1 Volatility and correlation forecast evaluation

To evaluate the forecasting performance of the two models, we construct two measures of realized volatility and correlation using intraday data. The realized volatility is computed as the sum of intraday returns (see, e.g. Andersen et al. (2001)),

$$\begin{aligned} RV_t=\sum ^{N_t}_{i=1}r^2_{t,i} \end{aligned}$$

(7)

where $r_{t,i}$ is the intraday return on day t for intraday period i ($i=1,2,\cdots ,N_t$). We use transaction prices of ETH, LTC, BTC and XRP from January 2019 to December 2021, sampled in calendar time and tick-time with 5-minute sampling frequency^{Footnote 3}. The intraday return data are obtained from Bitfinex exchange^{Footnote 4}, using a Python code. The realized correlation^{Footnote 5} is calculated as:

$$\begin{aligned} RC_{xy,t}=\frac{\sum \nolimits ^{N_t}_{i=1}r_{x,t,i}r_{y,t,i}}{\sqrt{RV_{x,t}}\sqrt{RV_{y,t}}} \end{aligned}$$

where $r_{x,t,i}$ and $r_{y,t,i}$ are the intraday return series for cryptocurrencies X and Y on day t for intraday period i ($i=1,2,\cdots ,N_t$) and $RV_{x,t}$ and $RV_{x,t}$ are the realized volatility for X and Y on day t.

Table 5 Results of out-of-sample forecasting accuracy

Full size table

Following (Patton 2011), we use two popular and robust loss functions, mean square error (MSE) and Gaussian quasi-likelihood (QLIKE) to compare the forecast accuracy of the GAS and DCC models on the out-of-sample data. These two loss functions are given by,

$$\begin{aligned} \text {MSE}_{\sigma ^2}=\frac{1}{N}\sum _{i=1}^{N}\Big (\sigma _i^2-\hat{\sigma }_i^2\Big )^2,\;\;\;\; \text {MSE}_{\rho }=\frac{1}{N}\sum _{i=1}^{N}\Big (\rho -\hat{\rho }_i\Big )^2\;\;\;\; \end{aligned}$$

(8)

and

$$\begin{aligned} \text {QLIKE}_{\sigma ^2}=\frac{1}{N}\sum _{i=1}^{N}\Bigg (\log (\hat{\sigma }_i^2)+\frac{\sigma _i^2}{\hat{\sigma }_i^2}\Bigg ),\;\;\;\; \text {QLIKE}_{\rho }=\frac{1}{N}\sum _{i=1}^{N}\Bigg (\log (\hat{\rho }_i)+\frac{\rho _i}{\hat{\rho }_i}\Bigg ), \end{aligned}$$

(9)

where $\hat{\sigma }_i^2$, $\hat{\rho }_i$ are the rolling forecasts on volatility and correlation of day i by the two models, $\sigma _i^2$, $\rho _i$ are the realized volatility and correlation at day i, respectively. N is the total number of volatility/correlation forecasts. We also use the (Diebold and Mariano 1995) method to test for the null hypothesis that the forecasts by the GAS model are less accuracy than or equal to the forecasts by the DCC model.

Table 5 reports the OOS losses for volatility and correlation, using the loss functions in (8) and (9), for the GAS and DCC models. The Diebold–Mariano statistics on the loss differences are also presented to see whether the gains are statistically significant. Overall, the forecasting ability of volatility and correlation in the GAS model is superior to those of the DCC model. Judging by the MSE and QLIKE, it is significant that the GAS model delivers substantially better correlation forecasts than the DCC model though the two models provide similar correlation forecasts between the BTC and XRP return series in terms of MSE.

The volatility forecasts comparison of MSE and QLIKE between the two models are mixed. The MSE favours the GAS model for all volatilities, while the QLIKE supports the GAS model for XRP volatility only. There is no evidence to show a significant difference of volatility forecasts for ETH, LTC and BTC in terms of QLIKE. These results can be further confirmed in the plots. The difference of correlation forecasts between the two models can be found across the whole OOS period (Figs. 9 and 10), while the volatility forecasts of BTC are similar for both models (Figs. 11, 12, 13 and 14). Noted that the DCC model continuously gives large volatility forecasts for all three return series when there are large changes in the return series.

Interestingly, we find that, on average, for both models, the dynamic correlation forecasts between cryptocurrencies behave similarly in all pairs. The correlations remain positive and at high levels with a few fluctuations across the whole OOS period using GAS model, while those using DCC models gives more sensitive dynamics, especially after January 2020. This could be considered as the consequence of the COVID-19 effect on cryptocurrencies. In particular, during January 2020 to May 2020, weak correlation forecasts can be observed between XRP and other cryptocurrencies using both models, which is, again, due to the SEC lawsuit.

3.2.2 Density forecast evaluation

To conduct further the comparison experiment, we use the estimated results for each of the models in the previous section to get one-step-ahead density forecasts and the evaluation is based on scoring rules, which are widely used in weather and climate prediction (Palmer 2012) and financial risk management (Groen et al. 2013). Let ${\textbf {y}}=(y^{(1)},\cdots ,y^{(N)})$ be an observation of the N-dimensional random vector, let f(.) denote a forecast density of ${\textbf {y}}$, let $\Omega $ denote the set of possible values of ${\textbf {y}}$, and let $\mathcal {F}$ denote a convex class of probability distribution on $\Omega $. A scoring rule is a loss function:

$$\begin{aligned} S(f,y):\mathcal {F}\times \Omega \rightarrow \mathbb {R}\cup \{\infty \} \end{aligned}$$

such that better forecast yields a lower score. A scoring rule S is said to be proper if the expected score is optimized, while the true distribution of the observation is issued as a forecast, i.e.

$$\begin{aligned} \mathbb {E}_gS(g,\cdot )\le \mathbb {E}_gS(f,\cdot ) \end{aligned}$$

(10)

for all $f,g\in \mathcal {F}$. Furthermore, a scoring rule is called strictly proper if equality (10) holds only if $f=g$.

A natural approach is the logarithmic score (Good 1952; Mitchell and Hall 2005; Amisano and Giacomini 2007), which is defined as:

$$\begin{aligned} Log S(f,y)=-\log f({\textbf {y}}). \end{aligned}$$

(11)

However, the logarithmic score is not sensitive to distance, which means it only rewards the predictive densities for assigning high probabilities to realized values but not the neighbourhood values. To overcome this problem, (Gneiting and Raftery 2007) introduce the energy score which is a generalization of the univariate continuous ranked probability score (CRPS) and allows for a direct comparison of density forecasts. The energy score is defined as:

$$\begin{aligned} ES(f,y)=E\left( \Vert Y-{\textbf {y}}\Vert ^{\beta }\right) -\frac{1}{2}E\left( \Vert Y-\tilde{Y}\Vert ^{\beta }\right) \end{aligned}$$

(12)

where $\tilde{Y}$ is an independent copy of Y, so it is drawn independently from the same distribution f(.) as Y, $\Vert .\Vert $ is the Euclidean norm. Gneiting and Raftery (2007) show that the energy score is strictly proper with $\beta \in (0,2)$. In application, $\beta =1$ seems to be a standard choice and the score is usually calculated through Monte Carlo methods.

Pinson and Tastu (2013) show that the discrimination ability of energy score may be limited, while the dependence structure of multivariate probabilistic forecasts is misspecified. To overcome this problem, Scheuerer and Hamill (2015) propose the variogram score which is based on pairwise differences:

$$\begin{aligned} VS(f,y)=\sum _{i,j=1}^{N}w_{ij}\left( \vert y_i-y_j\vert ^p-E\vert x_i-x_j\vert ^p\right) ^2 \end{aligned}$$

(13)

where N is the dimension of random vector ${\textbf {y}}$, $x_i$ and $x_j$ are the ith and jth component of a random vector ${\textbf {x}}$ that is from the distribution f, $w_{ij}$ are nonnegative weights that allows one to emphasize pairs of component combinations and standard choice for weights is $w_{ij}=1$. $p>0$ is the order of the variogram score. The variogram score is proper relative to the class of distributions for which the 2p-th moments of all elements are finite and it is not strictly proper (Scheuerer and Hamill 2015). In application, the choice of p is a trade-off between all relative moments of the pairwise deviation and outliers. Typical choices of p include 0.5 and 1.

To test the null hypothesis of equal predictive ability of two competing models based on a given scoring rule, we consider (Diebold and Mariano 1995) type tests using score difference. Given a scoring rule S, the score difference is defined as:

$$\begin{aligned} d_{t}=S(\hat{f}_{1},{\textbf {y}}_{t})-S(\hat{f}_{2},{\textbf {y}}_{t}) \end{aligned}$$

where $\hat{f}_{1}$ and $\hat{f}_{2}$ are the density forecasts. The null hypothesis of equal scores is:

$$\begin{aligned} H_0: E(d_{t})=0, \text { for all }t \end{aligned}$$

versus the alternative $H_1: E(d_{t})\ne 0$. It can be shown that, under the null hypothesis, with certain conditions (e.g. see Giacomini and White 2006), the statistic

$$\begin{aligned} DM=\frac{\bar{d}}{\sqrt{\hat{\sigma }^2/n}} \rightarrow N(0,1) \end{aligned}$$

(14)

where n is the forecast sample size, $\bar{d}=\frac{1}{n}\sum _{t=1}^{n}d_t$ and $\hat{\sigma }^2$ is a heteroskedasticity and autocorrelation-consistent variance estimator of $\sigma ^2=var(\sqrt{n}\bar{d})$.

We applied the above three scores to evaluate and compare the density forecasts by GAS and DCC models. For variogram score, we present the results with different p values ($p=0.5, 1, 2$) as used in Scheuerer and Hamill (2015)). The overall density forecast can be evaluated using average score $\bar{d}$ during the whole out-of-sample period^{Footnote 6} and the DM statistics are obtained using the log score in (11), the energy score in (12) and the variogram score in (13). The score difference $d_{t}$ is computed by subtracting the score of the DCC model density forecast from the score of the GAS density forecast, such that negative values of $d_{t}$ indicate the better predictive ability of the forecast method based on the GAS model. Table 6 shows the average score differences $\bar{d}_{n}$ with the accompanying tests of equal predictive accuracy as in (14). These results clearly demonstrate that both energy and variogram scoring rules suggest superior density predictive ability of the GAS model. The large values of average variogram score difference with $p=2$ are caused by the nature of quadratic form, and the results are in accord with the simulation studies by Scheuerer and Hamill (2015).

From the risk management point of view, it is also important to focus on the performance of density forecasts in the region of interest. Therefore, we compare the models in terms of correctly forecasting the 1% and 5% value-at-risk (VaR) at 1-day horizons for both individual cryptocurrencies and different portfolios that can be constructed from the three cryptocurrencies. We define five different arbitrary portfolios, $p_{jt}=g_jr_t$ for given $4\times 1$ weight vectors $g_j$ and for $j=1,2,3,4,5$. By ordering the cryptocurrencies as ETH, LTC, BTC and XRP, we construct the following long-only and long-short portfolios: $g_1=(1/4,1/4,1/4,1/4)$, $g_2=(1/4,1/4,1/4,-1/4)$, $g_3=(1/4,1/4,-1/4,1/4)$, $g_4=(1/4,-1/4,1/4,1/4)$ and $g_5=(-1/4,1/4,1/4,1/4)$. The long-short positions reflect the relative value bets among these cryptocurrencies.

We simulate 10000 sample paths for ${\textbf {r}}_{t+1}=(r_1,r_2,r_3,r_4)'$, denoted by ${\textbf {r}}_{t+1}^s$ for $s=1,2,\cdots ,10000$ using the multivariate t distribution by the GAS and DCC models. We then construct the simulated individual returns $r^s_{i,t+1}$ for $i=1,2,3,4$ and portfolio returns $p^s_{j,t+1}=g_j'{} {\textbf {r}}_{t+1}^s$ for $j=1,2,3,4,5$. We use the sample of 10000 simulated paths to estimate the quantiles of the forecasting distribution at the 1-day horizon. The out-of-sample VaR accuracy is assessed through the unconditional coverage (UC) test (Kupiec 1995) and the conditional coverage (CC) test (Christoffersen 1998).

Table 6 Average score differences and tests of equal predictive accuracy

Full size table

Table 7 Results of out-of-sample VaR forecasting performance

Full size table

Table 7 presents the UC and CC test statistics and the corresponding p values of the 5% and 1% VaR forecasts for both individual returns (upper panel) and four portfolios (lower panel). For the individual VaR forecasts, all results, except for BTC returns series, suggest that GAS model performs better than DCC model at the 1% and 5% quantile levels. The GAS and DCC models provide same results for the BTC return: the 1% VaRs forecasts perform reasonably well, but the 5% VaR forecasts are rejected for both tests. Meanwhile, the GAS model outperforms the DCC model in general for all portfolios in the forecasting experiment. The only exception is the portfolio with weights $g_2=(1/4,1/4,1/4,-1/4)$ and $g_5=(-1/4,1/4,1/4,1/4)$ at the 5% significant level and the portfolio with weight $g_4=(1/4,-1/4,1/4,1/4)$ at the 1% significant level, for which both model perform poorly.

In Figs. 15 and 16, we show the 1% and 5% VaR estimates against the realized returns for portfolio 1, i.e. the long-only portfolio with equal weights for the four cryptocurrencies. We observe that typically the VaR estimates based on the DCC models are more extreme, confirming that the DCC model significantly overestimates the risk at both 5% and 1% quantile levels, especially when the return changes are large (e.g. April 2020 and May 2021). These results are in accordance with previous findings (Creal et al. 2011). The estimates of the DCC model are based on lagged squared returns and the forecasts thus move stochastically every day. However, the updating equation in the GAS model with the Student-t density provides a more moderate increase in the variance/correlation for a large absolute realization of return. The forecasts using the GAS model naturally inherit the return information. Overall, we conclude that the GAS model has better out-of-sample forecasting behavior.

4 Conclusion

We have investigated the co-dependence and portfolio VaR of cryptocurrencies using four popular virtual currencies (Bitcoin, Ethereum, Litecoin and Ripple). The results of the multivariate GAS model show strong dynamic interdependence among the cryptocurrencies throughout the sample period. Our out-of-sample forecasting period notably included the COVID-19 outbreak period, which lasted from early 2020 to the end of 2021. Thus, it sheds new light on the multivariate risk measures of cryptocurrencies for global investors.

We examine the out-of-sample predictive performance of the multivariate GAS model for a range of financial assets at various quantile levels. Using a battery of scoring rules and backtesting procedures, our results show that the GAS model significantly outperforms the traditional DCC-GARCH model. These results still hold if different cryptocurrencies are considered. There is plenty of room for future research on the analysis of cryptocurrencies, especially during financial turmoil. We can extend the existing scoring rules (especially in multivariate cases) to a more flexible form to cover a particular region of the density. An alternative extension could explore the safe-haven properties of cryptocurrencies, stablecoins and traditional assets. Under this framework, the dynamic correlations and the portfolio diversification can be studied systematically.

Notes

Since 2019, China’s central bank has announced that all transactions of cryptocurrencies are illegal, effectively banning digital tokens such as Bitcoin. As one of the consequences, the price of major cryptocurrencies has dropped sharply.
CryptoCompare’s real-time aggregate index methodology (CCCAGG) calculates the market price of cryptocurrency pairs traded across exchanges. Aggregating transaction data from more than 250 exchanges, CryptoCompare uses a 24-hour volume-weighted average for every currency pair.
Liu et al. (2015) finds that it is difficult to significantly beat the realized volatility (RV) using 5-minute intervals.
we downloaded the intraday data from different exchanges (e.g. Bitstamp, Coinbase) and the results using these data are the same.
Andersen et al. (2001) introduced the realized covariance and the realized correlation comes from the realized covariance divided by the square roots of the realized volatilities in (7).
Noted that all the scores we discussed above are proper (Gneiting and Raftery 2007), which means that any incorrect density forecasts $\hat{f}_t$ do not receive a lower average score (negatively oriented score) than the true density.

References

Amisano G, Giacomini R (2007) Comparing density forecasts via weighted likelihood ratio tests. J Bus Econ Stat 25:177–190
Article Google Scholar
Andersen T, Bollerslev T, Diebold F, Labys P (2001) The distribution of realized exchange rate volatility. J Am Stat Assoc 96:42–55
Article Google Scholar
Antonakakis N, Chatziantoniou I, Gabauer D (2019) Cryptocurrency market contagion: market uncertainty, market complexity, and dynamic portfolios. J Int Financ Markets, Inst Money 61:37–51
Article Google Scholar
Apergis N (2021) COVID-19 and cryptocurrency volatility: evidence from asymmetric modelling. Financ Res Lett. https://doi.org/10.1016/j.frl.2021.102659
Article Google Scholar
Ardia D, Boudt K, Catania L (2019) Generalized autoregressive score models in R: the GAS package. J Stat Softw 88(6):1–28
Article Google Scholar
Aslanidis N, Bariviera AF, Martinez-Ibanez O (2019) An analysis of cryptocurrencies conditional cross correlations. Financ Res Lett 31:130–137
Article Google Scholar
Avramov D, Chordia T, Goyal A (2006) The impact of trades on daily volatility. Rev Financ Stud 19:1241–1277
Article Google Scholar
Blau B, Griffith T, Whitby R (2020) Comovement in the Cryptocurrency market. Econom Bull 40(1):448–455
Google Scholar
Boako G, Tiwari AK, Roubaud D (2019) Vine copula-based dependence and portfolio value-at-risk analysis of the cryptocurrency market. Int Econom 158:77–90
Article Google Scholar
Bollerslev T (1986) Generalized autoregressive conditional heteroskedasticity. J Econom 31:307–327
Article Google Scholar
Bouri E, Gupta R (2017) Does Bitcoin hedge global uncertainty? Evidence from wavelet-based quantile-in-quantile regression. Finance Res Lett 23:87–95
Article Google Scholar
Bouri E, Gupta R, Lau CKM, Roubaud D, Wang S (2018) Bitcoin and global financial stress: a copula-based approach to dependence and causality in the quantiles. Q Rev Econ Finance 69:297–307
Article Google Scholar
Cheikh NB, Zaied YB, Chevallier J (2020) Asymmetric volatility in cryptocurrency markets: new evidence from smooth transition GARCH models. Financ Res Lett 35:101293
Article Google Scholar
Chen R, Xu J (2019) Forecasting volatility and correlation between oil and gold prices using a novel multivariate GAS model. Energy Econom 78:379–391
Article Google Scholar
Chi Y, Hao W (2021) Volatility models for cryptocurrencies and applications in the options market. J Int Financial Mark Inst Money 75:101421
Article Google Scholar
Christoffersen PF (1998) Evaluating Interval Forecasts. Int Econ Rev 39(4):841–862
Article Google Scholar
Chowdhury MSR, Damianov DS, Elsayed AH (2022) Bubbles and crashes in cryptocurrencies: interdependence, contagion, or asset rotation? Financ Res Lett 46:102494
Article Google Scholar
Conlon T, McGee R (2020) Safe Haven or Risky Hazard? Bitcoin during the COVID-19 Bear Market. Financ Res Lett 35:3560361
Article Google Scholar
Corbet S, Lucey B, Yarovya L (2018) Datastamping the Bitcoin and Ethereum bubbles. Financ Res Lett 26:81–88
Article Google Scholar
Creal DD, Koopman SJ, Lucas A (2011) A dynamic multivariate Heavy-Tailed model for time-varying volatilities and correlations. J Bus Econom Stat 29(4):552–563
Article Google Scholar
Creal DD, Koopman SJ, Lucas A (2013) Generalized autoregressive score models with applications. J Appl Economet 28(5):777–795
Article Google Scholar
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econom Stat 13(3):253–263
Google Scholar
Engle RF (1982) Autoregressive conditional heteroskedasticity with estimates of the variance of United Kingdom inflation. Econometrica 50(4):987–1008
Article Google Scholar
Engle RF (2002) Dynamic conditional correlation: a simple class of multivariate generalized autoregressive conditional heteroskedasticity models. J Bus Econom Stat 20(3):339–350
Article Google Scholar
Engle RF, Russell JR (1998) Autoregressive conditional duration: a new model for irregularly spaced transaction data. Econometrica 66(5):1127–1162
Article Google Scholar
Ghabri Y, Guesmi K, Zantour A (2021) Bitcoin and liquidity risk diversification. Financ Res Lett 40:101679
Article Google Scholar
Giacomini R, White H (2006) Tests of conditional predictive ability. Econometrica 74:1545–1578
Article Google Scholar
Gneiting T, Raftery AE (2007) Strictly proper scoring rules, prediction, and estimation. J Am Stat Assoc 102(477):359–378
Article Google Scholar
Good IJ (1952) Rational Decisions. J Royal Stat Soc B 14(1):107–114
Google Scholar
Groen JJ, Paap R, Ravazzolo F (2013) Real-time inflation forecasting in a changing world. J Bus Econom Stat 31(1):29–44
Article Google Scholar
Guesmi K, Saadi Samir, Abid I, Ftiti Z (2019) Portfolio diversification with virtual currency: Evidence from bitcoin. International Review of Financ Anal 63:431–437
Guesmi K, Saadi S, Abid I, Ftiti Z (2019) Portfolio diversification with virtual currency: evidence from bitcoin. Int Rev Financ Anal 63:431–437
Article Google Scholar
Katsiampa P (2017) Volatility estimation for bitcoin: a comparison of GARCH models. Econ Lett 158:3–6
Article Google Scholar
Katsiampa P (2019) An empirical investigation of volatility dynamics in the cryptocurrency market. Res Int Bus Financ 50:322–333
Article Google Scholar
Katsiampa P, Corbet S, Lucey B (2019) High frequency volatility co-movements in cryptocurrency markets. J Int Financ Mark, Inst Money 62:35–52
Article Google Scholar
Kupiec PH (1995) Techniques for verifying the accuracy of risk measurement models. J Deriv 3(2):73–84. https://doi.org/10.3905/jod.1995.407942
Article Google Scholar
Liu L, Patton A, Sheppard K (2015) Does anything beat 5-minute RV? A comparison of realized measures across multiple asset classes. J Econom 187(1):293–311
Article Google Scholar
Mitchell J, Hall SG (2005) Evaluating, comparing and combining density forecasts using the KLIC with an application to the Bank of England and NIESR ‘fan’ charts of inflation. Oxford Bull Econ Stat 67:995–1033
Article Google Scholar
Nadarajah S, Chu J (2017) On the inefficiency of Bitcoin. Econ Lett 150:6–9
Article Google Scholar
Pace PD, Rao J (2023) Comovement and instability in cryptocurrency markets. Int Rev Econ Financ 8:173–200
Article Google Scholar
Palmer T (2012) Towards the probabilistic earth-system simulator: a version for the future of climate and weather prediction. Q J R Meteorol Soc 138(665):841–861
Article Google Scholar
Patton AJ (2011) Volatility forecast comparison using imperfect volatility proxies. J Econom 160(1):246–256
Article Google Scholar
Pinson P, Tastu J (2013) Discrimination ability of the Energy score, Technical University of Denmark. DTU Compute-Technical Report-2013 No. 15
Scheuerer M, Hamill TM (2015) Variogram-based proper scoring rules for probabilistic forecasts of multivariate quantities. Mon Weather Rev 143(4):1321–1334
Article Google Scholar
Stavroyiannis S, Babalos V (2017). Dynamic Properties of the Bitcoin and the US Market. https://doi.org/10.2139/ssrn.2966998
Syuhada K, Hakim A (2020) Modeling risk dependence and portfolio VaR forecast through vine copula for cryptocurrencies. PLoS ONE 15(12):e0242102
Article Google Scholar
Tafakori L, Pourkhanali A, Fard FA (2018) Forecasting spikes in electricity return innovations. Energy 150:508–526
Article Google Scholar
Tang K, Xiong W (2012) Index investment and the financialization of commodities. Financ Anal J 68(6):54–74
Article Google Scholar
Tran VL, Leirvik T (2020) Efficiency in the markets of crypto-currencies. Financ Res Lett 35:101382
Article Google Scholar

Download references

Acknowledgements

I am grateful to Robert Kunst and the anonymous referee, whose constructive and helpful comments have significantly improved the paper. This work is supported by the Faculty of Natural Sciences Research Development Fund, Keele University.

Author information

Authors and Affiliations

School of Computing and Mathematics, Keele University, MacKay Building, Keele, ST5 5BG, UK
Jie Cheng

Authors

Jie Cheng
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jie Cheng.

Ethics declarations

Competing interest

The author has no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Cheng, J. Modelling and forecasting risk dependence and portfolio VaR for cryptocurrencies. Empir Econ 65, 899–924 (2023). https://doi.org/10.1007/s00181-023-02360-7

Download citation

Received: 05 May 2022
Accepted: 02 January 2023
Published: 16 January 2023
Issue Date: August 2023
DOI: https://doi.org/10.1007/s00181-023-02360-7

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Modelling and forecasting risk dependence and portfolio VaR for cryptocurrencies

Abstract

Similar content being viewed by others

Forecasting cryptocurrencies returns: Do macroeconomic and financial variables improve tail expectation predictions?

Relationships among return and liquidity of cryptocurrencies

The nexus between the volatility of Bitcoin, gold, and American stock markets during the COVID-19 pandemic: evidence from VAR-DCC-EGARCH and ANN models

1 Introduction