Abstract
The multi variate mixture dynamics model is a tractable, dynamical, arbitrage-free multivariate model characterized by transparency on the dependence structure, since closed form formulae for terminal correlations, average correlations and copula function are available. It also allows for complete decorrelation between assets and instantaneous variances. Each single asset is modelled according to a lognormal mixture dynamics model, and this univariate version is widely used in the industry due to its flexibility and accuracy. The same property holds for the multivariate process of all assets, whose density is a mixture of multivariate basic densities. This allows for consistency of single asset and index/portfolio smile. In this paper, we generalize the MVMD model by introducing shifted dynamics and we propose a definition of implied correlation under this model. We investigate whether the model is able to consistently reproduce the implied volatility of FX cross rates once the single components are calibrated to univariate shifted lognormal mixture dynamics models. We consider in particular the case of the Chinese Renminbi FX rate, showing that the shifted MVMD model correctly recovers the CNY/EUR smile given the EUR/USD smile and the USD/CNY smile, thus highlighting that the model can also work as an arbitrage free volatility smile extrapolation tool for cross currencies that may not be liquid or fully observable. We compare the performance of the shifted MVMD model in terms of implied correlation with those of the shifted simply correlated mixture dynamics model where the dynamics of the single assets are connected naively by introducing correlation among their Brownian motions. Finally, we introduce a model with uncertain volatilities and correlation. The Markovian projection of this model is a generalization of the shifted MVMD model.
1 Introduction to the multivariate mixture dynamics
The multi variate mixture dynamics model (MVMD) introduced by Brigo et al. (2004) and recently described in a deeper way in Brigo et al. (2014) is a tractable dynamical arbitrage-free model defined as the multidimensional version of the lognormal mixture dynamics model (LMD) in Brigo and Mercurio (2000, 2001) (see also Brigo et al. 2003). The single-asset LMD model is a no-arbitrage model widely used among practitioners because of its practical advantages in calibration and pricing (analytical formulae for European options, explicit expression for the local volatility) and of its flexibility and accuracy. In fact, a variant of this model is presently used in the calibration of implied volatility surfaces for single stocks and equity indices in the Bloomberg terminal (Bloomberg 2012), and in the subsequent pricing of European, American and path-dependent options on single assets and baskets of assets. However, LMD is a local volatility model, and as such it suffers from limitations of that class of models. Stochastic volatility models have been shown to be more suitable to capture the correct smile dynamics, especially as far as future volatility smiles implied by the model are concerned. Models like LMD can be useful for example in risk management, where one is more interested in tractability and the overall calibration at time zero rather than in fine tuning the dynamics for a very specific valuation and hedging task.
In the multidimensional setting, the main advantage of the MVMD over other multidimensional models, such as e.g., the Wishart model (Da Fonseca et al. 2007; Gourieroux 2007) is in its tractability and flexibility which allows the MVMD to calibrate index volatility smiles consistently with the univariate assets smiles. In addition, a full description of its dependence structure (terminal correlations, average correlations, copula functions) is available.
The MVMD model also enjoys some interesting properties of Markovian projection. First of all, the model can be seen as a Markovian projection of a model with uncertain volatilities denominated MUVM model. As a consequence, European option prices under the MVMD model can more easily be computed under the MUVM model instead. However, the MVMD model remains superior in terms of smoothness and dynamics. Secondly, the Geometric average basket under the MVMD model can be projected into a univariate lognormal mixture dynamics model. Consequently, European option prices on the basket can be easily computed through the Black and Scholes formula.
Finally, under the MVMD model, the terminal correlation between assets and squared volatilities is zero. This mitigates the common drawback of local volatility models of having perfect instantaneous correlation between assets and squared volatilities.
In this paper we generalize the MVMD model, including shifts to the dynamics of the single assets, and we study the correlation skew under this framework.
Before going into details, we recapitulate the definition of the MVMD model (in the non-shifted case), starting with the univariate LMD model and then generalizing to the multidimensional case.
1.1 The volatility smile mixture dynamics model for single assets
Given a maturity \(T>0\), we denote by P(0, T) the price at time 0 of the zero-coupon bond maturing at T, and by \((\Omega ,{{{\mathcal {F}}}},{\mathbb {P}})\) a probability space with a filtration \(({{{{\mathcal {F}}}}}_t)_{t\in [0,T]}\) which is \({\mathbb {P}}\)-complete and satisfying to the usual conditions. We assume the existence of a measure \({\mathbb {Q}}\) equivalent to \({\mathbb {P}}\), called risk–neutral or pricing measure, ensuring absence of arbitrage opportunities in the classical setup, for example, of Harrison and Kreps (1979) and Harrison and Pliska (1981). In this framework we consider N diffusion processes \(Y^i(t)\) with dynamics
and a deterministic initial value \(Y^i(0),\) marginal densities \(p_t^{i}\) and diffusion coefficient \(v_i\). The processes \(Y^i\) do not model financial variables but are introduced only to refer to their probability densities when building mixtures of distributions later on. In this sense we term these processes “instrumental”. We define \(S_t\) as the solution of
where s is a local volatility function, namely a deterministic function of t and S only, and it is computed so that the marginal density \(p_t\) of S(t) is a linear convex combination of the densities \(p_t^{i}\) of the instrumental processes (Brigo and Mercurio 2000, 2001, 2002):
In what follows we restrict ourselves to the case
with \(\sigma ^i\) deterministic. Note that we have denoted the density \(p^i\) with \(\ell ^i\) in this lognormal case. The parameter \(\mu \) is completely specified by \({\mathbb {Q}}\). If the asset is a stock paying a continuous dividend yield q and r is the time T constant risk-free rate, then \(\mu =r-q\). If the asset is an exchange rate and \(r_d\) and \(r_f\) are the (deterministic) domestic and foreign rates at time T, respectively, then \(\mu =r_d-r_f\). If the asset is a forward price, then \(\mu =0\).
Brigo and Mercurio (2001) proved that defining
and assuming a few additional nonstringent assumptions on the \(\sigma ^i\), the corresponding dynamics for \(S_t\) admits a unique strong solution.
Theorem 1
(Existence and uniqueness of solutions for the LMD model) Assume that all the real functions \(\sigma ^i(t)\), defined on the real numbers \(t \ge 0\), are once continuously differentiable and bounded from above and below by two positive real constants. Assume also that in a small initial time interval \(t \in [0, \epsilon ]\), \(\epsilon >0\), the functions \(\sigma ^i(t)\) have an identical constant value \(\sigma _0\). Then the lognormal mixture dynamics model (LMD) defined by
admits a unique strong solution and the forward Kolmogorov equation (Fokker Planck equation) for its density admits a unique solution satisfying (1.3), which is a mixture of lognormal densities.
An important consequence of the above construction is that European option prices on S can be written as linear combinations of Black-Scholes prices with weights \(\lambda ^i\). The same combination holds for the Greeks at time 0.
1.2 Combining mixture dynamics on several assets: SCMD
Consider now n different asset prices \(S_1 \dots S_n\) each calibrated to an LMD model, as in Eq. (1.6), and denote by \(\lambda _i^{k}\), \(\sigma _i^{k}\) the parameters relative to the k-th instrumental process of the asset i. There are two possible ways in order to connect the dynamics of the single assets into a multivariate model. The first more immediate way consists in introducing a non-zero quadratic covariation between the Brownian motions driving the LMD models of Eq. (1.6) for \(S_1 \dots S_n\) leading to the so-called SCMD model.
Definition 2
(SCMD model) We define the Simply Correlated multivariate Mixture Dynamics (SCMD) model for \({\underline{S}}= [S_1,\ldots ,S_n]\) as a vector of univariate LMD models, each satisfying Theorem 1 with diffusion coefficients \(s_1,\ldots ,s_n\) given by equation (1.6) and densities \(\ell _1,\ldots ,\ell _n\) applied to each asset, and connected simply through quadratic covariation \(\rho _{ij}\) between the Brownian motions driving assets i and j. This is equivalent to the following n-dimensional diffusion process where we keep the W’s independent and where we embed the Brownian covariation into the diffusion matrix \({\bar{C}}\), whose i-th row we denote by \({\bar{C}}_i\):
where T represents the transposition operator.
Assumption
We assume \(\rho =(\rho _{ij})_{i,j}\) to be positive definite.
It is evident from the previous construction that the SCMD is consistent with both the dynamics of the single assets \(S_i\) and the instantaneous correlation matrix \(\rho \). Moreover, we can easily simulate a path of S by exogenously computing \(\rho \) for example from historical data, assuming it constant over time and applying a naive Euler scheme. However an explicit expression for the density of \({\underline{S}}= [S_1,\ldots ,S_n]\) under the SCMD dynamics is not available. As a consequence, if we aim at computing prices of options whose payoff depends on the value at time T only we still need to simulate entire paths of \({\underline{S}}\) over the interval [0, T], which can be quite time consuming.
1.3 Lifting the mixture dynamics to asset vectors: MVMD
A different approach, still consistent with the single assets’ dynamics, lies in merging the dynamics of the single assets in such a way that the mixture property is lifted to the multivariate density and the corresponding model gains some further tractability property with respect to the SCMD model. This can be achieved by mixing in all possible ways the densities of the instrumental processes of each individual asset and by imposing the correlation structure \(\rho \) at the level of the single instrumental processes, rather than of the assets as we did for the SCMD model. This has important consequences on the actual structure of the correlation, see Brigo et al. (2004). Below we summarize the construction leading to the MVMD model, while referring to Brigo et al. (2014) for further details.
Assume we have calibrated an LMD model for each \(S_i(t)\): if \(p_{S_i(t)}\) is the density of \(S_i\), we write
where \((\ell _{i,t}^k)_{k}\) are the densities of \((Y_i^{k})_k\), instrumental processes for \(S_i\) evolving lognormally according to the stochastic differential equation:
For notational simplicity we assume the number of base densities \(N_i\) to be the same, N, for all assets. The exogenous correlation structure \(\rho _{ij}\) is given by the symmetric, positive–definite matrix \(\rho \).
Denote by \({{\underline{S}}}(t) = \left[ S_1(t),\ldots ,S_n(t)\right] ^T\) the vector of asset prices with
As we did for the one dimensional case, we look for a matrix A such that
or more explicitly
where \(\Xi ^{(k_1 \cdots k_n)}(t)\) is the integrated covariance matrix whose (i, j) element is
Computations show that if a solution exists, this must satisfy the definition below.
Definition 3
(MVMD model) The (Lognormal) multi variate mixture dynamics (MVMD) model is given by
\(\ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}}) := p_{\left[ Y_1^{k_1}(t),\ldots ,Y_n^{k_n}(t)\right] ^T}({\underline{x}})\) and defining B such that \(\rho =B B^T\), \(a = C B (C B)^T\),
where
From the previous definitions it is evident that the dynamics of the single assets \(S_i\) in the SCMD model are Markovian. On the other hand, under the MVMD model, while the dynamics of the whole vector S is Markovian, those of the single assets are not. This leads to more realistic dynamics.
Under mild assumptions, existence and uniqueness of a solution can be proved through the following Theorem.
Theorem 4
Assume that the volatilities \(\sigma _i^{k_i}(t)\) for all i are once continuously differentiable, uniformly bounded from below and above by two positive real numbers \({\tilde{\sigma }}\) and \({\hat{\sigma }}\) respectively, and that they take a common constant value \(\sigma _0\) for \(t \in [0,\epsilon ]\) for a small positive real number \(\epsilon \), namely
Assume also the matrix \(\rho \) to be positive definite. Then the MVMD n-dimensional stochastic differential equation (1.15) admits a unique strong solution. The diffusion matrix \(a(t,{\underline{x}})\) in (1.16) is positive definite for all t and x.
Before introducing the shifted version of the model, we would like to discuss briefly a physical measure dynamics associated with MVMD. If we replaced the drift parameters \(\mu \) with historical returns, leaving aside the fact that these are known to be difficult to estimate in a satisfactory way, we would immediately obtain a mixture dynamics under the physical measure. Gaussian mixtures for multivariate returns have been considered in the risk premia literature for risk parity portfolios, see for example Bruder at al. (2016) and references therein. In further work we could try to connect our approach to frameworks such as the mixture models used in Bruder at al. (2016).
2 Introducing a shift in MVMD
When modelling a one dimensional asset price through an LMD model, implied volatilities with minimum exactly at a strike equal to the forward asset price are the only possible. In order to gain greater flexibility and therefore move the smile minimum point from the ATM forward we can shift the overall density by a deterministic function of time, carefully chosen in order to preserve risk–neutrality and therefore guarantee no–arbitrage. This is the so–called shifted lognormal mixture dynamics model (Brigo and Mercurio 2002). Under this model the new asset-price process S is defined as
with \(\beta \) real constant and \(X_t\) satisfying (1.6). Under the assumption \(K- \beta e^{\mu T}>0\) the price at time 0 of a European call option with strike K and maturity T can be written as
and thus as a combination of Black and Scholes prices with strike \(K- \beta e^{\mu T}\). The model therefore preserves the same level of tractability as in the non shifted case with the advantage of gaining more flexibility.
Once each asset is calibrated to a shifted LMD model, we have two possibilities for reconstructing the dynamics of the multidimensional process. The first possibility is to reconnect the single assets by introducing a non-zero quadratic covariation between the Brownian motions (as we did for the SCMD model), leading to what we call the shifted SCMD model. The second possibility going on the same lines as the approach leading to the MVMD model, lies in applying the same shift \(\beta _i e^{\mu _i t}\) to each instrumental process \(Y_i^k\) of each asset \(X_i\)
where \(Y_i^k\) satisfies the dynamics in (1.10) (this is equivalent to applying the shift \(\beta _i e^{\mu _i t}\) directly to the i-th asset) and then mix the corresponding densities \(p_{S_i^k(t)}(x)\) in all possible ways. Computations similar to those for the non-shifted case show that if a solution exists, it must satisfy the definition below (details on the computations are shown in the “Appendix”).
Definition 5
(Shifted MVMD model) The shifted multi variate mixture dynamics model is given by
and defining B such that \(\rho =B B^T\), \({{\tilde{a}}}={\widetilde{C}} B ({\widetilde{C}} B)^T\),
with \({V}^{k_1,\ldots ,k_n}\) as in (1.17).
We now have all the instruments to introduce the correlation skew and study its behaviour under shifted SCMD and shifted MVMD dynamics.
3 The correlation skew
The aim of this section is to introduce a definition of correlation skew and to study its behaviour under shifted MVMD dynamics, in comparison with the correlation skew under shifted SCMD dynamics. It is observed in practice under normal market conditions that assets are relatively weakly correlated with each other. However during periods of market stress stronger correlations are observed. Furthermore, different pairs of assets will have different correlations in general. This suggests that a single time-constant correlation parameter for all options quoted on a basket of assets, or an index, say, may not be sufficient to reproduce all option prices on the basket/index for a given expiry. The situation might be improved by considering time-constant but heterogeneous correlations, changing value for each pair of assets, but even this may not be sufficient. We refer to works inferring a multidimensional dynamics from a set of single–asset dynamics for further discussion. Among others, this has been shown in Bakshi et al. (2003) for options on the S&P 100 index and in Langnau (2010) for options on the Euro Stoxx 50 index and on the DAX index.
When computing the implied volatility, European call prices (or equivalently put prices) are considered and the reference model is the benchmark (Black and Scholes 1973) model. It seems then natural to consider as multidimensional benchmark a model where the single assets follow geometric Brownian motions and constant correlation among the single Brownian shocks is introduced. However, when moving from the one-dimensional to the multidimensional framework a bigger variety of possible option instruments to use in order to compare prices under the reference model and the model under analysis appears, the particular choice depending on the specific product we are interested in. We discuss now briefly the two dimensional case. Austing (2014) recently provided a discussion on some of the most popular multi-assets products suggesting the use of composite options as benchmark on which defining the implied correlation. In this paper we adopt a different approach based on the comparison with options on \(S_1(t)\), \(S_2(t)\) with payoff
Assume that the pair \((S_1,S_2)\) follows a bi-dimensional Black and Scholes model, in other words \(S_1\) and \(S_2\) follow two geometric Brownian motions with correlation \(\rho \) and consider the payoff in Eq. (3.1). Given the Black and Scholes implied volatilities for \(S_1\) and \(S_2\), the value \(\rho _{impl}\) such that prices under the bi-dimensional Black and Scholes model are the same as market prices
is called implied correlation. If we try to match option prices for a given maturity T and two different strikes \(K_1\), \(K_2\), we will observe two different values of the implied correlation. This is contrary to the hypothesis of constant correlation in the bi-dimensional Black and Scholes model.
The curve \(K \rightarrow \rho _{impl}(K,T)\) is called correlation skew. Thus, the correlation skew can be considered as a descriptive tool/metric similar to the volatility smile in the one-dimensional case, with the difference that it primarily describes implied dependence instead of volatility.
3.1 Explaining the skew in MVMD with the single parameter \(\rho \) via MUVM
The aim of this section is to introduce a definition of implied correlation under shifted MVMD dynamics, when using options with payoff as in Eq. (3.1). This leads to a straightforward application in the foreign exchange market within the study of triangular relationships. Imagine, for example, that \(S_1\) and \(S_2\) represent the exchange rates USD/EUR and EUR/JPY, respectively. The cross asset \(S_3=S_1 S_2\) then represents the USD/JPY exchange rate, and the corresponding payoff in Eq. (3.1) is the payoff of a call option on the USD/JPY FX rate. In the following, we will investigate whether the shifted MVMD model is able to consistently reproduce the implied volatility of \(S_3\), once the single components \(S_1\), \(S_2\) are calibrated to univariate shifted LMD models. Consistency properties of this kind are important, for example, in order to reconstruct the time series of less liquid cross currency pairs from more liquid ones.
Before proceeding we make a remark on the interpretation of \(\rho \). Keeping in mind the definition of instantaneous local correlation in a bivariate diffusion model
and making use of Schwartz’s inequality, we obtain that the absolute value of the local correlation under the shifted MVMD model is smaller than the value under the shifted SCMD model. The result is contained in the Proposition below.
Proposition 6
(Local correlation in shifted MVMD and shifted SCMD) The instantaneous local correlation under the shifted SCMD model is \(\rho \), whereas for the shifted MVMD model we have
where \({{\tilde{\ell }}}_t^{(kk')}(x_1,x_2)\) is defined as in Eq. (2.4).
We see that \(\rho \) enters the formula for the instantaneous local correlation \(\rho _L\) in the MVMD model, even though the latter is more complex than the constant value \(\rho \). Our aim is to find a value of \(\rho \) matching prices of options with payoff as in Eq. (3.1) under shifted MVMD dynamics with market prices.
In order to do that we will make use of a model with uncertain parameters of which the shifted MVMD is a Markovian projection. Indeed, as shown in Brigo et al. (2014) the MVMD model as in Definition 3 (without shift) is a Markovian projection of the model defined below
where each \(Z_i\) is a standard one dimensional Brownian motion with \(d \left\langle Z_i, Z_j \right\rangle _t = \rho _{i,j} dt\), \(\mu _i\) are constants, \(\sigma ^I:= [\sigma ^{I_1}_1,\ldots ,\sigma ^{I_n}_n]^T\) is a random vector independent of Z and representing uncertain volatilities with \(I_1,\ldots ,I_n\) mutually independent. More specifically, each \(\sigma ^{I_i}_i\) takes values in a set of N deterministic functions \(\sigma _i^k\) with probability \(\lambda _i^k\). Thus, for all times in \((\varepsilon ,+\infty )\) with small \(\varepsilon \) we have
Now it is straightforward to show that if we add a shift to each component as follows
we obtain a model having the shifted MVMD model (2.3)–(2.5) as Markovian projection. This can be easily shown by Gyöngy’s lemma (Gyöngy 1986).
Theorem 7
The shifted MVMD model is a Markovian projection of the shifted MUVM model.
Proof
A straightforward application of Ito’s lemma shows that \(\tilde{\xi }(t)\) satisfies the system of SDEs below
where \(diag({\underline{\alpha }}(t))\) is a deterministic matrix whose i-th diagonal element is the shift \(\beta _i e^{\mu _i t}\) and \({A^I(t)}\) is the Cholesky decomposition of the covariance matrix \(\Sigma ^I_{i,j}(t) := \sigma ^{I_i}_i(t) \sigma ^{I_j}_j(t)\ \rho _{ij}.\)
Define \({{\tilde{v}}}(t,{\underline{\xi }}(t)) = diag(\underline{\tilde{\xi }}(t)-{\underline{\alpha }}(t)) {A^I(t)}\). In order to show that the MVMD model is a Markovian projection of the MUVM model, we need to show that
where \({{\tilde{\sigma }}}(t,{\underline{x}})=diag({\underline{x}}) \widetilde{C}(t, {\underline{x}})B\) and \( {{\widetilde{C}}}\) is defined as in (2.3).
Observing that
and performing simple matrix manipulations, Eq. (3.5) is easily obtained. \(\square \)
Since we will infer the value of \(\rho \) from prices of options with payoff as in (3.1) depending on the value of \((S_1,S_2)\) at time T only, we can make computations under the shifted MUVM rather then the shifted MVMD, as these two models have the same one-dimensional (in time) distributions. Computations under the shifted MUVM model are easier to do (with respect to the shifted MVMD case) since conditioning on \(\{I_i=j\}\), \(\xi _i\) follows a shifted geometric Brownian motion with volatility \(\sigma _i^j\).
In particular we will focus on the bidimensional specification in which case the shifted MUVM reduces to
where the Brownian motions \(W_1\), \(W_2\) have correlation \(\rho \).
Once we have calibrated \(S_1\) and \(S_2\) independently, each to a univariate shifted LMD model, we notice that the only parameter missing when computing prices of options having payoff as in (3.1) is \(\rho \).
Definition 8
We define the implied correlation parameter in the shifted MVMD model as the value \(\rho \) minimizing the squared percentage difference between implied volatilities from options with payoff (3.1) under the shifted MVMD model and market implied volatilities.
3.2 The correlation skew in SCMD via \(\rho \)
Now, assume to model the joint dynamics of \((S_1,S_2)\) as a shifted SCMD model instead. In this case
with
where the Brownian motions \(W_1\), \(W_2\) have correlation \(\rho \). In this case the parameter \(\rho \) really represents the true value of the instantaneous local correlation, as opposed to the MVMD case. We still define the implied correlation as the value \(\rho \) minimizing the squared percentage difference between implied volatilities from options with payoff (3.1) under the shifted SCMD model and market implied volatilities.
3.3 Pricing under the shifted MUVM
Now, we consider computing the price of options such as (3.1), namely options on cross FX rates, under the shifted model. In general one has a loss of tractability with respect to the non-shifted case. However, one can still express the price via a semi-analytic formula involving double integration:
where n(x; m, S) is the density function of a one-dimensional Gaussian random variable with mean m and standard deviation S,
and \(\Sigma ^{i,j}_{h,k}=\sigma _h^i \sigma _k^j T\) for \(h,k=1,2\) and \(i,j=1,\dots N\). This follows from the fact that the density of the product
can be written as
where
Now we focus on a single term in the summation (3.10) and for simplicity we drop the superscript i, j . Calling \(F_1(t)\), \(F_2(t)\) the t-forward asset prices and defining \(x_i=\ln \frac{\xi _i}{F_i(t)}+\frac{\Sigma _{i,i}}{2}\) we can rewrite the expectation as
with \(\alpha _i = \beta _i e^{\mu _i T}\), where \(n({\underline{x}};0,\Sigma )\) is the density of a bivariate normal distribution with mean equal to zero and covariance matrix \(\Sigma \) defined as below
Observing that \(n({\underline{x}};0,\Sigma )=n(x_1;0, \Sigma _{1,1})n(x_2-\rho x_1 \sqrt{\Sigma _{2,2}/\Sigma _{1,1}};0,(1-\rho ^2)\Sigma _{2,2})\), integrating with respect to \(x_2\) and replacing in (3.10) we obtain
from which Eq. (3.9) is easily derived.
4 Comparing correlation skews in shifted MVMD and SCMD
The aim of this section is to compare the shifted MVMD and the shifted SCMD models in terms of implied correlation, analysing their performance in reproducing triangular relationships.
4.1 Numerical case study with cross FX rates
Specifically, we consider the exchanges \(S_1=\) USD/EUR, \(S_2=\) EUR/JPY under a shifted MUVM model with 2 components
with
where \(r^{\EUR {} }\), \(r^{\$}\), \(r^{Y}\) are the euro, dollar and yen interest rates, respectively, and \(\sigma _1^{I_1}(t)\), \(\sigma _2^{I_2}(t)\) are as in Eq. (3.2). \(W_t^{1,\EUR {} }\) and \(W_t^{2,Y}\) indicate that we are considering the dynamics of \(S_1\) and of \(S_2\), each under its own domestic measure, that is the euro in the case of \(S_1\) and the yen in the case of \(S_2\).
We calibrate \(S_1\) and \(S_2\) independently, each to its own volatility curve, using 2 components and minimizing the squared percentage difference between model and market implied volatilities. Then, we look at the product \(S_1 S_2\), representing the cross exchange USD/JPY, and we check whether the model is able to reproduce the cross smile consistently with the smiles of the single assets. In particular, we find \(\rho \) that minimizes the squared percentage difference between implied volatilities from options on the basket \(S_3=S_1 S_2\) under the shifted MVMD model (the shifted SCMD model) and market implied volatilities. In other words, we look at the implied correlations under the shifted MVMD model and the shifted SCMD model.
When performing calibration on \(S_3\), we express both the dynamics of \(X_1\) and of \(X_2\) under the yen:
and then we calculate prices of options on
All the data for our numerical experiments are downloaded from a Bloomberg terminal. We start by considering data from 19th February 2015. The initial values of \(S_1,S_2\) are \(S_1(0)=0.878\), \(S_2(0)=135.44\). First we calibrate \(S_1\) and \(S_2\) using implied volatilities from options with maturity of 6 months. Denoting
the T-term volatilities of the instrumental processes of \(S_1\) and \(S_2\), respectively,
the vector of probabilities of each component and \(\beta _1\), \(\beta _2\) the shift parameters, we obtain
for the asset \(S_1\) and
for the asset \(S_2\). Then, we perform a calibration on the cross product \(S_3=\)USD/JPY using volatilities from call options with maturity of 6 months, finding the values:
for the shifted MVMD model and
for the shifted SCMD model. The higher value (in absolute terms) of the correlation parameter in the shifted MVMD model is due to higher state dependence in the diffusion matrix with respect to the shifted SCMD model. This is partly related to Proposition 6. In other words, in order to achieve the same local correlation as in the shifted SCMD model, the shifted MVMD model needs a higher absolute value of \(\rho \).
The corresponding prices and implied volatilities are plotted in Fig. 1 whereas Table 1 reports the absolute differences between market and model values corresponding to a few strikes. The reported plot shows that the shifted MVMD model is better at reproducing market prices than the shifted SCMD model. What is very remarkable in this example is that the shifted MVMD fits the whole correlation skew with just one value of \(\rho \).
As a second numerical experiment we repeat the calibration using prices with maturity of 9 months. Specifically, we first calibrate \(S_1\) and \(S_2\) obtaining the values
for the asset \(S_1\) and
for the asset \(S_2\). For \(S_2\), we observe that the higher volatility now has the highest probability as opposed to the results found for 6 months options. Then, we perform a calibration on the cross product \(S_3=\)USD/JPY using volatilities from call options with maturity of 9 months, finding the values:
for the shifted MVMD model and
for the shifted SCMD model. These values are comparable with those found for 6 months options. This shows that the model is quite consistent.
The corresponding prices and implied volatilities are shown in Fig. 2 whereas Table 2 reports some absolute differences between model and market values. Overall, also in this case the shifted MVMD model outperforms the shifted SCMD in terms of ability to reproduce market prices on the cross product.
5 Introducing random correlations in the mixture dynamics
A single correlation parameter \(\rho \) may not be enough to fit prices on the cross asset. To overcome this, we can allow for random correlations between the single assets in the shifted MUVM model (3.6). Specifically,
where the Brownian motions \(W_1^{I_1}\), \(W_2^{I_2}\) now have correlation \(\rho ^{I_1,I_2}\). The correlation parameter will therefore assume the value \(\rho ^{h,k}\) in correspondence with a couple \((\sigma ^{h}_1, \sigma ^{k}_2)\) with probability \(\lambda _h \lambda _k\).
This setting is reminiscent of regime switching models. If one adopted a similar setting under the physical measure, with the drift parameters \(\mu \) being replaced with drift parameters under the physical measure, one would obtain a kind of regime switching model. This would not be a full fledged regime switching model, however, since only one regime switch would occur. Once the random variables \(I_1\) and \(I_2\) are realized, there are no more switches. The I’s would need to be replaced by Markov chains, but this would immediately increase combinatorial complexity, leading potentially to combinatorial explosion. The link of our approach with the literature of regime switching models needs to be investigated further. A futher interesting issue to investigate would be assigning a dynamics to the correlation structure and discussing how this might change when moving from the physical measure to the pricing measure. This will be considered in further work. We now go back to the relationship between shifted MUVM model and shifted MVMD model.
Theorem 9
The shifted MUVM model with uncertain correlation parameter has, as Markovian projection, a shifted MVMD model solution of the SDE (2.3) but with equation (1.17) transformed into
Proof
The Markovian projection property can be easily shown by an application of Gyöngy’s lemma, in a similar was as in the proof of Theorem 7. \(\square \)
In other words, the correlation between two generic instrumental processes \(Y_i^k\), \(Y_j^h\) will depend not only on the assets \(S_i\), \(S_j\), but will correspond to a specific choice of the instrumental processes \(Y_i^k\), \(Y_j^h\) themselves.
5.1 Cross FX rates study for shifted MVMD with random correlations
As a numerical illustration we performed on the shifted MVMD model the same experiment as in Sect. 4. We used 6 months options from 7th September 2015. The initial values of the single FX rates are \(S_1(0)=0.8950\), \(S_2(0)=133.345\). In this case the calibration of the shifted SCMD model is much worse, to the point that there is no value of \(\rho \) that can fit any of the prices obtained through this model. On the other hand, in the case of the shifted MVMD model, in particular when introducing random correlations, the fit leads to quite good results. As in the previous cases we independently calibrate \(S_1\)=USD/EUR and \(S_2\)=EUR/JPY on the corresponding implied volatilities obtaining
and then we look at the cross exchange rate \(S_3=S_1 S_2\)=USD/JPY. When performing calibration using a shifted MVMD model with one correlation parameter only, we obtain
whereas when using random correlations, we have
Calibration of the MVMD model on 6 months options, relative to data from 7th September 2015. The calibration using one single correlation parameter is shown in the top part which corresponds to a fitted value equal to \(\rho =-0.6147\). In the bottom, calibration using the MVMD model with random correlations is presented. The corresponding fitted correlations are \(\rho ^{1,1}=-0.8717,\ \rho ^{1,2}=-0.1762,\ \rho ^{2,1}=-0.6591,\ \rho ^{2,2}= -0.2269. \)
The corresponding plots are shown in Fig. 3, in connection with Table 3. In this case we also see that using random correlations improves the fit with respect to the case with a single correlation parameter. Moreover, computing the expectation and the standard deviation for the random correlation under the risk-neutral measure \(\mathbb {Q}\), we obtain
satisfying \(\vert \mathbb {E}^{Q}(\rho ^{i,j}) - \rho \vert < \frac{ \text{ Std }^Q(\rho ^{i,j})}{2}\). In other words, the absolute difference between the \(\mathbb {Q}\)-expected random correlation and the deterministic correlation is smaller than half the \(\mathbb {Q}\)-standard deviation. This means that the random correlation is on average not that far from the deterministic value.
Finally, we repeat the same experiment using options with maturity of 9 months. We find:
When looking at the cross product \(S_1 S_2\)=USD/JPY, we obtain
in the case of one single correlation parameter, and
in the case where random correlations are introduced. Corresponding plots and absolute differences between market and model prices/implied volatilities can be found in Fig. 4 and Table 4, which show that the shifted MVMD model with random correlations outperforms the constant-deterministic correlation model in this case as well.
The values of expected random correlation and standard deviation under the \(\mathbb {Q}\) measure are
With respect to the case of 6 months options, we observe a movement of the \(\mathbb {Q}\)-expected random correlation away from the constant correlation. Moreover, if we look at the terminal correlations, that is the correlation between \(S_1(T)\) and \(S_2(T)\), for \(T=9\) months, we obtain
in case \(\rho \) is deterministic and
in case \(\rho \) is random. As a final observation, we remark that in case \(\rho \) is constant, an application of Schwartz’s inequality shows that the absolute value of the terminal correlation is always smaller than the absolute value of the instantaneous correlation, as verified by the results above. One may wonder whether the same inequality holds in case of random correlations, if we substitute the instantaneous value with the mean of the random correlations. In this case it is not possible to use Schwartz’s inequality as we did before and, indeed, the results obtained show that the inequality does not hold, at least not for the example considered above.
Calibration of the MVMD model on 9 months options, relative to data from 7th September 2015. The calibration using one single correlation parameter is shown in the top part which corresponds to a fitted value equal to \(\rho =-0.7488\). In the bottom, calibration using the MVMD model with random correlations is presented. The corresponding fitted correlations are \(\rho ^{1,1}=-0.8679,\ \rho ^{1,2}=-0.2208,\ \rho ^{2,1}=-0.8303,\ \rho ^{2,2}= -0.3270.\)
6 Potential use for smile extrapolation: a Renminbi case study
As a numerical illustration we performed on the shifted MVMD model analogous experiments as those in Sects. 4 and 5, using 6 months options relative to data from 18th December 2015. We first independently calibrate the exchange rates \(S_1\)=EUR/USD and \(S_2\)=USD/CNH on the corresponding implied volatilities, thus obtaining
where the initial values of the single FX rates are \(S_1(0)=1.0842\) and \(S_2(0)=6.55\). The corresponding plots are shown in Fig. 5 (top).
In order to test how the model perform when looking at the FX cross rate \(S_3=S_1 S_2=\)EUR/CNH, we first calibrate the shifted MVMD model using the atm call option only, thus obtaining
We then plot the whole volatility curve corresponding to the calibrated value of \(\rho \). This is represented in Fig. 5 (bottom, left) which shows a good fit. Moreover, if we try to calibrate using not only the atm option, but a certain number of different strikes, we obtained values of the calibrated correlations quite close to the previous value \(\rho _{MVMD}(6M)\).
As a second experiment, we perform a calibration of the shifted MVMD model, when introducing random correlations. The values obtained are
and the corresponding plot is shown in Fig. 5 (bottom, right). The values of expected random correlation and standard deviation under the \(\mathbb {Q}\) measure are
On top, calibration of the EUR/USD FX rate (left) and the USD/CNH FX rate (right) each independently on a shifted LMD model. We used data from 18 December 2015. On the bottom, calibration of the MVMD model on the EUR/CNH exchange. In the case on the left, the calibration is obtained by fitting the shifted MVMD model to the atm option only, obtaining \(\rho =-0.1205\). In the case on the right, the calibration is obtained by fitting the shifted MVMD model with random correlation. In the last case \(\rho _1=0.864\), \(\rho _2=-0.2843\), \(\rho _3=-0.3417\), \(\rho _4=-0.1006\)
7 Conclusions
We introduced a shifted MVMD model where each single asset follows shifted LMD dynamics which are combined so that the mixture property is lifted to a multivariate level, in the same way as for the non-shifted case (Brigo et al. 2004). In this framework, we analysed the implied correlation from cross exchange rates and compared the results with those in the shifted SCMD model where the single assets are connected by simply introducing instantaneous correlations among the Brownian motions driving each asset.
In further work we may extend our analysis and examples to calibration of several maturities at the same time.
Finally, we generalized the MUVM model in Brigo et al. (2004), having MVMD as a Markovian projection, to a shifted model with random correlation, achieving more flexibility. This allows one to capture the correlation skew better. Indeed, the numerical experiments which we have conducted show that this model may be able to consistently reproduce triangular relationships among FX cross rates, or in other words to reproduce the implied volatility of a cross exchange rate in a consistent way with the implied volatilities of the single exchange rates.
One possible further use of the models given here is in proxying the smile for illiquid cross FX rates resulting from the product of two liquid FX rates. While one would have to find the relevant correlation parameters, possibly based on historical estimation with some adjustments for risk premia, the models presented here allow us to infer the detailed structure of the cross FX rate smile in an arbitrage free way.
References
Austing, P. (2014). Smile pricing explained. London: Palgrave Macmillan.
Bakshi, G., Kapadia, N., & Madan, D. (2003). Stock return characteristics, skew laws, and the differential pricing of individual equity options. Review of Financial Studies, 16(1), 101–143.
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–659.
Bloomberg, L. (2012). Local volatility for equity underlyings, Tech. Rep., Retrieved on 9 December 2015 from a Bloomberg terminal.
Brigo, D., Mercurio, F., & Rapisarda, F. (2004). Connecting univariate smiles and basket dynamics: a new multidimensional dynamics for basket options. http://www.ima.umn.edu/talks/workshops/4-12-16.2004/rapisarda/MultivariateSmile.pdf.
Brigo, D., Rapisarda, F., & Sridi, A. (2014). The arbitrage-free multivariate mixture dynamics model: Consistent single-assets and index volatility smiles. Available on SSRN & arXiv.
Brigo, D., & Mercurio, F. (2000). A mixed-up smile. Risk, 13(9), 123–126.
Brigo, D., & Mercurio, F. (2001). Displaced and mixture diffusions for analytically-tractable smile models. In H. Geman, D. B. Madan, S. R. Pliska, & A. C. F. Vorst (Eds.), Mathematical finance—Bachelier congress 2000 (pp. 151–174). Berlin: Springer.
Brigo, D., & Mercurio, F. (2002). Lognormal-mixture dynamics and calibration to market volatility smiles. International Journal of Theoretical and Applied Finance, 5(4), 427–446.
Brigo, D., Mercurio, F., & Sartorelli, G. (2003). Alternative asset-price dynamics and volatility smile. Quantitative Finance, 3(3), 173–183.
Bruder, B., Kostyuchyk, N., & Roncalli, T. (2016). Risk parity portfolios with skewness risk: An application to factor investing and alternative risk premia. Lyxor asset management technical report.
Da Fonseca, J., Grasselli, M., & Tebaldi, C. (2007). Option pricing when correlations are stochastic: An analytical framework. Review of Derivatives Research, 10, 151–180.
Gourieroux, C. (2007). Continuous time Wishart process for stochastic risk. Econometric Reviews, 25(2), 177–217.
Gyöngy, I. (1986). Mimicking the one-dimensional marginal distributions of processes having an Itô differential. Probability Theory and Related Fields, 71(4), 501–516.
Harrison, J. M., & Kreps, D. M. (1979). Martingales and arbitrage in multiperiod securities markets. Journal of Economic Theory, 20(3), 381–408.
Harrison, J. M., & Pliska, S. R. (1981). Martingales and stochastic integrals in the theory of continuous trading. Stochastic Processes and Their Applications, 11(3), 215–260.
Langnau, A. (2010). A dynamic model for correlation. London: Risk Magazine.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
C. Pisani: Nordea. The views and opinions expressed in this article are those of the authors and do not necessarily reflect the opinion or position of Nordea. Examples of analysis performed within this article are only examples and should not be interpreted as investment advice. The research of Camilla Pisani leading to these results has received funding from the People Programme (Marie Curie Actions) of the European Union’s Seventh Framework Programme FP7/2007-2013/ under REA Grant Agreement No. 289032. This paper however reflects solely the Author’s personal opinion and the Union is not liable for any use that may be made of the information contained therein.
F. Rapisarda: Bloomberg. This paper reflects solely the Author’s personal opinion and does not represent the opinions of the author’s employers, present and past, in any way.
Appendix
Appendix
In this “Appendix” we provide the details leading to definition 5. We start by applying a shift to each component \(Y_i^k\) of each asset as follows
Keeping in mind that \(Y_i^k\) satisfies
we obtain, by applying Ito’s formula
The corresponding asset price \(S_i\) will therefore be a shifted LMD model with shift equal to \(\beta _i e^{\mu _i t}\). In order to find the dynamics of the whole multidimensional process S(t), that is the process corresponding to S(t) after having applied the shift, we look for an SDE of the type
where \(\rho =B B^T\) such that the corresponding density satisfies
In other words, the density \(p_{S(t)}\) is obtained by mixing the single densities \(p_{S_i^k(t)}(x)\) in all the possible ways.
In order to find the diffusion matrix \({\widetilde{C}}\), we compute the Fokker-Planck equations for \(p_{{\underline{S}}(t)}\) and \({\tilde{\ell }}_{1, \dots n;t}^{k_1, \dots , k_n}\). Defining \(\tilde{a}(t,{\underline{S}}(t))=({\widetilde{C}} B) ({\widetilde{C}} B)^T\) where \({\widetilde{C}}_i\) denotes the i-th row of \({\widetilde{C}}\), we obtain
and
Making use of Eq. (8.4) and the equation above
On the other hand, from Eq. (8.6)
Finally, comparing the two expressions obtained for \(\frac{\partial }{\partial t}p_{{\underline{S}}(t)}(x)\)
so that
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Brigo, D., Pisani, C. & Rapisarda, F. The multivariate mixture dynamics model: shifted dynamics and correlation skew. Ann Oper Res 299, 1411–1435 (2021). https://doi.org/10.1007/s10479-019-03239-6
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10479-019-03239-6
Keywords
- MVMD model
- Mixture of densities
- Multivariate local volatility
- Correlation skew
- Random correlation
- Calibration
- Cross exchange rates
- FX smile
- Index volatility smile
- Renminbi–USD smile
- Renminbi–EUR smile
- CNY–USD smile
- CNY–EUR smile
- SCMD model