1 Introduction to the multivariate mixture dynamics

The multi variate mixture dynamics model (MVMD) introduced by Brigo et al. (2004) and recently described in a deeper way in Brigo et al. (2014) is a tractable dynamical arbitrage-free model defined as the multidimensional version of the lognormal mixture dynamics model (LMD) in Brigo and Mercurio (2000, 2001) (see also Brigo et al. 2003). The single-asset LMD model is a no-arbitrage model widely used among practitioners because of its practical advantages in calibration and pricing (analytical formulae for European options, explicit expression for the local volatility) and of its flexibility and accuracy. In fact, a variant of this model is presently used in the calibration of implied volatility surfaces for single stocks and equity indices in the Bloomberg terminal (Bloomberg 2012), and in the subsequent pricing of European, American and path-dependent options on single assets and baskets of assets. However, LMD is a local volatility model, and as such it suffers from limitations of that class of models. Stochastic volatility models have been shown to be more suitable to capture the correct smile dynamics, especially as far as future volatility smiles implied by the model are concerned. Models like LMD can be useful for example in risk management, where one is more interested in tractability and the overall calibration at time zero rather than in fine tuning the dynamics for a very specific valuation and hedging task.

In the multidimensional setting, the main advantage of the MVMD over other multidimensional models, such as e.g., the Wishart model (Da Fonseca et al. 2007; Gourieroux 2007) is in its tractability and flexibility which allows the MVMD to calibrate index volatility smiles consistently with the univariate assets smiles. In addition, a full description of its dependence structure (terminal correlations, average correlations, copula functions) is available.

The MVMD model also enjoys some interesting properties of Markovian projection. First of all, the model can be seen as a Markovian projection of a model with uncertain volatilities denominated MUVM model. As a consequence, European option prices under the MVMD model can more easily be computed under the MUVM model instead. However, the MVMD model remains superior in terms of smoothness and dynamics. Secondly, the Geometric average basket under the MVMD model can be projected into a univariate lognormal mixture dynamics model. Consequently, European option prices on the basket can be easily computed through the Black and Scholes formula.

Finally, under the MVMD model, the terminal correlation between assets and squared volatilities is zero. This mitigates the common drawback of local volatility models of having perfect instantaneous correlation between assets and squared volatilities.

In this paper we generalize the MVMD model, including shifts to the dynamics of the single assets, and we study the correlation skew under this framework.

Before going into details, we recapitulate the definition of the MVMD model (in the non-shifted case), starting with the univariate LMD model and then generalizing to the multidimensional case.

1.1 The volatility smile mixture dynamics model for single assets

Given a maturity \(T>0\), we denote by P(0, T) the price at time 0 of the zero-coupon bond maturing at T, and by \((\Omega ,{{{\mathcal {F}}}},{\mathbb {P}})\) a probability space with a filtration \(({{{{\mathcal {F}}}}}_t)_{t\in [0,T]}\) which is \({\mathbb {P}}\)-complete and satisfying to the usual conditions. We assume the existence of a measure \({\mathbb {Q}}\) equivalent to \({\mathbb {P}}\), called risk–neutral or pricing measure, ensuring absence of arbitrage opportunities in the classical setup, for example, of Harrison and Kreps (1979) and Harrison and Pliska (1981). In this framework we consider N diffusion processes \(Y^i(t)\) with dynamics

$$\begin{aligned} d Y{^i(t)} = \mu Y{^i(t)} dt + v{^i}(t, Y{^i(t))}Y{^i(t)} dW(t) \end{aligned}$$
(1.1)

and a deterministic initial value \(Y^i(0),\) marginal densities \(p_t^{i}\) and diffusion coefficient \(v_i\). The processes \(Y^i\) do not model financial variables but are introduced only to refer to their probability densities when building mixtures of distributions later on. In this sense we term these processes “instrumental”. We define \(S_t\) as the solution of

$$\begin{aligned} dS(t) = \mu S(t) dt + s(t,S(t)) S(t) dW(t) \end{aligned}$$
(1.2)

where s is a local volatility function, namely a deterministic function of t and S only, and it is computed so that the marginal density \(p_t\) of S(t) is a linear convex combination of the densities \(p_t^{i}\) of the instrumental processes (Brigo and Mercurio 2000, 2001, 2002):

$$\begin{aligned} p_t = \sum _i \lambda ^i p_t^{i} \text{ with } \lambda ^i \ge 0, \quad \forall i \text{ and } \sum _i \lambda ^i = 1. \end{aligned}$$
(1.3)

In what follows we restrict ourselves to the case

$$\begin{aligned} \left\{ \begin{array}{l l} Y^i(0)&{}=S(0),\\ v_i(t,x) &{}= \sigma ^i(t), \\ V^i(t) &{}= \sqrt{\int _0^t \sigma ^i(s)^2 ds} \\ \ell ^i_t(x) &{}=p_t^{i}(x) = \frac{1}{\sqrt{2 \pi } x V^i(t)} \exp \left[ -\frac{1}{2 V_i^2(t)} \left( \ln \left( \frac{x}{S(0)}\right) - \mu t + \frac{1}{2} V^i(t)^2 \right) ^2 \right] \end{array} \right. \end{aligned}$$
(1.4)

with \(\sigma ^i\) deterministic. Note that we have denoted the density \(p^i\) with \(\ell ^i\) in this lognormal case. The parameter \(\mu \) is completely specified by \({\mathbb {Q}}\). If the asset is a stock paying a continuous dividend yield q and r is the time T constant risk-free rate, then \(\mu =r-q\). If the asset is an exchange rate and \(r_d\) and \(r_f\) are the (deterministic) domestic and foreign rates at time T, respectively, then \(\mu =r_d-r_f\). If the asset is a forward price, then \(\mu =0\).

Brigo and Mercurio (2001) proved that defining

$$\begin{aligned} s(t,x) = \left( \frac{\sum _{k=1}^N \lambda ^{k} \sigma ^k(t)^2 \ell ^k_t(x)}{\sum _{k=1}^N \lambda ^{k} \ell ^k_t(x)}\right) ^{1/2} \end{aligned}$$
(1.5)

and assuming a few additional nonstringent assumptions on the \(\sigma ^i\), the corresponding dynamics for \(S_t\) admits a unique strong solution.

Theorem 1

(Existence and uniqueness of solutions for the LMD model) Assume that all the real functions \(\sigma ^i(t)\), defined on the real numbers \(t \ge 0\), are once continuously differentiable and bounded from above and below by two positive real constants. Assume also that in a small initial time interval \(t \in [0, \epsilon ]\), \(\epsilon >0\), the functions \(\sigma ^i(t)\) have an identical constant value \(\sigma _0\). Then the lognormal mixture dynamics model (LMD) defined by

$$\begin{aligned} d S_t = \mu S_t dt + s(t,S_t) S_t dW_t, \ \ S_0, \ \ s(t,x) = \left( \frac{\sum _{k=1}^N \lambda ^{k} \sigma ^k(t)^2 \ell ^k_t(x)}{\sum _{k=1}^N \lambda ^{k} \ell ^k_t(x)}\right) ^{1/2}, \end{aligned}$$
(1.6)

admits a unique strong solution and the forward Kolmogorov equation (Fokker Planck equation) for its density admits a unique solution satisfying (1.3), which is a mixture of lognormal densities.

An important consequence of the above construction is that European option prices on S can be written as linear combinations of Black-Scholes prices with weights \(\lambda ^i\). The same combination holds for the Greeks at time 0.

1.2 Combining mixture dynamics on several assets: SCMD

Consider now n different asset prices \(S_1 \dots S_n\) each calibrated to an LMD model, as in Eq. (1.6), and denote by \(\lambda _i^{k}\), \(\sigma _i^{k}\) the parameters relative to the k-th instrumental process of the asset i. There are two possible ways in order to connect the dynamics of the single assets into a multivariate model. The first more immediate way consists in introducing a non-zero quadratic covariation between the Brownian motions driving the LMD models of Eq. (1.6) for \(S_1 \dots S_n\) leading to the so-called SCMD model.

Definition 2

(SCMD model) We define the Simply Correlated multivariate Mixture Dynamics (SCMD) model for \({\underline{S}}= [S_1,\ldots ,S_n]\) as a vector of univariate LMD models, each satisfying Theorem 1 with diffusion coefficients \(s_1,\ldots ,s_n\) given by equation (1.6) and densities \(\ell _1,\ldots ,\ell _n\) applied to each asset, and connected simply through quadratic covariation \(\rho _{ij}\) between the Brownian motions driving assets i and j. This is equivalent to the following n-dimensional diffusion process where we keep the W’s independent and where we embed the Brownian covariation into the diffusion matrix \({\bar{C}}\), whose i-th row we denote by \({\bar{C}}_i\):

$$\begin{aligned} d{\underline{S}}(t)= & {} diag({\underline{\mu }}) {\underline{S}}(t)dt + diag({\underline{S}}(t)) \bar{{C}}(t,{\underline{S}}(t))d{\underline{W}}(t), \ \ \ {\bar{a}}_{i,j}(t,{\underline{S}}) := \bar{{C}}_{i} \bar{{C}}_{j}^T \end{aligned}$$
(1.7)
$$\begin{aligned} {\bar{a}}_{i,j}(t,{\underline{S}})= & {} s_i(t,S_i) s_j(t,S_j) \rho _{ij}= \left( \frac{\sum _{k=1}^N \lambda _i^{k} \sigma _i^k(t)^2 \ell _{i,t}^k(S_i)}{\sum _{k=1}^N \lambda _{i}^{k} \ell _{i,t}^k(S_i)} \ \ \frac{\sum _{k=1}^N \lambda _j^k \sigma _{j}^k(t)^2 \ell _{j,t}^k(S_j)}{\sum _{k=1}^N \lambda _{j}^k \ell _{j,t}^k(S_j)}\right) ^{1/2} \rho _{ij}\nonumber \\ \end{aligned}$$
(1.8)

where T represents the transposition operator.

Assumption

We assume \(\rho =(\rho _{ij})_{i,j}\) to be positive definite.

It is evident from the previous construction that the SCMD is consistent with both the dynamics of the single assets \(S_i\) and the instantaneous correlation matrix \(\rho \). Moreover, we can easily simulate a path of S by exogenously computing \(\rho \) for example from historical data, assuming it constant over time and applying a naive Euler scheme. However an explicit expression for the density of \({\underline{S}}= [S_1,\ldots ,S_n]\) under the SCMD dynamics is not available. As a consequence, if we aim at computing prices of options whose payoff depends on the value at time T only we still need to simulate entire paths of \({\underline{S}}\) over the interval [0, T], which can be quite time consuming.

1.3 Lifting the mixture dynamics to asset vectors: MVMD

A different approach, still consistent with the single assets’ dynamics, lies in merging the dynamics of the single assets in such a way that the mixture property is lifted to the multivariate density and the corresponding model gains some further tractability property with respect to the SCMD model. This can be achieved by mixing in all possible ways the densities of the instrumental processes of each individual asset and by imposing the correlation structure \(\rho \) at the level of the single instrumental processes, rather than of the assets as we did for the SCMD model. This has important consequences on the actual structure of the correlation, see Brigo et al. (2004). Below we summarize the construction leading to the MVMD model, while referring to Brigo et al. (2014) for further details.

Assume we have calibrated an LMD model for each \(S_i(t)\): if \(p_{S_i(t)}\) is the density of \(S_i\), we write

$$\begin{aligned} p_{S_i(t)}(x) = \sum _{k=1}^{N_i} \lambda _{i}^{k} \ell _{i,t}^k(x), \text{ with } \lambda _{i}^k \ge 0,\quad \forall k \text{ and } \sum _k \lambda _{i}^k = 1, \end{aligned}$$
(1.9)

where \((\ell _{i,t}^k)_{k}\) are the densities of \((Y_i^{k})_k\), instrumental processes for \(S_i\) evolving lognormally according to the stochastic differential equation:

$$\begin{aligned} dY_i^{k}(t) = \mu _i Y_i^{k}(t) dt + \sigma _i^{k}(t) Y_i^{k}(t) dZ_i(t),\ \ \ d \langle Z_i, Z_j\rangle _t = \rho _{ij} dt,\ \ \ Y_i^k(0)=S_i(0).\nonumber \\ \end{aligned}$$
(1.10)

For notational simplicity we assume the number of base densities \(N_i\) to be the same, N, for all assets. The exogenous correlation structure \(\rho _{ij}\) is given by the symmetric, positive–definite matrix \(\rho \).

Denote by \({{\underline{S}}}(t) = \left[ S_1(t),\ldots ,S_n(t)\right] ^T\) the vector of asset prices with

$$\begin{aligned} d{\underline{S}}(t) = diag({\underline{\mu }}) {\underline{S}}(t)dt + diag({\underline{S}}(t)) {A}(t,{\underline{S}}(t))d{\underline{W}}(t). \end{aligned}$$
(1.11)

As we did for the one dimensional case, we look for a matrix A such that

$$\begin{aligned} p_{{\underline{S}}(t)}({\underline{x}})= \sum _{k_1,k_2,\ldots k_n=1}^N \lambda _1^{k_1} \cdots \lambda _n^{k_n} \ell _{1,\ldots ,n; t}^{k_1,\ldots ,k_n}({\underline{x}}), \ \ \ \ \ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}}) := p_{\left[ Y_1^{k_1}(t),\ldots ,Y_n^{k_n}(t)\right] ^T}({\underline{x}}), \nonumber \\ \end{aligned}$$
(1.12)

or more explicitly

$$\begin{aligned} \ell _{1,\ldots ,n; t}^{k_1,\ldots ,k_n}({\underline{x}}) = \frac{1}{(2 \pi )^{\frac{n}{2}} \sqrt{\det \Xi ^{(k_1 \cdots k_n)}(t)} \Pi _{i=1}^n x_i}&\exp \left[ -\frac{ {\tilde{x}}^{(k_1 \cdots k_n) T} \Xi ^{(k_1 \cdots k_n)}(t)^{-1} {\tilde{x}}^{(k_1 \cdots k_n)} }{2} \right] \end{aligned}$$

where \(\Xi ^{(k_1 \cdots k_n)}(t)\) is the integrated covariance matrix whose (ij) element is

$$\begin{aligned} \Xi _{ij}^{(k_1 \cdots k_n)}(t)= & {} \int _0^t \sigma _i^{k_i}(s) \sigma _j^{k_j}(s) \rho _{ij} ds \end{aligned}$$
(1.13)
$$\begin{aligned} {\tilde{x}}_i^{(k_1 \cdots k_n)}= & {} \ln x_i - \ln x_i(0) - \mu _i t + \int _0^t \frac{\sigma _i^{k_i^2}(s)}{2} ds. \end{aligned}$$
(1.14)

Computations show that if a solution exists, this must satisfy the definition below.

Definition 3

(MVMD model) The (Lognormal) multi variate mixture dynamics (MVMD) model is given by

$$\begin{aligned} d{\underline{S}}(t)= & {} diag({\underline{\mu }})\ {\underline{S}}(t)\ dt + diag({\underline{S}}(t))\ {C}(t,{\underline{S}}(t)) B \ d{\underline{W}}(t),\nonumber \\ C_i(t,{\underline{x}}):= & {} \frac{\sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \sigma _i^{k_i}(t) \ \ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})}{\sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})}, \end{aligned}$$
(1.15)

\(\ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}}) := p_{\left[ Y_1^{k_1}(t),\ldots ,Y_n^{k_n}(t)\right] ^T}({\underline{x}})\) and defining B such that \(\rho =B B^T\), \(a = C B (C B)^T\),

$$\begin{aligned} a_{i,j}(t,{\underline{x}}) = \frac{\sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ {V}^{k_1,\ldots ,k_n}(t)\ \ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})}{\sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})} \end{aligned}$$
(1.16)

where

$$\begin{aligned} {V}^{k_1,\ldots ,k_n}(t) = \left[ \sigma _i^{k_i}(t)\ \rho _{i,j}\ \sigma _j^{k_j}(t)\right] _{i,j = 1,\ldots ,n}. \end{aligned}$$
(1.17)

From the previous definitions it is evident that the dynamics of the single assets \(S_i\) in the SCMD model are Markovian. On the other hand, under the MVMD model, while the dynamics of the whole vector S is Markovian, those of the single assets are not. This leads to more realistic dynamics.

Under mild assumptions, existence and uniqueness of a solution can be proved through the following Theorem.

Theorem 4

Assume that the volatilities \(\sigma _i^{k_i}(t)\) for all i are once continuously differentiable, uniformly bounded from below and above by two positive real numbers \({\tilde{\sigma }}\) and \({\hat{\sigma }}\) respectively, and that they take a common constant value \(\sigma _0\) for \(t \in [0,\epsilon ]\) for a small positive real number \(\epsilon \), namely

$$\begin{aligned}&{\tilde{\sigma }}=\inf _{t \ge 0} \left( \ \min _{i=1 \cdots n, k_i=1, \ldots N} \ \ (\sigma _i^{k_i}(t) ) \right) , \\&{\hat{\sigma }}=\sup _{t \ge 0} \left( \ \max _{i=1 \cdots n, k_i=1 \cdots N} \ \ (\sigma _i^{k_i}(t) ) \right) \\&\sigma _i^{k_i}(t) = \sigma _0 > 0 \ \ \text{ for } \text{ all } \ \ t \in [0, \epsilon ]. \end{aligned}$$

Assume also the matrix \(\rho \) to be positive definite. Then the MVMD n-dimensional stochastic differential equation (1.15) admits a unique strong solution. The diffusion matrix \(a(t,{\underline{x}})\) in (1.16) is positive definite for all t and x.

Before introducing the shifted version of the model, we would like to discuss briefly a physical measure dynamics associated with MVMD. If we replaced the drift parameters \(\mu \) with historical returns, leaving aside the fact that these are known to be difficult to estimate in a satisfactory way, we would immediately obtain a mixture dynamics under the physical measure. Gaussian mixtures for multivariate returns have been considered in the risk premia literature for risk parity portfolios, see for example Bruder at al. (2016) and references therein. In further work we could try to connect our approach to frameworks such as the mixture models used in Bruder at al. (2016).

2 Introducing a shift in MVMD

When modelling a one dimensional asset price through an LMD model, implied volatilities with minimum exactly at a strike equal to the forward asset price are the only possible. In order to gain greater flexibility and therefore move the smile minimum point from the ATM forward we can shift the overall density by a deterministic function of time, carefully chosen in order to preserve risk–neutrality and therefore guarantee no–arbitrage. This is the so–called shifted lognormal mixture dynamics model (Brigo and Mercurio 2002). Under this model the new asset-price process S is defined as

$$\begin{aligned} S_t= \beta e^{\mu t} +X_t \end{aligned}$$
(2.1)

with \(\beta \) real constant and \(X_t\) satisfying (1.6). Under the assumption \(K- \beta e^{\mu T}>0\) the price at time 0 of a European call option with strike K and maturity T can be written as

$$\begin{aligned} P(0,T) {\mathbb {E}}^T\{ (S_T-K)^{+}\}=P(0,T) {\mathbb {E}}^T\{ (X_T-[K- \beta e^{\mu T}])^{+}\} \end{aligned}$$
(2.2)

and thus as a combination of Black and Scholes prices with strike \(K- \beta e^{\mu T}\). The model therefore preserves the same level of tractability as in the non shifted case with the advantage of gaining more flexibility.

Once each asset is calibrated to a shifted LMD model, we have two possibilities for reconstructing the dynamics of the multidimensional process. The first possibility is to reconnect the single assets by introducing a non-zero quadratic covariation between the Brownian motions (as we did for the SCMD model), leading to what we call the shifted SCMD model. The second possibility going on the same lines as the approach leading to the MVMD model, lies in applying the same shift \(\beta _i e^{\mu _i t}\) to each instrumental process \(Y_i^k\) of each asset \(X_i\)

$$\begin{aligned} S_i^k(t)=Y_i^k(t)+\beta _i e^{\mu _i t} \end{aligned}$$

where \(Y_i^k\) satisfies the dynamics in (1.10) (this is equivalent to applying the shift \(\beta _i e^{\mu _i t}\) directly to the i-th asset) and then mix the corresponding densities \(p_{S_i^k(t)}(x)\) in all possible ways. Computations similar to those for the non-shifted case show that if a solution exists, it must satisfy the definition below (details on the computations are shown in the “Appendix”).

Definition 5

(Shifted MVMD model) The shifted multi variate mixture dynamics model is given by

$$\begin{aligned} d {\underline{S}}(t)= & {} diag({\underline{\mu }}){\underline{S}}(t) dt + diag({\underline{S}}(t)) {\widetilde{C}}(t, {\underline{S}}(t))B d {\underline{W}}(t), \\ {\widetilde{C}}_i(t,{\underline{x}}):= & {} \frac{\sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \sigma _i^{k_i}(t) (x_i-\beta _ie^{\mu _i t}) \ \tilde{\ell }_{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})}{x_i \sum _{k_1,\ldots ,k_n=1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \tilde{\ell }_{1,\ldots ,n;t}^{k_1,\ldots ,k_n}({\underline{x}})}, \nonumber \end{aligned}$$
(2.3)
$$\begin{aligned} {\tilde{\ell }}_{1, \dots n;t}^{k_1, \dots , k_n} ({\underline{x}})= & {} p_{[S_1^{k_1}(t), \dots , S_n^{k_n}(t)]^T}( {\underline{x}})=\ell _{1,\ldots ,n;t}^{k_1,\ldots ,k_n} ({\underline{x}}-{\underline{\beta }} e^{{\underline{\mu }} t}) \end{aligned}$$
(2.4)

and defining B such that \(\rho =B B^T\), \({{\tilde{a}}}={\widetilde{C}} B ({\widetilde{C}} B)^T\),

$$\begin{aligned} {{\tilde{a}}}_{ij}(t,{\underline{x}})=\frac{ \sum _{k_1,k_2,\dots k_n = 1}^N \lambda _1^{k_1} \cdots \lambda _n^{k_n} \ {V}^{k_1,\ldots ,k_n}(t) (x_i-\beta _i e^{\mu _i t})(x_j-\beta _j e^{\mu _j t}) {\tilde{\ell }}_{1, \dots n;t}^{k_1, \dots , k_n} ({\underline{x}})}{x_i x_j \sum _{k_1,k_2,\dots k_n = 1}^N \lambda _1^{k_1} \cdots \lambda _n^{k_n} {\tilde{\ell }}_{1, \dots n;t}^{k_1, \dots , k_n} ({\underline{x}})}\nonumber \\ \end{aligned}$$
(2.5)

with \({V}^{k_1,\ldots ,k_n}\) as in (1.17).

We now have all the instruments to introduce the correlation skew and study its behaviour under shifted SCMD and shifted MVMD dynamics.

3 The correlation skew

The aim of this section is to introduce a definition of correlation skew and to study its behaviour under shifted MVMD dynamics, in comparison with the correlation skew under shifted SCMD dynamics. It is observed in practice under normal market conditions that assets are relatively weakly correlated with each other. However during periods of market stress stronger correlations are observed. Furthermore, different pairs of assets will have different correlations in general. This suggests that a single time-constant correlation parameter for all options quoted on a basket of assets, or an index, say, may not be sufficient to reproduce all option prices on the basket/index for a given expiry. The situation might be improved by considering time-constant but heterogeneous correlations, changing value for each pair of assets, but even this may not be sufficient. We refer to works inferring a multidimensional dynamics from a set of single–asset dynamics for further discussion. Among others, this has been shown in Bakshi et al. (2003) for options on the S&P 100 index and in Langnau (2010) for options on the Euro Stoxx 50 index and on the DAX index.

When computing the implied volatility, European call prices (or equivalently put prices) are considered and the reference model is the benchmark (Black and Scholes 1973) model. It seems then natural to consider as multidimensional benchmark a model where the single assets follow geometric Brownian motions and constant correlation among the single Brownian shocks is introduced. However, when moving from the one-dimensional to the multidimensional framework a bigger variety of possible option instruments to use in order to compare prices under the reference model and the model under analysis appears, the particular choice depending on the specific product we are interested in. We discuss now briefly the two dimensional case. Austing (2014) recently provided a discussion on some of the most popular multi-assets products suggesting the use of composite options as benchmark on which defining the implied correlation. In this paper we adopt a different approach based on the comparison with options on \(S_1(t)\), \(S_2(t)\) with payoff

$$\begin{aligned} \left( S_1(T)S_2(T)-K\right) ^{+}. \end{aligned}$$
(3.1)

Assume that the pair \((S_1,S_2)\) follows a bi-dimensional Black and Scholes model, in other words \(S_1\) and \(S_2\) follow two geometric Brownian motions with correlation \(\rho \) and consider the payoff in Eq. (3.1). Given the Black and Scholes implied volatilities for \(S_1\) and \(S_2\), the value \(\rho _{impl}\) such that prices under the bi-dimensional Black and Scholes model are the same as market prices

$$\begin{aligned} \text{ MKT }\_\text{ Prices }(S_1(0),S_2(0),K,T)=\text{ BS }\_\text{ Prices }(S_1(0),S_2(0),K,T, \rho _{impl}(K,T)) \end{aligned}$$

is called implied correlation. If we try to match option prices for a given maturity T and two different strikes \(K_1\), \(K_2\), we will observe two different values of the implied correlation. This is contrary to the hypothesis of constant correlation in the bi-dimensional Black and Scholes model.

The curve \(K \rightarrow \rho _{impl}(K,T)\) is called correlation skew. Thus, the correlation skew can be considered as a descriptive tool/metric similar to the volatility smile in the one-dimensional case, with the difference that it primarily describes implied dependence instead of volatility.

3.1 Explaining the skew in MVMD with the single parameter \(\rho \) via MUVM

The aim of this section is to introduce a definition of implied correlation under shifted MVMD dynamics, when using options with payoff as in Eq. (3.1). This leads to a straightforward application in the foreign exchange market within the study of triangular relationships. Imagine, for example, that \(S_1\) and \(S_2\) represent the exchange rates USD/EUR and EUR/JPY, respectively. The cross asset \(S_3=S_1 S_2\) then represents the USD/JPY exchange rate, and the corresponding payoff in Eq. (3.1) is the payoff of a call option on the USD/JPY FX rate. In the following, we will investigate whether the shifted MVMD model is able to consistently reproduce the implied volatility of \(S_3\), once the single components \(S_1\), \(S_2\) are calibrated to univariate shifted LMD models. Consistency properties of this kind are important, for example, in order to reconstruct the time series of less liquid cross currency pairs from more liquid ones.

Before proceeding we make a remark on the interpretation of \(\rho \). Keeping in mind the definition of instantaneous local correlation in a bivariate diffusion model

$$\begin{aligned} \rho _L(t) := \frac{d \langle S_1, S_2 \rangle _t}{\sqrt{d \langle S_1, S_1 \rangle _t \ d \langle S_2, S_2 \rangle _t } } \end{aligned}$$

and making use of Schwartz’s inequality, we obtain that the absolute value of the local correlation under the shifted MVMD model is smaller than the value under the shifted SCMD model. The result is contained in the Proposition below.

Proposition 6

(Local correlation in shifted MVMD and shifted SCMD) The instantaneous local correlation under the shifted SCMD model is \(\rho \), whereas for the shifted MVMD model we have

$$\begin{aligned} \rho _L(t)&=\frac{ \rho \sum _{k,k'=1}^{N} {\lambda _1}^k {\lambda _2}^{k'} \sigma _1^{(k)} \sigma _2^{(k')} \tilde{\ell }_t^{(kk')}(x_1,x_2) }{ \sqrt{ \left( \sum _{k,k'=1}^{N} {\lambda _1}^k {\lambda _2}^{k'} \sigma _1^{(k)2} \tilde{\ell }_t^{(kk')}(x_1,x_2) \right) \left( \sum _{k,k'=1}^{N} {\lambda _1}^k {\lambda _2}^{k'} \sigma _2^{(k')2} \tilde{\ell }_t^{(kk')}(x_1,x_2) \right) } }, \\\vert \rho _L(t) \vert&\le \rho \end{aligned}$$

where \({{\tilde{\ell }}}_t^{(kk')}(x_1,x_2)\) is defined as in Eq. (2.4).

We see that \(\rho \) enters the formula for the instantaneous local correlation \(\rho _L\) in the MVMD model, even though the latter is more complex than the constant value \(\rho \). Our aim is to find a value of \(\rho \) matching prices of options with payoff as in Eq. (3.1) under shifted MVMD dynamics with market prices.

In order to do that we will make use of a model with uncertain parameters of which the shifted MVMD is a Markovian projection. Indeed, as shown in Brigo et al. (2014) the MVMD model as in Definition 3 (without shift) is a Markovian projection of the model defined below

$$\begin{aligned} d\xi _i(t) = \mu _i\ \xi _i(t) dt + \sigma ^{I_i}_i(t)\ \xi _i(t) dZ_i(t),\ \ i = 1,\ldots ,n, \end{aligned}$$
(3.2)

where each \(Z_i\) is a standard one dimensional Brownian motion with \(d \left\langle Z_i, Z_j \right\rangle _t = \rho _{i,j} dt\), \(\mu _i\) are constants, \(\sigma ^I:= [\sigma ^{I_1}_1,\ldots ,\sigma ^{I_n}_n]^T\) is a random vector independent of Z and representing uncertain volatilities with \(I_1,\ldots ,I_n\) mutually independent. More specifically, each \(\sigma ^{I_i}_i\) takes values in a set of N deterministic functions \(\sigma _i^k\) with probability \(\lambda _i^k\). Thus, for all times in \((\varepsilon ,+\infty )\) with small \(\varepsilon \) we have

$$\begin{aligned} (t \longmapsto \sigma ^{I_i}_i(t)) = \left\{ \begin{array}{l} (t \longmapsto \sigma _i^1(t))\ \ \text{ with } \ {\mathbb {Q}} \ \ \text{ probability }\ \ \lambda _i^1 \\ (t \longmapsto \sigma _i^2(t))\ \ \text{ with } \ {\mathbb {Q}} \ \ \text{ probability }\ \ \lambda _i^2 \\ \vdots \\ (t \longmapsto \sigma _i^N(t))\ \ \text{ with } \ {\mathbb {Q}} \ \ \text{ probability }\ \ \lambda _i^N \end{array} \right. \end{aligned}$$

Now it is straightforward to show that if we add a shift to each component as follows

$$\begin{aligned} {{\tilde{\xi }}}_i(t)=\xi _i(t)+\beta _i e^{\mu _i t} \end{aligned}$$
(3.3)

we obtain a model having the shifted MVMD model (2.3)–(2.5) as Markovian projection. This can be easily shown by Gyöngy’s lemma (Gyöngy 1986).

Theorem 7

The shifted MVMD model is a Markovian projection of the shifted MUVM model.

Proof

A straightforward application of Ito’s lemma shows that \(\tilde{\xi }(t)\) satisfies the system of SDEs below

$$\begin{aligned} d\underline{{{\tilde{\xi }}}}(t) = diag({\underline{\mu }})\ \underline{{{\tilde{\xi }}}}(t)\ dt + diag (\underline{{{\tilde{\xi }}}}(t)-{\underline{\alpha }}(t))\ {A^I(t)}\ d{\underline{W}}(t) \end{aligned}$$
(3.4)

where \(diag({\underline{\alpha }}(t))\) is a deterministic matrix whose i-th diagonal element is the shift \(\beta _i e^{\mu _i t}\) and \({A^I(t)}\) is the Cholesky decomposition of the covariance matrix \(\Sigma ^I_{i,j}(t) := \sigma ^{I_i}_i(t) \sigma ^{I_j}_j(t)\ \rho _{ij}.\)

Define \({{\tilde{v}}}(t,{\underline{\xi }}(t)) = diag(\underline{\tilde{\xi }}(t)-{\underline{\alpha }}(t)) {A^I(t)}\). In order to show that the MVMD model is a Markovian projection of the MUVM model, we need to show that

$$\begin{aligned} {\mathbb {E}}[{{\tilde{v}}} {{\tilde{v}}}^{T}|\underline{{{\tilde{\xi }}}}(t) = \underline{{{\tilde{x}}}}] = {{\tilde{\sigma }}} \ \tilde{\sigma }^T(t,{\underline{x}}). \end{aligned}$$
(3.5)

where \({{\tilde{\sigma }}}(t,{\underline{x}})=diag({\underline{x}}) \widetilde{C}(t, {\underline{x}})B\) and \( {{\widetilde{C}}}\) is defined as in (2.3).

Observing that

$$\begin{aligned}&{\mathbb {E}}[{{\tilde{v}}} {{\tilde{v}}}^{T}|{\underline{\xi }}(t) \in d {\underline{x}}] \\&\quad = \frac{{\mathbb {E}}[diag(\underline{{{\tilde{\xi }}}}(t)- {\underline{\alpha }}(t))\ \Sigma \ diag(\underline{{{\tilde{\xi }}}}(t)- {\underline{\alpha }}(t))\ 1_{\{\underline{{{\tilde{\xi }}}}(t) \in d{\underline{x}}\}}]}{{\mathbb {E}}[1_{\{\underline{{{\tilde{\xi }}}}(t) \in d{\underline{x}}\}}]}\\&\quad = \frac{diag({\underline{x}}-{\underline{\alpha }}(t)) \sum _{k_1,\ldots ,k_n = 1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ {V}^{k_1,\ldots ,k_n}(t) {{\tilde{\ell }}}_{1,\ldots ,n;t}^{k_1,\ldots ,k_n} ({\underline{x}})\ diag({\underline{x}}-{\underline{\alpha }}(t)) \ d{\underline{x}}}{\sum _{k_1,\ldots ,k_n = 1}^{N} \lambda _1^{k_1}...\lambda _n^{k_n}\ \tilde{\ell }_{1,\ldots ,n;t}^{k_1,\ldots ,k_n} ({\underline{x}}) \ d{\underline{x}} } \end{aligned}$$

and performing simple matrix manipulations, Eq. (3.5) is easily obtained. \(\square \)

Since we will infer the value of \(\rho \) from prices of options with payoff as in (3.1) depending on the value of \((S_1,S_2)\) at time T only, we can make computations under the shifted MUVM rather then the shifted MVMD, as these two models have the same one-dimensional (in time) distributions. Computations under the shifted MUVM model are easier to do (with respect to the shifted MVMD case) since conditioning on \(\{I_i=j\}\), \(\xi _i\) follows a shifted geometric Brownian motion with volatility \(\sigma _i^j\).

In particular we will focus on the bidimensional specification in which case the shifted MUVM reduces to

$$\begin{aligned} d S_1(t)= & {} \mu _1\ S_1(t) dt + \sigma ^{I_1}_1(t)\ (S_1(t)-\beta _1 e^{\mu _1 t}) dW_1(t)\nonumber \\ d S_2(t)= & {} \mu _2\ S_2(t) dt + \sigma ^{I_2}_2(t)\ (S_2(t)-\beta _2 e^{\mu _2 t}) dW_2(t) \end{aligned}$$
(3.6)

where the Brownian motions \(W_1\), \(W_2\) have correlation \(\rho \).

Once we have calibrated \(S_1\) and \(S_2\) independently, each to a univariate shifted LMD model, we notice that the only parameter missing when computing prices of options having payoff as in (3.1) is \(\rho \).

Definition 8

We define the implied correlation parameter in the shifted MVMD model as the value \(\rho \) minimizing the squared percentage difference between implied volatilities from options with payoff (3.1) under the shifted MVMD model and market implied volatilities.

3.2 The correlation skew in SCMD via \(\rho \)

Now, assume to model the joint dynamics of \((S_1,S_2)\) as a shifted SCMD model instead. In this case

$$\begin{aligned} \begin{array}{l l} d S_1(t) &{}= \mu S_1(t) dt + \nu _1(t,S_1(t)-\beta _1 e^{\mu t}) (S_1(t)-\beta _1 e^{\mu t}) dW_1(t), \\ d S_2(t) &{}= \mu S_2(t) dt + \nu _2(t,S_2(t)-\beta _2 e^{\mu t}) (S_2(t)-\beta _2 e^{\mu t}) dW_2(t) \end{array} \end{aligned}$$
(3.7)

with

$$\begin{aligned} \begin{array}{l l} \nu _1(t,x) &{}= \left( \frac{\sum _{k=1}^N \lambda _1^{k} \sigma _1^k(t)^2 \ell ^k_t(x)}{\sum _{k=1}^N \lambda _1^{k} \ell ^k_t(x)}\right) ^{1/2},\\ \nu _2(t,x) &{}= \left( \frac{\sum _{k=1}^N \lambda _2^{k} \sigma _2^k(t)^2 \ell ^k_t(x)}{\sum _{k=1}^N \lambda _2^{k} \ell ^k_t(x)}\right) ^{1/2} \end{array} \end{aligned}$$
(3.8)

where the Brownian motions \(W_1\), \(W_2\) have correlation \(\rho \). In this case the parameter \(\rho \) really represents the true value of the instantaneous local correlation, as opposed to the MVMD case. We still define the implied correlation as the value \(\rho \) minimizing the squared percentage difference between implied volatilities from options with payoff (3.1) under the shifted SCMD model and market implied volatilities.

3.3 Pricing under the shifted MUVM

Now, we consider computing the price of options such as (3.1), namely options on cross FX rates, under the shifted model. In general one has a loss of tractability with respect to the non-shifted case. However, one can still express the price via a semi-analytic formula involving double integration:

$$\begin{aligned}&e^{-r T} \mathbb {E}[(B-K)_{+}]= e^{-r T} \sum _{i,j=1}^N \lambda _1^i \lambda _2^j \int _{K}^{\infty } dB(B-K) \nonumber \\&\qquad \times \int _{-\infty }^{\log \left( \frac{B/\alpha _2 - \alpha _1}{F_1(T)} \right) +\frac{\Sigma _{1,1}^{i,i}}{2}} d x_1 \frac{n(x_1;0,\Sigma _{1,1}^{i, i})n(D^{i,j}(B,x_1);0,(1-\rho ^2) \Sigma _{2,2}^{j,j})}{B-\alpha _2F_1(T)e^{x_1- \Sigma _{1,1}^{i,i}/2}-\alpha _1 \alpha _2} \end{aligned}$$
(3.9)

where n(xmS) is the density function of a one-dimensional Gaussian random variable with mean m and standard deviation S,

$$\begin{aligned} D^{i,j}(B,x_1)=ln\left( \frac{B}{F_1(t)e^{x_1-\frac{\Sigma ^{i,i}_{1,1}}{2}}+\alpha _1} - \alpha _2\right) - ln (F_2(t))+\frac{\Sigma ^{j,j}_{2,2}}{2}-\rho x_1 \sqrt{\frac{\Sigma ^{j,j}_{2,2}}{\Sigma ^{i,i}_{1,1}}}, \end{aligned}$$

and \(\Sigma ^{i,j}_{h,k}=\sigma _h^i \sigma _k^j T\) for \(h,k=1,2\) and \(i,j=1,\dots N\). This follows from the fact that the density of the product

$$\begin{aligned} B=S_1 S_2=(\xi _1+\beta _1e^{\mu _1 T})(\xi _2+\beta _2e^{\mu _2 T}) \end{aligned}$$

can be written as

$$\begin{aligned} p_{B_T}(B)dB=\mathbb {Q}(B_T\in dB)={\mathbb {E}}[1_{\{B_T \in dB\}}]=\sum _{i,j=1}^N \lambda _1^i \lambda _2^j{\mathbb {E}}\left[ 1_{\{(\xi _1^i+\beta _1 e^{\mu _1 T}) (\xi _2^j +\beta _2 e^{\mu _2 T})\in dB\}}\right] \nonumber \\ \end{aligned}$$
(3.10)

where

$$\begin{aligned} \begin{array}{l l} d\xi _1(t) = \mu _1\ \xi _1(t) dt + \sigma ^{i}_1(t)\ \xi _1(t) dW_1(t)\\ d\xi _2(t) = \mu _2\ \xi _2(t) dt + \sigma ^{j}_2(t)\ \xi _2(t) dW_2(t) \end{array} \end{aligned}$$

Now we focus on a single term in the summation (3.10) and for simplicity we drop the superscript i, j . Calling \(F_1(t)\), \(F_2(t)\) the t-forward asset prices and defining \(x_i=\ln \frac{\xi _i}{F_i(t)}+\frac{\Sigma _{i,i}}{2}\) we can rewrite the expectation as

$$\begin{aligned}&\int dx_1 dx_2 1_{\{(F_1(t)e^{x_1-\Sigma _{1,1}/2}+\alpha _1)(F_2(t)e^{x_2-\Sigma _{2,2}/2}+\alpha _2)\in dB \}}n({\underline{x}};0,\Sigma )\\&\quad =\left( -\frac{d}{dB} \int _{D_B} dx_1 dx_2 n({\underline{x}};0,\Sigma ) \right) dB \end{aligned}$$

with \(\alpha _i = \beta _i e^{\mu _i T}\), where \(n({\underline{x}};0,\Sigma )\) is the density of a bivariate normal distribution with mean equal to zero and covariance matrix \(\Sigma \) defined as below

$$\begin{aligned} \Sigma =\begin{pmatrix} \Sigma _{1,1} &{} \rho \sqrt{\Sigma _{1,1} \Sigma _{2,2}} \\ \rho \sqrt{\Sigma _{1,1} \Sigma _{2,2}}&{} \Sigma _{2,2}\\ \end{pmatrix} \end{aligned}$$
(3.11)

Observing that \(n({\underline{x}};0,\Sigma )=n(x_1;0, \Sigma _{1,1})n(x_2-\rho x_1 \sqrt{\Sigma _{2,2}/\Sigma _{1,1}};0,(1-\rho ^2)\Sigma _{2,2})\), integrating with respect to \(x_2\) and replacing in (3.10) we obtain

$$\begin{aligned} p_{B_T}(B)=\sum _{i,j=1}^N \lambda _1^i \lambda _2^j \int _{-\infty }^{\log \left( \frac{B/\alpha _2 - \alpha _1}{F_1(T)} \right) +\frac{\Sigma _{1,1}^{i,i}}{2}} dx_1 \frac{n(x_1;0,\Sigma _{1,1}^{i, j})n(D^{i,j}(B,x_1);0,(1-\rho ^2) \Sigma _{2,2}^{i,j})}{B-\alpha _2F_1(T)e^{x_1- \Sigma _{1,1}^{i,j}}-\alpha _1 \alpha _2} \end{aligned}$$

from which Eq. (3.9) is easily derived.

4 Comparing correlation skews in shifted MVMD and SCMD

The aim of this section is to compare the shifted MVMD and the shifted SCMD models in terms of implied correlation, analysing their performance in reproducing triangular relationships.

4.1 Numerical case study with cross FX rates

Specifically, we consider the exchanges \(S_1=\) USD/EUR, \(S_2=\) EUR/JPY under a shifted MUVM model with 2 components

$$\begin{aligned} S_1(t)=X_1(t)+\beta _1e^{(r^{\EUR {} }-r^{\$})t}\\ S_2(t)=X_2(t)+\beta _2 e^{(r^{Y}-r^{\EUR {} })t} \end{aligned}$$

with

$$\begin{aligned} dX_1(t)=(r^{\EUR {} }-r^{\$})X_1(t)dt+\sigma _1^{I_1}(t) X_1(t)dW_t^{1,\EUR {} }\\ dX_2(t)=(r^{Y}-r^{\EUR {} })X_2(t)dt+\sigma _2^{I_2}(t) X_2(t)dW_t^{2,Y} \end{aligned}$$

where \(r^{\EUR {} }\), \(r^{\$}\), \(r^{Y}\) are the euro, dollar and yen interest rates, respectively, and \(\sigma _1^{I_1}(t)\), \(\sigma _2^{I_2}(t)\) are as in Eq. (3.2). \(W_t^{1,\EUR {} }\) and \(W_t^{2,Y}\) indicate that we are considering the dynamics of \(S_1\) and of \(S_2\), each under its own domestic measure, that is the euro in the case of \(S_1\) and the yen in the case of \(S_2\).

We calibrate \(S_1\) and \(S_2\) independently, each to its own volatility curve, using 2 components and minimizing the squared percentage difference between model and market implied volatilities. Then, we look at the product \(S_1 S_2\), representing the cross exchange USD/JPY, and we check whether the model is able to reproduce the cross smile consistently with the smiles of the single assets. In particular, we find \(\rho \) that minimizes the squared percentage difference between implied volatilities from options on the basket \(S_3=S_1 S_2\) under the shifted MVMD model (the shifted SCMD model) and market implied volatilities. In other words, we look at the implied correlations under the shifted MVMD model and the shifted SCMD model.

When performing calibration on \(S_3\), we express both the dynamics of \(X_1\) and of \(X_2\) under the yen:

$$\begin{aligned} dX_1(t)&=(r^{\EUR {} }-r^{\$}-\rho \sigma _1^{I_1}(t) \sigma _2^{I_2}(t))X_1(t)dt+\sigma _1^{I_1} X_1(t)dW_t^{1,Y}\\ dX_2(t)&=(r^{Y}-r^{\EUR {} })X_2(t)dt+\sigma _2^{I_2}(t) X_2(t)dW_t^{2,Y}, \end{aligned}$$

and then we calculate prices of options on

$$\begin{aligned} S_3(t)=(X_1(t)+\beta _1e^{(r^{\EUR {} }-r^{\$})t})(X_2(t)+\beta _2 e^{(r^{Y}-r^{\EUR {} })t}). \end{aligned}$$

All the data for our numerical experiments are downloaded from a Bloomberg terminal. We start by considering data from 19th February 2015. The initial values of \(S_1,S_2\) are \(S_1(0)=0.878\), \(S_2(0)=135.44\). First we calibrate \(S_1\) and \(S_2\) using implied volatilities from options with maturity of 6 months. Denoting

$$\begin{aligned} \eta _1=\left( \sqrt{\frac{\int _0^T \sigma _1^1(s)^2 ds}{T}}, \sqrt{\frac{\int _0^T \sigma _1^2(s)^2 ds}{T}}\right) \\ \eta _2=\left( \sqrt{\frac{\int _0^T \sigma _2^1(s)^2 ds}{T}}, \sqrt{\frac{\int _0^T \sigma _2^2(s)^2 ds}{T}}\right) \end{aligned}$$

the T-term volatilities of the instrumental processes of \(S_1\) and \(S_2\), respectively,

$$\begin{aligned} \lambda _1&=(\lambda _1^1,\lambda _1^2),\\ \lambda _2&=(\lambda _2^1,\lambda _2^2) \end{aligned}$$

the vector of probabilities of each component and \(\beta _1\), \(\beta _2\) the shift parameters, we obtain

$$\begin{aligned} \eta _1=(0.1952,0.0709),\ \lambda _1=(0.1402,0.8598),\ \beta _1=0.00068 \end{aligned}$$

for the asset \(S_1\) and

$$\begin{aligned} \eta _2=(0.1184,0.0962),\ \lambda _2=(0.2735,0.7265),\ \beta _2=0.9752 \end{aligned}$$

for the asset \(S_2\). Then, we perform a calibration on the cross product \(S_3=\)USD/JPY using volatilities from call options with maturity of 6 months, finding the values:

$$\begin{aligned} \rho _{MVMD}(6M)=-0.6015 \end{aligned}$$

for the shifted MVMD model and

$$\begin{aligned} \rho _{SCMD}(6M)=-0.5472 \end{aligned}$$

for the shifted SCMD model. The higher value (in absolute terms) of the correlation parameter in the shifted MVMD model is due to higher state dependence in the diffusion matrix with respect to the shifted SCMD model. This is partly related to Proposition 6. In other words, in order to achieve the same local correlation as in the shifted SCMD model, the shifted MVMD model needs a higher absolute value of \(\rho \).

The corresponding prices and implied volatilities are plotted in Fig. 1 whereas Table 1 reports the absolute differences between market and model values corresponding to a few strikes. The reported plot shows that the shifted MVMD model is better at reproducing market prices than the shifted SCMD model. What is very remarkable in this example is that the shifted MVMD fits the whole correlation skew with just one value of \(\rho \).

As a second numerical experiment we repeat the calibration using prices with maturity of 9 months. Specifically, we first calibrate \(S_1\) and \(S_2\) obtaining the values

$$\begin{aligned} \eta _1=(0.2236,0.0761),\ \lambda _1=(0.0262,0.9738),\ \beta _1=0.0100 \end{aligned}$$

for the asset \(S_1\) and

$$\begin{aligned} \eta _2=(0.1244,0.0497),\ \lambda _2=(0.7584,0.2416),\ \beta _2=0.7856 \end{aligned}$$

for the asset \(S_2\). For \(S_2\), we observe that the higher volatility now has the highest probability as opposed to the results found for 6 months options. Then, we perform a calibration on the cross product \(S_3=\)USD/JPY using volatilities from call options with maturity of 9 months, finding the values:

$$\begin{aligned} \rho _{MVMD}(9M)=-0.6199 \end{aligned}$$

for the shifted MVMD model and

$$\begin{aligned} \rho _{SCMD}(9M)=-0.5288 \end{aligned}$$

for the shifted SCMD model. These values are comparable with those found for 6 months options. This shows that the model is quite consistent.

The corresponding prices and implied volatilities are shown in Fig. 2 whereas Table 2 reports some absolute differences between model and market values. Overall, also in this case the shifted MVMD model outperforms the shifted SCMD in terms of ability to reproduce market prices on the cross product.

Fig. 1
figure 1

Calibration on 6 months options, relative to 19 February 2015. The implied correlation is \(\rho =-0.6015\) for the shifted MVMD model (top) and \(\rho =-0.5472\) for the shifted SCMD model (bottom)

Table 1 Calibration on 6 months options, relative to 19 February 2015
Fig. 2
figure 2

Calibration on 9 months options, relative to 19 February 2015. The implied correlation is \(\rho =-0.6199\) for the shifted MVMD model (top) and \(\rho =-0.5288\) for the shifted SCMD model (bottom), which are comparable with the values obtained using 6 months options

Table 2 Calibration on 9 months options, relative to 19 February 2015

5 Introducing random correlations in the mixture dynamics

A single correlation parameter \(\rho \) may not be enough to fit prices on the cross asset. To overcome this, we can allow for random correlations between the single assets in the shifted MUVM model (3.6). Specifically,

$$\begin{aligned} d S_1(t)= & {} \mu _1\ S_1(t) dt + \sigma ^{I_1}_1(t)\ (S_1(t)-\beta _1 e^{\mu _1 t}) dW^{I_1}_1(t)\nonumber \\ d S_2(t)= & {} \mu _2\ S_2(t) dt + \sigma ^{I_2}_2(t)\ (S_2(t)-\beta _2 e^{\mu _2 t}) dW^{I_2}_2(t) \end{aligned}$$
(5.1)

where the Brownian motions \(W_1^{I_1}\), \(W_2^{I_2}\) now have correlation \(\rho ^{I_1,I_2}\). The correlation parameter will therefore assume the value \(\rho ^{h,k}\) in correspondence with a couple \((\sigma ^{h}_1, \sigma ^{k}_2)\) with probability \(\lambda _h \lambda _k\).

This setting is reminiscent of regime switching models. If one adopted a similar setting under the physical measure, with the drift parameters \(\mu \) being replaced with drift parameters under the physical measure, one would obtain a kind of regime switching model. This would not be a full fledged regime switching model, however, since only one regime switch would occur. Once the random variables \(I_1\) and \(I_2\) are realized, there are no more switches. The I’s would need to be replaced by Markov chains, but this would immediately increase combinatorial complexity, leading potentially to combinatorial explosion. The link of our approach with the literature of regime switching models needs to be investigated further. A futher interesting issue to investigate would be assigning a dynamics to the correlation structure and discussing how this might change when moving from the physical measure to the pricing measure. This will be considered in further work. We now go back to the relationship between shifted MUVM model and shifted MVMD model.

Theorem 9

The shifted MUVM model with uncertain correlation parameter has, as Markovian projection, a shifted MVMD model solution of the SDE (2.3) but with equation (1.17) transformed into

$$\begin{aligned} {V}^{k_1,\ldots ,k_n}(t) = \left[ \sigma _i^{k_i}(t)\ \rho ^{k_i,k_j}_{i,j}\ \sigma _j^{k_j}(t)\right] _{i,j = 1,\ldots ,n}. \end{aligned}$$
(5.2)

Proof

The Markovian projection property can be easily shown by an application of Gyöngy’s lemma, in a similar was as in the proof of Theorem 7. \(\square \)

In other words, the correlation between two generic instrumental processes \(Y_i^k\), \(Y_j^h\) will depend not only on the assets \(S_i\), \(S_j\), but will correspond to a specific choice of the instrumental processes \(Y_i^k\), \(Y_j^h\) themselves.

5.1 Cross FX rates study for shifted MVMD with random correlations

As a numerical illustration we performed on the shifted MVMD model the same experiment as in Sect. 4. We used 6 months options from 7th September 2015. The initial values of the single FX rates are \(S_1(0)=0.8950\), \(S_2(0)=133.345\). In this case the calibration of the shifted SCMD model is much worse, to the point that there is no value of \(\rho \) that can fit any of the prices obtained through this model. On the other hand, in the case of the shifted MVMD model, in particular when introducing random correlations, the fit leads to quite good results. As in the previous cases we independently calibrate \(S_1\)=USD/EUR and \(S_2\)=EUR/JPY on the corresponding implied volatilities obtaining

$$\begin{aligned} \eta _1= & {} (0.1803,0.0916),\ \lambda _1=(0.0274,0.9726),\ \beta _1=0.0128 \\ \eta _2= & {} (0.1230,0.0501),\ \lambda _2=(0.6575,0.3425),\ \beta _2=0.1867 \end{aligned}$$

and then we look at the cross exchange rate \(S_3=S_1 S_2\)=USD/JPY. When performing calibration using a shifted MVMD model with one correlation parameter only, we obtain

$$\begin{aligned} \rho =-0.6147, \end{aligned}$$

whereas when using random correlations, we have

$$\begin{aligned} \rho ^{1,1}=-0.8717,\ \rho ^{1,2}=-0.1762,\ \rho ^{2,1}=-0.6591,\ \rho ^{2,2}= -0.2269. \end{aligned}$$
Fig. 3
figure 3

Calibration of the MVMD model on 6 months options, relative to data from 7th September 2015. The calibration using one single correlation parameter is shown in the top part which corresponds to a fitted value equal to \(\rho =-0.6147\). In the bottom, calibration using the MVMD model with random correlations is presented. The corresponding fitted correlations are \(\rho ^{1,1}=-0.8717,\ \rho ^{1,2}=-0.1762,\ \rho ^{2,1}=-0.6591,\ \rho ^{2,2}= -0.2269. \)

The corresponding plots are shown in Fig. 3, in connection with Table 3. In this case we also see that using random correlations improves the fit with respect to the case with a single correlation parameter. Moreover, computing the expectation and the standard deviation for the random correlation under the risk-neutral measure \(\mathbb {Q}\), we obtain

$$\begin{aligned} \mathbb {E}^{\mathbb {Q}}( \rho ^{i,j})= & {} -0.5144 \\ \text{ Std }^{\mathbb {Q}}(\rho ^{i,j})= & {} 0.2105 \end{aligned}$$

satisfying \(\vert \mathbb {E}^{Q}(\rho ^{i,j}) - \rho \vert < \frac{ \text{ Std }^Q(\rho ^{i,j})}{2}\). In other words, the absolute difference between the \(\mathbb {Q}\)-expected random correlation and the deterministic correlation is smaller than half the \(\mathbb {Q}\)-standard deviation. This means that the random correlation is on average not that far from the deterministic value.

Table 3 Calibration on 6 months options, relative to 7th September 2015

Finally, we repeat the same experiment using options with maturity of 9 months. We find:

$$\begin{aligned} \eta _1= & {} (0.2073,0.0936),\ \lambda _1=(0.0012,0.9988),\ \beta _1=0.0216 \\ \eta _2= & {} (0.1573,0.0689),\ \lambda _2=(0.3563,0.6437),\ \beta _2=0.1288. \end{aligned}$$

When looking at the cross product \(S_1 S_2\)=USD/JPY, we obtain

$$\begin{aligned} \rho =-0.7488 \end{aligned}$$

in the case of one single correlation parameter, and

$$\begin{aligned} \rho ^{1,1}=-0.8679,\ \rho ^{1,2}=-0.2208,\ \rho ^{2,1}=-0.8303,\ \rho ^{2,2}= -0.3270 \end{aligned}$$

in the case where random correlations are introduced. Corresponding plots and absolute differences between market and model prices/implied volatilities can be found in Fig. 4 and Table 4, which show that the shifted MVMD model with random correlations outperforms the constant-deterministic correlation model in this case as well.

The values of expected random correlation and standard deviation under the \(\mathbb {Q}\) measure are

$$\begin{aligned} \mathbb {E}^{\mathbb {Q}}( \rho ^{i,j})= & {} -0.5063 \\ \text{ Std }^{\mathbb {Q}}(\rho ^{i,j})= & {} 0.2411. \end{aligned}$$

With respect to the case of 6 months options, we observe a movement of the \(\mathbb {Q}\)-expected random correlation away from the constant correlation. Moreover, if we look at the terminal correlations, that is the correlation between \(S_1(T)\) and \(S_2(T)\), for \(T=9\) months, we obtain

$$\begin{aligned} {{\hat{\rho }}}(9M)=-0.6894 \end{aligned}$$

in case \(\rho \) is deterministic and

$$\begin{aligned} {{\hat{\rho }}}_{\text{ random }}(9M)=-0.5596 \end{aligned}$$

in case \(\rho \) is random. As a final observation, we remark that in case \(\rho \) is constant, an application of Schwartz’s inequality shows that the absolute value of the terminal correlation is always smaller than the absolute value of the instantaneous correlation, as verified by the results above. One may wonder whether the same inequality holds in case of random correlations, if we substitute the instantaneous value with the mean of the random correlations. In this case it is not possible to use Schwartz’s inequality as we did before and, indeed, the results obtained show that the inequality does not hold, at least not for the example considered above.

Fig. 4
figure 4

Calibration of the MVMD model on 9 months options, relative to data from 7th September 2015. The calibration using one single correlation parameter is shown in the top part which corresponds to a fitted value equal to \(\rho =-0.7488\). In the bottom, calibration using the MVMD model with random correlations is presented. The corresponding fitted correlations are \(\rho ^{1,1}=-0.8679,\ \rho ^{1,2}=-0.2208,\ \rho ^{2,1}=-0.8303,\ \rho ^{2,2}= -0.3270.\)

Table 4 Calibration on 9 months options, relative to 7th September 2015

6 Potential use for smile extrapolation: a Renminbi case study

As a numerical illustration we performed on the shifted MVMD model analogous experiments as those in Sects. 4 and  5, using 6 months options relative to data from 18th December 2015. We first independently calibrate the exchange rates \(S_1\)=EUR/USD and \(S_2\)=USD/CNH on the corresponding implied volatilities, thus obtaining

$$\begin{aligned} \eta _1= & {} (0.1132,0.0841),\ \lambda _1=(0.2209,0.7791),\ \beta _1=0.0063 \\ \eta _2= & {} (0.0447,0.1455),\ \lambda _2=(0.5956,0.4044),\ \beta _2=-0.0587 \end{aligned}$$

where the initial values of the single FX rates are \(S_1(0)=1.0842\) and \(S_2(0)=6.55\). The corresponding plots are shown in Fig. 5 (top).

In order to test how the model perform when looking at the FX cross rate \(S_3=S_1 S_2=\)EUR/CNH, we first calibrate the shifted MVMD model using the atm call option only, thus obtaining

$$\begin{aligned} \rho _{MVMD}(6M)=-0.1205. \end{aligned}$$

We then plot the whole volatility curve corresponding to the calibrated value of \(\rho \). This is represented in Fig. 5 (bottom, left) which shows a good fit. Moreover, if we try to calibrate using not only the atm option, but a certain number of different strikes, we obtained values of the calibrated correlations quite close to the previous value \(\rho _{MVMD}(6M)\).

As a second experiment, we perform a calibration of the shifted MVMD model, when introducing random correlations. The values obtained are

$$\begin{aligned} \rho ^{1,1}=0.864, \rho ^{1,2}=-0.2843,\ \rho ^{2,1}=-0.3417,\ \rho ^{2,2}=-0.1006 \end{aligned}$$

and the corresponding plot is shown in Fig. 5 (bottom, right). The values of expected random correlation and standard deviation under the \(\mathbb {Q}\) measure are

$$\begin{aligned} \mathbb {E}^{\mathbb {Q}}( \rho ^{i,j})= & {} -0.1020 \\ \text{ Std }^{\mathbb {Q}}(\rho ^{i,j})= & {} 0.3904. \end{aligned}$$
Fig. 5
figure 5

On top, calibration of the EUR/USD FX rate (left) and the USD/CNH FX rate (right) each independently on a shifted LMD model. We used data from 18 December 2015. On the bottom, calibration of the MVMD model on the EUR/CNH exchange. In the case on the left, the calibration is obtained by fitting the shifted MVMD model to the atm option only, obtaining \(\rho =-0.1205\). In the case on the right, the calibration is obtained by fitting the shifted MVMD model with random correlation. In the last case \(\rho _1=0.864\), \(\rho _2=-0.2843\), \(\rho _3=-0.3417\), \(\rho _4=-0.1006\)

7 Conclusions

We introduced a shifted MVMD model where each single asset follows shifted LMD dynamics which are combined so that the mixture property is lifted to a multivariate level, in the same way as for the non-shifted case (Brigo et al. 2004). In this framework, we analysed the implied correlation from cross exchange rates and compared the results with those in the shifted SCMD model where the single assets are connected by simply introducing instantaneous correlations among the Brownian motions driving each asset.

In further work we may extend our analysis and examples to calibration of several maturities at the same time.

Finally, we generalized the MUVM model in Brigo et al. (2004), having MVMD as a Markovian projection, to a shifted model with random correlation, achieving more flexibility. This allows one to capture the correlation skew better. Indeed, the numerical experiments which we have conducted show that this model may be able to consistently reproduce triangular relationships among FX cross rates, or in other words to reproduce the implied volatility of a cross exchange rate in a consistent way with the implied volatilities of the single exchange rates.

One possible further use of the models given here is in proxying the smile for illiquid cross FX rates resulting from the product of two liquid FX rates. While one would have to find the relevant correlation parameters, possibly based on historical estimation with some adjustments for risk premia, the models presented here allow us to infer the detailed structure of the cross FX rate smile in an arbitrage free way.