Abstract
We propose an affine term structure model that allows for tenordependence of yield curves and thus for different risk categories in interbank rates, an important feature of postcrisis interest rate markets. The model has a Nelson–Siegel factor loading structure and thus economically well interpretable parameters. We show that the model is tractable in terms of estimation and provides good insample fit and outofsample forecasting performance. The proposed model is arbitragefree across maturities and tenors, and thus perfectly suited for risk management and pricing purposes. We apply our framework to the pricing of caplets in order to illustrate its practical applicability and its suitability for stress testing.
Introduction
The global financial crisis of 2007/2008 has triggered some structural changes in interest rate markets that invalidated the classical notion of a single yield curve. In particular, in postcrisis interest rate markets, yields vary not only with respect to maturity but depend also on the length of the tenor structure of the interest rate derivative from which they are calculated. Thus, nowadays interest rate markets are characterized by multiple (tenor dependent) term structures that reflect different risk categories. This has important implications for pricing, portfolio allocation, risk management and monetary policy.
In this paper, we construct an affine term structure model that (1) allows for tenordependent yield curves, (2) is free of arbitrage across maturities and tenors, (3) is computationally tractable with (4) economically interpretable parameters, and (5) provides superior insample fit and outofsample forecasting performance within the Nelson–Siegel class of models. The model is suitable also for negative interest rate environments and allows for semiclosed pricing formulas for various interest rate derivatives.
Our paper relates to a large literature on term structure models. The models introduced in [28, 30] are heavily used by various central banks to fit daily yield curves (see e.g. [1, 9]). They are flexible enough to describe various different shapes of interest rate term structures and the dynamic extension of the Nelson–Siegel model introduced in [6] has been shown to be highly tractable and at the same time provides accurate forecasting results. While the Nelson–Siegel parameters (as well as the Svensson parameters) have convenient economic interpretations as level, slope and curvature factors, the model is not arbitragefree and hence not suitable for pricing purposes and of limited use for risk management. Affine arbitragefree termstructure models have been introduced e.g. in [5, 8] in the single curve setting. However, as shown in [7] the canonical affine models typically provide poor forecasting performance. This shortcoming is overcome in [2] who introduce an arbitragefree affine specification of the Nelson–Siegel model in the single curve setting.
The financial crisis of 2007/2008, however, has invalidated the classical notion of a single yield curve. Reference [20] points out the importance of taking different risk categories in interbank rates into account. In the past years, affine short rate models in the multiple curve setting have been introduced in [14, 16, 19, 25,26,27] among others. Affine Libor models for tenor dependent yields have been developed in [17, 18]. Reference [4] provides a very general framework to model multiple yield curves through affine processes which comprises short rate models, Libor market models and Heath–Jarrow–Morton (HJM) models of tenor dependent yield curves. In particular, their approach relies on a general numéraire process and the modelling of multiplicative spreads between tenor dependent interbank rates and Overnight Indexed Swap (OIS) rates as first suggested in [3].
Our approach builds on the idea developed in [2] and combines it with the general affine model introduced in [4] in order to develop an affine arbitragefree multiple curve version of the Nelson–Siegel model. The proposed model inherits advantages from both the affine models as well as the parsimonious Nelson–Siegel class of models: it can be estimated through standard techniques (Kalman filter) and presents good in and outofsample performances. Since it is free of arbitrage across maturities and tenors and has parameters with a sound economic interpretation, the model is well suited for derivative pricing and tailormade for scenario generation, stress testing and other risk management purposes. We illustrate this by applying our model to the pricing of caplets and by analysing the impact of level, slope and curvature shifts in the discount curve and the multiplicative log spreads on the caplet price.
The remainder of the paper is structured as follows. In Sect. 2 we specify the financial market setting of our model. Section 3 summarizes the general affine multiple curve model introduced in [4] adapted to our market setting. The specification of the arbitragefree multiple curve Nelson–Siegel model is then derived in Sect. 4. Estimation and forecasting results are provided in Sect. 5 while Sect. 6 discusses the pricing of interest rate derivatives in the proposed model. Section 7 concludes.
Financial market instruments
We consider a financial market in which Xibor rates are quoted for a finite and generic set of tenors \({\mathscr {D}}=\{\delta _1,\ldots ,\delta _m\}\) with \(\delta _1<\cdots <\delta _m\) for some \(m\in \mathbb {N}\) and we denote by \(L_t^{k}(t,t+\delta _{\mathrm{k}})\) the (spot) Xibor rate prevailing at time t for the time interval \([t,t+\delta _{\mathrm{k}}]\). The tenor \(\delta _{k}\) is typically equal to 1 week (1W) or several months (1M, 2M, 3M, 6M, or 12M). We assume that forward rate agreements with Xibor rates of different tenors as reference rate are traded in the market.
Definition 1
A forward rate agreement (FRA) with tenor \(\delta _{k}\), settlement date T and strike K, is a contract stating that the fixed interest rate K will apply to a certain nominal value N, which we assume to be normalized to one, over the specified future period \([T,T+\delta _{k}]\). The payoff at maturity \(T+\delta _{k}\) is then given by
The interest rate K which ensures that the FRA for period \([T,T+\delta _{k}]\) has value zero at time t is referred to as FRA rate and will be denoted by \(L^{k}_t(T,T+\delta _{k})\) in the sequel.
In addition, we assume that overnight indexed swaps are traded in the market. The EONIA rate is the reference rate for overnight borrowing in the interbank market in the Eurozone and represents the underlying of overnight indexed swaps in the Eurozone.
Definition 2
An overnight indexed swap (OIS) is a contract where an investor agrees to pay a predetermined fixed rate K on a notional N at some predetermined equidistant future dates \(\{T_0,\ldots , T_n\}\) with tenor \(\delta =T_iT_{i1}\) to the other party. In return, the investor receives interest rate payments at a floating interest rate on the same notional principal. The floating rate in an OIS is indexed to an overnight rate (such as the EONIA in the Eurozone) and is given by simply compounding the consecutive overnight rates between the dates \(T_i\) and \(T_i+\delta \). In the following, we again normalize the notional N to one. The fixed rate K such that the OIS contract for the time period \([T_0,T_n]\) has zero value at time \(t\le T_0\) is called the time t forward swap rate and will be denoted by \(K_t^{\mathrm{OIS}}(T_0,T_n)\).
Applying appropriate bootstrapping techniques (see e.g. [15]) to the quoted OIS forward swap rates, we can extract the term structure \(T\mapsto P^{d}(t,T)\) of OIS zerocoupon bond prices \(P^{d}(t,T)\) at time t and we define the simple compounded OIS spot rate at time t for the time period \([t,t+\delta ]\) as
From the bootstrapped OIS zerocoupon bond prices, we can extract the simple compounded forward OIS rate \(L_t^d(T,T+\delta )\) at time t for the time period \([T,T+\delta ]\) which is defined via
The OIS rates will be considered as riskfree. This is motivated by the wide dissemination of collateral agreements to reduce the counterparty risk associated with OTCtraded derivatives.^{Footnote 1} Collateralisation is based on discounting at an overnight rate and reflects a funding and hedging mechanism. As a consequence, prices of OTCtraded derivatives quoted in the interbank market can be considered free of credit and liquidity risk.
We then have the following definition of a multiple curve financial market (compare [4, 13]).
Definition 3
For a fixed time horizon \(\mathbb {T}<\infty \) we consider a financial market consisting of the following basic traded assets

1.
OIS zerocoupon bonds for all maturities \(T\in [0,\mathbb {T}]\), and

2.
FRA contracts for all maturities \(T\in [0,\mathbb {T}]\) and for all tenors \({\delta _1,\ldots , \delta _m}\).
The riskfree term structure can be derived from the OIS bonds. The tenordependent (risky) yield curves reflect different levels of credit and liquidity risk in the interbank market and can be extracted from quoted FRA and swap rates of corresponding tenors. Since the fair swap rate \(S_t^k\) at time \(t\le T_1\) of a swap with tenor \(\delta _k\) and cashflow dates \(T_1,\ldots ,T_n\) can be represented through the relation
as a function of FRA rates for some fixed strike rate \(K\in \mathbb {R}\) and tenor length \(\delta _k\), it suffices to assume that FRA contracts for all maturities and tenors are traded in the market.
Continuoustime affine model for tenordependent term structures
Next, we set up the continuous time affine model for multiple yield curves. Therefore, we first fix the dynamics of the riskfree short rate. Afterwards, we specify the dynamics of the spreads between tenordependent (risky) yields and the riskfree yields in order to obtain the risky term structures.
In the first step, we follow the classical approach of [8] in modelling the term structure of riskfree interest rates as an affine function of underlying latent factors. We suppose that prices are discounted at the OIS rate, i.e., the numéraire \(B=(B_t)_{0\le t\le \mathbb {T}}\) is given by the OIS bank account
for \(t\in [0,\mathbb {T}],\) where \(r^{d}_t\) denotes the overnight short rate at time t. Further, we denote by \(\mathbb {Q}\) the associated (spot) martingale measure such that prices of traded assets can be calculated as conditional expectations of Bdiscounted payoffs under \(\mathbb {Q}\). We consider the probability space \((\Omega ,{\mathscr {F}},\mathbb {Q})\) with filtration \(({\mathscr {F}}_t)_{t\ge 0}\) generated by a standard Brownian motion \((W_t)_{t\ge 0}\) on \(\mathbb {R}^{n}\) under \(\mathbb {Q}\). We model the shortrate \(r^d\) as an affine function of a state variable \(X_t\) that follows an affine process^{Footnote 2} on some open subset \(M\subset \mathbb {R}^n\). More specifically, we assume that \(X_t\) satisfies the stochastic differential equation
under \(\mathbb {Q}\), where \(\theta :[0,T]\rightarrow \mathbb {R}^n\) and \(K:[0,T]\rightarrow \mathbb {R}^{n\times n}\) are bounded, continuous functions. Moreover, the matrix \(\Sigma :[0,T]\rightarrow \mathbb {R}^{n\times n}\) is assumed to be a bounded, continuous function and \(D:\mathbb {R}^n\times [0,T]\rightarrow \mathbb {R}^{n\times n}\) has diagonal structure,
for \(\gamma =(\gamma ^1,\ldots ,\gamma ^n):[0,T]\rightarrow \mathbb {R}^n\) and \(\vartheta =(\vartheta _i^j)_{i,j=1,\ldots ,n}:[0,T]\rightarrow \mathbb {R}^{n\times n}\) bounded and continuous functions. We assume that the overnight (discount) short rate \((r_t^{d})_{t\ge 0}\) is an affine function of the state variable, i.e.,
for a bounded and continuous function \(\rho _0^{d}:[0,\mathbb {T}]\rightarrow \mathbb {R}\) and a vector \(\rho _1^{d}\in \mathbb {R}^n.\)
In the second step, in order to introduce tenordependent term structures, we follow [4] and model the multiplicative spot spreads
between Xibor rates and simple compounded OIS spot rates for each tenor k as exponentially affine in \(X_t\). Therefore, we assume that
for a bounded and continuous function \(\rho _0^{k}:[0,\mathbb {T}]\rightarrow \mathbb {R}\) and a vector \(\rho _1^{k}\in \mathbb {R}^n.\) Note that the spot spreads can be directly calculated from the quoted Xibor and OIS rates. Moreover, we define the multiplicative forward spreads \(S^{k}(t,T)\) for \(0\le t\le T\le \mathbb {T}\) and tenor k by
The Tforward measure equivalent to the pricing measure \(\mathbb {Q}\) is defined via the RadonNikodym derivative
with \(P^d(t,T)=\mathbb {E}^{\mathbb {Q}}[B_t/B_T{\mathscr {F}}_t]\) the time t price of a discount (OIS) bond with maturity T. As shown in [4, Prop. 2.5], the fair FRA rate for tenor \(\delta _{k}\) can then be expressed as conditional expectation of the (spot) Xibor rate of tenor \(\delta _{k}\), i.e.,
for all \(0\le t\le T\le \mathbb {T}.\) Similarly, the multiplicative (forward) spread satisfies
for all \(0\le t\le T\le \mathbb {T}.\) It can be shown that X generates exponentially affine discount bond prices \(P^{d}(t,T)\) and forward spreads \(S^{k}(t,T)\). More explicitly, by applying Prop. 3.16 in [4], the discount bond prices and forward multiplicative spreads for tenor k are given by
for all \(0\le t\le T\le \mathbb {T}\) and all \(k=1,\ldots ,m\), where
with \(\phi \) and \(\psi \) denoting the characteristic exponents of the process \(Y=(X,\int _0^{\cdot } X_sds)\), which are given as solutions to the system of ordinary differential equations (ODEs)
with boundary conditions \(\phi (0,u,v)=0\) and \(\psi (0,u,v)=u\). We refer to “Appendix A” for the derivation of these ODEs.
Arbitragefree dynamic tenordependent Nelson–Siegel model
As a special case of the affine multiple term structure model outlined above, we develop in this section an arbitragefree tenordependent Nelson–Siegel model. For simplicity of notation, we consider a model for only two term structures here, the discount curve and a risky curve of tenor k. Therefore, we consider unobservable state variables \(X_t=(X_t^{1},\ldots , X^{6}_t)\in \mathbb {R}^6\) where the first three factors correspond to level, slope and curvature factors of the overnight (discount) curve, and the remaining variables influence the level, slope and curvature of the tenordependent term structure for tenor k through their impact on the multiplicative spreads. Following [2] we suppose that the overnight (discount) short rate is of the form
i.e., \(r_t^d\) is determined by the sum of the level and slope factors for the discount curve and the parameters \(\rho _0^{d}\equiv 0\) and \(\rho _1^{d}=(1,1,0,0,0,0)^\top \) in the notation of Sect. 3. Further, we assume that the state variables have the following \(\mathbb {Q}\)dynamics
with \(\theta =(\theta _1,\ldots ,\theta _6)\) and matrix \(K\in \mathbb {R}^{6\times 6}\) of triangular form
for \(\lambda _d,\lambda _k>0.\) In the following we will denote the upper left \(3\times 3\) submatrix of K by \(K^{d}\) and the lower right \(3\times 3\) submatrix by \(K^{k}\). Further, suppose that \(\Sigma \in \mathbb {R}^{6\times 6}\) has triangular form
Hence, the first three factors \((X_t^1,X_t^2,X_t^3)\), which influence the discount curve, are independent of the last three factors \((X_t^4,X_t^5,X_t^6)\), which affect the tenor spreads, and vice versa. In analogy to [2] and by application of Eq. (2), we can express the riskfree zerocoupon bond prices in this model as follows
where we have
because \(\rho _0^{d}\equiv 0\) and \(\gamma _j\equiv 1\) in this setting, and
as \(\vartheta ^j\equiv 0\) in this setting. The latter equation can be rewritten as
Due to the boundary condition \(\psi (0,u,v)=u\) we obtain that the fifth and sixth component of the vector \(\psi (Tt,0,\rho _1^{d})\) are equal to zero and we have
Thus, due to the special choice of the matrices K and \(\Sigma \), we can rewrite the discount bond price as
where the functions \(\psi ^d(t,T)=(\psi _1^d(t,T),\psi _2^d(t,T),\psi _3^d(t,T))\) with \(\psi _i^d(t,T)=\psi _i(Tt,0,\rho _1^d)\), for \(i=1,2,3\), are solutions to the system of Riccati equations
and the function \({\mathscr {A}}^{d}\) solves
where \(\theta ^\mathrm{d}=(\theta ^{d}_1,\theta ^{d}_2,\theta ^{d}_3)^\top \). Hence, the discount bond prices are determined solely by the first three latent factors. The boundary conditions are \(\psi _1^d(T,T)=\psi _2^d(T,T)=\psi _3^d(T,T)={\mathscr {A}}^{d}(T,T)=0.\) Similarly to [2], we obtain that the solutions to the ODEs are equal to the Nelson–Siegel factor loadings
and \({\mathscr {A}}^{d}\) is given by
Therefore, the riskfree (discount) yields \(y^{d}_t(\tau )\) at time t for timetomaturity \(\tau =Tt>0\) are described by a function of the form
Thus, factor loadings agree with those in the Nelson–Siegel model but there is an additional "yieldadjustment term" \({\mathscr {A}}^{d}(t,T)/(Tt)\) which depends on the maturity of the bond. As has been shown in [2], by taking this additional term into account, the classical dynamic Nelson–Siegel model developed in [6] can be turned into an arbitragefree term structure model. In other words, the yieldadjustmentterm ensures absence of arbitrage across maturities in our model.
Next, we turn to the term structure of interest rates for tenor k. Therefore, we assume that the multiplicative spot spread for tenor k is of the form
i.e. \(\rho _0^{k}=0\) and \(\rho _1^{k}=(0,0,0,1,1,0)^\top \) in the notation of Sect. 3. We obtain from Eq. (3) that the tenordependent multiplicative forward spreads satisfy
with
since \(\rho _0^{k}\equiv 0\) and \(\gamma _j\equiv 1\) in this setting. Moreover, we have
as \(\vartheta ^j\equiv 0\). The latter equation implies that
where we use the fact that \(\psi _i(Tt,0,\rho _1^{d})=0\) for \(i=4,5,6\) (compare Eq. (5)) and the fact that we have \(\psi _i(Tt,\rho _1^{k},\rho _1^{d})=\psi _i(Tt,0,\rho _1^{d})\) for the first three components \(i=1,2,3\), as implied by the boundary condition \(\psi (0,u,v)=u\) for \(u=\rho _1^{k}=(0,0,0,1,1,0)^\top \). Hence, we can express the multiplicative forward spreads as
where the functions \(\psi ^k(t,T)=(\psi ^k_1(t,T),\psi _2^k(t,T),\psi _3^k(t,T))\) with \(\psi _i^k(t,T)=\psi _{3+i}(Tt,\rho _1^k,\rho _1^d)\) for \(i=1,2,3\), are solutions to the system of Riccati equations
and the function \({\mathscr {A}}^{k}\) solves
where \(\theta ^{k}=(\theta _4,\theta _5,\theta _6)^\top .\) Hence, the multiplicative forward spreads are solely determined by the last three latent factors. The boundary conditions are given by \(\psi ^k(T,T)=(1,1,0)\) and \({\mathscr {A}}^{k}(T,T)=0\). The solution of this system of equations is given by
and
Thus, the bond prices and multiplicative spreads are both exponentially affine in \(X_t\) with coefficients that have a similar structure as the Nelson–Siegel factor loadings. We now consider the risky (tenordependent) term structure for tenor k. The oneperiod continuously compounded yields \(y^{k}_t(t+\delta _{k})\) can be calculated from the simple compounded spot Xibor rates as follows
where we used relation (1). The multiperiod yields \(y^{k}_t(\tau )\) at time t for tenor k and timetomaturity \(\tau =Tt=N\delta _k\) for some \(N\in \mathbb {N}\), are obtained as follows
where we again applied relation (1). Since the discount yields are affine in \(X_t\) (compare Eq. (6)) and the multiplicative forward spreads are exponentially affine in \(X_t\) (compare Eq. (7)), we obtain that the tenordependent yields are also affine in \(X_t\). Moreover, we observe that the tenordependent term structure is also of Nelson–Siegel shape due to the special structure of the discount short rate and the multiplicative spot spreads. As the multiplicative spreads are usually greater than one in the post crisis period, the level, slope and curvature factors for the spreads lead to a positive shift of the overnight (discount) curve so that the risky term structures are strictly larger than the discount curve. The terms \({\mathscr {A}}^k(t,t+n\delta _k)/\tau \) in the expression for the tenordependent yields represent “tenoradjustment terms” which ensure absence of arbitrage across tenors.
Numerical results
In this section we illustrate the numerical performances of the proposed model by fitting it to real data. We start by illustrating how the historical time series of yields and tenor dependent multiplicative spreads are built, then we show insample and outofsample performances in terms of estimation and forecasting. Computations are done using Matlab^{®}(Version R2019b) in Microsoft Windows 10^{®}running on a machine equipped with Intel(R) Core(TM) i79750HQ CPU @2.60GHz and 16 GB of RAM.
Data
Our data set consists of daily bootstrapped discount and three month yield curves for maturities up to 10 years. More specifically, we use European market data provided by Bloomberg for the time period from September 2nd, 2005, until May 22nd, 2018. We bootstrap the discount curves from Overnight Indexed Swap (OIS) rates indexed to the EONIA rate with maturities ranging from one week up to 10 years. Further, we construct the risky 3 month yield curves from market quotes of deposit rates for 3 month maturity and swap rates with maturities between 6 months and 10 years. All yields have been bootstrapped following the procedure in [15]. In total we consider 3316 observations for yields and multiplicative spreads of 3 months tenor. From the constructed yield curves we only consider the interest rates with maturities \(\tau \in \{0.25,0.5,1,3,5,7,10\}\) (expressed in years) in the insample estimation and outofsample forecasting. These have been directly bootstrapped from market data and hence do not allow for any arbitrage opportunities that might arise due to interpolation methods. Including interest rates of further maturities has only a minor impact on the results.
Yield curve fitting
In Sect. 4, we introduced an affine term structure model which is free of arbitrages across tenors and maturities and takes into account correlations between factors. We will refer to this model as correlated arbitragefree multiple curve Nelson Siegel (AFMCNS) model. We provide a statespace formulation of the AFMCNS model and describe the Kalman filter estimation of the model in “Appendix B”. Parameters can then be estimated by maximizing numerically the loglikelihood function in (14), for which we use the builtin Matlab^{®} function fmincon.
To evaluate the model’s in and outofsample performances, we consider three submodels which are closely related to the correlated AFMCNS model for comparison. The first is the independent AFMCNS model, which is equivalent to the correlated one with the obvious exception that the offdiagonal elements in the matrices \(K^{\mathbb {P}}\) and \(\Sigma \) are equal to 0. The estimation results for the correlated as well as for the independent version of the AFMCNS model are presented below. Second, by forcing the yield and tenor adjustment terms to be equal to 0, we obtain a dynamic Nelson–Siegel model which is not arbitragefree but adapted for a multiplecurve setting. We refer to this model as MCDNS model. It can be written in statespace form as
where \(X_t = \{X^1_t, X^2_t, X^3_t, X^4_t, X^5_t, X^6_t\}\), \(y_t = \{y^d_t(\tau _n), \log S^k_t(\tau _n)\}_{n=1}^{7}\), B is the matrix of the Nelson–Siegel factor loading coefficients, and A is a diagonal matrix in the independent MCDNS and a full matrix in the correlated MCDNS. Finally,
where H is a diagonal matrix and \(Q=q q'\) is a diagonal matrix in the independent MCDNS and a full matrix in the correlated MCDNS. The Kalman filter for the MCDNS model is set up following [2, Section 4.1]. Estimation results for the MCDNS model are reported in Table 1.
AFMCNS model estimation
Table 2 shows the estimation results for the AFMCNS model. For the oneday conditional meanreversion matrix \(\exp \left( K^{\mathbb {P}} /252\right) \) the elements on the main diagonal are similar in the independent and correlated case, with factors related to the discount yields more persistent than those related to the logspreads. Offdiagonal elements appear non negligible, especially for those elements corresponding to \(\{X_t^i\}_{i=4}^{6}\) for which we find a lot of values higher than 0.01 in absolute value. Concerning the longrun mean \(\theta ^{\mathbb {P}}\) there is not much difference between the two models. However, the values are shifted when compared to \(\mu \) (in the MCDNS) which can be explained by the fact that we are adding the adjustment term. In fact, the yield and tenor adjustment terms are negative for all maturities and decreasing in maturity (compare Fig. 1). Since we subtract the adjustment terms in the measurement equation, the mean level in the AFMCNS model is smaller than in the MCDNS model. Moreover, the tenor adjustment term is much smaller in size compared to the yield adjustment term (roughly by a factor 0.1) which explains why the mean level of the log spreads is less affected than the mean level of the discount yields.
When converting the volatility matrix \(\Sigma \) into a oneday conditional covariance matrix^{Footnote 3}
we obtain:
in case of independence and
in the correlated case. We find that factors related to the discount yields are more volatile than those related to the logspreads. This is comparable in case of the MCDNS model.^{Footnote 4} From the covariance estimates, we calculate correlation coefficients between factors
and find that correlations are not negligible. E.g. there is a very strong negative correlation of 0.9665 (resp. 0.8350) between \(X_1\) and \(X_2\) (resp. between \(X_4\) and \(X_6\)). This indicates that for the discount yields level shifts often go along with slope changes while for the log spreads level shifts mostly go along with curvature changes.
When comparing the loglikelihood values reported in the caption of Table 2, it is evident that the correlated version of the AFMCNS model performs better insample than the independent specification. This is not only due to the higher number of parameters, as largely confirmed by the likelihood ratio (LR) test for which we obtain a very high LR of 10755.21, but this is also confirmed by the results in Table 3, where root mean squared errors (RMSEs) are consistently smaller in the correlated case than in the independent factor case. RMSEs and mean errors (ME) displayed in Table 3 are computed from the differences between the historical yields and model estimated yields. Figure 2 shows a plot of these time series. This figure also appreciates the ability of the proposed model to replicate the time series of observed yields. Further, comparing the loglikelihood ratios of the MCDNS and the AFMCNS models (reported in the captions of Tables 1 and 2) confirms superiority of the AFMCNS model. This is also confirmed by the smaller RMSEs in Table 3.
Forecasting
We evaluate the forecasting performance of our proposed model by comparing six and twelvemonths ahead forecasts of discount and tenor dependent yields with its nonarbitrage free counterpart. For convenience of exposition, we denote \(y_t := \{y^d_t(\tau _n), y^k_t(\tau _n)\}_{n=1}^{7}\). Forecasting yields in the AFMCNS model consists of two steps:

(1)
Estimate model parameters over the sample period ending at time T as in Sect. 5.2.

(2)
Compute optimal h months ahead forecasts as
where \(E^{\mathbb {P}}_T[X_{T+h}] = (I  \exp \left( K^{\mathbb {P}} h)\right) )\theta ^{\mathbb {P}} + \exp \left( K^{\mathbb {P} }h\right) X_{T}\). We are interested in computing 6 and 12 months ahead forecasts which are typically better achieved using monthly data instead of daily data. Therefore, we split our data set with daily observations in 21 datasets with monthly observations by taking one data point each month. More specifically, we construct historical time series of yields taking every 21st observation. We do so starting from each day of the first month of the dataset. In this way, we get 21 historical time series with monthly observations. Then, on each subdataset, we estimate the model on a 6 years (72 months) rolling window and forecast yields using Eq. (10). This produces 73 forecasts for each subdataset and in total \(73 \times 21 = 1533\) six and twelve months ahead forecasts. In Table 4 we report the root mean squared forecasting errors (RMSFE) for the proposed arbitrage free multiple curve Nelson Siegel model (AFMCNS) and for the multiple curve dynamic Nelson Siegel (MCDNS) model for benchmark comparison. Note that we only consider here the independent versions of those models. In fact, in unreported tests, we obtained superior out of sample performances of those models with respect to their correlated counterparts. This finding is consistent with [2, Table 7]. Results in Table 4 are striking and show that the AFMCNS is better in forecasting than the MCDNS throughout all the various yields and maturities with a RMSFE around 4060% smaller than the benchmark model for short maturities and 1240% for higher maturities.
Derivative pricing
As an application, we illustrate the pricing of derivative instruments under the proposed AFMCNS model. More specifically, we consider the price at time t of a caplet with notional N, reset date T, and settlement date \(T+ \delta _k\). Its payoff at the settlement date is given by \(N\delta _k(L(T, T+\delta _k)  K )^+\). Following [4] the time t price of the caplet can be derived in semiclosed form. Therefore, we first define
Then Eqs. (2) and (3) imply that
where we used that \(\rho _0^d=\rho _0^k\equiv 0\) in our Nelson–Siegel setting as well as \(\phi (0,u,v)=0\) and \(\psi (0,u,v)=u\). Then, the modified moment generating function of \(Y_T\) can be calculated (using Eq. (2) and \(\rho _0^d=\rho _0^k\equiv 0\)) as
with \(\phi (Tt, u, v)\) and \(\psi (Tt, u, v)\) given by the solution to the system of ODEs in Eq. (4) adapted to our Nelson–Siegel setting, i.e.
with boundary conditions \(\phi (0,u,v)=0\) and \(\psi (0,u,v)=u\). This ODE system can be solved analytically and the solution for \(\psi (Tt, u, v)\) is given by^{Footnote 5}
where \(v_i\) and \(u_i\) are \(i^\mathrm{th}\) entries of the vectors u and v.
The price of the caplet can then be expressed as
Using the above modified moment generating function, we thus obtain (compare [4], Proposition 4.2)
where
This illustrates the applicability of our model to the pricing of caplets, which also holds in a negative interest rate environment. To further confirm the practical relevance of the model for derivative pricing we propose a simple calibration exercise. We build a surface of caplet prices on 15 Sep 2016 using discount bond values and cap implied volatilities, following the procedure outlined in [10]. We end up with market caplet prices for \(n_T = 15\) maturities \(T=\{2.5, 3, 3.5, 4, 4.5, 5, 5.5, 6, 6.5, 7, 7.5, 8, 8.5, 9, 9.5\}\) years and \(n_K = 5\) strikes \(K = \{0.005, 0.0013, 0.0025, 0.01, 0.02\}\). Then, we calibrate the proposed model by solving numerically the following minimization problem:
where \(\Theta \) is the model parameters vector, \(\Pi ^{mkt}\) is the market caplet prices surface and \(\Pi (\Theta )\) is the model caplet prices surface (where we have put in evidence the dependence on the model parameters) computed as in (11). Let us consider for this numerical illustration the independent version of the AFMCNS model. We obtain the following estimates for the model parameters: \(\lambda ^d = 0.3540\), \(\lambda ^k = 0.4680\), \(X_0 =\{0.0121, 0.0237,0.0212,0.0003, 0.0004, 0.0004\}\) and
Figure 3 shows the final calibration output. We find that the model is able to replicate correctly observed caplet prices with a mean absolute error (MAE) around 1.10E04 (and squared pricing error of order \(10^{7}\)).
The pricing of swaptions in our framework can in principle be done by adapting the results in [4, Section 4.2] to the proposed AFMCNS model along the same lines to what has been done in the case of caplets. It should be pointed out, however, that the pricing problem is more delicate since the semiclosed formulas in the mentioned paper rely on an approximation of the exercise boundary by an event which is defined in terms of an affine function of the driving process. Finally, for what concerns derivative instruments with more involved payoff structure, we remark that, since \(X_t\) evolves according to a multidimensional Gaussian OrnsteinUhlenbeck process, which can be simulated efficiently using for example the Euler scheme or, alternatively, the state transition Eq. (13), interest rate derivatives can be efficiently priced under the proposed model via Monte Carlo simulation. Other analytical or semiclosed pricing formulas for interest rate derivatives in the multiple curve setting are derived e.g. in [12] using continuousstate branching processes with immigration as driving processes or in [11] using timeinhomogeneous Lévy processes to model forward swap rates.
As the proposed model has a Nelson–Siegel factor loading structure, the latent variables have convenient economic interpretations as level, slope and curvature factors. Due to this feature and the fact that the model is free of arbitrage, our approach is tailormade for risk management purposes. In the sequel, we illustrate such an application by studying the price of the caplet for varying initial state variables \((X^1_t,\ldots , X_t^6)\). Results reported in Fig. 4 show that the caplet price is most severely affected by shifts in the level \(X_t^1\) of the discount curve, followed by slope changes \(X_t^2\), and finally by curvature shifts \(X_t^3\). Corresponding changes in \(X_t^4,X_t^5,\) and \(X_t^6\) representing shifts in level, slope and curvature of the multiplicative log spreads have a smaller but nonnegligible impact on the caplet price. This is particularly interesting when considering the fact that during the global financial crisis, spreads between interbank rates and overnight (discount) rates increased from less than 10bps to levels up to 250bps at the peak of the crisis. In this way, our proposed approach has important implications for risk management as economically meaningful stress scenarios can be easily simulated and due to absence of arbitrage can be used for calculating portfolio values under adverse market situations.
Conclusions
In this paper, we proposed an arbitragefree affine term structure model for multiple yield curves that has a Nelson–Siegel factor loading structure. Our numerical results document superior insample and outofsample performance of our approach within the Nelson–Siegel class of models. Due to the sound economic interpretation of the latent variables of our model and the absence of arbitrage, the setting is very well suited for risk management purposes. In particular, it allows to study the sensitivity of a portfolio of interest rate related products with respect to level, slope and curvature shocks to the riskfree yield curve and/or to the tenor spreads. We illustrated this by applying the proposed model to the pricing of caplets. Since the valuation of various interest rate derivatives relies on forward looking interest rates, such as LIBOR, and hence on potential spread adjustments to overnight rates, we believe that our results remain relevant also beyond a discontinuation of LIBOR after 2021.
Data availability
Data is available through Bloomberg. Code snippets are available upon request.
Notes
That is, a timehomogeneous Markov process whose characteristic function \(\mathbb {E}^{\mathbb {Q}}[e^{\langle u, X_t\rangle }]=e^{{\tilde{\phi }}(t,u)+{\tilde{\psi }}(t,u)^\top X_0}\) is exponentially affine in the initial state \(X_0\).
The small magnitude of the components of Q is explained by the fact that we are using daily data. Those numbers are consistent with [2, Eq. 12 and 13] which use monthly data.
Results are available upon request.
\(\phi (Tt, u, v)\) has been computed with Mathematica^{®}, code snippets are available upon requests.
References
Bank for International Settlements: Zerocoupon yield curves: technical documentation. BIS papers no. 25 (2005)
Christensen, J.H.E., Diebold, F.X., Rudebusch, G.D.: The affine arbitragefree class of Nelson–Siegel term structure models. J. Econom. 164, 4–20 (2011)
Cuchiero, C., Fontana, C., Gnoatto, A.: A general HJM framework for multiple yield curve modelling. Finance Stoch. 20, 267–320 (2016)
Cuchiero, C., Fontana, C., Gnoatto, A.: Affine multiple yield curve models. Math. Finance 29, 568–611 (2019)
Dai, Q., Singleton, K.J.: Specification analysis of affine term structure models. J. Finance 55, 1943–1978 (2000)
Diebold, F.X., Li, C.: Forecasting the term structure of government bond yields. J. Econom. 130, 337–364 (2006)
Duffee, G.: Term premia and interest rate forecasts in affine models. J. Finance 57, 405–443 (2002)
Duffie, D., Kan, R.: A yieldfactor model of interest rates. Math. Finance 6, 379–406 (1996)
ECB, Technical notes on Eurozone yield calibration. European Central Bank Technical Documentation, vol. 29, pp. 568–611 (2015)
Eberlein, E., Gerhart, C.: A multiplecurve Lévy forward rate model in a twoprice economy. Quant. Finance 18(4), 537–561 (2018)
Eberlein, E., Gerhart, C., Lütkebohmert, E.: A multiple curve Lévy swap market model. Appl. Math. Finance 27(5), 396–421 (2020)
Fontana, C., Gnoatto, A., Szulda, G.: Multiple yield curve modelling with CBI processes. Math. Financ. Econ. 15, 579–610 (2021)
Fontana, C., Grbac, Z., Gümbel, S., Schmidt, T.: Term structure modeling for multiple curves with stochastic discontinuities. Finance Stoch. 24, 465–511 (2020)
Filipović, D., Trolle, A.B.: The term structure of interbank risk. J. Financ. Econ. 109, 707–733 (2013)
Gerhart, C., Lütkebohmert, E.: Empirical analysis and forecasting of multiple yield curves. Insur. Math. Econ. 95, 59–78 (2020)
Grasselli, M., Miglietta, G.: A flexible spot multiplecurve model. Quant. Finance 6, 1465–1477 (2016)
Grbac, Z.: Credit risk in Lévy LIBOR modeling: rating based approach. Ph.D. thesis, vol. 109, pp. 191–226. University of Freiburg (2010)
Grbac, Z., Papapantoleon, A., Schoenmakers, J., Skovmand, D.: Affine LIBOR models with multiple curves: theory, examples and calibration. SIAM J. Financ. Math. 6, 984–1025 (2015)
Grbac, Z., Meneghello, L., Runggaldier, W.: Derivative pricing for a multicurve extension of the Gaussian, exponentially quadratic short rate model. Innov. Deriv. Mark. 109, 191–226 (2016)
Henrard, M.: The irony in the derivatives discounting. Wilmott Mag. 1, 92–98 (2007)
International Swaps and Derivatives Association, ISDA Margin Survey 2012. Available at https://www.isda.org/category/research/surveys (2012)
International Swaps and Derivatives Association, ISDA Margin Survey 2014. Available at https://www.isda.org/category/research/surveys (2014)
KellerRessel, M.: Affine processes—theory and applications in finances. Ph.D. thesis, Vienna University of Technology (2009)
KellerRessel, M., Mayerhofer, E.: Exponential moments of affine processes. Ann. Appl. Probab. 25, 714–752 (2015)
Kenyon, C.: Postshock shortrate pricing. Risk Mag. 1, 83–87 (2010)
Kijima, M., Tanaka, K., Wong, T.: A multiquality model of interest rates. Quant. Finance 2, 133–145 (2009)
Morino, L., Runggaldier, W.: On multicurve models for the term structure. In Dieci, R., He, X.Z., Hommes, C. (eds) Nonlinear economic dynamics and financial modelling: essays in honour of Carl Chiarella. Springer International, pp. 275–290 (2014)
Nelson, C.R., Siegel, A.F.: Parsimonious modelling of yield curves. J. Bus. 60, 473–489 (1987)
Singleton, K.J.: Empirical Dynamic Asset Pricing. Princeton University Press, Princeton (2006)
Svensson, L.E.: Estimating and interpreting forward interest rates: Sweden 1992–1994. Centre for Economic Policy Research: Discussion paper, vol. 60, pp. 473–489 (1994)
Acknowledgements
We thank the editor and an anonymous referee for carefully reading the manuscript and for several constructive and detailed comments that helped to improve our paper. This work was supported by the German Research Foundation (DFG) through the Grant LU 1186/41. Financial support is gratefully acknowledged by the first and the third author.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no relevant financial or nonfinancial interests to disclose.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
A Derivation of characteristic exponents of Y
To derive explicit expressions for the characteristic exponents \(\phi \) and \(\psi \) of the process \(Y=(X, \int _0^{\cdot } X_sds)\), we first calculate the characteristic exponents of the process X starting in \(X_0.\) Therefore, suppose that
for functions \({\tilde{\phi }}(t,u)\) and \({\tilde{\psi }}(t,u)\). Define the process
with \({\tilde{\phi }}(0,u)=0\) and \({\tilde{\psi }}(0,u)=u\). Then
and
if M is a martingale. In that case, we then obtain
and (12) indeed gives the correct characteristic function. Thus, we need to prove that \(f(t,X_t)\) is a martingale. Denoting the time derivative by \('\) and applying Itô’s formula to \(f(t,X_t)\), we obtain
Hence, \(f(t,X_t)\) is a local martingale if
i.e. if we have
with boundary conditions \({\tilde{\phi }}(0,u)=0\) and \({\tilde{\psi }}(0,u)=u\). We define the functional characteristics \(({\tilde{F}},{\tilde{R}})\) of X via
We can then apply Theorem 4.10 in [23] to infer the functional characteristics (F, R) of Y as
Thus, the characteristic exponents \(\phi \) and \(\psi \) of the process Y can be written as
with boundary conditions \(\phi (0,u,v)=0\) and \(\psi (0,u,v)=u\) (see also [24], Sec. 3).
B Statespace formulation and estimation framework
We fit the parameters of the model under the realworld measure \(\mathbb {P}\). The relationship of the process dynamics between the realworld and the riskneutral measure are characterised by Girsanov’s Theorem. In particular we have
where the risk premium \(b_t(\omega )\) is a predictable process of affine form
with \(b_0 \in \mathbb {R}^6, B \in \mathbb {R}^{6 \times 6}\). This specification ensures the affine structure of the state variables X under \(\mathbb {P}\) (see [7]). Hence, we are able to choose any vector \(\theta ^{\mathbb {P}}\) and matrix \(K^{\mathbb {P}}\) under \(\mathbb {P}\) and still ensure the required \(\mathbb {Q}\)dynamics of the model.
Following [2], the state transition equation is given by
with \(X_t=(X_t^1,\ldots ,X_t^6)^{\top }\) and the measurement equation is
where the transition and measurement errors are assumed to be orthogonal to the initial state and
with diagonal matrix H and
Following [29], we set the mean level \(\theta =(\theta _1,\ldots , \theta _6)\) under \(\mathbb {Q}\) equal to zero. Therefore we obtain
and
where the coefficients are given by \(d_1=\sigma _{11}^2, d_2=\sigma _{21}^2 + \sigma _{22}^2, d_3= \sigma _{31}^2+ \sigma _{32}^2+\sigma _{33}^2, d_4=\sigma _{11}\sigma _{21}, d_5=\sigma _{11}\sigma _{31}, d_6=\sigma _{21}\sigma _{31}+ \sigma _{22}\sigma _{32}\) and \(c_1=\sigma _{44}^2, c_2=\sigma _{54}^2 + \sigma _{55}^2, c_3=\sigma _{64}^2 + \sigma _{65}^2 + \sigma _{66}^2, c_4=\sigma _{44}\sigma _{54}, c_5=\sigma _{44}\sigma _{64}, c_6=\sigma _{54}\sigma _{64} + \sigma _{55}\sigma _{65}\).
The Kalman filter can be now set up in analogy to [2]. Therefore, the filter is initialised at the unconditional mean and variance of the state variables under \(\mathbb {P}\): \(X_0 = \theta ^\mathbb {P}\) and \(\Sigma _0 = \int _{0}^{\infty } e^{K^{\mathbb {P} }s} \Sigma \Sigma ' e^{(K^{\mathbb {P}})' s} ds\). The prediction step is
where \(\Phi _t = (I  \exp \left( K^{\mathbb {P} }\Delta t \right) ) \theta ^{\mathbb {P}}\), \(\Psi _t = \exp \left( K^{\mathbb {P}} \Delta t\right) \) where \(\Delta t\) is the time between observations, which we set equal to 1/252 (respectively, 1/12) wherever we deal with daily (monthly) data. \(X_t\) is updated at time t via
where
The Gaussian loglikelihood is then computed according to:
where N is the number of observed yields and \(\det (\cdot )\) denotes the determinant of a square matrix.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Brignone, R., Gerhart, C. & Lütkebohmert, E. Arbitragefree Nelson–Siegel model for multiple yield curves. Math Finan Econ 16, 239–266 (2022). https://doi.org/10.1007/s1157902100308y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s1157902100308y
Keywords
 Affine processes
 Dynamic factor model
 Multiple term structures
 Nelson–Siegel curve
JEL Classification
 E43
 G12
 G17