Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence

Boreiko, D. V.; Kaniovski, Y. M.; Pflug, G. Ch.

doi:10.1007/s10100-015-0415-6

Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence

Original Paper
Open access
Published: 09 September 2015

Volume 24, pages 989–1007, (2016)
Cite this article

Download PDF

You have full access to this open access article

Central European Journal of Operations Research Aims and scope Submit manuscript

Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence

Download PDF

D. V. Boreiko¹,
Y. M. Kaniovski¹ &
G. Ch. Pflug²

1715 Accesses
7 Citations
Explore all metrics

Abstract

Three coupling schemes for generating dependent credit rating transitions are compared and empirically tested. Their distributions, the corresponding variances and default correlations are characterized. Using Standard and Poor’s data for OECD countries, parameters of the models are estimated by the maximum likelihood method and MATLAB optimization software. Two pools of debtors are considered: with 5 and with 12 industry sectors. They are classified into two non-default credit classes. First portfolio mimics the Dow Jones iTraxx EUR market index. The default correlations evaluated for 12 industry sectors are confronted with their counterparts known for the US economy.

Structural Credit Risk Models: Endogenous Versus Exogenous Default

Modeling stochastic recovery rates and dependence between default rates and recovery rates within a generalized credit portfolio framework

Article 01 June 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Credit risk analysis requires modeling of dependent defaults. A classical approach, due to Merton (1974), employed a stochastic process to describe the (latent) value of a firm. A default event here is triggered by breaching some specified threshold. Termed as structural models, they treat credit risk correlation between two debtors as the correlation between the respective stochastic processes determining values of the firms. For example, within the CreditMetrics approach, see Gupton et al. (1997), dependent defaults of several firms are modeled by using a multivariate Gaussian distribution.

A more realistic and technically sophisticated setting for generating dependent defaults, so-called reduced form models, allows the default probability to depend on several economic factors. Some of them are latent while the others may be observable. The total risk is typically decomposed into an idiosyncratic part and a common component. The latter is often interpreted as a systemic factor. The relative strength of the components and, consequently, correlations between assets are parameterized by deterministic weights. Different types of copulae are used. A variety of distributions have been considered. There are models formulated in discrete as well as in continuous time. For particular examples, see among others Li (2000), Jarrow and Yu (2001), Hull and White (2001), Bangia et al. (2002), Lando (2004), Hull and White (2004), McNeil and Wendin (2007), Stefanescu et al. (2009), Choroś-Tomczyk et al. (2013). Frey and McNeil (2003) analyze and classify the existing approaches to generating dependent defaults.

Within the CreditMetrics approach, “where migration analysis is a corner stone, that is, the study of changes in the credit quality of names through time” (see Gupton et al. 1997, page iv), a (discrete-time) Markovian transition matrix is estimated. It governs the evolution of a representative debtor through credit classes.

While credit risk models concentrate, typically, on dependent defaults, studies on systemic risk attempt to analyze the events that precede a default. See Upper (2011) for a comprehensive analysis of simulation methods in systemic risk analysis. In other words, the whole interdependent migration process of the debtors has to be considered.

The Markovian property of the credit rating migration process, its time-homogeneity and the discrete-time setting have been criticized and several refinements of it have been suggested. See Altman (1998), Bangia et al. (2002), Lando and Skødeberg (2002), Frydman and Schuermann (2008), Korolkiewicz and Elliott (2008), Stefanescu et al. (2009), Xing et al. (2012) among others. Conceptually, dependence of transition probabilities upon macroeconomic factors has been introduced and the corresponding models have been empirically tested. In technical terms, these refinements relay on hidden Markov models and they employ a variety of estimation techniques. In order to render models of the migration process more realistic, continuous time settings have been introduced and estimated.

Taking a credit rating Markovian transition matrix as a marginal distribution, a joint distribution of the whole pool of debtors can be obtained by a coupling scheme. This possibility of introducing dependence among credit rating migrations of the debtors constituting a portfolio is analyzed in Kaniovski and Pflug (2007) and in Wozabal and Hochreiter (2012).

In both cases, transition probabilities are modified according to binary unobserved tendency variables, that can be interpreted in a context of business cycles. Every migration is governed by an idiosyncratic term and a common component. Unlike in the case of a reduced form model, the weights that determine the relative strength of the components are random. A tendency variable affects the distribution of the corresponding common component in the following way. For a credit class, conditional on “favorable” realizations of the corresponding tendency variables, migrations towards a better credit quality become more likely, whereas worsening of the credit quality will be less probable.

In Kaniovski and Pflug (2007), the common component remains the same for all debtors belonging to a credit class irrespective of their industry sectors. Wozabal and Hochreiter (2012) introduced an alternative coupling scheme. It implies much weaker dependence among the debtors. In their case, conditional on realizations of the corresponding tendency variables, the common tendencies affecting a pool of debtors characterized by a combination of a credit class and an industry sector are identically distributed and independent.

In what follows next, the model by Kaniovski and Pflug (2007) is referred to as Scheme 1, Index 2 is assigned to its modification by Wozabal and Hochreiter (2012) and the coupling techniques introduced here is labeled by 3.

The distributions corresponding to these three coupling techniques are compared. It is shown that variances of the number of defaults and correlations of credit events are the largest for first scheme whereas they are the smallest for second one. Consequently, the coupling scheme suggested here takes an intermediate position regarding the known techniques.

While for one-year correlations of credit events there are explicit formulas, estimating multi-year credit events’ correlations bootstrapping has to be used. In the latter case, repeated Monte-Carlo runs of the model generate transition sample paths, that are treated then by a standard statistical algorithm for sample correlation.

Using a Standard and Poor’s (S&P’s) data set, parameters of these coupling models are estimated. There are two portfolios considered: with 5 and with 12 industry sectors. The debtors are classified into two non-default credit classes.

The maximum likelihood estimates are obtained by MATLAB optimization software: Interior Point algorithm (IP) and Sequential Quadratic Programming (SQP) method.

2 Coupling schemes

Consider a portfolio containing debtors that are non-homogeneous in their credit ratings and industry sectors. Let there be $M\ge 2$ non-default credit classes. Numbering them in a descending order, we assign 1 to the most secure assets, while the next to default credit class is indexed by M. Defaulted firms receive the index $M+1$. There are $S\ge 1$ industry sectors. Following the CreditMetrics approach, see Gupton et al. (1997), it is assumed that credit rating migrations are governed by an $M\times (M+1)$ Markovian transition matrix P with elements $p_{i,j}$. That is, $p_{i,j}$ stands the probability of a transition within one year, from ith credit rating to jth. Since $M+1$ is an absorbing state of the corresponding Markov chain, $p_{M+1,i}=\mathbb {I}_{\{i=M+1\}}$. Here $\mathbb {I}_{\{A\}}$ denotes the indicator function of a statement A,

$$\begin{aligned} \mathbb {I}_{\{A\}}=\left\{ \begin{array}{l} 1 \quad \text {if } A \; \text {holds true}, \\ 0 \quad \text {if } A \; \text {is false}.\\ \end{array} \right. \end{aligned}$$

The credit rating migrations occur at times $t=1,2,\ldots $. Set $N^{k,i}(t)$ for the number of debtors from industry sector k in credit class i at time t. At the beginning there are $\mathcal {N}(1)=\sum _{i=1}^{M} \sum _{k=1}^{S}N^{k,i}(1)$ debtors in the portfolio.

The coupling techniques generate counts $N^{k,i}(t),\;t>1$ such that:

the evolutions of debtors through credit classes are dependent;
the corresponding random process of credit rating transitions is time homogeneous and every individual migration is governed by the same Markovian transition matrix P.

Assign a number $n=1, 2, \dots , \mathcal {N}(1)$ to every debtor in the portfolio at time $t=1$. Set $X_n(t)$ for the credit rating at time $t\ge 1$ of the firm numbered by n. Then $X_n(t)$ is a discrete-time Markov chain with $M+1$ states. Its transient states are $1,2,\ldots , M$.

Denote by s(n) the industry sector of firm n. The rating randomly changes in time, becoming $X_{n}(2)$ at time $t=2$, while the assignment to the sector s(n) remains constant. The evolution of the whole portfolio is captured by a multi-dimensional random process $\vec {X}(t)=(X_{1}(t),X_{2}(t),\ldots ,X_{\mathcal {N} (1)}(t))$ whose components are identically distributed and dependent. Let us look at a transition from time $t=1$ to time $t=2$.

First introduce independent random variables $\xi _{n}, n=1,2,\ldots ,\mathcal {N}(1)$. Each of them assumes values $1,2,\ldots ,M+1$. The corresponding probabilities read:

$$\begin{aligned} \mathbb {P}\{\xi _{n}=j\}=p_{X_{n}(1),j}. \end{aligned}$$

Conceptually, $\xi _{n}$ represents an idiosyncratic component of a move from $X_n(1)$ to $X_n(2)$. Its impact is determined by a Bernoulli random variable $\delta _{n}$ according to the formula:

$$\begin{aligned} X_{n}(2)=\delta _{n}\xi _{n}+(1-\delta _{n}) \eta _{n}. \end{aligned}$$

Here $\eta _{n}$ stands for a common component in the transition from $X_n(1)$ to $X_n(2)$. It introduces a dependence mechanism among $X_n(2)$. Random variables $\{\xi _{n}\}$, $\{\eta _{n}\}$ and $\{\delta _{n}\}$ are independent. Since all debtors are assumed to be governed by the same Markovian transition matrix,

$$\begin{aligned} \mathbb {P}\{\eta _{n}=j\}=p_{X_{n}(1),j}. \end{aligned}$$

Random variables $\delta _n$ are independent in n and $\mathbb {P}\{\delta _{n}=1\}=q_{X_{n}(1),s(n)}$.

Denote by $\{0,1\}^M$ the set of all possible vectors $\vec {\chi }=(\chi _1, \dots , \chi _M)$, where $\chi _i=0$ or 1. Introduce a random vector $\vec {\Pi }=(\Pi _1,\ldots ,\Pi _M)$ with values in $\{0,1\}^M$, termed as a tendency vector. Denote by $\pi (\cdot )$ the distribution of $\vec {\Pi }$, i.e.

$$\begin{aligned} \mathbb {P}\{\vec {\Pi } = \vec {\chi } \} = \pi (\vec {\chi }) \end{aligned}$$

for all $\vec {\chi } \in \{0,1\}^M$. The distribution $\pi (\cdot )$ is given as an input parameter for the simulation and may be determined by estimation from observed data.

The common component has the following structure. When $\chi _i=1$, all of the random variables $\eta _n$, such that $X_n(1)=i$, cannot assume values larger than i. If the credit class transitions of every debtor belonging to credit class i were governed exclusively by the corresponding $\eta _n$, this would mean that the credit rating of such debtors may not worsen. For this reason, the situation when $\chi _i=1$ is termed as a non-deteriorating tendency. In the same way, $\chi _i=0$ implies that all of the random variables $\{\eta _n\}$, such that $X_n(1)=i$, take on exclusively the values exceeding i. This is a deterioration (of their credit ratings).

There are three possibilities for the dependence mechanism. For Scheme 2, Wozabal and Hochreiter (2012) assume that, conditionally on $\vec {\Pi }$, $\eta _n$ are independent in n. A unique common component governs all debtors belonging a credit class and these random variables are conditionally on $\vec {\Pi }$ independent for different credit classes in Scheme 1, see Kaniovski and Pflug (2007). (More formally: random variables $\eta _n$ and $\eta _l$ are conditionally on $\vec {\Pi }$ independent for $X_n(1)\not =X_l(1)$, while $\eta _n=\eta _l$ for $X_n(1)=X_l(1)$.) Here an intermediate variant is introduced. For Scheme 3 it is assumed that all debtors which belong to the same combination of credit rating and industry sector are affected by the same common component and these random variables are independent for different combinations. (In short: $\eta _n=\eta _l$ if $X_n(1)=X_l(1)$ and $s(n)=s(l)$, otherwise random variables $\eta _n$ and $\eta _l$ are conditionally on $\vec {\Pi }$ independent.)

The conditional distribution of $\eta _n$ is defined as follows:

$$\begin{aligned} \mathbb {P}\{\eta _{n}=j\mid \vec {\chi }\}=p_{X_n(1),j}(\chi _{X_n(1)}), \end{aligned}$$

where conditional probabilities $p_{i,j}(\cdot )$ read:

$$\begin{aligned} p_{i,j}(1) =\left\{ \begin{array}{ll} p_{i,j}/p_{i}^{+} &{} \quad \text {if }j\le i, \\ 0 &{} \quad \text {if }j>i; \end{array} \right. \;\text {and}\;\; p_{i,j}(0)=\left\{ \begin{array}{ll} p_{i,j}/p_{i}^{-} &{}\quad \text {if }j>i, \\ 0 &{} \quad \text {if }j\le i. \end{array} \right. \end{aligned}$$

Here $p_i^+=p_{i,1}+p_{i,2}+\ldots +p_{i,i}$ and $p_1^-=1-p_i^+$.

Counts $N^{k,i}(2)$ at time $t=2$ are obtained by the following formula:

$$\begin{aligned} N^{k,i}(2)=\sum _{n=1}^{{\mathcal {N}}(1)}\mathbb {I}_{\{X_{n}(2)=i,s(n)=k\}}. \end{aligned}$$

Denote by $D^{k,i}(2)$ the number of debtors from industry sector k defaulted at time 2 that had credit rating i at time 1. Then

$$\begin{aligned} D^{k,i}(2)=\sum _{n=1}^{\mathcal {N}(1)}\mathbb {I}_{\{s(n)=k\}}\mathbb {I}_{\{X_n(1)=i\}}\mathbb {I}_{\{X_n(2)=M+1\}}. \end{aligned}$$

Since $\vec {X}(t)$ is a time-homogeneous random process, $\vec {X}(t)$ (as well as the corresponding counts $N^{k,i}(t)$ and $D^{k,i}(t)$) can be defined analogously for $t\ge 3$. We summarize the three models in Table 1.

Table 1 The three models

Full size table

3 Input parameters

In order to run the model, the following inputs are required:

a $M\times (M+1)$ Markovian transition matrix P;
a distribution $\pi (\cdot )$ of the tendency vector;
a $M\times S$ matrix Q whose entries $q_{i,s}$ are probabilities of success of Bernoulli random variables $\{\delta _n\}$.

Since

$$\begin{aligned} p_i^+=\sum _{\vec {\chi }\in \{0,1\}^M: \chi _i=1} \pi (\vec {\chi }), \end{aligned}$$

(1)

P and $\pi (\cdot )$ are related. However these M relations are not sufficient neither to identify a Markovian matrix P given a distribution $\pi (\cdot )$ nor for finding a $\pi (\cdot )$ given a P. See Bahadur (1961) for an exhaustive characterization of distributions on binary strings.

Given a Markovian matrix P and a $M\times M$ matrix of correlation coefficients

$$\begin{aligned} c_{i,j}=\frac{\sum _{\vec {\chi }\in \{0,1\}^M: \chi _i=\chi _j=1}\pi (\vec {\chi })-p_i^+p_j^+}{\sqrt{p_i^+(1-p_i^+)p_j^+(1-p_j^+)}}, \end{aligned}$$

Kaniovski and Pflug (2007) introduced a quadratic optimization problem in order to find a distribution $\pi (\cdot )$. Note that only for $M=2$ there is an explicit formula for $\pi (\cdot )$, because

$$\begin{aligned} \pi (1,1)=p_1^+ p_2^+ +c_{1,2}\sqrt{p_1^+(1-p_1^+)p_2^+(1-p_2^+)}. \end{aligned}$$

Given migration counts, Wozabal and Hochreiter (2012), employing a heuristic global optimization technique, identify $\pi (\cdot )$ for a given P by the maximum likelihood method. Since rating agencies report their (annual) Markovian transition matrices, conventionally a transition matrix P is assumed to be known and all estimation efforts concentrate on finding Q and $\pi (\cdot )$ as factors determining dependencies among components of a portfolio.

4 Distributions of defaults

Let us denote by $\overrightarrow{{Mul}}(N,p_1,\ldots ,p_k)$ a multinomial distribution with probabilities of success $p_i$ and number of trials N as well as a k-dimensional random vector with this distribution. At time $t=2$, debtors are allocated to credit classes according to a randomization of the following distributions:

Model 2: a convolution,
$$\begin{aligned}&\sum _{k=1}^S\sum _{i=1}^M\overrightarrow{{Mul}}(N^{k,i}(1),q_{i,k}p_{i,1}+(1-q_{i,k})p_{i,1}(\chi _i),\ldots ,q_{i,k}p_{i,M+1}\\&\quad +\,(1-q_{i,k})p_{i,M+1}(\chi _i)); \end{aligned}$$
Model 3: a convolution of mixtures,
$$\begin{aligned}&\sum _{k=1}^S\sum _{i=1}^M\sum _{j=1}^{M+1}p_{i,j}(\chi _i)\overrightarrow{{ Mul}}(N^{k,i}(1),q_{i,k}p_{i,1},\ldots ,q_{i,k}p_{i,j-1},q_{i,k}p_{i,j}\\&\quad +1-q_{i,k},q_{i,k}p_{i,j+1},\ldots , q_{i,k}p_{i,M+1}); \end{aligned}$$
Model 1: a convolution of mixtures of convolutions,
$$\begin{aligned}&\sum _{i=1}^M\sum _{j=1}^{M+1}p_{i,j}(\chi _i)\sum _{k=1}^S\overrightarrow{{ Mul}}(N^{k,i}(1),q_{i,k}p_{i,1},\ldots ,q_{i,k}p_{i,j-1},q_{i,k}p_{i,j}\\&\quad +1-q_{i,k},q_{i,k}p_{i,j+1},\ldots , q_{i,k}p_{i,M+1}). \end{aligned}$$

The corresponding weights are $\pi (\vec {\chi })$.

In order to compare variances of these randomized distributions, observe that the contributions due to debtors with credit rating i to the variances of j-th coordinate are related as follows:

$$\begin{aligned} \mathbb {V}ar_{3}^{i,j}(\chi _i)= & {} \mathbb {V}ar_{2}^{i,j}(\chi _i)+ p_{i,j}(\chi _i)[1-p_{i,j}(\chi _i)]\sum _{k=1}^S(1-q_{i,k})^2 N^{k,i}(1)[N^{k,i}(1)-1],\\ \mathbb {V}ar_{1}^{i,j}(\chi _i)= & {} \mathbb {V}ar_{3}^{i,j}(\chi _i)+ 2\left\{ p_{i,j}(\chi _i)\sum _{k=1}^S(q_{i,k}p_{i,j}+ 1-q_{i,k})N^{k,i}(1)\sum _{r=k+1}^S(q_{i,r}p_{i,j}\right. \\&\left. +1-q_{i,r})N^{r,i}(1)+ [1-p_{i,j}(\chi _i)]p_{i,j}^2\sum _{k=1}^Sq_{i,k}N^{k,i}(1)\sum _{r=k+1}^Sq_{i,r}N^{r,i}(1)\right\} , \end{aligned}$$

where $i=1,\ldots ,M$, $j=1,\ldots , M+1$. Consequently, Scheme 2 (1) implies the smallest (largest) variances.

5 Likelihood functions and optimization problems

The likelihood function for Scheme 2 is given in Wozabal and Hochreiter (2012) by

$$\begin{aligned} I\times L_{2}(\pi (\cdot ),Q), \end{aligned}$$

where

$$\begin{aligned} I= & {} \prod _{t=1}^{T}\prod _{s=1}^{S}\prod _{m_1=1}^M\prod _{m_2=1}^{M+1}p_{m_1,m_2}^{I^t(s,m_1,m_2)},\\ L_{2}(\pi (\cdot ),Q)= & {} \prod _{t=1}^{T}\sum _{\vec {\chi }\in \{0,1\}^M}\pi (\vec {\chi })\prod _{s=1}^{S} \prod _{m_1=1}^M \prod _{m_2=1}^{M+1} f(s,\vec {\chi },m_1,m_2,Q)^{I^t(s,m_1,m_2)},\\ f(s,\vec {\chi }, m_1,m_2,Q)= & {} \left\{ \begin{array}{l} \frac{1-q_{m_1,s}p_{m_1}^{-}}{p_{m_1}^{+}}, \quad \text {if }\quad m_1\ge m_2,\; \chi _{m_1}=1, \\ \frac{1-q_{m_1,s}p_{m_1}^{+}}{p_{m_1}^{-}}, \quad \text {if }\quad m_1 < m_2,\; \chi _{m_1}=0, \\ q_{m_1,s}, \quad \quad \;\, \text {otherwise}. \end{array} \right. \end{aligned}$$

Time instants from $t=1$ through $t=T$ correspond to the period of observation. $I^t(s,m_1,m_2)$ denotes the number of companies in sector s that have moved from credit class $m_1$ to credit class $m_2$ in period t. Containing no unknowns, the multiplier I cannot affect the outcome of maximization the likelihood function. It is ignored in the calculations reported next.

By a similar argument that is sketched in “Appendix”, likelihood functions for models 1 and 3 are obtained as

$$\begin{aligned} I\times L_{1}(\pi (\cdot ),Q), \end{aligned}$$

and

$$\begin{aligned} I\times L_{3}(\pi (\cdot ),Q), \end{aligned}$$

respectively. Here

$$\begin{aligned} L_{1}(\pi (\cdot ),Q)= & {} \prod _{t=1}^{T}\sum _{\vec {\chi }\in \{0,1\}^M}\pi (\vec {\chi })\prod _{m_1=1}^M g(t,\vec {\chi },m_1,Q),\\ L_{3}(\pi (\cdot ),Q)= & {} \prod _{t=1}^{T}\sum _{\vec {\chi }\in \{0,1\}^M}\pi (\vec {\chi })\prod _{s=1}^{S}\prod _{m_1=1}^M v(t,s,\vec {\chi },m_1,Q),\\ g(t,\vec {\chi },m_1,Q)= & {} \sum _{i=1}^{M+1}p_{m_1,i}(\chi _{m_1})\prod _{s=1}^S \left( q_{m_1,s}+\frac{1-q_{m_1,s}}{p_{m_1,i}}\right) ^{I^t(s,m_1,i)}\prod _{m_2=1,m_2\not =i}^{M+1}q_{m_1,s}^{I^t(s,m_1,m_2)},\\ v(t,s,\vec {\chi },m_1,Q)= & {} \sum _{m_2=1}^{M+1}p_{m_1,m_2}(\chi _{m_1}) \left( q_{m_1,s}+\frac{1-q_{m_1,s}}{p_{m_1,m_2}}\right) ^{I^t(s,m_1,m_2)}\prod _{j=1,\,j\not =m_2}^{M+1}q_{m_1,s}^{I^t(s,m_1,j)}. \end{aligned}$$

The components of Q and $\pi (\cdot )$ belong to [0, 1]. There are linear constrains:

$$\begin{aligned}&\sum _{\vec {\chi }\in \{0,1\}^M}\pi (\vec {\chi })=1, \end{aligned}$$

(2)

$$\begin{aligned}&\sum _{\vec {\chi }\in \{0,1\}^M, \;\;\chi _i=1}\pi (\vec {\chi })=p_i^+,\quad i=1,2,\ldots ,M. \end{aligned}$$

(3)

The first one states that the values $\pi (\cdot )$ form a probability distribution, while the remaining equalities are relations (1). Conceptually they mean that ith coordinate of a feasible tendency vector takes value 1 with probability $p_i^+$.

6 Input data

Using a S&P’s data set covering 10,413 firms from 30 OECD countries for $T=16$ years, from 1991 through 2006, two cases, with $S=5$ and with $S=12$ industry sectors, are analyzed. There are $M=2$ non-default credit classes: investment grade and non-investment grade debtors. The investment grade debtors are characterized by S&P’s ratings from AAA to BBB, while the non-investment grade ones occupy ratings from BB downward. An investment grade debtor, a non-investment grade one and a defaulted debtor are indexed by 1, 2 and 3, respectively.

The first pool of debtors mimics the portfolio generating the Dow Jones iTraxx EUR market index. It comprises a part of the data set represented by debtors belonging to the following industry sectors:

1 –
auto and industrial
2 –
consumer
3 –
energy with utilities
4 –
finance and insurance
5 –
telecommunications, media and technology.

The second pool contains all debtors of the data set, classified into the following industries:

1 –
aero, auto, capital goods, metal
2 –
consumer, service
3 –
energy, natural resources
4 –
financial institutions
5 –
forest and building products, homebuilders
6 –
health care, chemicals
7 –
high technology, computers, office equipment
8 –
insurance, real estate
9 –
leisure time, media
10 –
telecommunications
11 –
transportation
12 –
utilities.

The same list of twelve industry sectors was analyzed by Nagpal and Bahar (2001), who dealt with a S&P’s data set covering American firms for the period from 1991 through 1999. They reported one-, five- and seven-year default correlations and suggested practical applications to credit risk analysis based on them. Using a traditional statistical technique, these authors encountered a natural pitfall: “too few defaults (seven) to draw any meaningful conclusions” in sector of telecommunications. See Nagpal and Bahar (2001), p. 94. Their results serve as a benchmark for the estimates of credit event correlations suggested here.

7 Estimates and their interpretation

Applying time averages, the following Markovian matrix is obtained:

$$\begin{aligned} P=\left( \begin{array}{ccc} 0.9733\;(0.1613) &{} \quad 0.0257\; (0.1582) &{} \quad 0.0010\;(0.0322) \\ 0.0882\;(0.2836) &{} \quad 0.8865\; (0.3172) &{} \quad 0.0253\;(0.1571) \end{array} \right) . \end{aligned}$$

The values in parentheses are standard deviations of the respective probabilities.

Logarithms of $L_{i}(\pi (\cdot ),Q)$ have to be maximized in a unit hypercube subject to constraints (2) and (3). According to Allman et al. (2009), statistical estimation problems of this kind have typically multiple solutions. Given this, a variety of methods and initial approximations have to be tried, including the use of a solution obtained by one of the methods as a starting point for the rest.

Unlike Wozabal and Hochreiter (2012), who maximized $L_{2}(\pi (\cdot ),Q)$ by a heuristic global optimization method tailored for this case, here standard constrained optimization programs of MATLAB are used. The package contains two suitable methods: IP and SQP algorithms. In all cases the optimal values and the corresponding solutions were identical for both algorithms. Each time a maximum point was found in a couple of seconds. The gradient and the Hessian matrix were estimated numerically. In order to find a solution, the SQP method required some 30 % less iterations than the IP algorithm. This is consistent with what is reported in the literature on constrained optimization. See, for example, Nocedal and Wright (2006). Given an initial point, the (local) maximum value found by the SQP algorithm, was at least as good as the solution of the IP algorithm. That is, typically a maximum point reported below was “discovered” by the SQP algorithm and then “confirmed” by the IP method.

Five industry sectors. For Scheme 2 the following $Q^{(2)}$ and $\pi ^{(2)}(\cdot )$ came out:

$$\begin{aligned}&\left( \begin{array}{ccccc} 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 1.0000\\ 0.6224 &{} \quad 0.4532 &{} \quad 0.6607 &{} \quad 0.6139 &{} \quad 1.0000\\ \end{array} \right) ,\\&(\pi ^{(2)}(1,1),\pi ^{(2)}(1,0),\pi ^{(2)}(0,1),\pi ^{(2)} (0,0))=(0.9733,0.0000,0.0014,0.0253). \end{aligned}$$

Also $c^{(2)}_{1,2}=0.9727$, where $c^{(2)}_{1,2}$ stands for $\mathbb {C}orr(\Pi _1^{(2)},\Pi _2^{(2)})$. Probabilities $q_{i,s}^{(3)}$ and $\pi ^{(3)}(\cdot )$ read:

$$\begin{aligned}&\left( \begin{array}{ccccc} 0.9560 &{} \quad 0.9852 &{} \quad 0.9270 &{} \quad 0.9774 &{} \quad 0.9984\\ 0.6240 &{} \quad 0.4584 &{} \quad 0.5967 &{} \quad 0.6140 &{} \quad 0.8155\\ \end{array} \right) ,\\&(0.9480,0.0253,0.0267,0.0000)\; \text { with}\;\;c^{(3)}_{1,2}=-0.0267. \end{aligned}$$

For Scheme 1 the estimates are:

$$\begin{aligned}&\left( \begin{array}{cccccc} 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 0.9693\\ 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 1.0000 &{} \quad 0.8155\\ \end{array} \right) ,\\&(0.9483,0.0251,0.0264,0.0002) \;\; \text { with}\;\; c^{(1)}_{1,2}=-0.0170. \end{aligned}$$

Since for two Bernoulli random variables lack of correlation is equivalent to independence, small in absolute value $c^{(1)}_{1,2}$ and $c^{(3)}_{1,2}$ mean that coordinates of $\vec {\Pi }^{(1)}$ and $\vec {\Pi }^{(3)}$ are almost independent. In other words, hidden tendencies governing investment grade debtors depend very weakly on the corresponding tendencies for non-investment grade debtors. The sign minus may indicate a mismatch among the trends.

Turning to matrices $Q^{(i)}$, note that the larger $q_{X_n(1),s(n)}$ is, the weaker will be the impact of the common tendency on the evolution of debtor n. Investment grade debtors appear to be affected almost exclusively by idiosyncratic factors. Second and third schemes seem to imply a stronger dependence on common factors than first one. This conclusion does not contradict to the claim that Scheme 1 (2) generates the strongest (weakest) dependence pattern for a fixed set of parameters. In fact, here distributions $\pi ^{(i)}(\cdot )$ are different for all cases and this prevents a comparison of matrices $Q^{(i)}$. Moreover, the reported estimates represent a “reaction” of the corresponding model to the actually observed counts. That is, if distributions $\pi ^{(i)}(\cdot )$ were the same for all schemes, in order to reproduce a given dependence pattern, Scheme 1 could have required a weaker impact of common components and, consequently, larger entries of matrix $Q^{(1)}$ than Schemes 2 and 3.

The quality of the estimates for $\pi (\cdot )$ and Q depends upon the following three factors. First, entries of P can be evaluated with errors. Their magnitude depends upon numbers of migrations in the data set. In fact, some of the standard deviations quoted above do not look negligible in comparison with the corresponding transition probabilities. Facing a sample of counts, one has to ignore this factor taking matrix P as a given input. Second, it is not guaranteed that the numerical methods, if even both of them arrive at the same result, find a global maximum point. Third, sixteen years of observation may be insufficient for achieving a good precision even if the (global) maximum points were found correctly. In fact, the (sample) likelihood function in hand is based on a finite sample of counts $I^t(s,m_1,m_2)$. Consequently, there can be a bias between the true parameter values and the numerically found maximum points.

For the above transition matrix P, simulations have been run for different values of Q and $\pi (\cdot )$, each time 100 trials for 5 industry sectors and 16 years. At the beginning of every time instant t in each credit class i and in each industry sector k there were $N^{k,i}(t)=100$ debtors. In other words, new firms were added into the portfolio in order to substitute defaulted ones. In every run both of the optimization algorithms were used to improve each other. For schemes in hand, the deviation between the estimates and the true values was approximately 0.1. The bias appears to be attributable to a finite sample size rather than to an error in finding a global maximum point of a (sample) likelihood function. Carreira-Perpiñán and Renals (2000) in a similar numerical study demonstrate that the known theoretical complexity of such statistical settings does not prevent from successful applications of them.

Twelve industry sectors. For Scheme 2 the following $\pi ^{(2)}(\cdot )$ and $Q^{(2)}$ were found:

$$\begin{aligned}&\left( \begin{array}{cccccccccccc} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000\\ 0.6224 &{} 0.4532 &{} 0.3936 &{} 0.9002 &{} 0.5101 &{} 0.4427 &{} 0.7762 &{} 1.0000 &{} 0.4050 &{} 1.0000 &{} 0.5883 &{} 1.0000\\ \end{array} \right) , \\&(0.9733, 0.0000,0.0014,0.0253),\quad c^{(2)}_{1,2}=0.9727. \end{aligned}$$

In the case of Scheme 3 the same distribution $\pi (\cdot )$ came out, while $Q^{(3)}$ reads:

$$\begin{aligned} \left( \begin{array}{cccccccccccc} 1.0000 &{} 1.0000 &{} 0.5819 &{} 0.2135 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 0.7287 &{} 1.0000 &{} 0.7662\\ 0.6240 &{} 0.4584 &{} 0.3975 &{} 0.9265 &{} 0.5252 &{} 0.4429 &{} 0.7804 &{} 0.8176 &{} 0.4051 &{} 1.0000 &{} 0.5904 &{} 0.7265\\ \end{array} \right) . \end{aligned}$$

For Scheme 1 the results are:

$$\begin{aligned}&\left( \begin{array}{cccccccccccc} 1.0000 &{} 1.0000 &{} 0.5819 &{} 0.2135 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 1.0000 &{} 0.7287 &{} 1.0000 &{} 0.7662\\ 0.9758 &{} 0.9852 &{} 0.9836 &{} 0.9916 &{} 0.9622 &{} 0.9810 &{} 0.9062 &{} 0.8428 &{} 0.9834 &{} 0.8671 &{} 0.9514 &{} 0.9553\\ \end{array} \right) , \\&(0.9480, 0.0253,0.0267,0.0000),\quad c^{(1)}_{1,2}=-0.0267. \end{aligned}$$

As compared with the case of five industries, only for Scheme 3, distributions $\pi ^{(3)}(\cdot )$ differ profoundly. In fact, a slight mismatch transforms into nearly perfect synchronicity of trends.

Matrices $Q^{(i)}$ estimated here give rise to a wider scope of conceptual interpretations than those in the situation with five industries. In general, the contribution of common tendencies seems to be stronger. Because the corresponding distributions $\pi (\cdot )$ coincide, matrices $Q^{(2)}$ and $Q^{(3)}$ can be compared.

Since the observed dependence pattern, as characterized by the transition counts, is identical for both schemes, intuitively the entries of $Q^{(2)}$ should not exceed their counterparts of $Q^{(3)}$. (Remember, an identical parametrization implies a stronger, as compared with Scheme 2, dependence for Scheme 3 only between debtors belonging to the same credit class and the same industry sector. Then, in order to produce the observed “strength” of dependence, one would expect the corresponding elements of $Q^{(2)}$ to be smaller than their analogs of $Q^{(3)}$.) Among 24 entries of $Q^{(3)}$, 6 or 25 % are not consistent with this intuition. Considering the two credit classes separately, reveals that the situation is better, 2 out 12 against 4 out 12 values, respectively 17 and 33 %, in the case of non-investment grade debtors. However, interpreting these values, one has to take into account that they are obtained numerically, using a procedure, where precision of the estimates cannot be guaranteed. Among the most evident factors affecting the final result are different numbers of counts for different industries. In particular, counts in industry sector 8 exceed 40 times counts in sector 10.

Whole economy. Since Nagpal and Bahar (2001) analyze default correlations for the whole economy, in order to obtain counterparts of their estimates, $\pi (\cdot )$ and Q characterizing the whole data set were found. That is, here $S=1$ and the required counts obtain by summing up the corresponding numbers of transitions over all industries. Scheme 2:

$$\begin{aligned} \left( \begin{array}{c} 1.0000\\ 1.0000\\ \end{array} \right) ,\;\,(0.9733,0.0000,0.0014,0.0253),\; c^{(2)}_{1,2}=0.9727; \end{aligned}$$

Schemes 1 and 3 coincide in the case of a single industry sector. The corresponding estimates read:

$$\begin{aligned} \left( \begin{array}{c} 0.9845\\ 0.8601\\ \end{array} \right) ,\;\,(0.9480,0.0253,0.0267,0.0000),\; c^{(1)}_{1,2}=-0.0267. \end{aligned}$$

Since the distributions corresponding to the three schemes are different, it is not possible to decide which of them fits the best to the data set. However, assuming that one of them is the true distribution, the likelihood ratio can be used in order to rank the remaining two according to their similarity to the true one.

For estimates $\pi ^{(i)}(\cdot )$ and $Q^{(i)}$ given above set

$$\begin{aligned} l_{i,j}=\log L_i(\pi ^{(i)}(\cdot ),Q^{(i)})-\log L_i(\pi ^{(j)}(\cdot ),Q^{(j)}). \end{aligned}$$

These values for five and twelve industry sectors as well as for the whole economy, separated by a slash, are given in the following table (Table 2):

Table 2 Logarithm of likelihood ratio

Full size table

Intuitively, considering a true statistical model and its alternatives, the smallest likelihood ratio, or, equivalently, its logarithm, can indicate the most similar of them to the true one. In particular, if Scheme 1 is the correct model, Scheme 3 fits data in hand better than Scheme 2. In the same way, considering Scheme 2 as the true model, we see that Scheme 3 would be more suitable than Scheme 1. Finally, if Scheme 3 were the correct model, Scheme 1 would be preferred to Scheme 2. This informal argument based on likelihood ratios shows, that, once again, Scheme 3 takes an intermediate position between Schemes 1 and 2.

8 Correlations of credit events

Set $A_{i,I}^k(T)$ the event that a debtor in sector k moves from credit class i to credit class I in T-year time. Then

$$\begin{aligned} \mathbb {C}orr \left( \mathbb {I}_{\{A_{i,I}^k(T)\}},\mathbb {I}_{\{A_{j,J}^l(T^\prime )\}}\right) = \frac{\mathbb {E} \mathbb {I}_{\{A_{i,I}^k(T)\}}\mathbb {I}_{\{A_{j,J}^l(T^\prime )\}}- \mathbb {E}\mathbb {I}_{\{A_{i,I}^k(T)\}}\mathbb {E}\mathbb {I}_{\{A_{j,J}^l(T^\prime )\}}}{\sqrt{\mathbb {E}\mathbb {I}_{\{A_{i,I}^k(T)\}}[1-\mathbb {E}\mathbb {I}_{\{A_{i,I}^k(T)\}}] \mathbb {E}\mathbb {I}_{\{A_{j,J}^l(T^\prime )\}}[1-\mathbb {E}\mathbb {I}_{\{A_{j,J}^l(T^\prime )\}}]}} \end{aligned}$$

is termed as correlation of credit events $A_{i,I}^k(T)$ and $A_{j,J}^l(T^\prime )$. If $I=J=M+1$ this is a default correlation. If $T=T^\prime $, let us denote it by $\rho _{i,j}^{k,l}(T)$.

For debtors numbered by n and r,

$$\begin{aligned} \rho _{i,j}^{k,l}(1)=\frac{(1-q_{i,k})(1-q_{j,l})\left( \mathbb {E}\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}-p_{i,I}p_{j,J}\right) }{\sqrt{p_{i,I}(1-p_{i,I})p_{j,J}(1-p_{j,J})}}, \end{aligned}$$

(4)

if the following relations hold true:

$$\begin{aligned} s(n)=k, s(r)=l, X_n(1)=i, X_r(1)=j. \end{aligned}$$

Note that

$$\begin{aligned} \mathbb {E}\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}= & {} \mathbb {E}(\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}\mid \chi _i=1,\,\chi _j=1)\mathbb {P}\{\Pi _i=1,\,\Pi _j=1\}\\&+\,\mathbb {E}(\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}\mid \chi _i=1,\,\chi _j=0)\mathbb {P}\{\Pi _i=1,\,\Pi _j=0\}\\&+\, \mathbb {E}(\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}\mid \chi _i=0,\,\chi _j=1)\mathbb {P}\{\Pi _i=0,\,\Pi _j=1\}\\&+\, \mathbb {E}(\mathbb {I}_{\{\eta _n=I\}}\mathbb {I}_{\{\eta _r=J\}}\mid \chi _i=0,\,\chi _j=0)\mathbb {P}\{\Pi _i=0,\,\Pi _j=0\}. \end{aligned}$$

Schemes 1–3 imply that, respectively:

$$\begin{aligned}&\mathbb {E}\mathbb {I}_{\{\eta _n=M+1\}}\mathbb {I}_{\{\eta _r=M+1\}}=\left\{ \begin{array}{ll} \frac{p_{i,M+1}p_{j,M+1}}{p_i^-p_j^-}\mathbb {P}\{\Pi _i=0,\,\Pi _j=0\}, &{}\quad \text {if }i\not =j,\\ \frac{p_{i,M+1}^2}{p_i^-}, &{}\quad \text {if }i=j; \end{array} \right. \\&\mathbb {E}\mathbb {I}_{\{\eta _n=M+1\}}\mathbb {I}_{\{\eta _r=M+1\}}=\left\{ \begin{array}{ll} \frac{p_{i,M+1}p_{j,M+1}}{p_i^-p_j^-}\mathbb {P}\{\Pi _i=0,\,\Pi _j=0\}, &{}\quad \text {if }i\not =j, \\ \frac{p_{i,M+1}^2}{p_i^-}, &{}\quad \text {if }i=j,\,k\not =s, \\ p_{i,M+1}, &{}\quad \text {if } i=j,\, k=s; \end{array} \right. \end{aligned}$$

and

$$\begin{aligned} \mathbb {E}\mathbb {I}_{\{\eta _n=M+1\}}\mathbb {I}_{\{\eta _r=M+1\}}=\left\{ \begin{array}{ll} \frac{p_{i,M+1}p_{j,M+1}}{p_i^-p_j^-}\mathbb {P}\{\Pi _i=0,\,\Pi _j=0\},&{}\quad \text {if }i\not =j,\\ p_{i,M+1}, &{}\quad \text {if } i=j. \end{array} \right. \end{aligned}$$

For fixed P, Q and $\pi (\cdot )$, these relations imply that $\rho _{i,j}^{k,l}(1)$ coincide for the three coupling schemes as long as debtors have different credit ratings, that is, $i\not =j$. If these correlations are not equal to zero, then they will be positive of negative depending upon the sign of

$$\begin{aligned} \mathbb {P}\{\Pi _i=0,\,\Pi _j=0\}-p_i^-p_j^-. \end{aligned}$$

If debtors have the same credit rating, then all $\rho _{i,i}^{k,l}(1)$ are non-negative: the largest for Scheme 1 and the smallest for Scheme 2. Scheme 3 is characterized by intermediate values: if debtors are from different industry sectors, default correlations are identical to those for Scheme 2, while if debtors belong to the same industry sector, the correlations coincide with their counterparts for Scheme 1.

Turning to the case in hand, note that whenever an entry of Q equals 1, all default correlations involving the corresponding debtors will be 0. Moreover, if $\pi (0,0)=0$, as it is the case for first coupling scheme, then

$$\begin{aligned} \rho _{i,j}^{k,l}(1)=\left\{ \begin{array}{ll} -(1-q_{i,k})(1-q_{j,l})\sqrt{\frac{p_{i,M+1}p_{j,M+1}}{(1-p_{i,M+1})(1-p_{j,M+1})}}, &{}\quad \text {if }i\not =j,\\ (1-q_{i,k})(1-q_{i,l}),&{}\quad \text {if }i=j. \end{array} \right. \end{aligned}$$

Having a triple P, Q and $\pi (\cdot )$ and substituting these inputs in relations (4), one-year default correlations can be found, while multi-year events correlations can be estimated by a traditional statistical technique based on repeated runs of the model, in other words, Monte-Carlo simulations. Note that the actual iTraxx portfolio contains only investment grade titles.

Default correlations corresponding to the triples estimated for coupling Scheme 3 are summarized in the next three tables. The columns One year, formula contain the values obtained by formula (4), while the other correlations are estimated using averages based on 100000 independent observations of the respective random variables (Tables 3, 4, 5).

Table 3 Default correlations expressed in percent, whole database

Full size table

Table 4 Default correlations expressed in percent, five industry sectors

Full size table

Table 5 Default correlations expressed in percent, twelve industry sectors

Full size table

The tables demonstrate, that analytically evaluated default correlations follow very well their sample counterparts, as long as these values are not too small. In particular, this is the case for non-investment grade debtors. The poor match for the correlations close to 0 is caused by the multiplier $1-q_{i,k}$. It makes the correlations evaluated according to formula (4) equal to 0, if $q_{i,k}$ is sufficiently close to 1.

9 Conclusions

Distributions, their variances, in particular variances of the number of defaults, as well as default correlations were compared for three coupling methods: for the one introduced by Wozabal and Hochreiter (2012) (Kaniovski and Pflug (2007)) the variances and the correlations were the smallest (largest), the scheme suggested in this paper takes an intermediate position. Using real data concerning OSCD countries, parameters of the models were estimated by standard optimization methods available in MATLAB for two portfolios. One of them mimics the Dow Jones iTraxx EUR market index. The other one, covering the same industry sectors as a study of Nagpal and Bahar (2001), who analyzed American firms, allows a quantitative comparison of the corresponding dependence patters in these two economic environments. A bootstrap procedure was suggested in order to estimate correlations of credit events. The corresponding Monte-Carlo estimates match their counterparts obtained analytically.

References

Allman EL, Matias C, Rhodes JA (2009) Identifiability of parameters in latent structure with many observed variables. Ann Stat 37(6A):3099–3132
Article Google Scholar
Altman EI (1998) The importance and subtlety of credit rating migration. J Bank Finance 22(10–11):1231–1247
Article Google Scholar
Bahadur RR (1961) A representation of the joint distribution of responses to $n$ dichotomous items. In: Solomon H (ed) Studies in item analysis and prediction. Stanford University Press, USA, pp 158–168
Google Scholar
Bangia A, Diebold FX, Kronimus A, Schagen C, Schuermann T (2002) Ratings migration and the business cycle, with applications to credit portfolio stress testing. J Bank Finance 26(2/3):445–474
Article Google Scholar
Carreira-Perpiñán M, Renals S (2000) Practical identifiability of finite mixtures of multivariate Bernoulli distributions. Nueral Comput 2(1):141–152
Article Google Scholar
Choroś-Tomczyk B, Härdle W, Okhrin O (2013) Valuation of collateralized debt obligations with hierarchical archimedean copulae. J Empir Finance 24(1):42–62
Article Google Scholar
Frey R, McNeil AJ (2003) Dependent defaults in models of portfolio credit risk. J Risk 6(1):59–92
Article Google Scholar
Frydman H, Schuermann T (2008) Credit rating dynamics and Markov mixture models. J Bank Finance 32(6):1062–1075
Article Google Scholar
Gupton GM, Finger ChC, Bhatia M (1997) CreditMetrics—technical document. J.P. Morgan Inc., New York
Google Scholar
Hull J, White A (2001) The general Hull-White model and supercalibration. Financ Anal J 57(6):34–43
Article Google Scholar
Hull J, White A (2004) Valuation of a CDO and an $n$-th to default CDS without Monte Carlo simulation. J Deriv 12(2):8–23
Article Google Scholar
Jarrow R, Yu F (2001) Counterparty risk and the pricing of defaultable securities. J Finance 56(5):1765–1799
Article Google Scholar
Kaniovski YM, Pflug GCh (2007) Risk assessment for credit portfolios: a coupled Markov chain model. J Bank Finance 31(8):2303–2323
Article Google Scholar
Korolkiewicz M, Elliott R (2008) A hidden Markov model of credit quality. J Econ Dyn Control 32(12):3807–3819
Article Google Scholar
Lando D (2004) Credit risk modeling: theory and applications. Princeton University Press, Princeton
Google Scholar
Lando D, Skødeberg TM (2002) Analyzing rating transitions and rating drift with continuous observations. J Bank Finance 26(2/3):423–444
Article Google Scholar
Li DX (2000) On default correlation: a copula approach. J Fixed Income 9(4):43–54
Article Google Scholar
McNeil A, Wendin J (2007) Bayesian inference for generalized linear mixed models of portfolio credit risk. J Empir Finance 14(2):131–149
Article Google Scholar
Merton RC (1974) On the pricing of corporate debt: the risk structure of interest rates. J Finance 29(2):449–470
Google Scholar
Nagpal K, Bahar R (2001) Measuring default correlation. Risk 14(3):129–132
Google Scholar
Nocedal J, Wright SJ (2006) Numerical optimization, 2nd edn. Springer Series in Operations Research, New York
Google Scholar
Stefanescu C, Tunary R, Turnbull S (2009) The credit rating process and estimation of transition probabilities: a Bayesian approach. J Empir Finance 16(2):216–234
Article Google Scholar
Upper C (2011) Simulations methods to assess the danger of contagion in interbank markets. J Financ Stab 7(3):111–125
Article Google Scholar
Wozabal D, Hochreiter R (2012) A coupled Markov chain approach to credit risk modeling. J Econ Dyn Control 36(3):403–415
Article Google Scholar
Xing H, Sun N, Chen Y (2012) Credit rating dynamics in the presence of unknown structural breaks. J Bank Finance 36(1):78–89
Article Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Economics and Management, Free University of Bozen-Bolzano, Piazza Università 1, 39100, Bolzano, Italy
D. V. Boreiko & Y. M. Kaniovski
Department of Statistics and Decision Support Systems, University of Vienna, Universitätstraße 5, 1090, Vienna, Austria
G. Ch. Pflug

Authors

D. V. Boreiko
View author publications
You can also search for this author in PubMed Google Scholar
Y. M. Kaniovski
View author publications
You can also search for this author in PubMed Google Scholar
G. Ch. Pflug
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Y. M. Kaniovski.

Additional information

The current version of this paper has benefited from comments and suggestions of an anonymous referee.

Appendix

In both cases the argument follows very closely the proof by Wozabal and Hochreiter (2012). The Markov property implies the product in t. Consequently, it is enough to consider a single time instant t. The sum in $\vec {\chi }$ accounts for all possible realizations of the tendency vector. Fix a realization $\vec {\chi }$. Consider migrations starting at a credit rating $m_1$.

In the case of Scheme 1, if the corresponding common component assumes value i, all credit ratings different from i in all industry sectors are reachable from $m_1$ only through idiosyncratic moves. By independence of such moves, this event occurs with probability

$$\begin{aligned} \prod _{s=1}^S\prod _{m_2=1,\;m_2\not =i}^{M+1}(q_{m_1,s}p_{m_1,m_2})^{I^t(s,m_1,m_2)}. \end{aligned}$$

On the other hand, each of $I^t(s,m_1,i)$ debtors in industry sector s, migrating to rating i, is driven either by the common or by the idiosyncratic component. The corresponding events occur with probabilities $1-q_{m_1,s}$ or $q_{m_1,s}p_{m_1,i},$ respectively. Since these migrations are independent in s, all transitions from $m_1$ to i occur with probability

$$\begin{aligned} \prod _{s=1}^S (q_{m_1,s}p_{m_1,i}+1-q_{m_1,s})^{I^t(s,m_1,i)}. \end{aligned}$$

By these two observations, probability of all transitions starting at $m_1$ reads

$$\begin{aligned}&\sum _{i=1}^{M+1}p_{m_1,i}(\chi _{m_1})\prod _{s=1}^S (q_{m_1,s}p_{m_1,i}+1-q_{m_1,s})^{I^t(s,m_1,i)}\\&\quad \times \prod _{m_2=1,\;m_2\not =i}^{M+1}(q_{m_1,s}p_{m_1,m_2})^{I^t(s,m_1,m_2)}. \end{aligned}$$

Since, given a realization $\vec {\chi }$, common components are independent in $m_1$, the corresponding terms have to be multiplied. Then the whole evolution at time t takes place with probability

$$\begin{aligned}&\pi (\vec {\chi })\prod _{m_1=1}^M\sum _{i=1}^{M+1}p_{m_1,i}(\chi _{m_1})\prod _{s=1}^S (q_{m_1,s}p_{m_1,i}+1- q_{m_1,s})^{I^t(s,m_1,i)}\\&\qquad \times \prod _{m_2=1,\; m_2\not =i}^{M+1}(q_{m_1,s}p_{m_1,m_2})^{I^t(s,m_1,m_2)}\\&\quad = I(t)\times \pi (\vec {\chi })\prod _{m_1=1}^M\sum _{i=1}^{M+1}p_{m_1,i}(\chi _{m_1})\prod _{s=1}^S \left( q_{m_1,s}+\frac{1-q_{m_1,s}}{p_{m_1,i}}\right) ^{I^t(s,m_1,i)}\\&\qquad \times \prod _{m_2=1,\;m_2\not =i}^{M+1}q_{m_1,s}^{I^t(s,m_1,m_2)}. \end{aligned}$$

Here

$$\begin{aligned} I(t)=\prod _{s=1}^{S}\prod _{m_1=1}^M\prod _{i=1}^{M+1}p_{m_1,i}^{I^t(s,m_1,i)}. \end{aligned}$$

In the case of Scheme 3, common components are independent in s and i. Therefore the products over all industry sectors and over all non-default credit classes come to exist. For industry s, the sum in $m_2$ corresponds to mutually exclusive events

$$\begin{aligned} A_{m_2}=\{\text {The respective common component assumes the value}\quad m_2.\} \end{aligned}$$

Conditional on $A_{m_2}$, credit rating $m_2$ is reachable either, with probability $q_{m_1,s}p_{m_1,m_2}$, through an idiosyncratic move, or, with probability $1-q_{m_1,s}$, through the common component. All other credit ratings, $j\not =m_2$, are reachable in this case only by idiosyncratic moves and the corresponding probabilities are $(q_{m_1,s}p_{m_1,j})^{I^t(s,m_1,j)}$. By independence, multiplying the respective probabilities, the following term is obtained

$$\begin{aligned}&\pi (\vec {\chi })\prod _{s=1}^S\prod _{m_1=1}^M\sum _{m_2=1}^{M+1}p_{m_1,m_2}(\chi _{m_1}) (q_{m_1,s}p_{m_1,m_2}+1- q_{m_1,s})^{I^t(s,m_1,m_2)}\\&\qquad \times \prod _{j=1,\; j\not =m_2}^{M+1}(q_{m_1,s}p_{m_1,j})^{I^t(s,m_1,j)}\\&\quad =I(t)\times \pi (\vec {\chi })\prod _{s=1}^S\prod _{m_1=1}^M\sum _{m_2=1}^{M+1}p_{m_1,m_2}(\chi _{m_1}) \left( q_{m_1,s}+\frac{1- q_{m_1,s}}{p_{m_1,m_2}}\right) ^{I^t(s,m_1,m_2)}\\&\qquad \times \prod _{j=1,\; j\not =m_2}^{M+1}q_{m_1,s}^{I^t(s,m_1,j)}. \end{aligned}$$

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Boreiko, D.V., Kaniovski, Y.M. & Pflug, G.C. Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence. Cent Eur J Oper Res 24, 989–1007 (2016). https://doi.org/10.1007/s10100-015-0415-6

Download citation

Published: 09 September 2015
Issue Date: December 2016
DOI: https://doi.org/10.1007/s10100-015-0415-6

Keywords

Mathematical Subject Classification

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence

Abstract

Similar content being viewed by others

Structural Credit Risk Models: Endogenous Versus Exogenous Default

Structural Credit Risk Models: Endogenous Versus Exogenous Default

Modeling stochastic recovery rates and dependence between default rates and recovery rates within a generalized credit portfolio framework

1 Introduction

2 Coupling schemes

3 Input parameters

4 Distributions of defaults

5 Likelihood functions and optimization problems

6 Input data

7 Estimates and their interpretation

8 Correlations of credit events

9 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Rights and permissions

About this article

Cite this article

Keywords

Mathematical Subject Classification

JEL Classification

Navigation

Modeling dependent credit rating transitions: a comparison of coupling schemes and empirical evidence

Abstract

Similar content being viewed by others

Structural Credit Risk Models: Endogenous Versus Exogenous Default

Structural Credit Risk Models: Endogenous Versus Exogenous Default

Modeling stochastic recovery rates and dependence between default rates and recovery rates within a generalized credit portfolio framework

1 Introduction

2 Coupling schemes

3 Input parameters

4 Distributions of defaults

5 Likelihood functions and optimization problems

6 Input data

7 Estimates and their interpretation

8 Correlations of credit events

9 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematical Subject Classification

JEL Classification

Search

Navigation