1 Introduction

I build on the work of Krause and Lahmann (2017) and use it to extend their analysis for a de- and re-levering procedure for equity betas. I include interest and principal prioritization as additional cases, and I also discuss the case of constant leverage and an infinite horizon.

The main objective and contribution of this paper is to show how betas can be de- and re-levered when debt is risky and when different assumptions on the treatment of a cancellation of debt (COD) are made. The usual equations on de- and re-levering betas with risky debt implicitly assume that a COD is taxed as for example in Arzac and Glosten (2005). But as Krause and Lahmann (2017) mention, there are cases in which a COD remains untaxed. Furthermore, the assumption of a taxed COD is often not formally stated. However, presented equations change significantly when the assumption of taxes on a COD is dropped and when debt is risky.

The pricing framework builds mainly on the findings of Modigliani and Miller (1958) and Modigliani and Miller (1963). Additionally, Miles and Ezzell (1980) and Miles and Ezzell (1985) derive risk-adjusted discount rates in a multiperiod setting with corporate taxes and with constant leverage. In the more recent literature, several authors include the taxation of a COD into their analyses of corporations with risky debt [see e.g., (Kruschwitz and Löffler 2006; Cooper and Nyborg 2008; Blaufus and Hundsdoerfer 2008)].

Eventually, the de- and re-levering procedure relies on an expected return equation from an asset pricing model. This model does not need to be the classic mean-variance CAPM. A more general framework that relies on a stochastic discount factor is presented in Cochrane (2005) and can also be used.

I continue to introduce the basic notation for the single-period case. I present the beta equation for risk-free debt. Continuing with risky debt, I divide into the case of a taxation of a COD and the one without a taxation of a COD. At the end of the next section, I summarize the equations, discuss them and give a short example on possible miscalculations through applying the incorrect equation. In Sect. 3, I briefly discuss the infinite horizon case with constant leverage, which, under simplifying assumptions, is similar to the single-period case. Section 4 summarizes the paper.

2 Levered and unlevered beta in a single-period setting

2.1 On COD taxation

According to Schwartzman and Brandstetter (2015), a US company’s forgiven or cancelled amount of debt from a bankruptcy or insolvency has to be recognized as gross income.Footnote 1 The COD is included in taxable income. However, there are several important exceptions from the general rule, most importantly bankruptcy and insolvency exclusions.

For bankruptcy, which is defined as a case under Title 11 of the US Code, discharges of indebtedness under “Chapter 11 reorganization, Chapter 7 liquidations, and Chapter 13 bankruptcy proceedings” [see Schwartzman and Brandstetter (2015)] are excluded from taxable COD income. The discharge has to be ordered by a court or approved by a court. The COD is not included in gross income, but so called tax attributes “certain losses, credits, and basis of property must be reduced by the amount of excluded income (but not below zero)” (IRS 2012, p. 26).

Insolvency, defined as liabilities in excess of the company’s market value at the time right before the discharge, is also excluded from COD income. Only the insolvent amount can be excluded, and, again, certain tax attributes must be reduced. The reduction of tax attributes leads to a partial postponment of the tax on a COD instead of a full forgiveness IRS (2012) p. 26 ff.

The items mentioned above can only give a general picture. For more information please refer to the documentation from the Internal Revenue Service, for example, to IRS (2012) or to Schwartzman and Brandstetter (2015). Following the prior literature on pricing with and without a COD taxation, I will separately analyze the two extreme cases, the one in which a COD is taxed whenever debt is cancelled and the one in which it is never taxed. More elaborate models can be developed upon what is provided here.

2.2 Basic equations

I start with a simple setting with two points in time as in Krause and Lahmann (2017). I use the same notation and assumptions as in the mentioned paper. The single-period analysis has the advantages that additional assumptions on what happens after default are not necessary and that a simple notation without time subscripts is sufficient. Using simplifying assumptions, the infinite horizon version does not differ from the single period results as will be shown later. The starting point of the single-period analysis is the identity of (expected) cash flows:

$$\begin{aligned} E[{\text {FCF}}^{\text {L}}]=E[{\text {ECF}}] + E[{\text {Int}} + {\text {PP}}]=E[{\text {FCF}}] + E[{\text {TS}}]. \end{aligned}$$
(1)

Levered free cash flows \({\text {FCF}}^{\text {L}}\) are equal to the sum of equity cash flows ECF, as well as debt cash flows, which, in turn, consist of interest payments Int and principal payments PP. Alternatively, levered free cash flows are equal to unlevered free cash flows FCF and tax savings TS. Taking expected values through the operator \(E[\cdot ]\) keeps the identity. Equation (1) can be restated using values and returns

$$\begin{aligned} S\times E[R^{\text {E}}] + D\times E[R^{\text {D}}] = (S + D - {\text {VTS}})\times E[R^{\text {U}}] + E[{\text {TS}}], \end{aligned}$$
(2)

in which

$$\begin{aligned} V^{\text {U}}=V^{\text {L}} - {\text {VTS}}=S + D - {\text {VTS}}, \end{aligned}$$
(3)

here \(V^{\text {L}}\) is the value of the levered firm, \(V^{\text {U}}\) the value of the unlevered firmFootnote 2, S the value of equity, D the value of debt, and VTS the value of tax savings. Furthermore, \(R^{\text {E}}\) is the return on (levered) equity, \(R^{\text {D}}\) is the return on debt, and \(R^{\text {U}}\) is the return on unlevered equity. In the expected value operator they are the respective expected returns. I use R for gross returns and r for net returns, where \(R=1+r\). Values can be obtained by discounting expected cash flows, with \(S=\frac{E[{\text {ECF}}]}{E[R^{\text {E}}]}\) for equity, \(D=\frac{E[{\text {Int}} + {\text {PP}}]}{E[R^{\text {D}}]}\) for debt and \(V^{\text {U}}=\frac{E[{\text {FCF}}]}{E[R^{\text {U}}]}\) for the value of the unlevered firm. Having defined the basic notation and relations, I turn to the risk-free case.

2.3 Risk-free debt and risk-free tax savings

I start with the simple but least relevant practical case for corporate valuations: risk-free debt. Since a COD will never happen with risk-free debt, its tax treatment does not play a role. I assume that the firm generates enough taxable income before interest payments to be able to fully deduct interest payments and to qualify for full interest tax savings. Then, all tax savings are equal to the risk-free interest payments times the tax rate on corporate profits \(\tau\): \({\text {TS}}=\tau \times {\text {Int}}=\tau \times r^{\text {f}} \times D\). Debt yields the risk-free interest rate \(r^{\text {f}}\). I stress here that risk-free debt is a very special case, in which the firm is able to pay off all of its debt obligations in any future state. Leverage potentially affects the firms profitability so that, with higher leverage, risk-free debt becomes more and more unlikely. Compare Krause and Lahmann (2015) for a numerical example.

When debt is risk-free and the mean-variance CAPM is used to obtain expected returns, the following equation shows the relation of the levered and unlevered beta, i.e., of \(\beta _{{\text {E}},{\text {M}}}\) and \(\beta _{{\text {U}},{\text {M}}}\):

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \left( 1 + \frac{D}{S} \times \frac{1 + r^{\text {f}} \times (1-\tau )}{R^{\text {f}}}\right) \times \beta _{{\text {U}},{\text {M}}}. \end{aligned}$$
(4)

Appendix 1 shows the derivation Eq. (4). Equation (32) in Arzac and Glosten (2005) is similar to Eq. (4) when their \(r^{\text {D}}\), a discount rate for debt, is replaced by \(r^{\text {f}}\). Their derivation is actually done in an infinite horizon setting. However, Eq. (4) is the result of a single-period analysis. Adding periods does not add much to the analysis.

The term \(\frac{1 + r^f \times (1-\tau )}{R^f}\) is due to tax savings. With a zero tax rate this term equals one. For \(r^f>0\) and with the tax rate between zero and one (inclusive) this term is less than one. For \(r^f<0\) the term is greater than one.

Equation (4) also shows that higher debt relative to equity, i.e., a higher leverage, scales up the levered beta. With higher leverage relatively more debt payments have to be made from the cash flows coming from the firm’s assets. The remaining diminished cash flows go to equity holders. The return of those cash flows have a higher absolute value of the beta than before the increase in leverage. For positive unlevered beta the levered beta increases. If the unlevered beta is negative, the levered beta decreases. Appendix 1 shows how this occurs.

A higher tax rate decreases the levered beta. A higher tax rate increases tax savings and the value of them, which counters the effect of leverage, i.e., systematic risk is decreased through the presence of risk-free tax savings. Given that debt is risk-free, the risk-free tax savings are earned by equity holders. Therefore, tax savings increase the value of equity.

Equation (4) can also be stated in terms of leverage. This gives

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \frac{1}{1-l} \times \left( 1 - l \times \frac{\tau \times r^{\text {f}}}{R^{\text {f}}}\right) \times \beta _{{\text {U}},{\text {M}}}. \end{aligned}$$
(5)

A zero risk-free rate would also take out the tax effect. In this case interest payment would be zero so that there are no tax savings—just as for a zero tax rate:

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \frac{1}{1-l} \times \beta _{{\text {U}},{\text {M}}}. \end{aligned}$$
(6)

After this introduction for risk-free debt, I turn to the more relevant case in the coporate world, the case of risky debt.

2.4 Risky debt

For risky debt the COD treatment is important. From Krause and Lahmann (2017), I restate the pricing equations for the value of the tax savings VTS for the different cases. Without (superscript NC) and with (superscript C) a tax on debt cancellation the tax savings are, respectively,

$$\begin{aligned} {\text {TS}}^{{\text {NC}}}&=\tau \times {\text {Int}}, \end{aligned}$$
(7)
$$\begin{aligned} {\text {TS}}^{{\text {C}}}&=\tau \times {\text {Int}} - \tau \times C, \end{aligned}$$
(8)

here \(\tau\) is the deterministic tax rate on corporate profits, and C is the amount of debt that is cancelled, i.e., the losses on the principal D—the COD. When C is taxed, the whole tax savings are reduced by \(\tau \times C\). Here it is assumed that \(\tau \times C\) can actually be paid by the defaulting firm. Furthermore, interest payments can be expressed as

$$\begin{aligned} {\text {Int}}=r^{\text {c}} \times D - (L - C), \end{aligned}$$
(9)

in which L are the total losses on debt, i.e., losses on interest \(L^{{\text {Int}}}\) and on principal payments C, and \(r^{\text {c}}\) is the coupon rate.Footnote 3 According to Krause and Lahmann (2017), the values of the tax savings are then

$$\begin{aligned} {\text {VTS}}^{{\text {NC}}}&=\tau \times \frac{r^{\text {f}} \times D}{R^{\text {f}}} + \tau \times p(C)\end{aligned}$$
(10)
$$\begin{aligned} {\text {VTS}}^{{\text {C}}}&=\tau \times \frac{r^{\text {f}} \times D}{R^{\text {f}}}. \end{aligned}$$
(11)

The term p(C) is the price of the COD, i.e., \(p(\cdot )\) is used as a pricing operator. The basis for further derivations is the identity

$$\begin{aligned} S \times E[R^{\text {E}}] + D\times E[R^{\text {D}}] = (S + D - {\text {VTS}}) \times E[R^{\text {U}}] + E[{\text {TS}}]. \end{aligned}$$
(12)

I continue with the case of taxation of a COD.

2.4.1 Taxes on debt cancellation

I use Eq. (12) and substitute in the respective expression for the value of the tax shield and the tax savings from Eqs. (11) and (8):

$$\begin{aligned} S\times E[R^{\text {E}}] + D\times E[R^{\text {D}}] = \left( S + D - \tau \times \frac{r^{\text {f}} \times D}{R^{\text {f}}}\right) \times E[R^{\text {U}}] + \tau \times (r^{\text {c}} \times D - E[L - C]) - \tau \times E[C]. \end{aligned}$$
(13)

The \(\tau \times E[C]\) terms cancel on the rhs and \(r^{\text {c}} \times D - E[L]=D \times E[r^{\text {D}}]\) so that the equation turns to

$$\begin{aligned} S \times E[R^{\text {E}}] + D \times (1 + E[r^{\text {D}}] \times (1-\tau )) = \left( S + D - \tau \times \frac{r^{\text {f}} \times D}{R^{\text {f}}}\right) \times E[R^{\text {U}}]. \end{aligned}$$
(14)

I use the expected return equation for the mean-variance CAPMFootnote 4 of the form

$$\begin{aligned} E[R^{\text {i}}]=R^{\text {f}} + \beta _{{\text {i}},{\text {M}}} \times (E[{\text {R}}^{\text {M}}] - R^{\text {f}}), \end{aligned}$$
(15)

for the expected return, in which i stands for the return on levered equity, on unlevered equity, on tax savings or on debt. The return \(R^{\text {M}}\) is the return on the market portfolio. I obtain

$$\begin{aligned}&S \times (R^{\text {f}} + \beta _{{\text {E}},{\text {M}}}\times (E[R^{\text {M}}] - R^{\text {f}})) + D\times (1 + (R^{\text {f}} + \beta _{{\text {D}},{\text {M}}}\times (E[{\text {R}}^{\text {M}}] - R^{\text {f}})-1)\times (1-\tau ))\nonumber \\&\quad = \left( S + D - \tau \times \frac{r^f \times D}{R^f}\right) \times (R^f + \beta _{U,M}\times (E[R^M] - R^f)). \end{aligned}$$
(16)

This expression can be simplified to

$$\begin{aligned} S \times \beta _{{\text {E}},{\text {M}}} + D\times \beta _{{\text {D}},{\text {M}}}\times (1-\tau ) = \left( S + D - \tau \times \frac{r^{\text {f}} \times D}{R^{\text {f}}}\right) \times \beta _{{\text {U}},{\text {M}}}. \end{aligned}$$
(17)

Rearranging yields

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \left( 1 + \frac{D}{S}\times \frac{1 + r^{\text {f}} \times (1-\tau )}{R^{\text {f}}}\right) \times \beta _{{\text {U}},{\text {M}}} -\beta _{{\text {D}},{\text {M}}}\times \frac{D}{S}\times (1-\tau ). \end{aligned}$$
(18)

When debt has no systematic risk, i.e., when \(\beta _{{\text {D}},{\text {M}}}=0\), the equation reduces to the one with risk-free debt. Again, Arzac and Glosten (2005) have the same result in their Equation (32). However, there is no reference to the treatment of debt cancellation. The following will show that the assumption of taxes on a COD is crucial to obtain this equation. Without it, valuations change and with them the de- und re-levering procedure. It is reasonable to expect the beta for debt to have the same sign as the one for equity. Furthermore, we usually expect positive equity betas together with positive debt betas. That means when the market is doing well as whole, debt contracts will also do better. When the market is not doing well, for example in recessions, there will be more defaults and the return on debt will also be lower. With a positive beta for debt, the additional debt-related term in Eq. (18) reduces the levered beta, which counters the effect of the first term on the rhs of the equation, which increases the levered beta with more leverage.

2.4.2 No taxes on debt cancellation

Proportional loss distribution according to contractual debt payments Krause and Lahmann (2017) derive an equation for the tax shield value for the case of a proportional loss distribution. Proportional loss distribution means that total losses L are distributed proportionally or pro rata according to the contractually agreed debt payments. Losses on interest and principal payments are, respectively, \(L^{{\text {Int}}}=L \times \frac{r^{\text {c}}}{R^{\text {c}}}\) and \(C=L\times \frac{1}{R^{\text {c}}}\). The equation for the value of the tax shield is

$$\begin{aligned} {\text {VTS}}^{{\text {NC}}}&=\tau \times \frac{r^{\text {c}} \times D}{R^{\text {c}}}. \end{aligned}$$
(19)

Tax savings are given by

$$\begin{aligned} {\text {TS}}=\tau \times \left( r^{\text {c}}\times D - L\times \frac{r^{\text {c}}}{R^{\text {c}}}\right) . \end{aligned}$$
(20)

I substitute both equations into Eq. (12) and obtain:

$$\begin{aligned} S\times E[R^{\text {E}}] + D\times E[R^{\text {D}}] = \left( S + D - \tau \times \frac{r^{\text {c}} \times D}{R^{\text {c}}}\right) \times E[R^{\text {U}}] + \tau \times r^{\text {c}} \times D - \tau \times E[L] \times \frac{r^{\text {c}}}{R^{\text {c}}}. \end{aligned}$$
(21)

I rewrite the term on the rhs: \(\tau \times r^c \times D - \tau \times E[L] \times \frac{r^c}{R^c}=\tau \times r^c \times D \times \frac{E[R^D]}{R^c}\). I substitute this into the prior equation to obtain

$$\begin{aligned} S\times E[R^{\text {E}}] + D\times E[R^{\text {D}}]\times \left( 1-\frac{\tau \times r^{\text {c}}}{R^{\text {c}}}\right) = \left( S + D - \tau \times \frac{r^{\text {c}} \times D}{R^{\text {c}}}\right) \times E[R^{\text {U}}]. \end{aligned}$$
(22)

Now, I use the expected return equations from the mean-variance CAPM:

$$\begin{aligned} \nonumber&S\times (R^{\text {f}} + \beta _{{\text {E}},{\text {M}}}\times (E[R^{\text {M}}] - R^{\text {f}})) + D\times (R^{\text {f}} + \beta _{{\text {D}},{\text {M}}}\times (E[R^{\text {M}}] - R^{\text {f}}))\times \left( 1-\frac{\tau \times r^{\text {c}}}{R^{\text {c}}}\right) \nonumber \\&\quad = \left( S + D - \tau \times \frac{r^{\text {c}} \times D}{R^{\text {c}}}\right) \times (R^{\text {f}} + \beta _{{\text {U}},{\text {M}}}\times (E[R^{\text {M}}] - R^{\text {f}})). \end{aligned}$$
(23)

This simplifies to

$$\begin{aligned} S\times \beta _{{\text {E}},{\text {M}}} + D\times \beta _{{\text {D}},{\text {M}}}\times \left( 1-\frac{\tau \times r^{\text {c}}}{R^{\text {c}}}\right) = \left( S + D - \tau \times \frac{r^{\text {c}} \times D}{R^{\text {c}}}\right) \times \beta _{{\text {U}},{\text {M}}}. \end{aligned}$$
(24)

Rearranging for the levered beta I obtain

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \left( 1 + \frac{D}{S}\times \frac{1 + r^{\text {c}} \times (1-\tau )}{R^{\text {c}}}\right) \times \beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\times \frac{D}{S}\times \frac{1 + r^{\text {c}} \times (1-\tau )}{R^{\text {c}}}. \end{aligned}$$
(25)

or

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \beta _{{\text {U}},{\text {M}}} + (\beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}})\times \frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}. \end{aligned}$$
(26)

In most cases, the beta for the unlevered firm is bigger than the one for debt because of priorities of debt cash flows to be paid to debt holders. That means \(\beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\) is usually positive and with that the levered beta is greater than the unlevered beta—something that one would intuitively expect. In the less likely case, if \(\beta _{{\text {U}},{\text {M}}} < \beta _{{\text {D}},{\text {M}}}\), then \(\beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}<0\), and the levered beta is less than the unlevered beta.

Loss distribution not proportional to contractual debt payments As Krause and Lahmann (2015) show, with a pro rata loss distribution according to contractual interest and principal payments, the expected rate of return on debt, i.e., the discount rate on debt \(E[R^{\text {D}}]\), is equal to a weighted average of the expected rates of return on its components, i.e., the one on interest \(E[R^{{\text {Int}}}]\) and the one on principal payments \(E[R^{{\text {PP}}}]\). Notice that \(p({\text {Int}})=\frac{E[{\text {Int}}]}{E[R^{{\text {Int}}}]}\) and \(p({\text {PP}})=\frac{E[{\text {PP}}]}{E[R^{{\text {PP}}}]}\) define the discount rates for interest and principal payments. Krause and Lahmann (2015) show that, with interest or principal prioritization, expected rates of return on debt, interest and principal payments regularly differ. Without a COD taxation, tax savings are just interest payments scaled by the tax rate. Thus, the rate of return and the expected rate of return on interest payments and on tax savings are equal: \(E[R^{{\text {TS}}}]=E[R^{{\text {Int}}}]\).Footnote 5 The expected return on debt as a weighted average of the expected returns on interest and principal payments is

$$\begin{aligned} E[R^{\text {D}}]&= \frac{E[{\text {Int}}] + E[{\text {PP}}]}{D}\nonumber \\&= E[R^{{\text {Int}}}]\times \frac{p({\text {Int}})}{D} + E[R^{{\text {PP}}}]\times \frac{p({\text {PP}})}{D}. \end{aligned}$$
(27)

Since \(D=p({\text {Int}}) + p({\text {PP}})\) and values are positive, the weights \(\frac{p({\text {Int}})}{D}\) and \(\frac{p({\text {PP}})}{D}\) are positive and add up to one. Due to this relation, possible relations of the three expected returns on debt cash flows are:

$$\begin{aligned} E[R^{{\text {TS}}}] = E[R^{{\text {Int}}}]< E[R^{\text {D}}] < E[R^{{\text {PP}}}] \end{aligned}$$
(28)
$$\begin{aligned} E[R^{{\text {TS}}}] = E[R^{{\text {Int}}}] = E[R^{\text {D}}] = E[R^{{\text {PP}}}] \end{aligned}$$
(29)
$$\begin{aligned} E[R^{{\text {TS}}}] = E[R^{{\text {Int}}}]> E[R^{\text {D}}] > E[R^{{\text {PP}}}]. \end{aligned}$$
(30)

In the mean-variance CAPM, the only parameter that leads to different expected returns between different assets is the beta of an asset. The risk-free rate and the equity premium are not dependent on what kind of asset is regarded. Therefore, the respective betas must follow the same ordering as the expected returns:Footnote 6

$$\begin{aligned} \beta _{{\text {TS}},{\text {M}}} = \beta _{{\text {Int}},{\text {M}}}< \beta _{{\text {D}},{\text {M}}} <\beta _{{\text {PP}},{\text {M}}}\end{aligned}$$
(31)
$$\begin{aligned} \beta _{{\text {TS}},{\text {M}}} = \beta _{{\text {Int}},{\text {M}}}=\beta _{{\text {D}},{\text {M}}} = \beta _{{\text {PP}},{\text {M}}} \end{aligned}$$
(32)
$$\begin{aligned} \beta _{{\text {TS}},{\text {M}}} =\beta _{{\text {Int}},{\text {M}}}>\beta _{{\text {D}},{\text {M}}} > \beta _{{\text {PP}},{\text {M}}}. \end{aligned}$$
(33)

To derive equations for betas, I use Eq. (12). I write it down in the form

$$\begin{aligned} S\times E[R^{\text {E}}] + D\times E[R^{\text {D}}] = (S + D - {\text {VTS}}) \times E[R^{\text {U}}] + {\text {VTS}} \times E[R^{{\text {TS}}}]. \end{aligned}$$
(34)

Rearranging, simplifying, and using the CAPM equations leads to

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}}= \frac{S + D}{S}\times \beta _{{\text {U}},{\text {M}}} - \frac{D}{S}\times \beta _{{\text {D}},{\text {M}}} + \frac{{\text {VTS}}}{S}\times (\beta _{{\text {TS}},{\text {M}}} - \beta _{{\text {U}},{\text {M}}}). \end{aligned}$$
(35)

This equation allows the beta of the tax savings to be different than the one for total debt payments. In what follows, I establish equations that are comparable to the case with the pro rata loss distribution. I write the equation for the tax savings as

$$\begin{aligned} \nonumber {\text {VTS}}&=p({\text {TS}})=\tau \times p({\text {Int}})\nonumber \\&=\tau \times p(D \times r^{\text {c}} - L^{{\text {Int}}})\nonumber \\&=\tau \times \left( \frac{D\times r^{\text {c}}}{R^{\text {f}}} - p(L^{{\text {Int}}})\right) \nonumber \\&=\tau \times \frac{D\times r^{\text {c}}}{R^{\text {c}}}\times\left( \frac{R^{\text {c}}}{R^{\text {f}}} - \frac{p(L^{{\text {Int}}})}{D}\times \frac{R^{\text {c}}}{r^{\text {c}}}\right) \nonumber \\&=\tau \times \frac{D r^{\text {c}}}{R^{\text {c}}} + \tau \times \left( p(L)\times\frac{r^{\text {c}}}{R^{\text {c}}} - p(L^{{\text {Int}}}) \right) . \end{aligned}$$
(36)

The first term in the last equality is the equation for the value of the tax savings for a pro rata loss distribution according to contractual debt payments. In case of a pro rata loss distribution, the second term is always zero because then \(p(L^{{\text {Int}}})=p(L)\times\frac{R^{\text {c}}}{r^{\text {c}}}\). With loss distributions not proportional to contractual debt payments, the second term is usually not zero. Using that in Eq. (35), I obtain

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}}=&\left( 1 + \frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}} \right) \times \beta _{{\text {U}},{\text {M}}} - \frac{D}{S}\times \beta _{{\text {D}},{\text {M}}} + \frac{D}{S} \times \frac{\tau \times r^{\text {c}}}{R^{\text {c}}}\times \beta _{{\text {TS}},{\text {M}}} \nonumber \\&+ \frac{F}{S}\times (\beta _{{\text {TS}},{\text {M}}} - \beta _{{\text {U}},{\text {M}}}), \end{aligned}$$
(37)

with \(F=\tau \times \left( p(L)\times \frac{r^{\text {c}}}{R^{\text {c}}} - p(L^{{\text {Int}}}) \right)\). It turns out that additional information is needed. The beta of the returns on tax savings, i.e., on interest payments is needed as well as the price of losses on interest payments.Footnote 7 I define \(\beta _{\Delta {\text {TS}},{\text {M}}}=\beta _{{\text {TS}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\). I use this relation and restate Eq. (37) as

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}}=&\left( 1 +\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}\right) \times \beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\times \frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}} \nonumber \\&+\left( \frac{D}{S} \times \frac{\tau \times r^{\text {c}}}{R^{\text {c}}} + \frac{F}{S}\right) \times \beta _{\Delta TS,M} + \frac{F}{S}\times (\beta _{{\text {D}},{\text {M}}} - \beta _{{\text {U}},{\text {M}}}). \end{aligned}$$
(38)

This allows for a better comparability with Eq. (26), i.e., the equation for the pro rata loss distribution according to contractual principal and interest payments. I continue with two prominent cases of loss distributions: interest and principal prioritization.

Loss distribution not proportional to contractual debt payments—interest prioritization A reasonable non-proportional loss distribution is the case of interest prioritization. Interest prioritization means that principal payments will incur losses first. Only if losses are greater than principal payments, interest will incur losses as well. Relation (28) is usually what we expect in this case.Footnote 8 I simplify further. I assume that interest payments will never incur losses. This is a reasonable assumption as long as interest payments are small relative to principal payments, which is what we mainly observe in practice. Under this assumption, interest payments are risk-free so that \(L^{{\text {Int}}}=0\) in any state. The price of losses on interest payments must be zero as well. It follows that the beta of tax savings is zero. The equation for interest payments turns to

$$\begin{aligned} {\text {Int}}=r^{\text {c}} \times D. \end{aligned}$$
(39)

and tax savings are

$$\begin{aligned} {\text {TS}}^{{\text {NC}}}&=\tau \times r^{\text {c}}\times D. \end{aligned}$$
(40)

I discount this risk-free quantity at the risk-free rate, i.e., \(E[R^{{\text {TS}}}]=R^{\text {f}}\), to obtain the value of the tax savings

$$\begin{aligned} {\text {VTS}}^{{\text {NC}}}&=\tau \times \frac{r^{\text {c}}\times D}{R^{\text {f}}}. \end{aligned}$$
(41)

Equation (37) condenses to

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}} = \left( 1 + \frac{D}{S}\times \frac{R^{\text {f}} - \tau \times r^{\text {c}} }{R^{\text {f}}}\right) \times \beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\times \frac{D}{S}. \end{aligned}$$
(42)

As for the pro rata distribution the levered beta is also a combination of the unlevered beta and a scalar as well as the debt beta and a scalar. However, the scalars differ here.

Loss distribution not proportional to contractual debt payments—principal prioritization The other extreme in terms of prioritization is principal payment prioritization. In this case losses are first assigned to interest payments. Only if loses are greater than interest payments, the excess amount of losses is assigned to principal payments. In this case a sensible assumption such as for interest prioritization is not available. Instead, I use Eq. (38) and assumptions on F and the betas.Footnote 9

Given total losses on debt L, the losses on interest payments \(L^{{\text {Int}}}\) must be greater than the pro rata share of total losses: \(L\times\frac{r^{\text {c}}}{R^{\text {c}}} < L^{{\text {Int}}}\). Since this is true for any state in which losses occur, the factor F is less than zero: \(F=\tau \times \left( p(L)\times\frac{r^{\text {c}}}{R^{\text {c}}} - p(L^{{\text {Int}}}) \right) <0\). To parameterize, I assume that the price of losses on interest payments is equal to \(p(L^{{\text {Int}}}) = p(L)\times\left( \frac{r^{\text {c}}}{R^{\text {C}}} + \alpha \right)\), with \(\alpha \in \left( 0, 1-\frac{r^{\text {c}}}{R^{\text {c}}}\right)\) as the percentage that the price of interest losses is higher than the pro rata share of the price of total losses. Using this parameter in the equation for F, I obtain

$$\begin{aligned} F&=-\tau \times \alpha \times p(L) \end{aligned}$$
(43)
$$\begin{aligned}&=-\tau \times \alpha \times D\times \frac{r^{\text {c}} - r^{\text {f}}}{R^{\text {f}}}. \end{aligned}$$
(44)

The second equality follows from the equation for the coupon rate,Footnote 10 which can be rearranged for the price of losses. Equation (38) turns to

$$\begin{aligned} \beta _{{\text {E}},{\text {M}}}=&\left( 1 +\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}\right) \times \beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}\times \frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}} \nonumber \\&+ \tau \times \frac{D}{S}\times \left( \frac{r^{\text {c}}}{R^{\text {c}}} - \alpha \times \frac{r^{\text {c}} - r^{\text {f}}}{R^{\text {f}}}\right) \times \beta _{\Delta {\text {TS}},{\text {M}}} + \tau \times \alpha \times \frac{r^{\text {c}} - r^{\text {f}}}{R^{\text {f}}}\times \frac{D}{S}\times (\beta _{{\text {U}},{\text {M}}} - \beta _{{\text {D}},{\text {M}}}). \end{aligned}$$
(45)

Notice that with \(\alpha \in \left( 0, 1-\frac{r^{\text {c}}}{R^{\text {c}}}\right)\), it follows, for the term attached to \(\beta _{\Delta {\text {TS}},{\text {M}}}\), that \(\frac{r^{\text {f}}}{R^{\text {f}}}<\left( \frac{r^{\text {c}}}{R^{\text {c}}} - \alpha \times \frac{r^{\text {c}} - r^{\text {f}}}{R^{\text {f}}}\right) < \frac{r^{\text {c}}}{R^{\text {c}}}\).Footnote 11 Furthermore, it is reasonable to assume that \(\beta _{\Delta {\text {TS}},{\text {M}}}>0\), i.e., the beta of the returns on tax savings is greater than the one for the returns on debt. Since the return on tax savings and on interest payments are the same, the betas of the two figures are the same as well. With losses first assigned to interest payments, their returns will regularly have a higher beta than the beta for returns on principal payments and the one for returns on debt payments as a whole.Footnote 12

In the next subsection, I will compare the differences of the equations more in detail.

2.5 Overview and discussion

Table 1 shows an overview of the different cases treated in this paper. Some remarks are in order. For debt returns uncorrelated with market returns \(\beta _{{\text {D}},{\text {M}}}=0\), the expected return on debt is equal to the risk free rate: \(E[R^{\text {D}}]=R^{\text {f}}\). However, with risky debt, the coupon rate must be greater than the risk-free rate because the coupon rate must account for the default risk: \(r^{\text {c}}>r^{\text {f}}\). This can also be restated as \(r^{\text {c}}=r^{\text {f}} + \kappa\), in which \(\kappa >0\) is a credit risk premium. Thus, for \(\beta _{{\text {D}},{\text {M}}}=0\), the equations for the risk-free case and for the risky case with taxes on a COD are the same. Krause and Lahmann (2017) found that the value of the tax savings for risky debt with taxed COD is the same as if debt is risk-free. However, without taxes on a COD the equations to adjust betas differ through the involvement of the coupon rate and possible other parameters.

Table 1 Summary of cases and equations

A comparison of the equations in Table 1 makes most sense for the purpose of picking the correct equation given the case of the tax treatment and loss distribution. Then, one can evaluate the bias of picking an incorrect equation.

It does not make sense to pick a set of parameters and then try out all of the equations for the different cases intending to evaluate what the outcome would be given the parameters. The reason is that the different cases may be consistent only with different sets of inputs such as leverage and the coupon rate, because the tax assumption may influence the loss distribution.

I look at the factors scaling up and down the unlevered equity betas and the debt betas.

For the first three equations and the fifth equation notice that \(1+\frac{D}{S}\times \frac{1 + r^{\text {f}}\times (1-\tau )}{R^{\text {f}}}>1+\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}\). To see that I rewrite the equations to \(-\tau \times \frac{r^{\text {f}}}{R^{\text {f}}}>-\tau \times \frac{r^{\text {c}}}{R^{\text {c}}}\) and divide by \(-\tau\) to obtain \(\frac{r^{\text {f}}}{R^{\text {f}}}<\frac{r^{\text {c}}}{R^{\text {c}}}\). I multiply by \(R^{\text {f}}\) and \(R^{\text {c}}\), which are both positive. That leads to \(r^{\text {f}} \times (R^{\text {f}} + \kappa )<(r^{\text {f}} + \kappa )\times R^{\text {f}}\) which simplifies to \(0<\kappa\). The credit risk premium must be positive so that the inequality always holds.

Regarding equations three and four, the inequality \(1+\frac{D}{S}\times \frac{1 + r^{\text {c}} \times (1-\tau )}{R^{\text {c}}}>1+\frac{D}{S}\times \frac{R^{\text {f}} - \tau \times r^{\text {c}}}{R^{\text {f}}}\) holds only for \(r^{\text {c}}>0\).

Regarding the first two equations and the forth one, the inequality \(1+\frac{D}{S}\times \frac{1 + r^{\text {f}} \times (1-\tau )}{R^{\text {f}}}>1+\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {f}}}\) always holds. This can be seen using the same simplifying steps as before.

I continue with the terms attached to the debt betas in equation two and three. For those equations \(-\frac{D}{S}\times (1-\tau )>-\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}\). To see this, I simplify to \(\tau >\tau \times \frac{r^{\text {c}}}{R^{\text {c}}}\), divide by \(\tau\) and multiply by \(R^{\text {c}}\) to obtain \(R^{\text {c}}>r^{\text {c}}\) or \(1>0\), which always holds. Regarding equations three and four, for \(r^{\text {c}}>0\) the inequality \(-\frac{D}{S}<-\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}\) holds.

Thus, for a positive coupon rate

$$\begin{aligned} 1+\frac{D}{S}\times \frac{1 + r^{\text {f}} \times (1-\tau )}{R^{\text {f}}}>1+\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}>1+\frac{D}{S}\times \frac{R^{\text {f}} - \tau \times r^{\text {c}}}{R^{\text {f}}}, \end{aligned}$$
(46)

and

$$\begin{aligned} -\frac{D}{S}\times (1-\tau )>-\frac{D}{S}\times \frac{1 + r^{\text {c}}\times (1-\tau )}{R^{\text {c}}}>-\frac{D}{S}. \end{aligned}$$
(47)

That means for a positive coupon rate and for positive betas for debt and unlevered equity, the levered beta will be smaller going down the equations in Table 1 until the forth equation.

For positive betas debt betas may be larger than unlevered equity betas. For high debt betas and since the term including the debt beta is subtracted, the beta for levered equity may even be less than the one for unlevered equity, i.e., \(\beta _{{\text {E}},{\text {M}}}<\beta _{{\text {U}},{\text {M}}}\). I will have a quick look at \(\beta _{{\text {D}},{\text {M}}}\) when \(\beta _{{\text {E}},{\text {M}}}=\beta _{{\text {U}},{\text {M}}}\). For the second case, the case with a taxation of a COD, I obtain

$$\begin{aligned} \beta _{{\text {D}},{\text {M}}}=\beta _{{\text {E}},{\text {M}}}\times\frac{1 + r^{\text {f}}\times (1 - \tau )}{R^{\text {f}} (1 - \tau )}, \end{aligned}$$
(48)

in which \(\frac{1 + r^{\text {f}}\times (1 - \tau )}{R^{\text {f}}\times (1 - \tau )}>1\). For example for \(r^{\text {f}}=0.02\) and \(\tau =0.3\) the term is about 1.4 so that \(\beta _{{\text {D}},{\text {M}}}\) has to be more than 1.4 times bigger than \(\beta _{{\text {E}},{\text {M}}}\) to obtain \(\beta _{{\text {E}},{\text {M}}}<\beta _{{\text {U}},{\text {M}}}\).

For the pro rata case the debt beta for \(\beta _{{\text {E}},{\text {M}}}=\beta _{{\text {U}},{\text {M}}}\) is

$$\begin{aligned} \beta _{{\text {D}},{\text {M}}}=\beta _{{\text {E}},{\text {M}}}, \end{aligned}$$
(49)

so that \(\beta _{{\text {D}},{\text {M}}}\) has to be greater than \(\beta _{{\text {E}},{\text {M}}}\) to obtain \(\beta _{{\text {E}},{\text {M}}}<\beta _{{\text {U}},{\text {M}}}\).

For interest prioritization with the assumption of certain interest payments, I obtain

$$\begin{aligned} \beta _{{\text {D}},{\text {M}}}=\beta _{{\text {E}},{\text {M}}}\times\frac{R^{\text {f}} - \tau \times r^{\text {c}}}{R^{\text {f}}}, \end{aligned}$$
(50)

in which \(\frac{R^{\text {f}} - \tau \times r^{\text {c}}}{R^{\text {f}}}=1 - \tau \times r^{\text {c}}/R^{\text {f}}\) is less than one for \(r^{\text {c}}>0\). For example, for \(r^{\text {f}}=0.02\), \(\tau =0.3\) and \(r^{\text {c}}=0.06\) the term is 0.98. Thus, \(\beta _{{\text {D}},{\text {M}}}\) does not even have to be greater than the beta of unlevered equity for \(\beta _{{\text {E}},{\text {M}}}<\beta _{{\text {U}},{\text {M}}}\).

The fifth equation treats principal prioritization. The equation requires the additional parameters \(\alpha\) and \(\beta _{\Delta {\text {TS}},{\text {M}}}\). It was constructed so that the first part of the equation is the same as the equation for the pro rata case. Thus, the two additional terms in the equation for principal prioritization versus the equation for the pro rata case determine whether the resulting levered equity beta is greater, less than or equal to the levered equity beta for the pro rata distribution, when equal inputs are used. As was mentioned earlier, \(\beta _{\Delta {\text {TS}},{\text {M}}}>0\) is a reasonable assumption for principal prioritization. Furthermore, for a positive risk-free rate the term \(\tau \times \frac{D}{S}\times \left( \frac{r^{\text {c}}}{R^{\text {c}}} - \alpha \times \frac{r^{\text {c}} - r^{\text {f}}}{R^{\text {f}}}\right)\) is always positive. Assuming that \(\beta _{{\text {U}},{\text {M}}}>\beta _{{\text {D}},{\text {M}}}\), makes the two additional terms positive. This leads to a levered equity beta that is greater than the one under the assumption of a pro rata distribution.

I provide a simple numerical example with the objective to observe the potential error from picking the incorrect procedure, i.e., the incorrect equation, given that the betas for unlevered equity and for debt are known. For the example, I choose the following parameters: \(\tau =0.3\), \(l=0.6\) (implying \(D/S=1.5\)), \(\beta _{{\text {U}},{\text {M}}}=0.9\), \(\beta _{{\text {D}},{\text {M}}}=0.4\), \(r^{\text {f}}=0.02\), \(\alpha =0.2\), \(\beta _{\Delta {\text {TS}},{\text {M}}}=0.3\) and \(r^{\text {c}}=0.06\). Table 2 shows that the betas for levered equity vary from 2.24 down to 1.63. That would imply expected returns for unlevered equity from 15.45 % down to 11.76 % for this example. The table shows that picking the incorrect equation has the potential for substantial errors in discount rates, and thus for substantial misvaluations. Especially assuming risk-free debt when debt is actually risky has a big impact in this example because the additional term with the debt beta, which reduces the levered beta, is not used for risk-free debt. The assumption of a COD taxation also has a significant impact on the beta and the associated expected return. In turn, the figures presented without a COD taxation do not differ markedly. The reason is that for those equations and for a moderate risk-free rate as well as coupon rate, all the factors multiplied with \(\beta _{{\text {E}},{\text {M}}}\) and with \(\beta _{{\text {D}},{\text {M}}}\) do not differ a lot. Furthermore, the two additional terms in the equation for principal prioritization are small for reasonable parameters, such as the ones used in the example.

Table 2 Summary of cases and equations

3 Levered and unlevered beta in an infinite period setting with constant leverage

I take a simple approach and assume constant leverage and independently and identically distributed (i.i.d.) returns. At any point in time and in any state debt and equity cash flows have the same return distribution. The risk-free rate is assumed to be constant. The firm issues only single-period debt. It adjusts debt and equity at each point in time to keep leverage constant. Even after a default leverage is kept constant by whoever is the (new) owner of the company.

The returns are now defined as \(R^{\text {E}}_{t+1}=\frac{{\text {ECF}}_{t+1} + S_{t+1}}{S_t}\) for the return on levered equity, \(R^{\text {U}}_{t+1}=\frac{{\text {FCF}}_{t+1} + V^{\text {U}}_{t+1}}{V^{\text {U}}_t}\) for the return on unlevered equity, and \(R^{\text {D}}_{t+1}=\frac{{\text {DCF}}_{t+1} + D_{t+1}}{D_t}\) for the return on debt.

The following identity must hold at any time:

$$\begin{aligned} {\text {ECF}}_{t+1} + S_{t+1} + {\text {DCF}}_{t+1} + D_{t+1} = {\text {FCF}}_{t+1} + V^{\text {U}}_{t+1} + {\text {TS}}_{t+1} + {\text {VTS}}_{t+1}. \end{aligned}$$
(51)

The identity holds the same way taking conditional expectations:

$$\begin{aligned}&E_t[{\text {ECF}}_{t+1}] + E_t[S_{t+1}] + E_t[{\text {DCF}}_{t+1}] + E_t[D_{t+1}]\nonumber \\&\quad = E_t[{\text {FCF}}_{t+1}] + E_t[V^{\text {U}}_{t+1}] + E_t[{\text {TS}}_{t+1}] + E_t[{\text {VTS}}_{t+1}]. \end{aligned}$$
(52)

I use the definitions of returns to obtain

$$\begin{aligned} S_t \times E_t[R^{\text {E}}_{t+1}] + D_t \times E_t [R^{\text {D}}_{t+1}] = (S_t + D_t - {\text {VTS}}_t)\times E_t[R^{\text {U}}_{t+1}] + E_t[{\text {TS}}_{t+1}] + E_t[{\text {VTS}}_{t+1}]. \end{aligned}$$
(53)

I represent the value of the tax savings as discounted next period’s tax savings and discounted next period’s value of all future tax savings, in which I assign two different discount rates to the respective figures:

$$\begin{aligned} {\text {VTS}}_t=\frac{E_t[{\text {TS}}_{t+1}]}{E_t[R^{{\text {TS}}}_{t+1}]} + \frac{E_t[{\text {VTS}}_{t+1}]}{E_t[R^{{\text {VTS}}}_{t+1}]}. \end{aligned}$$
(54)

The first part, \(\frac{E_t[{\text {TS}}_{t+1}]}{E_t[R^{{\text {TS}}}_{t+1}]}\), is the value of the tax savings of the next period. This value can be represented through the equations presented in the single-period model—only the time subscripts need to be adjusted. The second part, \(\frac{E_t[{\text {VTS}}_{t+1}]}{E_t[R^{{\text {VTS}}}_{t+1}]}\), is the value at time t of all tax savings incurred after \(t+1\). An important result of Miles and Ezzell (1985) is that for constant leverage the value of the tax shield is discounted at the discount rate for unlevered equity. Krause and Lahmann (2015) find that this is due to the tax shield value being proportional to the value of the unlevered firm when leverage is constant. They also find that the result holds in an i.i.d. setting with risky debt.

Since I assume constant leverage here, the equation for the tax shield value can be restated with \(E_t[R^{{\text {VTS}}}_{t+1}]=E_t[R^{{\text {U}}}_{t+1}]\).

$$\begin{aligned}&S_t \times E_t[R^{\text {E}}_{t+1}] + D_t \times E_t [R^{\text {D}}_{t+1}] \nonumber \\&\quad =\left( S_t + D_t - \left[ \frac{E_t[{\text {TS}}_{t+1}]}{E_t[R^{{\text {TS}}}_{t+1}]} + \frac{E_t[{\text {VTS}}_{t+1}]}{E_t[R^{\text {U}}_{t+1}]}\right] \right) \times E_t[R^{\text {U}}_{t+1}] + E_t[{\text {TS}}_{t+1}] + E_t[{\text {VTS}}_{t+1}], \end{aligned}$$
(55)

Simplifying, the equation condenses to the same equation as for the single period [Eq. (12)]:

$$\begin{aligned} S_t \times E_t[R^{\text {E}}_{t+1}] + D_t \times E_t [R^{\text {D}}_{t+1}] = \left( S_t + D_t - \frac{E_t[{\text {TS}}_{t+1}]}{E_t[R^{{\text {TS}}}_{t+1}]} \right) \times E_t[R^{\text {U}}_{t+1}] + E_t[{\text {TS}}_{t+1}]. \end{aligned}$$
(56)

Thus, the same equations for the de- and re-levering procedures of betas must follow for the infinite horizon case with constant leverage in an i.i.d. world. Equations of Table 1 can also be applied for this case.

4 Conclusion

I provide equations to re- and de-lever betas under different assumptions regarding the riskiness of debt and the taxation of a COD. When a COD is not taxed, the distribution of losses on interest and principal payments becomes important. I look at five cases: the risk-free case, the risky case with a taxed COD, the risky case without a taxed COD and pro rata distributed losses, the risky case without a taxed COD and interest prioritization, and the risky case without a taxed COD and principal prioritization. I find that equations differ substantially so that the application of the incorrect procedure potentially leads to big errors in determining the discount rate. Additionally, the same equations for de- and re-levering betas for a single-period setting hold for a simple i.i.d. infinite horizon setting with constant leverage.