Spectral risk measure of holding stocks in the long run

We investigate how the spectral risk measure associated with holding stocks rather than a risk-free deposit, depends on the holding period. Previous papers have shown that within a limited class of spectral risk measures, and when the stock price follows speciﬁc processes, spectral risk becomes negative at long periods. We generalize this result for arbitrary exponential Lévy processes. We also prove the same behavior for all spectral risk measures (including the important special case of Expected Shortfall) when the stock price grows realistically fast and when it follows a geometric Brownian motion or a ﬁnite moment log stable process. This result would suggest that holding stocks for long periods has a vanishing downside risk. However, using realistic models, we ﬁnd numerically that spectral risk initially increases for a signiﬁcant amount of time and reaches zero level only after several decades. Therefore, we conclude that holding stocks has spectral risk for all practically relevant periods.


Introduction
There has been tremendous interest both from practitioners and researchers on the question of how risky it is to hold stocks rather than a risk-free deposit in the long run. The existence of time diversification, meaning that the risk of holding stocks decreases with the time horizon, has profound effects on all long-term investments such as pension savings and target date funds. Industry practitioners hold the common wisdom that the longer the intended holding period for a portfolio, the more it should contain risky, rather than risk-free investments.
The academic literature on the topic is more controversial (Bennyhoff 2009). Siegel (1998) showed that for investment horizons of at least 15 years, the risk of holding stocks (measured by realized variance) is lower than the risk of holding bonds or Treasury bills, and the risk of holding stocks is decreasing over time. However, Pástor and Stambaugh (2012) argue that stocks are more volatile in the long run from the perspective of an investor. The reason is that investors do not know the parameters of the stock price process (especially not the conditional expected return), and by making noisy predictions, at a 50-year horizon they face about 1.3 times higher return variance per year than at a 1-year horizon. Modeling the investors' beliefs, Avramov et al. (2017) find that there could be investors who perceive that the mean reversion of stocks is weaker, and hence they consider stocks riskier in the long run.
One can consider the probability that at time t the stock investment value S(t) is smaller than the risk free deposit value S 0 e rt , where S 0 is the initial investment and r is a constant riskfree rate. This probability decreases with t, supporting the notion that the risk of the stock investment decreases with the holding period. However, as noted by Harlow (1991), this concept does not take into account the difference between S 0 e rt and S(t), and this difference can, of course, be larger and larger as t increases. Bodie (1995) approached the concept of risk differently by quantifying risk as the price of a put option that guarantees the risk-free payout S 0 e rt at time t. The price of the corresponding put option was found to be increasing as t increases, which can be interpreted as the risk of holding stocks increases over time. Wilkie (2001) criticized the methodology of Bodie (1995) by saying that if we wish to guarantee that we get at least the proceeds of a risk-free investment, then (since buying the option also has some costs) we have to invest everything in the risk-free investment, and pricing the option makes no sense. Moreover, Ferguson and Dean (1996) argued that the method of Bodie (1995) forces one to conclude that the risk of an asset relative to the risk of any other asset increases with the investment horizon, leading to a paradox.
Following Treussard (2006); Nguyen et al. (2012), we consider the risk of holding a stock rather than a risk-free deposit over time. More precisely, we analyze the spectral risk measure (Acerbi 2002) is positive (negative), then the stock performs poorly (well), in comparison with the risk free money market account. Spectral risk measures are coherent (Artzner et al. 1999(Artzner et al. , 2007 and can be expressed as a weighted average of losses, with increasing weights for higher losses. Thus we can think of the spectral risk of Y (t) as a weighted average regret value associated with having bought the stock, rather than investing in a money market account.
As a special spectral risk, we consider Expected Shortfall (ES) (Acerbi and Tasche 2002;Rockafellar and Uryasev 2002), which can be computed and estimated efficiently (Acerbi and Tasche 2002) and is backtestable (Acerbi and Szekely 2014). Moreover, in 2013 the Basel Committee of Banking Supervision took the decision (Basel Committee 2013) of using ES for capital adequacy internal models, underscoring the importance of ES as a practical risk measure.
To describe the possible evolution of stock prices, we consider different stochastic processes. Besides the standard geometric Brownian motion (GBM) model, we analyze the exponential Lévy model (see for instance Mandelbrot (1963), Bertoin (1996), McCulloch (1996, Cai andKou (2011), Fu andYang (2012), Egami and Oryu (2015), and Fusai and Kyriakou (2016)). The exponential Lévy process, of which GBM is a special case, offers more flexibility to model the jumps and the heavy tail behavior of stock prices. As an illustration, we will investigate a particular exponential Lévy subclass called the Finite Moment Log Stable (FMLS) model (Carr and Wu 2003). In the FMLS model, moments of the logarithmic return exist on the gain side, but are infinite on the loss side. However, all moments exist for the price process, as the name FMLS indicates. Thus the FMLS price process can capture heavier tails for losses, while free from complications due to infinite moments.
There are two distinct regimes where the long-time behavior of risk is different. Depending on the parameters of the stock price process (the drift μ and the convexity adjustment κ), we distinguish between a high growth regime (μ > r + κ) and a medium growth regime (r + κ ≥ μ > r ). Our results extend those of Nguyen et al. (2012), since we treat the exponential Lévy model in general and the FMLS model in particular, and we have results for Expected Shortfall both in the GBM and in the FMLS models. We prove in case of a GBM stock process that in the high growth regime all spectral risk measures (including Expected Shortfall) will be negative at high t. We also investigate general exponential Lévy models and provide conditions under which in both at the high growth and at the medium growth regimes spectral risk will be negative at high t. For the FMLS model, we obtain the same results as for the GBM model, that is, in the high growth regime all spectral risk measures will be negative at high t.
This result would suggest that holding stocks for long periods has a vanishing downside risk. However, using realistic price process models, we find numerically that the spectral risk initially increases for a significant amount of time and reaches zero level only after several decades. This result indicates that holding stocks has spectral risk for all practically relevant periods. This conclusion is found to be insensitive to reasonable changes in parameters.
The structure of the paper is as follows. Spectral risk measures and their Choquet representation are given in Sect. 2. Section 3 contains the investigated stochastic processes, Sect. 4 provides the analytical results. Section 5 contains numerical results, Sect. 6 concludes.

Spectral risk measures
Consider the random variable X that represents the loss distribution of a portfolio at the future time t, as seen at the current time. Thus positive values of X are representing losses, negative values of X are representing profits. The set of the random loss variables is denoted by X .
Definition 2.1 The risk measure ρ : X → R is called a coherent risk measure (Artzner et al. 1999) if it satisfies the following properties: • monotonicity: where e −rt is the discount factor at time t.
Throughout this paper we will assume a constant risk-free rate r . The discount factor e −rt is omitted in some papers, but it is especially important for us here, as we will investigate long horizons.
Coherent risk measures have a sound axiomatic foundation (Artzner et al. 1999), can be related to Bellman's principle, are supported by general equilibrium theory (Csóka et al. 2007) and they can also capture liquidity risk (Acerbi and Scandolo 2008). Moreover, coherent risk measures have also been used as an objective function in inventory problems (Ahmed et al. 2007;Philpott and De Matos 2012) considered the incorporation of a time-consistent coherent risk measure into a multi-stage stochastic programming model.
As a prominent coherent risk measure, Expected Shortfall (ES) (Acerbi and Tasche 2002;Rockafellar and Uryasev 2002) can be computed and estimated efficiently (Acerbi and Tasche 2002), and it is backtestable (Acerbi and Szekely 2014). Moreover, in 2013 the Basel Committee of Banking Supervision took the decision (Basel Committee 2013) of using ES for capital adequacy internal models. Given a random loss variable X ∈ X and a confidence level α ∈ [0, 1], ES α (X ) is the average of the loss in the worst 100(1 − α)% cases. As an example, α = 90%, and ES 90% is the average of the loss in the worst 10% of the cases.
To define ES formally in our setting, we need some more notation. Given a random loss variable

Definition 2.2 Given a random loss variable
For α = 1, ES 1 , that is the (discounted) maximum loss, is defined as where ess. sup{X } represents the essential supremum of the random loss variable X . Spectral risk measures (Acerbi 2002) are also coherent and they are generalizing ES to get the weighted average of the losses with increasing weights as follows. Definition 2.3 Given a random loss variable X ∈ X , the spectral risk measure of X , ρ φ (X ) is defined as where φ ∈ L 1 ([0, 1]), and the following properties are true: Note again that all the usual formulas are written for losses, not gains, and they are adjusted with the e −rt discount factor. As it is clear from its definition, ES is a spectral risk measure that we will use as a specific example in our numerical calculations. Interestingly, Miller and Ruszczyński (2008) considered a portfolio optimization problem with different risk measures including spectral risk measures. We will work with spectral risk measures throughout this paper, but we will use an equivalent definition derived as a special case of Choquet integrals (Choquet 1954). This formulation also appears in decision theory to model uncertainty (see for instance Grabisch and Labreuche (2010) and Labreuche and Grabisch (2018)). Choquet integral risk measures (see for instance Sriboonchita et al. (2009)) are defined as follows.

Definition 2.4 Given a random loss variable
where h, the so-called distortion function (Wang 1996) is non-decreasing and satisfies h(0) = 0 and h(1) = 1. Nguyen et al. (2012) notes that for concave distortion functions h there exists a connection between Choquet integral risk measures and spectral risk measures In this work, we will refer to spectral risk measures and Choquet integral risk measures interchangeably. A distortion function h is non-degenerate, if there exists p > 0 such that h ( p) > 0, or, equivalently, there exists q < 1 such that φ(q) > 0.
Following Treussard (2006); Nguyen et al. (2012), we consider Y (t) = S 0 e rt − S(t) as the loss variable whose spectral risk we want to calculate using Definition 2.4. If Y (t) is positive (negative), then the stock performs poorly (well), in comparison with the risk-free deposit. We can also think of Y (t) as a regret value associated with having bought the stock, rather than holding our investment in a risk-free deposit.
Throughout the paper, we will restrict our attention to non-degenerate distortion functions. This means that we preclude the degenerate maximum loss risk measure from further investigation.

Investigated models
Let {S(t)} t≥0 be a positive valued stochastic process which represents the stock price on the market. As usual, this stochastic process is assumed to be defined on a complete filtered probability space ( , F , {F t } t≥0 , P).
In the geometric Brownian motion model (GBM), the stock price follows the stochastic process where μ > 0 is the drift, σ > 0 is the volatility, and {W t } t≥0 is a Wiener process. The standardized log return of this process {S(t)} t≥0 is distributed as where [z] is the standard normal distribution function. Exponential Lévy processes, of which GBM is a special example, offer more flexibility to model heavier tail behaviour. In the exponential Lévy model, the stock price follows the stochastic process where X t is a Lévy process fully determined by its Lévy triplet T, μ is the drift, and κ X is the convexity adjustment term that can be calculated as The convexity adjustment κ X plays the role of an exponential compensator in the sense that the process {e −κ X t+X t } t≥0 is an ({F t } t≥0 , P)-exponential-martingale. Note that κ X does not depend on time. We will denote the distribution function of the Lévy process at a given time t by F t .
We will consider a specific exponential Lévy model, called Finite Moment Log Stable (FMLS) model. In the FMLS model, the stock price movement follows the stochastic process (Carr and Wu 2003) where μ is the drift, σ is the scale of volatility, L α t is alpha stable distributed with shape parameters α ∈ (1, 2) and β = −1, with scale parameter t 1 α , and with zero mean, that is L α t ∼ S(α, −1, t 1 α , 0) (see more details in Carr and Wu (2003)). Note that the only remaining free parameter for the distribution is α. The α → 2 limit corresponds to the GBM model. Decreasing α results in heavier tails for the losses. As before, κ is the convexity adjustment term for the FMLS model, given as Note that κ is positive due to the properties of the sec function, as α ∈ (1, 2). Now the log return of the S(t) process becomes ln therefore its distribution is S(α, −1, σ t 1 α , (μ − κ)t) for a fixed t. Stable distributions have the following important property (see for example page 24 of Janicki and Weron (1994)).

Analytical results
The main question of this paper is the long horizon behavior of the risk curve ρ h (Y (t)). To give some insights, let us discuss some general qualitative features of the risk curve. At t = 0, ρ h (Y (0)) = 0, since initially the stock price is deterministic. Roughly speaking, there are two quantities affecting spectral risk, the mean and dispersion of the stock price distribution. Greater mean leads to lower risk, while greater dispersion results in higher risk. For short times, the dispersion grows much faster than the mean, leading to initially increasing risk curves. The question is whether risk starts to decrease at a certain time or keeps rising. If ρ h (Y (t)) becomes negative for a large t, it must have decreased from its positive maximum. The above described qualitative features of the risk curve will be demonstrated with numerical examples in Sect. 5. As discussed in the literature (see Hull (2012) and Ross (2015)), we will work under the statistical (real-world) measure in this paper. There are two distinct regimes where the long-time behavior of risk is different. The case when μ > r + κ we call the high growth regime. When r + κ ≥ μ > r , we are in the medium growth regime. The case μ ≤ r (low growth regime) is not realistic, as it would mean that the risky asset grows slower on average than the risk-free asset. Nguyen et al. (2012) investigated two exponential Lévy Processes, GBM and exponential Poisson. They proved that h > c > 0 is a sufficient condition in both regimes for the risk to be negative at high t. In the high growth regime, they also proved this behavior if h is continuous. We reinvestigated the GBM model and proved the new result that in the high growth regime all spectral risk measures would be negative at high t. Moreover, for general exponential Lévy models, we proved that in both high growth and medium growth regimes if h > c > 0, then spectral risk will be negative at high t. We also investigated a specific exponential Lévy model, the FMLS model, and we obtained the same results as for the GBM model, that is in the high growth regime all spectral risk measures will be negative at high t.

GBM model
In this section, we assume that the stock process S(t) is a geometric Brownian motion.
The formula derived by Nguyen et al. (2012) for the spectral risk, adjusted to our definition that takes into account discounting, is: We have the following own result.

Theorem 4.1 In the GBM stock price model, ρ h (Y (t)) is negative for a large t if μ > r + σ 2 2 .
Proof Since h is a monotonically decreasing function in a closed interval, it can only have a countable number of jump points, so the function is almost surely continuous everywhere.
Since we only consider non-degenerate distortion functions, where c > 0. For the integral in (9), we obtain the following lower bound. Since h ≥ 0 everywhere, we can assess that where the last equation follows from integrating the density function of a normally distributed random variable with mean σ √ t and standard deviation 1 on the interval [a , b]. Using the mean value theorem, for a d ∈ [a , b] we assess that Therefore, we can deduce that The last expression goes to infinity if t goes to infinity, since we assumed that μ−r − σ 2 2 > 0. It follows that ρ h (Y (t)) in (9) goes to minus infinity and hence becomes negative for a large t.

Arbitrary exponential Lévy models
In this section, we assume that the stock process S(t) is an exponential Lévy process. For a brief introduction, see Sect. 3. First, we derive an own formula for the spectral risk in this model.

Theorem 4.2 In an arbitrary exponential Lévy model, the spectral risk of Y (t) is
where F t is the distribution function of the Lévy process at a given time t, and f t is its density function.
Proof Using the Choquet representation in Definition 2.4, we get that We conduct a substitution to express this formula with the distribution function of the Lévy process. To do so, we further shape 1 − F Y (t) (x). We know that the log return of the stock in this model can be written as Using Eq. (16), we obtain the following In Eq. (15), performing the substitution z = ln(e rt − x (18) where C = (r − μ + κ X )t. Integrating by parts, we obtain that Substituting the values on the boundary points using h(0) = 0, h(1) = 1 we can assert that Applying Theorem 4.2, we prove the same long-time behavior for arbitrary Lévy models as was proven by Nguyen et al. (2012) in the case of GBM.

Theorem 4.3 In an arbitrary exponential Lévy model ρ h (Y (t)) is negative for a large t if h (·) is bounded below with a c > 0 constant, and if μ > r.
Proof Due to the definition of κ X and by the assumption of the theorem and therefore, goes to infinity as t goes to infinity, implying that ρ h (Y (t)) also goes to minus infinity and ρ h (Y (t)) becomes negative for a large t.

FMLS model
In this section, we assume that the stock process S(t) is an FMLS process. For a brief introduction, see Sect. 3.

Theorem 4.4 In the FMLS stock price model, the spectral risk of Y (t) is
Proof The proof is similar to that of Theorem 4.2. The main difference appears in the variable substitution. Due to the properties of the stable variables (Proposition 3.1) we can express the risk without time dependency.
Performing the substitution z = integrating by parts and utilising h(0) = 0 and h(1) = 1, we obtain ρ h (Y (t)) as stated in the theorem.

Theorem 4.5 In the FMLS stock price model, ρ h (Y (t)) is negative for a large t if h (·) is bounded below with a c > 0 constant, and if μ > r.
Proof This is a special case of Theorem 4.3, since the FMLS process is an exponential Lévy process.

Theorem 4.6 In the FMLS stock price model, ρ h (Y (t)) is negative for a large t if μ > r + κ.
Proof Since h is a monotonically decreasing function in a closed interval, it can only have a countable number of jump points, so the function is almost surely continuous everywhere.
Since we only consider non-degenerate distortion functions, where c > 0. As for the integral in (22), we obtain the following lower bound.
Next, we will show that α (z) is positive on an appropriate subinterval [a , b ] of [−∞, b]. We know that α (z) is the density function of a stable distributed random variable with parameters (α, −1, 1, 0). Moreover, on [−∞, b] the cumulative distribution function of this random variable is not constant, hence the density function is surely positive at some points. According to Proposition 3.12 on page 102 of Cont and Tankov (2004), if a Lévy process is infinitely active, then the density function of this process is continuous for every t. Carr and Wu (2003) showed that the stable process is infinitely active in case 1 < α < 2, meaning that it has an infinite number of jumps over any time interval. Hence we can assess that the process L α,−1 Denote the minimum value of α (z) on [a , b ] by c > 0. Since h ≥ 0 everywhere, we obtain that The last expression goes to infinity if t goes to infinity, since we assumed that μ > r + κ. It follows that ρ h (Y (t)) in (22) goes to minus infinity and hence becomes negative for a large t.

Numerical results
To illustrate the implications of our theoretical calculations, in this section we present numerical results performed with realistic model parameters. We obtained the daily closing prices on the dividend-adjusted SP500 index to represent the risky "stock" in our model. The cash deposit rate was estimated using data on the annualized returns on 3-month US treasury bills. Both time series were taken from 1960 to 2016. Calibrating the historical data to the GBM model yielded the following estimation for the parameters: r = 0.048, μ = 0.089, To illustrate the effect of stronger tails in the loss distribution, and to offer some robustness test, we also calibrated a set of FMLS models with four different α parameters: α = 1.8, 1.85, 1.9, 1.95. We used the same r = 0.048, μ = 0.089 values as for the GBM model. At each α, we set σ α such that the difference between the first and the third yearly return quartiles are matched with that of the GBM model. This procedure ensures consistency between the different models in terms of short-term volatility. All five of these models represent realistic test cases for long-term investments in a well-diversified US equity portfolio.
All the models are in the "high growth regime" as defined in Sect. 4. Using our results from Sect. 4, we computed the Expected Shortfall at the 90% level as an illustrative risk measure. We performed the calculation for all five calibrated models for different holding periods, up to one hundred years, with monthly steps. The value of the initial stock investment was set to unity. The distribution functions of the normal and stable distributions are readily available in MatLab, the necessary integrations were performed with numerical quadrature, also available in MatLab. The results are plotted in Fig. 1. The risk curves start at zero, increase up to a maximum value, then, in accordance with our analytical results, drop down to and below zero. At any given holding period, the GBM model produces the lowest risk levels, while the FMLS curves, especially the ones with relatively low α parameters, show higher risk values. This is due to the heavier tails in the FMLS distribution on the loss side.
As discussed in the seminal paper of Artzner et al. (1999), coherent risk measures can be interpreted as the amount of reserve cash necessary to lower market exposure to a tolerable level. Although the numerical results shown underscore our analytic conclusion that the risk measure becomes zero at some holding time for all the investigated models, note the time scale on the x axis. The necessary reserve cash becomes zero only at around, or beyond, one hundred years. The risk of holding stocks remains significant even after one hundred years, when the stock return distribution has heavy tails (see the curves with the smallest alpha values).
On the other hand, at long but realistic holding periods (a few decades), we need quite a sizable cushion to mitigate risk. According to Fig. 1, the necessary cash reserve amounts to about half of the initial stock value.  To perform some sensitivity tests, let us consider the effect of changing the parameters. In particular, we will explore parameter modifications that lower the risk measure. For most users, decreasing the 90% confidence level for the Expected Shortfall is not an option. Increasing the confidence level would lead to higher risks, since then the average of even worse outcomes would be calculated. To decrease risk, volatility should be decreased or the difference between μ and r should be increased in our models. In Fig. 2, we show risk curves with a reduced volatility of σ G B M = 0.125. In Fig. 3, the curves are calculated with the increased drift of μ = 0.11. All other parameters are the same as in Fig. 1. Our models with these modified parameters represent optimistic, but still realistic market views. Although the risk curves with the modified parameters shifted down as expected, our conclusions remain essentially the same: Risk remains significant for decades.

Conclusion
For long-run investors, it is crucial to know how to evaluate the risk associated with holding stocks rather than a risk-free deposit. We analyzed spectral risk, which can be interpreted as a weighted average regret value associated with having bought the stock, rather than investing in a money market account.
We show that for arbitrary exponential Lévy processes and when all possible outcomes have strictly positive weight, spectral risk becomes negative at long periods. We also prove the same behavior for all spectral risk measures (including the important special case of Expected Shortfall) when the stock price is in the high-growth regime and when it follows a GBM or an FMLS process.
We leave it as an open question whether Expected Shortfall becomes negative for arbitrary exponential Lévy processes in the high growth regime. Moreover, in a more realistic model, as well as in standard industry practice, for such long time horizons a stochastic interest rate model would be necessary. Since coherent measures of risk are extended to a dynamic setting with stochastic interest rates by Riedel (2004), it would be interesting to consider a model with a correlated joint distribution of the future stock price and the discount factor.
Our results show that in the long run, the steady growth of the expected stock price overrides its uncertainty in terms of the risk involved. However, how long is the long run in practice? To answer this question, we calibrated the models to historical market data and performed numerical calculations. We find that the risk as defined in our paper indeed becomes negative, but it remains significantly positive for decades, even for optimistic market views. Increasing the tails in the FMLS process, or adding model or parameter uncertainty makes the situation even worse. Also supported by the sensitivity tests, our final conclusion is that holding stocks is risky for all relevant holding periods.