Multi-period power utility optimization under stock return predictability

Bodnar, Taras; Ivasiuk, Dmytro; Parolya, Nestor; Schmid, Wolfgang

doi:10.1007/s10287-023-00434-6

Multi-period power utility optimization under stock return predictability

Original Paper
Open access
Published: 09 February 2023

Volume 20, article number 4, (2023)
Cite this article

Download PDF

You have full access to this open access article

Computational Management Science Aims and scope Submit manuscript

Multi-period power utility optimization under stock return predictability

Download PDF

Taras Bodnar ORCID: orcid.org/0000-0001-7855-8221¹,
Dmytro Ivasiuk²,
Nestor Parolya³ &
…
Wolfgang Schmid²

1676 Accesses
1 Citation
Explore all metrics

Abstract

In this paper, we derive an analytical solution to the dynamic optimal portfolio choice problem in the case of an investor equipped with a power utility function of wealth. The results are established by solving the Bellman backward recursion under the assumption that the vector of asset returns follows a vector-autoregressive process with predictable variables. In an empirical study, the performance of the derived solution is compared with the one obtained by applying the numerical method. The comparison is performed in terms of the final wealth and its expected utility. It is documented that the application of the analytical solution to the multi-period portfolio choice problem leads to higher values of both the final wealth and the expected utility.

Dynamic optimal mean-variance portfolio selection with stochastic volatility and stochastic interest rate

Article 01 September 2022

The Exact Solution of Multi-period Portfolio Choice Problem with Exponential Utility

Time-consistent risk-constrained dynamic portfolio optimization with transactional costs and time-dependent returns

Article 01 August 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

With the development of the monetary system and the rise of market trading capacity, portfolio theory has appeared to be an important approach in explaining the behaviour of stock prices and has provided a quantitative tool for strategic investment. Introduced in 1952 by Harry M. Markowitz, the optimal portfolio choice problem has become a popular topic in finance and applied statistics (see, e.g., Bodnar and Okhrin 2013; Bonaccolto et al. 2018; Consigli et al. 2019; Muhinyuza et al. 2020; Bodnar et al. 2021; Mariani et al. 2022). It aims to minimize the variance of the portfolio given some predefined level of the expected portfolio return called the mean-variance paradigm (see, Markowitz 1952; Merton 1972).

The mean-variance approach of Markowitz provides a simple way how to diversify the investment portfolio that brings a trade-off between the expected return and the variance manually settable by an investor. On the other hand, there are some challenges that the Markowitz optimization problem meets. Namely, it can be only presented as an expected utility maximization of the quadratic or exponential utility functions and it may be considered as a second-order approximation for an arbitrary utility. Besides that, it provides a single-period solution, whereas most investment strategies have a multi-period setup with a portfolio rebalancing during the investment horizon (see, Brandt 2009; Meucci and Nicolosi 2016; Zhao et al. 2022).

In this paper, we concentrate on finding the multi-period portfolio strategy for the expected utility optimization problem. The concept of the expected utility of wealth has become and remains an intensively discussed branch of portfolio theory (see Brandt 2009; Pennacchi 2008; Campbell and Viceira 2002; Bodnar et al. 2018). The utility is defined as an increasing concave function, which fulfills four axioms and implies an investor who benefits from the maximization of its expected value (see, Pennacchi 2008; Campbell and Viceira 2002). As such, several approaches can be used to specify the utility function and the investor has to choose one of them. To classify and understand different utility functions, one can employ the concept of “risk premium” which is usually presented by absolute risk aversion (ARA) and relative risk aversion (RRA). For instance, as discussed by J. Pratt (see, Pratt (1964)), the constant absolute risk aversion represents an individual willing to pay some fixed amount of money to avoid risk. On the other side, a willingness to sacrifice a percentage of the investor wealth characterizes the concept of relative risk aversion (Pennacchi 2008; Campbell and Viceira 2002). The exponential utility is a common example of the utility function with constant absolute risk aversion (CARA), while the power utility represents the constant relative risk aversion (CRRA) utility (see, Brandt 2009; Pennacchi 2008). Chronopoulos et al. (2011) developed a utility-based framework for optimal investment and made use of the CRRA utility function to determine the optimal time of investment. Finally, the other popular utility function used in the financial literature is the quadratic utility which is neither CARA nor CRRA (see, Li and Ng 2000).

The application of the power utility in portfolio selection problems has some advantages over other utility functions. For example, it obeys a decreasing ARA, which implies that with increasing wealth the investor would increase the number of dollars in the risky assets held in the portfolio. This is not the case, however, for the quadratic utility, which exhibits an increasing ARA, causing an unrealistic behaviour of investors on the market. Although the exponential utility obeys a constant ARA, which is more plausible than the increasing ARA in the case of the quadratic utility, it still lies behind the power utility because of the experimental and empirical evidences, which mostly justify a decreasing ARA (see, Friend and Blume (1975)). Moreover, the power utility investor would have an optimal strategy which is independent of the initial value of the wealth. This constitutes another very appealing property of the power utility. There are also some criticisms on the power utility, e.g., it seems to have an impact on the equity premium puzzle (see, Mehra and Prescott (1985)) and it is unbounded, while the exponential utility does not possess such issues (see, Alpanda and Woglom (2007) for more details). Even though we have focused solely on the power utility, we believe that a proper combination of both exponential and power utilities can potentially lead to a “better” utility function. Moreover, the methodology developed in the paper can be generalized to a more general class of utility functions, like hyperbolic absolute risk aversion (HARA) utility. This important and challenging problem has been left for future research.

Most of the results derived in the literature about the dynamic optimal portfolio choice problem are obtained in the continuous time scenario by using a Markov decision process with the asset prices that follow a geometric Brownian motion (see Vigna 2009; Çanakoğlu and Özekici 2012; Chronopoulos et al. 2011; Lioui and Poncet 2013; Moreno-Bromberg et al. 2013). Besides that, the quadratic utility function has appeared to be the most popular utility (see, Li and Ng (2000), Bodnar et al. (2015a)), while the exponential utility is another popular choice which is usually accompanied by the assumption that the asset returns are normally distributed (cf., Liu (2004), Soyer and Tanyeri (2006), Bodnar et al. (2015b)). On the other side, Li et al. (2001) argued that solving a dynamic optimal portfolio choice problem in the discrete-time setup is a considerably more difficult task. In the current paper, we present the solution for the power utility in the discrete-time case. Our findings complement the existent analytical solutions for multi-period portfolio choice problems derived when the expected quadratic utility function (cf., Li and Ng (2000), Bodnar et al. (2015a)) or the exponential utility (see, e.g., Bodnar et al. (2015b)) are used. Also, it generalizes the numerical solution which is obtained by applying the Taylor series expansion to the utility or value function (see, Brandt et al. 2005; Garlappi and Skoulakis 2011).

In the derivation of the theoretical results, it is assumed that the asset returns follow a vector auto-regressive model of order one with Gaussian error terms. The latter is a popular approach for modelling the behaviour of stock returns in financial literature (Brandt et al. 2005; Campbell et al. 2003). Applying the Bellman equation we obtain the dynamic solution of the multi-period maximization problem. In an empirical illustration, the new approach is compared to the well-known numerical solution suggested in Brandt et al. (2005), where the weights were obtained by the Taylor series expansion of the value function. The predictable model is fitted to weekly stock returns, while we consider an investor with an investment horizon of 1 month (4 weeks), 2 months, 3 months and 4 months who rebalances the holding portfolio on a weekly basis. The results of the simulation study show that the new solution provides a higher average utility compared to the numerical one. These results hold independently of the value of the risk-aversion coefficient and the investment horizon. Moreover, when the investment horizon increases, our findings become more pronounced. In particular, for the investment horizon of 3 and 4 months, we find that with a probability of about 80% the expected utility of the final wealth is larger than $-0.1$ when the closed-form solution is used, and the probability of such an event is close to zero for the numeric approximation. As such, the dynamic investment strategy based on the closed-form solution outperforms the investment strategy based on the numeric approximation. On the other side, it should be noted that the closed-form investment strategy is obtained under some model assumptions imposed on the data-generating process. The violation of these assumptions may lead to the strategy, which is not optimal any longer. Thus, an investor should first validate the model before applying the derived optimal portfolio strategy in practice. To this end, we also point out that both the closed-form strategy and the one based on the numeric approximation are explicitly derived for the investor who maximizes the power utility function. The solutions of dynamic optimal portfolio choice problems under other utility functions, like the quadratic utility and exponential utility, can be found in Çanakoğlu and Özekici (2010), Li and Ng (2000), Bodnar et al. (2015a), Bodnar et al. (2015b), among others.

The rest of the paper is structured as follows. Section 2 presents a framework and a problem formulation considered in the paper. The analytical solution of the utility maximization problem based on the power utility function is deduced in Theorem 1 of Sect. 3. Section 4 provides an empirical study investigating the practical relevance of the derived results. Section 5 summarises the theoretical and empirical findings obtained in the paper, while the proof of Theorem 1 can be found in the “Appendix” (Sect. 6).

2 Framework

Let ${\varvec{r}}_t$ denotes a k-dimensional random vector of simple returns on risky assets and let $r^{f}_t$ be the non-random simple return on a risk-free asset at time t. We also consider predictable variables ${{\mathbf{z}}}_t$ as a n-dimensional vector and assume that the combined vector $X_t=\left( {\varvec{r}}_t^{\prime },{{\mathbf{z}}}_t^{\prime }\right) ^{\prime }$ follows a vector autoregressive process (VAR) of order one given by

$$\begin{aligned} X_{t}={\tilde{\varphi }}+{\tilde{\Phi }}X_{t-1}+{\tilde{\varepsilon }}_t. \end{aligned}$$

(1)

It is assumed that $\{{\tilde{\varepsilon }}_t\}$ are independent random vectors with ${\tilde{\varepsilon }}_t\sim N({{\mathbf{0}}},{\tilde{{\varvec{\Sigma }}}}_{t})$ where ${\tilde{{\varvec{\Sigma }}}}_t$ defines $(k+n)\times (k+n)$-dimensional positive definite deterministic covariance matrix possibly dependent on t. In practice, this matrix can be fitted by employing the results from Xu and Phillips (2008) and Bodnar and Zabolotskyy (2011), who considered autoregressive models with time-varying unconditional variances. Moreover, Xu and Phillips (2008) pointed out that the time series models with time-depending variances are more flexible and they usually provide a better fit in practice. The predictable variable is often interpreted as the most correlated element with ${\varvec{r}}_t$ (see, Brandt and Santa-Clara 2006; Campbell et al. 2003; Bodnar et al. 2015b). It could be, for example, dividend yield, term spread, or even another asset return. Here we assume that the whole wealth is solely invested in the risky assets ${\varvec{r}}_t$ and in the risk-free asset, but not in the vector of predictable variables ${{\mathbf{z}}}_t$.

To recover the vector of returns one can multiply $X_t$ by the dimension-reduction matrix ${{\mathbf{L}}}=[{{\mathbf{I}}}_k{{\mathbf{O}}}_{k,n}]$ which consists of the k-dimensional identity matrix ${{\mathbf{I}}}_k$ and the $k\times n$-dimensional zero matrix ${{\mathbf{O}}}_{k,n}$. Let $\varphi ={{\mathbf{L}}}{\tilde{\varphi }}$, $\Phi ={{\mathbf{L}}}{\tilde{\Phi }}$ and $\varepsilon _t={{\mathbf{L}}}{\tilde{\varepsilon }}_t$. Then, it holds that

$$\begin{aligned} {\varvec{r}}_t={{\mathbf{L}}}X_{t}={{\mathbf{L}}}{\tilde{\varphi }}+{{\mathbf{L}}}{\tilde{\Phi }}X_{t-1}+{{\mathbf{L}}}{\tilde{\varepsilon }}_t={\varphi }+{\Phi }X_{t-1}+{\varepsilon }_t. \end{aligned}$$

(2)

We denote the conditional mean vector of ${\varvec{r}}_t$ by

$$\begin{aligned} {\varvec{\mu }}_t={{\mathbf{L}}}{\tilde{{\varvec{\mu }}}}_t={{\mathbf{L}}}E_{t-1}\left[ {X}_t\right] =E_{t-1}\left[ {\varvec{r}}_t\right] =\varphi +\Phi X_{t-1} \end{aligned}$$

and its conditional covariance matrix by

$$\begin{aligned} {\varvec{\Sigma }}_{t}=Var_{t-1}\left[ {\varvec{r}}_t\right] ={{\mathbf{L}}}{\tilde{{\varvec{\Sigma }}}}_t{{\mathbf{L}}}^{\prime }\end{aligned}$$

given the information set ${\mathcal{F}}_{t-1}$ available at a time point $t-1$. Hence, we get ${\varvec{r}}_t|{\mathcal{F}}_{t-1}\sim N({\varvec{\mu }}_t,{\varvec{\Sigma }}_t)$. Note that ${\varvec{\Sigma }}_{t}$ is assumed to be deterministic, while ${\varvec{\mu }}_t$ can be random as a measurable function of ${\mathcal{F}}_{t-1}$.

By ${\varvec{\omega }}_{t}$ we denote the vector of portfolio weights at time t. Let $r_{P,t}$ denote the portfolio simple return at time t and let ${\tilde{r}}_{P,t}$ be the portfolio log return at time t, i.e., ${\tilde{r}}_{P,t}=\ln (1+r_{P,t})$. Then the wealth evolution process of the investor is given by

$$\begin{aligned}W_t=W_{t-1}(1+r_{P,t})=W_{t-1}\exp ({\tilde{r}}_{P,t}),\end{aligned}$$

or using the approximation of the portfolio log return by the simple asset returns we get

$$\begin{aligned} W_t=W_{t-1}\exp \left( r^{f}_{t}+{\varvec{\omega }}_{t-1}^{\prime }\left( {\varvec{r}}_t-r^{f}_{t}{\textbf{1}}\right) \right) , \end{aligned}$$

(3)

for $t=1,\dots ,T$. Consequently,

$$\begin{aligned} W_T=W_{0}\exp \left( \sum \limits _{t=1}^T\left( r^{f}_{t}+{\varvec{\omega }}_{t-1}^{\prime }\left( {\varvec{r}}_t-r^{f}_{t}{\textbf{1}}\right) \right) \right) , \end{aligned}$$

where $W_0$ stands for the initial wealth.

Remark 1

It has to be noted that (3) holds only approximately and it is based on the approximation of the portfolio log return by the corresponding linear combination of the asset log returns. This approximation works usually very well in practice and it is a standard approach in financial literature (see, Tsay 2005, p.5).

The classical multi-period portfolio choice problem aims to maximize the expected utility $U\left( \cdot \right)$ of the final wealth $W_T$ received after the last investment period T. Namely,

$$\begin{aligned} V_0=\max _{\{ \varvec{\omega }_s \}_{s=0}^{T-1} } E_0\left[ U\left( W_T\right) \right] . \end{aligned}$$

(4)

Let

$$\begin{aligned} V_t=\max _{\{ \varvec{\omega }_s\}_{s=t}^{T-1} } E_t\left[ U\left( W_T\right) \right] . \end{aligned}$$

(5)

Then it holds

$$\begin{aligned} V_T= & {} U\left( W_T\right) , \end{aligned}$$

(6)

$$\begin{aligned} V_{T-1}= & {} \max _{ \varvec{\omega }_{T-1} } E_{T-1}\left[ U\left( W_T\right) \right] . \end{aligned}$$

(7)

In the literature, $V_t$ is often called the value function.

According to Brandt (2009) and Pennacchi (2008) problem (4) can be solved using the Bellman equation expressed as

$$\begin{aligned} V_{t}=\max _{\{ \varvec{\omega }_s \}_{s=t}^{T-1} } E_{t}\left[ U\left( W_T\right) \right] =\max _{ \varvec{\omega }_{t} } E_{t}\left[ \max _{\{ \varvec{\omega }_{s}\}_{s=t+1}^{T-1} } E_{t+1}\left[ U\left( W_T\right) \right] \right] =\max _{ \varvec{\omega }_{t} } E_{t}\left[ V_{t+1}\right] . \end{aligned}$$

(8)

Equation (8) determines a backward recursive scheme where the optimal weights are calculated by starting from the last investment period and ending up with the first one.

In this paper, we consider the utility of wealth given by a power function, also known as constant relative risk aversion (CRRA) utility, which belongs to the HARA family (see, LiCalzi and Sorato 2006). The power utility is defined by

$$\begin{aligned} U(W_T)=\frac{W^{1-\gamma }_T}{1-\gamma },\quad \gamma >0. \end{aligned}$$

(9)

It should also be noted that in the limiting case, i.e., when $\gamma =1$, then $U(W_T)$ becomes the logarithmic utility function given by

$$\begin{aligned} U(W_T)=\ln W_T. \end{aligned}$$

The CRRA utility implies the behaviour of an investor such that his/her decisions are independent of the initial wealth, which appears to be a very attractive property.

The multi-period optimization problem for the power utility is solved in the continuous-time setting by assuming the geometric Brownian motion for the stock prices and the Markov processes for the market states (see, Çanakoğlu and Özekici 2010, 2012). Moreover, in the case of short time intervals and in the case of the single-period investment the log-normal distribution is used to model the portfolio return (see, Campbell and Viceira 2002; Bodnar et al. 2020). In our framework we consider the discrete-time case assuming that the asset returns are modelled by the vector autoregressive process (1) and provide the analytic solution of the multi-period optimal portfolio choice problem (8).

3 Multi-period optimal portfolio: closed-form solution

Under the power utility function (9) and the assumption of VAR process (1) imposed on the asset returns, the multi-period optimal portfolio selection problem has a solution presented in Theorem 1. Henceforward, we will call it the "closed-form" solution.

Theorem 1

Let the join vector of asset returns and predictable variables follow a VAR(1) process as defined in (1). Then for $\gamma >1$ the optimal portfolio weights, which solve (4), are given by

$$\begin{aligned} {\varvec{\omega }}_{t-3}= & {} \frac{1}{\gamma -1}{{\mathbf{L}}}\Big ({\tilde{{\varvec{\Sigma }}}}^{-1}_{t-2}\left( {\tilde{{\varvec{\mu }}}}_{t-2}-r^{f}_{t-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \nonumber \\{} & {} -{\tilde{\Phi }}^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{t-1}\left( {\tilde{\varphi }}+r^{f}_{t-2}{\tilde{\Phi }}{{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{t-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \Big ) \end{aligned}$$

(10)

for $t\in 3,\dots ,T$,

$$\begin{aligned} {\varvec{\omega }}_{T-2}= & {} \frac{1}{\gamma -1}{{\mathbf{L}}}\Big ({\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1}\left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \nonumber \\{} & {} -\Phi ^{\prime }{\varvec{\Sigma }}^{-1}_{T}\left( \varphi +r^{f}_{T-1}\Phi {{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) \Big ), \end{aligned}$$

(11)

and

$$\begin{aligned} {\varvec{\omega }}_{T-1}=\frac{1}{\gamma -1}{\varvec{\Sigma }}^{-1}_{T}\left( {\varvec{\mu }}_T-r^{f}_{T}{\textbf{1}}\right) . \end{aligned}$$

(12)

The proof of Theorem 1 is given in the “Appendix” (see, Sect. 6). The following two corollaries formulate the results obtained under several important special cases of model (1).

Corollary 1

If no predictable variable is present in (1), the optimal weights in Theorem 1 can be simplified to

$$\begin{aligned} {\varvec{\omega }}_{t-2}=\frac{1}{\gamma -1}\left( {{\varvec{\Sigma }}}^{-1}_{t-1}\left( {{\varvec{\mu }}}_{t-1}-r^{f}_{t-1}{\textbf{1}}\right) -\Phi ^{\prime }{\varvec{\Sigma }}^{-1}_{t}\left( \varphi +r^{f}_{t-1}\Phi {\textbf{1}}-r^{f}_{t}{\textbf{1}}\right) \right) \end{aligned}$$

(13)

for $t\in 2,\dots ,T$, and

$$\begin{aligned} {\varvec{\omega }}_{T-1}=\frac{1}{\gamma -1}{\varvec{\Sigma }}^{-1}_{T}\left( {\varvec{\mu }}_T-r^{f}_{T}{\textbf{1}}\right) . \end{aligned}$$

(14)

Corollary 1 corresponds the statement of Theorem 1 with $n=0$, i.e., when the matrix ${\textbf{L}}$ is replaced by the identity matrix and, thus, ${\tilde{{\varvec{\Sigma }}}}_t={\varvec{\Sigma }}_t,\ {\tilde{\varphi }}=\varphi$, and ${\tilde{\Phi }}=\Phi$.

Corollary 2

Let the vectors of asset returns ${\varvec{r}}_t$ be stochastically independent over time. Then, the optimal weights for the multi-period portfolio choice problem (4) with $\gamma >1$ are given by

$$\begin{aligned} {\varvec{\omega }}_{t-1}=\frac{1}{\gamma -1}{\varvec{\Sigma }}^{-1}_{t}\left( {\varvec{\mu }}_t-r^{f}_{t}{\textbf{1}}\right) \end{aligned}$$

(15)

for $t\in 1,\dots ,T$.

The statement of Corollary 2 follows from Corollary 1 by setting $\Phi ={{\mathbf{0}}}$.

As expected, we receive that all the formulas for the optimal portfolio weights do not depend on the wealth component, implying that the optimal portfolio weights of the investor equipped with the power utility function are independent of the amount of initial wealth (see Campbell and Viceira 2002; Pennacchi 2008). One should also point out that the result obtained in Corollary 2 has a similar form to the optimal portfolio rule for a power utility derived by Marín-Solano and Navas (2010), where the asset prices are assumed to follow a geometric Brownian motion. On the other side, under a weaker assumption of the vector auto-regressive process, an additional term is appeared caused by the model-specific dependence structure.

4 Empirical illustration

In order to present the relevance of the obtained results in practice, we use the VAR(1) model estimated by the weekly returns of five stock market indices, i.e., Belgium, Germany, Japan, the UK and the USA, from January 4, 2002, to December 4, 2009 (see, Bodnar et al. 2015b).

The parameters of the model (1) for the considered data are estimated in (see Bodnar et al. 2015b, p.532) and they are given by

$$\begin{aligned} {\tilde{\varphi }}= & {} \begin{bmatrix} 4.83\times 10^{-4}\\ 1.20\times 10^{-3}\\ 6.74\times 10^{-4}\\ 5.54\times 10^{-4}\\ 2.79\times 10^{-5} \end{bmatrix},\\ {\tilde{\Phi }}= & {} \begin{bmatrix} 0.2011&{}\quad &{}\quad -0.1592&{}\quad &{}\quad 0.01892&{}\quad &{}\quad -0.196&{}\quad &{}\quad 0.455\\ 0.3139&{}\quad &{}\quad -0.1231&{}\quad &{}\quad -0.00191&{}\quad &{}\quad -0.511&{}\quad &{}\quad 0.434\\ 0.0487&{}\quad &{}\quad 0.0888&{}\quad &{}\quad -0.12131&{}\quad &{}\quad -0.224&{}\quad &{}\quad 0.343\\ 0.1829&{}\quad &{}\quad -0.0889&{}\quad &{}\quad 0.00988&{}\quad &{}\quad -0.441&{}\quad &{}\quad 0.382\\ 0.0766&{}\quad &{}\quad -0.0643&{}\quad &{}\quad -0.03049&{}\quad &{}\quad -0.114&{}\quad &{}\quad 0.113 \end{bmatrix},\\ {\tilde{{\varvec{\Sigma }}}}_t= & {} \begin{bmatrix} 13.08519&{}\quad &{}\quad 10.54450&{}\quad &{}\quad 4.36575&{}\quad &{}\quad 9.12037&{}\quad &{}\quad 6.78129\\ 10.54450&{}\quad &{}\quad 13.83354&{}\quad &{}\quad 5.64824&{}\quad &{}\quad 10.21854&{}\quad &{}\quad 8.33231\\ 4.36575&{}\quad &{}\quad 5.64824&{}\quad &{}\quad 7.99434&{}\quad &{}\quad 4.73337&{}\quad &{}\quad 3.66701\\ 9.12037&{}\quad &{}\quad 10.21854&{}\quad &{}\quad 4.73337&{}\quad &{}\quad 10.17679&{}\quad &{}\quad 6.92725\\ 6.78129&{}\quad &{}\quad 8.33231&{}\quad &{}\quad 3.66701&{}\quad &{}\quad 6.92725&{}\quad &{}\quad 7.24223 \end{bmatrix}\times 10^{-4}. \end{aligned}$$

The fitted autoregression matrix ${\tilde{\Phi }}$ displays larger values of the coefficients which correspond to the lagged values of the U.S. market return in comparison to the lagged values of the return on each capital market itself. As such, the return of the U.S. stock market is considered to be a predictable variable in our model and we use the returns of the other four capital markets to construct a dynamic four-dimensional optimal portfolio. Without loss of generality, we set the initial wealth to be equal one, i.e., $W_0=1$, and the weekly return of a hypothetical risk-free asset $r^{f}_{t}$ is assumed to be 0.06%, corresponding to the annual return of around 3%.

Table 1 The sample mean, the sample median, the mean absolute deviation and the median absolute deviation of the power utility of the final wealth for the numerical solution and the closed-form strategy for $\gamma \in \{4,6,9,12\}$ and $T\in \{4,8,12,16\}$

Full size table

The performance of the closed-form solution derived in Theorem 1 is compared to the one obtained by the numerical solution presented in Brandt et al. (2005). The numerical approximation of the weights of the dynamic optimal portfolio is obtained by the 4th order Taylor series expansion of the value function (see, Ivasiuk 2019). The comparison is done by simulating the return path and calculating the weights corresponding to the numerical and closed-form solutions, which then are used in the computation of the final utility. All values reported in the table and figures are based on 10000 independent repetitions. The simulations are performed for several combinations of the relative risk aversion parameter $\gamma \in \{4,6,9,12\}$ and time horizon $T\in \{4,8,12,16\}$. The choice of the risk aversion coefficient is motivated by the findings documented in Barsky et al. (1997), Elminejad et al. (2022), and Pennacchi (2008, p.21). In particular, Barsky et al. (1997) and Pennacchi (2008) pointed out that the median relative risk aversion is approximately equal to 7, while Elminejad et al. (2022) argued that the most common values of the risk aversion coefficient lie between 2 and 7 in the financial applications. Motivated by these results, we set $\gamma =4$ and $\gamma =6$, and also add two further values corresponding to a more risk-averse investor.

The results are depicted in Table 1 and in Figs. 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11 and 12. Table 1 demonstrates the values of the sample mean and sample median of the power utility of the final wealth together with the mean absolute deviation and the median absolute deviation, respectively, for several values of $\gamma$ and T described above. Both the sample median and the sample median absolute deviations are robust estimators for the location and scale parameters of a probability distribution. On the other side, the sample mean and the mean absolute deviation can be affected by extreme observations. As such, to robustify these two measures, their 95% trimmed versions are used.

We observe that all the sample means and the sample medians of the final utility corresponding to the numerical solution are always smaller than those obtained by using the analytical solution derived in Theorem 1. The differences between the values increase as the investment horizon becomes larger. Very pronounced results are documented when the sample median is used as the centrality measure. In this case, for $T=4$ the application of the analytical solution leads to the values of the final utility which are three times as larger as the corresponding values computed by applying the numerical solution, while for $T=16$ they are more than 200 times larger indicating a considerable improvement when the analytical solution is used. The values of the median absolute deviation are also significantly smaller in the case when the closed-form solution is used with the exception for $T\in \{12, 16\}$. For the sample means, the differences are not so large, although even in this case the closed-form solution outperforms the strategy based on the numerical solution. On the other side, the mean standard deviations computed for the closed-form solution are considerably larger than the ones obtained for the numerical solution. Such a result is also observable in Figs. 1, 2, 3, and 4, where the histograms of final wealth are plotted for the considered values of $\gamma$ and T. In all plots, the histograms created for the closed-form strategies are right skewed with the range of final wealth that includes values considerably larger than one, the point where the histograms for the numerical solution are concentrated. As such, large values of the final wealth have a strong impact on the computed mean absolute deviation in the case of the closed-form solution. To this end, we also compute the p-values of the two sample t-test for comparing the means and the medians of the two samples. Interestingly, all the p-values appear to be close to zero leading to the conclusion that the differences in the means and the medians are statistically significant in all of the considered cases. Finally, the increase of the investment horizon leads to opposite effects on the two dynamic trading strategies. While the numerical solution becomes more volatile, the application of the closed-form solution considerably reduces the uncertainty in the stochastic behaviour of the final utility.

The point is further investigated in Figs. 5, 6, 7, 8, 9, 10, 11 and 12, where the empirical cumulative distribution functions (ecdf) of the final wealth obtained by the application of the two considered dynamic trading strategies are depicted in Figs. 5, 6, 7 and 8, while the ecdfs of the corresponding utilities are shown in Figs. 9, 10, 11 and 12. In Figs. 5, 6, 7 and 8, we observe that the final wealth obtained by applying the numerical solution is tightly spread around one, whereas the closed-form portfolio has its wealth mostly larger with some amount of values being smaller. More precisely, the intersection points on the curves of two ecdfs reveal that the probability of receiving larger final wealth is larger when the closed-form solution is used. The impact becomes more pronounced when both T and $\gamma$ increase.

A similar relationship between the performance of the two considered dynamic trading strategies is observed in Figs. 9, 10, 11 and 12 with ecdfs constructed for the final utility. This result holds uniformly over the considered values for the risk aversion coefficient $\gamma$ and the investment horizon T, with higher impact when T and $\gamma$ increase. In contrast to the figures with the final wealth, we observe more volatile behavior for the utility of the final wealth computed by using the numerical solution. Also, the ecdfs of the utility of the final wealth obtained by employing the closed-form solution are concave and show high concentration of probability around the maximum value of the expected utility. Especially, when the investment horizon is large, like $T=12$ and $T=16$, with probability of about $80\%$ the value of the utility is larger than $-0.1$ when the closed-form trading strategy is used. For the trading strategy based on the numerical approximation the probability of such an event is less than $30\%$ when $\gamma <12$.

Next, we investigate the computational time of both the numerical and analytical solutions in Fig. 13. The analytical solution shows a clear advantage over the numerical one even in the case of small time horizons. The logarithmic computational time of the analytical solution always stays negative, which indicates that the computational cost is still low for time horizon $T=60$. This is not the case for the numerical strategy where the logarithmic computational time is always positive. Moreover, it is greater than one and reaches four in the case of $T=60$. This justifies that the numerical method is probably not the best choice for increasing time horizon. Using more summands of the Taylor series expansion in the case of the numerical solution would increase the accuracy, however, the computational time will increase considerably as well.

At last, we compare the proposed analytical solution obtained for the power utility function with the one of Bodnar et al. (2015b) which was derived under the exponential utility expressed as

$$\begin{aligned} U(W_T)=-e^{-\alpha W_T}~~\text {with }\alpha >0. \end{aligned}$$

Although the comparison is not completely fair because both utility functions represent different risk behaviour of the investors, the solutions seem to be quite similar. So, a question arises which of the two multi-period optimal portfolio strategies can generate a higher amount of the final wealth. This question is answered in Fig. 14. To keep the comparison fair we distinguish between low- and high-risk situations. The low-risk scenario will correspond to $\gamma =9$ and $\alpha =2$ for the power and exponential utilities, respectively. Similarly, the high-risk one will arise for $\gamma =4$ and $\alpha =0.8$. In both cases, we consider the investment horizon of $T=52$ and $T=104$. We observe a striking property of the analytic solution derived under the power utility function. Namely, it can generate considerably higher wealth levels in comparison to the solution derived under the exponential utility (stochastic dominance of the first order). The only case, where the situation is a bit unclear is the low-risk situation with $T=52$. However, one can still observe the stochastic dominance of the second order for the power utility solution in this case (the difference in the areas between the curves). To conclude, one can notice a clear dominance of the power utility over the exponential one in terms of final wealth when the time horizon is large enough.

5 Summary

In this paper, we present the solution to the multi-period portfolio selection problem for the investor who makes the investment decision based on maximizing the expected power utility. Using the Bellman back-propagation method, the closed-form expression of the dynamic optimal portfolio weights is derived under the assumption that the vector of asset returns follows a vector-autoregressive process of order one with Gaussian errors. In the derivation of the theoretical results the relationship between the portfolio log returns and the simple asset returns is employed which is widely used in finance and is known to provide a very good approximation in practice (cf., Tsay (2005, p.5)). The solution of the multi-period optimization problem is deduced for the investor whose relative risk aversion $\gamma$ is assumed to be larger than 1 which corresponds to the recent survey study documented that the median relative risk aversion of investor is 7 (see, e.g., Pennacchi (2008)). Moreover, $\gamma \in (0,1)$ indicates an extremely risky behaviour of an investor that is a rare case in practice.

The obtained theoretical findings are implemented in the empirical study, where the performance of the derived closed-form solution is compared to the numerical approximation suggested in Brandt et al. (2005). The results of the simulation study show that the new closed-form solution of the dynamic portfolio choice problem under the power utility considerably outperforms the numerical approach by providing significantly larger values of the final wealth and utility. This observation holds uniformly for all the considered values of the relative risk aversion coefficient and the investment horizons. Moreover, it is shown that the application of the derived closed-form solution leads to larger both the final wealth and its utility. In particular, when the investment horizon is set to be equal to 12 or 16 weeks, then with a probability of about $80\%$ the closed-form strategy leads to the value of the utility which is larger than $-0.1$. In contrast, if the numerical approximation is used, then the probability of receiving the utility larger than $-0.1$ is close to zero. To this end, we note that the performance of the numerical approximation can be improved when more than four summands are used in the Taylor series expansion of the value function. However, this will increase the computational time considerably, which is always dominated by the analytic solution. At last, one can observe that the proposed strategy dominates the solution given by the exponential utility for increasing time horizon.

The analytical results derived in the paper do not include the transaction costs which appear when the holding portfolio is rebalanced. The inclusion of the transaction costs in the derivation is a very challenging task, which is left for future research.

References

Alpanda S, Woglom GR (2007) The case against power utility and a suggested alternative: resurrecting exponential utility. SSRN Working paper
Barsky RB, Juster FT, Kimball MS, Shapiro MD (1997) Preference parameters and behavioral heterogeneity: an experimental approach in the health and retirement study. Q J Econ 112(2):537–579
Article Google Scholar
Bodnar T, Okhrin Y (2013) Boundaries of the risk aversion coefficient: should we invest in the global minimum variance portfolio? Appl Math Comput 219(10):5440–5448
Google Scholar
Bodnar T, Zabolotskyy T (2011) Estimation and inference of the vector autoregressive process under heteroscedasticity. Theory Probab Math Stat 83:27–45
Article Google Scholar
Bodnar T, Parolya N, Schmid W (2015) A closed-form solution of the multi-period portfolio choice problem for a quadratic utility function. Ann Oper Res 229(1):121–158
Article Google Scholar
Bodnar T, Parolya N, Schmid W (2015) On the exact solution of the multi-period portfolio choice problem for an exponential utility under return predictability. Eur J Oper Res 246(2):528–542
Article Google Scholar
Bodnar T, Okhrin Y, Vitlinskyy V, Zabolotskyy T (2018) Determination and estimation of risk aversion coefficients. CMS 15(2):297–317
Article Google Scholar
Bodnar T, Ivasiuk D, Parolya N, Schmid W (2020) Mean-variance efficiency of optimal power and logarithmic utility portfolios. Math Financ Econ 14(4):675–698
Article Google Scholar
Bodnar T, Lindholm M, Thorsén E, Tyrcha J (2021) Quantile-based optimal portfolio selection. CMS 18(3):299–324
Article Google Scholar
Bonaccolto G, Caporin M, Paterlini S (2018) Asset allocation strategies based on penalized quantile regression. CMS 15(1):1–32
Article Google Scholar
Brandt M (2009) Portfolio choice problems. Handbook of financial econometrics 1:269–336
Google Scholar
Brandt MW, Santa-Clara P (2006) Dynamic portfolio selection by augmenting the asset space. J Financ 61(5):2187–2217
Article Google Scholar
Brandt MW, Goyal A, Santa-Clara P, Stroud JR (2005) A simulation approach to dynamic portfolio choice with an application to learning about return predictability. Rev Financ Stud 18(3):831–873
Article Google Scholar
Campbell JY, Viceira LM (2002) Strategic asset allocation: portfolio choice for long-term investors. Clarendon lectures in economic
Campbell JY, Chan YL, Viceira LM (2003) A multivariate model of strategic asset allocation. J Financ Econ 67:41–80
Article Google Scholar
Çanakoğlu E, Özekici S (2010) Portfolio selection in stochastic markets with HARA utility functions. Eur J Oper Res 201(2):520–536
Article Google Scholar
Çanakoğlu E, Özekici S (2012) HARA frontiers of optimal portfolios in stochastic markets. Eur J Oper Res 221(1):129–137
Article Google Scholar
Chronopoulos M, De Reyck B, Siddiqui A (2011) Optimal investment under operational flexibility, risk aversion, and uncertainty. Eur J Oper Res 213(1):221–237
Article Google Scholar
Consigli G, Hitaj A, Mastrogiacomo E (2019) Portfolio choice under cumulative prospect theory: sensitivity analysis and an empirical study. CMS 16(1):129–154
Article Google Scholar
Elminejad A, Havranek T, Irsova Z (2022) Relative risk aversion: a meta-analysis
Friend I, Blume ME (1975) The demand for risky assets. Am Econ Rev 65(5):900–922
Google Scholar
Garlappi L, Skoulakis G (2011) Taylor series approximations to expected utility and optimal portfolio choice. Math Financ Econ 5(2):121–156
Article Google Scholar
Ivasiuk D (2019) An approximate solution for the power utility optimization under predictable returns. arXiv preprint arXiv:1911.06552
Li D, Ng W-L (2000) Optimal dynamic portfolio selection: multiperiod mean-variance formulation. Math Financ 10(3):387–406
Article Google Scholar
Li P, Xia J, Yan J-A et al (2001) Martingale measure method for expected utility maximization in discrete-time incomplete markets. Ann Econ Financ 2:445–465
Google Scholar
LiCalzi M, Sorato A (2006) The pearson system of utility functions. Eur J Oper Res 172(2):560–573
Article Google Scholar
Lioui A, Poncet P (2013) Optimal benchmarking for active portfolio managers. Eur J Oper Res 226(2):268–276
Article Google Scholar
Liu H (2004) Optimal consumption and investment with transaction costs and multiple risky assets. J Financ 59(1):289–338
Article Google Scholar
Mariani F, Polinesi G, Recchioni MC (2022) A tail-revisited Markowitz mean-variance approach and a portfolio network centrality. CMS 19:425–455
Article Google Scholar
Marín-Solano J, Navas J (2010) Consumption and portfolio rules for time-inconsistent investors. Eur J Oper Res 201(3):860–872
Article Google Scholar
Markowitz H (1952) Portfolio selection. J Finance 7(1):77–91
Google Scholar
Mathai A, Provost SB (1992) Quadratic forms in random variables: theory and applications. M. Dekker
Google Scholar
Mehra R, Prescott EC (1985) The equity premium: a puzzle. J Monet Econ 15(2):145–161
Article Google Scholar
Merton RC (1972) An analytic derivation of the efficient portfolio frontier. J Financ Quant Anal 7:1851–1872
Article Google Scholar
Meucci A, Nicolosi M (2016) Dynamic portfolio management with views at multiple horizons. Appl Math Comput 274:495–518
Google Scholar
Moreno-Bromberg S, Pirvu TA, Reveillac A (2013) CRRA utility maximization under dynamic risk constraints. Commun Stochast Anal 7(2):3
Google Scholar
Muhinyuza S, Bodnar T, Lindholm M (2020) A test on the location of the tangency portfolio on the set of feasible portfolios. Appl Math Comput 386:125519
Google Scholar
Pennacchi GG (2008) Theory of asset pricing. Pearson/Addison-Wesley Boston
Google Scholar
Pratt JW (1964) Risk aversion in the small and in the large. Econometrica 32(1/2):122–136
Article Google Scholar
Soyer R, Tanyeri K (2006) Bayesian portfolio selection with multi-variate random variance models. Eur J Oper Res 171(3):977–990
Article Google Scholar
Tsay RS (2005) Analysis of financial time series, vol 543. Wiley
Book Google Scholar
Vigna E (2009) Mean-variance inefficiency of CRRA and CARA utility functions for portfolio selection in defined contribution pension schemes. Collegio Carlo Alberto notebook 108, or CeRP wp 89, p 9
Xu K-L, Phillips PC (2008) Adaptive estimation of autoregressive models with time-varying variances. J Econ 142(1):265–280
Article Google Scholar
Zhao D, Bai L, Fang Y, Wang S (2022) Multi-period portfolio selection with investor views based on scenario tree. Appl Math Comput 418:126813
Google Scholar

Download references

Acknowledgements

The authors would like to thank Professor Stein-Erik Fleten and three anonymous Reviewers for their helpful suggestions.

Funding

Open access funding provided by Stockholm University.

Author information

Authors and Affiliations

Department of Mathematics, Stockholm University, Stockholm, Sweden
Taras Bodnar
Department of Statistics, European University Viadrina, Frankfurt (Oder), Germany
Dmytro Ivasiuk & Wolfgang Schmid
Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands
Nestor Parolya

Authors

Taras Bodnar
View author publications
You can also search for this author in PubMed Google Scholar
Dmytro Ivasiuk
View author publications
You can also search for this author in PubMed Google Scholar
Nestor Parolya
View author publications
You can also search for this author in PubMed Google Scholar
Wolfgang Schmid
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Taras Bodnar.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

In this section the proof of Theorem 1 is given.

Proof of Theorem 1

The Bellman equation is solved by deriving the expressions of the value functions starting at time $T-1$, which is expressed as

$$\begin{aligned} \begin{aligned} V_{T-1}&=\max _{{\varvec{\omega }}_{T-1}} E_{T-1}\left[ {U}\left( W_{T}\right) \right] =\max _{{\varvec{\omega }}_{T-1}} E_{T-1}\left[ \frac{W_{T}^{1-\gamma }}{1-\gamma }\right] \\&=\max _{{\varvec{\omega }}_{T-1}} \frac{W_{T-1}^{1-\gamma }}{1-\gamma }\exp ((1-\gamma )r^{f}_{T}) E_{T-1}\left[ \exp \left( (1-\gamma ){\varvec{\omega }}_{T-1}^{\prime }\left( {\varvec{r}}_T-r^{f}_{T}{{\mathbf{1}}}\right) \right) \right] \end{aligned} \end{aligned}$$

Since ${\varvec{r}}_T -r^{f}_{T}{{\mathbf{1}}}|{\mathcal{F}}_{t-1}\sim N({\varvec{\mu }}_T-r^{f}_{T}{{\mathbf{1}}},{\varvec{\Sigma }}_T)$, the conditional expectation is equal to the value of the moment-generating function defined for the multivariate normal distribution at $(1-\gamma ){\varvec{\omega }}_{T-1}$. Thus, for $\gamma > 1$ we get

$$\begin{aligned} \begin{aligned} V_{T-1}&=\max _{{\varvec{\omega }}_{T-1}}W_{T-1}^{1-\gamma }\exp ((1-\gamma )r^{f}_{T}) \\&\quad \times \frac{1}{1-\gamma } \exp \left( (1-\gamma ){\varvec{\omega }}_{T-1}^{\prime }\left( {\varvec{\mu }}_T-r^{f}_{T}{{\mathbf{1}}}\right) + \frac{(1-\gamma )^2}{2}{\varvec{\omega }}_{T-1}^{\prime }{\varvec{\Sigma }}_T{\varvec{\omega }}_{T-1}\right) , \end{aligned} \end{aligned}$$

which is maximized at

$$\begin{aligned} {\varvec{\omega }}_{T-1}=\frac{1}{\gamma -1}{\varvec{\Sigma }}^{-1}_{T}\left( {\varvec{\mu }}_T-r^{f}_{T}{\textbf{1}}\right) =\frac{1}{\gamma -1}{\varvec{\Sigma }}^{-1}_{T}\left( \varphi +\Phi X_{T-1}-r^{f}_{T}{\textbf{1}}\right) . \end{aligned}$$

As a result, the value function at time $T-1$ is given by

$$\begin{aligned} V_{T-1}=c_{T} \frac{W_{T-1}^{1-\gamma }}{1-\gamma } \exp \left( -\frac{1}{2}\left( \varphi +\Phi X_{T-1}-r^{f}_{T}{\textbf{1}}\right) ^{\prime }{\varvec{\Sigma }}^{-1}_T\left( \varphi +\Phi X_{T-1}-r^{f}_{T}{\textbf{1}}\right) \right) , \end{aligned}$$

(16)

where $c_{T}=\exp ((1-\gamma )r^{f}_{T})$.

The application of (16) leads to the value function at time $T-2$ expressed as

$$\begin{aligned}{} & {} V_{T-2}=\max _{{\varvec{\omega }}_{T-2}} E_{T-2}\left[ V_{T-1}\right] \nonumber \\{} & {} \quad =\max _{{\varvec{\omega }}_{T-2}} c_{T} \frac{W_{T-2}^{1-\gamma }}{1-\gamma }\exp ((1-\gamma )r^{f}_{T-1}) E_{T-2}\Bigg [\exp \left( (1-\gamma ){\varvec{\omega }}_{T-2}^{\prime }\left( {\varvec{r}}_{T-1}-r^{f}_{T-1}{{\mathbf{1}}}\right) \right) \nonumber \\{} & {} \qquad \times \exp \left( -\frac{1}{2}\left( \varphi +\Phi X_{T-1}-r^{f}_{T}{\textbf{1}}\right) ^{\prime }{\varvec{\Sigma }}^{-1}_T\left( \varphi +\Phi X_{T-1}-r^{f}_{T}{\textbf{1}}\right) \right) \Bigg ]. \end{aligned}$$

(17)

Let ${\tilde{X}}_t=X_t-r^{f}_{t}{{\mathbf{L}}}^\prime {{\mathbf{1}}}$. Then taking into account that

$$\begin{aligned} {{\mathbf{I}}}_k={{\mathbf{L}}}{{\mathbf{L}}}^{\prime }\text { and }{\varvec{r}}_t-r^{f}_{t}{\textbf{1}}={{\mathbf{L}}}\left( X_t-r^{f}_{t}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ={{\mathbf{L}}}{\tilde{X}}_t, \end{aligned}$$

the conditional expectation in (17) can be rewritten in the following way

$$\begin{aligned}{} & {} E_{T-2}\Bigg [\exp \left( (1-\gamma ){\varvec{\omega }}_{T-2}^{\prime }{{\mathbf{L}}}{\tilde{X}}_{T-1}\right) \nonumber \\{} & {} \qquad \times \exp \Bigg (-\frac{1}{2}\left( \varphi +\Phi {\tilde{X}}_{T-1}+r^{f}_{T-1}\Phi {{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) ^{\prime }\nonumber \\{} & {} \qquad \times {\varvec{\Sigma }}^{-1}_T \left( \varphi +\Phi {\tilde{X}}_{T-1}+r^{f}_{T-1}\Phi {{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) \Bigg ) \Bigg ]\nonumber \\{} & {} \quad =E_{T-2}\left[ \exp \left( -{\tilde{X}}_{T-1}^{\prime }A_{T-1}{\tilde{X}}_{T-1}-a_{T-1}({\varvec{\omega }}_{T-2})^{\prime }{\tilde{X}}_{T-1}-d_{T-1}\right) \right] \end{aligned}$$

(18)

with

$$\begin{aligned} \begin{aligned} A_{T-1}&=\frac{1}{2}\Phi ^{\prime }{\varvec{\Sigma }}^{-1}_T\Phi ,\\ a_{T-1}({\varvec{\omega }}_{T-2})^{\prime }&=\left( \varphi +r^{f}_{T-1}\Phi {{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) ^{\prime }{\varvec{\Sigma }}^{-1}_T\Phi +(\gamma -1){\varvec{\omega }}_{T-2}^{\prime }{{\mathbf{L}}},\\ d_{T-1}&=\frac{1}{2}\left( \varphi +\Phi r^{f}_{T-1}{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) ^{\prime }{\varvec{\Sigma }}^{-1}_T\left( \varphi +\Phi r^{f}_{T-1}{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) . \end{aligned} \end{aligned}$$

Finally, the application of Theorem 3.2a.1 in Mathai and Provost (1992) leads to

$$\begin{aligned} \begin{aligned}&V_{T-2}=\max _{{\varvec{\omega }}_{T-2}} c_{T} \frac{W_{T-2}^{1-\gamma }}{1-\gamma }\exp ((1-\gamma )r^{f}_{T-1})\left| {{\mathbf{I}}}+2A_{T-1}{\tilde{{\varvec{\Sigma }}}}_{T-1}\right| ^{-\frac{1}{2}}\\&\qquad \times \exp \left( -\frac{1}{2}\left( \left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1} \left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) +2d_{T-1}\right) \right) \\&\qquad \times \exp \Bigg [\frac{1}{2}\left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}-{\tilde{{\varvec{\Sigma }}}}_{T-1}a_{T-1}({\varvec{\omega }}_{T-2})\right) ^{\prime }\\&\qquad \times \left( {\tilde{{\varvec{\Sigma }}}}_{T-1}+2{\tilde{{\varvec{\Sigma }}}}_{T-1}A_{T-1}{\tilde{{\varvec{\Sigma }}}}_{T-1}\right) ^{-1}\left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}-{\tilde{{\varvec{\Sigma }}}}_{T-1}a_{T-1}({\varvec{\omega }}_{T-2})\right) \Bigg ], \end{aligned} \end{aligned}$$

which is maximized when

$$\begin{aligned} {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}-{\tilde{{\varvec{\Sigma }}}}_{T-1}a_{T-1}({\varvec{\omega }}_{T-2})={{\mathbf{0}}}. \end{aligned}$$

Substituting $a_{T-1}$ and solving with respect to ${\varvec{\omega }}_{T-2}$ gives us the optimal weights for the value function at time $T-2$ expressed as

$$\begin{aligned} \begin{aligned} {\varvec{\omega }}_{T-2}=&\frac{1}{\gamma -1}{{\mathbf{L}}}\Big ({\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1}\left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) -\Phi ^{\prime }{\varvec{\Sigma }}^{-1}_{T}\left( \varphi +r^{f}_{T-1}\Phi {{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T}{\textbf{1}}\right) \Big ). \end{aligned} \end{aligned}$$

Moreover,

$$\begin{aligned} V_{T-2}= c_{T-1}c_{T} \frac{W_{T-2}^{1-\gamma }}{1-\gamma } \exp \left( -\frac{1}{2}\left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1} \left( {\tilde{{\varvec{\mu }}}}_{T-1}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \right) , \end{aligned}$$

where

$$\begin{aligned} c_{T-1}=\exp ((1-\gamma )r^{f}_{T-1}-d_{T-1})\left| {{\mathbf{I}}}+2A_{T-1}{\tilde{{\varvec{\Sigma }}}}_{T-1}\right| ^{-\frac{1}{2}}. \end{aligned}$$

The value function at time $T-3$ is given by

$$\begin{aligned}{} & {} V_{T-3}=\max _{{\varvec{\omega }}_{T-3}} E_{T-3}\left[ V_{T-2}\right] \nonumber \\{} & {} \quad =\max _{{\varvec{\omega }}_{T-3}} c_{T-1} c_{T} \frac{W_{T-3}^{1-\gamma }}{1-\gamma }\exp ((1-\gamma )r^{f}_{T-2}) \nonumber \\{} & {} \qquad \times E_{T-3}\Bigg [\exp \left( (1-\gamma ){\varvec{\omega }}_{T-3}^{\prime }\left( {\varvec{r}}_{T-2}-r^{f}_{T-2}{{\mathbf{1}}}\right) \right) \nonumber \\{} & {} \qquad \times \exp \left( -\frac{1}{2}\left( {\tilde{\varphi }}+{\tilde{\Phi }}X_{T-2}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1} \left( {\tilde{\varphi }}+{\tilde{\Phi }}X_{T-2}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \right) \Bigg ]\nonumber \\{} & {} \quad = \max _{{\varvec{\omega }}_{T-3}} c_{T-1} c_{T} \frac{W_{T-3}^{1-\gamma }}{1-\gamma }\exp ((1-\gamma )r^{f}_{T-2}) \nonumber \\{} & {} \qquad \times E_{T-3}\left[ \exp \left( -{\tilde{X}}_{T-2}A_{T-2}{\tilde{X}}_{T-2}-a_{T-2}({\varvec{\omega }}_{T-3})^{\prime }{\tilde{X}}_{T-2}-d_{T-2}\right) \right] \end{aligned}$$

(19)

with

$$\begin{aligned} \begin{aligned} A_{T-2}&=\frac{1}{2}{\tilde{\Phi }}^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1}{\tilde{\Phi }},\\ a_{T-2}({\varvec{\omega }}_{T-3})^{\prime }&=\left( {\tilde{\varphi }}+r^{f}_{T-2}{\tilde{\Phi }}{{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1}{\tilde{\Phi }}+(\gamma -1){\varvec{\omega }}_{T-3}^{\prime }{{\mathbf{L}}},\\ d_{T-2}&=\frac{1}{2}\left( {\tilde{\varphi }}+r^{f}_{T-2}{\tilde{\Phi }}{{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1} \left( {\tilde{\varphi }}+r^{f}_{T-2}{\tilde{\Phi }}{{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) . \end{aligned} \end{aligned}$$

Using that the conditional expectation in (19) is given by (see, Mathai and Provost 1992, Theorem 3.2a.1)

$$\begin{aligned} \begin{aligned}&\left| {{\mathbf{I}}}+2{\tilde{A}}_{T-2}{\tilde{{\varvec{\Sigma }}}}_{T-2}\right| ^{-\frac{1}{2}}\\&\qquad \times \exp \left( -\frac{1}{2}\left( \left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-2} \left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) +2d_{T-2}\right) \right) \\&\qquad \times \exp \Bigg [\frac{1}{2}\left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}-{\tilde{{\varvec{\Sigma }}}}_{T-2}a_{T-2}({\varvec{\omega }}_{T-3})\right) ^{\prime }\\&\qquad \times \left( {\tilde{{\varvec{\Sigma }}}}_{T-2}+2{\tilde{{\varvec{\Sigma }}}}_{T-2}A_{T-2}{\tilde{{\varvec{\Sigma }}}}_{T-2}\right) ^{-1}\left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}-{\tilde{{\varvec{\Sigma }}}}_{T-2}a_{T-2}({\varvec{\omega }}_{T-3})\right) \Bigg ], \end{aligned} \end{aligned}$$

the maximum in (19) is attained at

$$\begin{aligned} {\varvec{\omega }}_{T-3}= & {} \frac{1}{\gamma -1}{{\mathbf{L}}}\Bigg ({\tilde{{\varvec{\Sigma }}}}^{-1}_{T-2}\left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \\{} & {} -{\tilde{\Phi }}^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-1}\left( {\tilde{\varphi }}+r^{f}_{T-2}{\tilde{\Phi }}{{\mathbf{L}}}^{\prime }{\textbf{1}}-r^{f}_{T-1}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \Bigg ). \end{aligned}$$

Finally, the value function at time $T-3$ is equal to

$$\begin{aligned} V_{T-3}= & {} c_{T-2}c_{T-1}c_{T}\frac{W_{T-3}^{1-\gamma }}{1-\gamma }\\{} & {} \times \exp \left( -\frac{1}{2}\left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) ^{\prime }{\tilde{{\varvec{\Sigma }}}}^{-1}_{T-2} \left( {\tilde{{\varvec{\mu }}}}_{T-2}-r^{f}_{T-2}{{\mathbf{L}}}^{\prime }{\textbf{1}}\right) \right) \end{aligned}$$

with

$$\begin{aligned} c_{T-2}=\exp \left( (1-\gamma )r^{f}_{T-2}-d_{T-2}\right) \left| {{\mathbf{I}}}+2A_{T-2}{\tilde{{\varvec{\Sigma }}}}_{T-2}\right| ^{-\frac{1}{2}}. \end{aligned}$$

Since the value function at time $T-3$ has the same structure as the value function at time $T-2$, the rest of the steps in the Bellman backward recursions are performed in the same way leading to the expressions of the optimal weights presented in the statement of the theorem. $\square$

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Bodnar, T., Ivasiuk, D., Parolya, N. et al. Multi-period power utility optimization under stock return predictability. Comput Manag Sci 20, 4 (2023). https://doi.org/10.1007/s10287-023-00434-6

Download citation

Received: 02 May 2022
Accepted: 28 December 2022
Published: 09 February 2023
DOI: https://doi.org/10.1007/s10287-023-00434-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Multi-period power utility optimization under stock return predictability

Abstract

Similar content being viewed by others

Dynamic optimal mean-variance portfolio selection with stochastic volatility and stochastic interest rate

The Exact Solution of Multi-period Portfolio Choice Problem with Exponential Utility

Time-consistent risk-constrained dynamic portfolio optimization with transactional costs and time-dependent returns

1 Introduction