Constructing Optimal Portfolio Rebalancing Strategies with a Two-Stage Multiresolution-Grid Model

Dai, Tian-Shyr; Chen, Bo-Jen; Sun, You-Jia; Yang, Dong-Yuh; Wu, Mu-En

doi:10.1007/s10614-024-10555-y

Constructing Optimal Portfolio Rebalancing Strategies with a Two-Stage Multiresolution-Grid Model

Open access
Published: 16 February 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Computational Economics Aims and scope Submit manuscript

Constructing Optimal Portfolio Rebalancing Strategies with a Two-Stage Multiresolution-Grid Model

Download PDF

1072 Accesses
Explore all metrics

Abstract

Sophisticated predetermined ratios are used to allocate portfolio asset weights to strike a good trade-off between profitability and risk in trading. Rebalancing these weights due to market fluctuations without incurring excessive transaction costs and tracking errors is a vital financial engineering problem. Rebalancing strategies can be modeled by discretely enumerating portfolio weights to form a grid space and then optimized via the Bellman equation. Discretization errors are reduced by increasing the grid resolution at the cost of increased computational time. To minimize errors with constrained computational resources (e.g., grid nodes), we vary the grid resolution according to the probability distribution of asset weights. Specifically, a grid space is first divided into several areas, and each area’s probability is estimated. Then, the discretization error’s upper bound is minimized by inserting an adequate number of grid nodes determined by Lagrange multipliers in a non-uniform fashion. In experiments, the proposed multiresolution rebalancing outperforms traditional uniform-resolution rebalancing and popular benchmark strategies such as the periodic, tolerance-band, and buy-and-hold strategies.

Optimal portfolio selection with volatility information for a high frequency rebalancing algorithm

Article Open access 25 March 2024

Heuristics for Portfolio Selection

Portfolio Optimization Via Online Gradient Descent and Risk Control

Article 30 June 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

One important topic in financial engineering and risk management is striking a balance between investment returns and risk by properly allocating portfolio weights. Estimating the target investment ratio (i.e., the optimal proportion of each asset’s value in a portfolio that maximizes the holder’s utility) has been widely studied using mean–variance analysis (Markowitz, 1952) and its extensions (e.g., Black and Litterman, 1990, De Prado, 2016), among other methods. Useful concepts introduced in the literature have been widely adopted. For example, Amenc et al. (2012) compare the performance of portfolio weights located on the efficient frontier by the global minimum variance and/or the maximum Sharpe ratio. They show that the former and the latter approaches perform better in bear and bull markets, respectively. Bodnar et al. (2020) analyze optimal portfolios under power and logarithmic utilities given that the gross returns of portfolios are approximately log-normally distributed. Ramírez-Hassan and Guerra-Urzola (2020) use a Bayesian estimator to reduce the estimation error of the minimum-variance portfolio. The asset allocation concept has also been adopted or extended in machine learning studies such as Yeh et al. (2014), Raffinot (2017), Almahdi and Yang (2017), Jain and Jain (2019), Filos (2019), Guan and Liu (2022), and Cong et al. (2021). Although the initial portfolio can be set to the target ratio, the value weights of the portfolio’s assets change continuously due to asset price evolution. Thus, portfolio weights should be frequently rebalanced to prevent significant divergence from the target ratio. However, continuous portfolio adjustments to coincide with the target ratio incur significant transaction costs (see Bregu, 2020). In contrast, overly infrequent adjustments cause portfolio weights to significantly deviate from the target ratio, which incurs high tracking errors. In wealth management, an optimal rebalancing strategy is needed which strikes a balance between transaction costs and tracking errors.

Popular static rebalancing strategies found in financial textbooks include buy-and-hold, periodic, and tolerance band rebalancing (Qian, 2020). In the first strategy, the portfolio is never adjusted regardless of market price fluctuations. This can incur significant losses due to black swan events like the financial tsunami of 2008. Periodic rebalancing, the second strategy, adjusts the portfolio weight to the target ratio at a predetermined frequency. It keeps the portfolio unadjusted at other time points regardless of the market status (stable or volatile). Tolerance band rebalancing, the third strategy, adjusts the portfolio weight when the weight of any of the assets deviates from the target weight by more than a predetermined tolerance level.

The literature has explored sophisticated ways to rebalance the portfolio dynamically without incurring substantial transaction costs. Instead of explicitly modeling the utility function over wealth, Leland (1999) evaluates the cost of tracking errors by measuring the divergence from the current portfolio ratio to the desired target portfolio ratio in terms of the difference of the mean–variance utility, as stated in footnote 7 of his paper, after which he estimates the total cost as the discounted expectation of tracking errors plus transaction costs. To minimize the total cost, a portfolio weight is adjusted only to the so-called “no-trade region” boundary once it falls out of this region. Partial differential equations are derived to solve this free boundary problem but the number of boundary conditions grows exponentially with the number of invested assets. Instead of solving the intractable free boundary problem, Muthuraman and Kumar (2006) solve the value function that represents the cost of a non-optimal no-trade region and then update the region with the value function until the region converges to the optimal one. Muthuraman and Zha (2008) improve the computational speed by evaluating the value function using simulation. Donohue and Yip (2003) numerically verify the superiority of Leland ’s no-trade-region model over the aforementioned static rebalancing methods. They show that the region’s shape and size are strongly affected by correlations among assets, transaction cost magnitudes, and the risk preferences of investors. Dichtl et al. (2014) and Marti et al. (2021) study whether dynamic rebalancing strategies outperform traditional rebalancing strategies. Dynamic rebalancing has also been used to explain how the capital gain tax (Gallmeyer et al., 2006) and asset liquidity (Kinlaw et al., 2013) influence investor behavior.

Much of the literature, such as Petronio et al. (2014) and Yun et al. (2021), models asset return or price processes as a Markov process. The optimal rebalancing problem can thus be modeled as a Markov decision process, as proposed in Sun et al. (2006). All possible portfolio weights are discretely enumerated as a multi-dimensional, uniformly distributed grid. Changes in portfolio weights due to changes in asset values or portfolio rebalancing are represented by movement from one grid node to another. To determine an optimal rebalancing strategy that minimizes the costs of transactions and tracking errors,^{Footnote 1} they solve the Bellman equation to determine the corresponding optimal rebalancing action for each grid node. Their method is widely referenced in recent studies of dynamic portfolio management, such as Tahar et al. (2007), Brito (2008), Branger et al. (2010), Israelov and Katz (2011), Holden and Holden (2013), and Carroll et al. (2017). In addition, Kritzman and Myrgren (2009) and Brown and Smith (2011) address and alleviate the curse of dimensionality problem due to increments of assets. This article suggests that the traditional uniformly distributed grid can be improved by allocating the grid points in a non-uniform fashion according to their importance. Specifically, the discrete enumeration of possible portfolio weights incurs discretization error that depends on the distance between adjacent grid nodes and the probability distribution. Given the non-uniform probability distribution of portfolio weights due to the return distribution of assets^{Footnote 2} and the portfolio rebalancing strategies, this article allocates grid nodes in a non-uniform fashion according to the occurrence probabilities of portfolio weights. The upper bound of the discretization errors is estimated and minimized by allocating grid nodes optimally determined by the Lagrange multiplier. Thus the proposed method is called a multiresolution grid (MRG) due to the above optimal adjustments to the grid resolution. Similar ideas are found in the derivative pricing literature: the grid resolution is adjusted according to the derivative’s payoff function to reduce pricing errors (see Figlewski and Gao, 1999, Dai et al., 2005, Dai and Lyuu, 2007).

The proposed two-stage MRG algorithm is described as follows. The first stage divides the space of portfolio weights into several areas and estimates the probability of the portfolio weight moving from the target ratio to each area via Monte Carlo simulation. The second stage allocates a fair number of grids to each area to minimize the upper bound of the discretization error of the total cost. Note that any arbitrary portfolio weight is categorized into nearby grid nodes. Thus the distance between two adjacent grid nodes reflects the upper-bound discretization error due to the Lipschitz continuity property of the cost function, as proved in Appendix B. Allocating a higher number of grid nodes into an area implies a higher resolution and a lower discretization error in that area. Note that as the computational time is proportional to the total number of grid nodes, it is infeasible to allocate infinite nodes to each area to reduce the error in an unlimited manner. To balance efficiency and accuracy, the lump sums of the upper bounds of discretization errors contributed by all areas are minimized under the constraint of a fixed number of total grid nodes (or computational resources) by substituting the probabilities of the aforementioned areas into the Lagrange multiplier. The optimal allocation number for each area is obtained by solving the Lagrange multiplier. Then, the multiresolution grid model is constricted by uniformly distributing the predetermined allocation number of nodes to each area. Finally, modified policy iteration (see Puterman and Shin (1978)) is applied to solve the cost and the optimal rebalancing action for each grid node. Experimental results show that the proposed MRG significantly improves on the traditional uniformly-distributed grid approach. MRG also outperforms other popular rebalancing strategies found in financial textbooks by backtesting with real transaction data from 2000 to 2020.

The remainder of this article proceeds as follows. Section 2 introduces the required background knowledge to construct a portfolio rebalancing strategy. Section 3 discusses the construction of MRG. Section 4 compares various rebalancing methods with simulated and real transaction data. Section 5 concludes the article.

2 Preliminaries

This section will introduce the required economic and financial background knowledge and algorithms for solving the Bellman equation. The mean–variance analysis and the construction of target ratios will be discussed in Sect. 2.1. Section 2.2 will introduce how tracking errors can be converted into risk-adjusted costs via the concepts of utility and certainty equivalence proposed in Bernoulli (1954). The total rebalancing cost comprises the risk-adjusted costs and the transaction costs. Section 2.3 will model the relations between future expected discount rebalancing costs and rebalancing strategies via the Bellman equation that will be solved by the modified policy iteration in Sect. 2.4.

2.1 Target Ratio Construction

Assume an investment portfolio comprises n strategic assets,^{Footnote 3} including stock and bond indexes, as detailed in Appendix A. The returns of these strategic assets are assumed to follow a multivariate normal distribution. The expected return vector $\mathbf {\mu }$ and the covariance matrix $\varvec{\Lambda }$ can be obtained by calibrating the historical daily closing prices of these indexes, as mentioned in financial textbooks such as Luenberger (2013). Here we follow Sun et al. (2006) by determining target ratios as the portfolio weight vector w that maximizes the investor’s mean–variance utility function^{Footnote 4} defined as

$$\begin{aligned} \mathbf {w^*}=\mathop {\arg \max }_{{\textbf{w}}}\left[ \mathbf {w^T\mu }-\frac{\alpha }{2}\mathbf {w^T}\varvec{\Lambda } \textbf{w}\right] , \end{aligned}$$

(1)

where portfolio weight vectors ${\textbf{w}}$ and $\mathbf {w^*}$ are n-dimensional vectors^{Footnote 5} that represent the value proportion of each asset’s value in a portfolio (the sum of the weights vector equals 1) and $\alpha $ denotes a constant risk aversion coefficient.^{Footnote 6} Here the utility is calculated as the expected return $\mathbf {w^T}\mu $ minus $\alpha /2$ multiplied by the covariance of the portfolio return $\mathbf {w^T}\Lambda {\textbf{w}}$.

2.2 Costs of Tracking Errors and Transactions

The total rebalancing costs can be decomposed into two parts: transaction costs, which reflect the cost to adjust the portfolio weights, and tracking errors, which reflect the loss of an investor due to non-optimal asset allocation. The transaction cost TC(${\textbf{w}}, \mathbf {w'}$) is defined as

$$\begin{aligned} \texttt {TC}({\textbf{w}}, \mathbf {w'}) = \left| {\textbf{w}} - \mathbf {w'}\right| \times {\textbf{c}}, \end{aligned}$$

(2)

where the portfolio weight vectors ${\textbf{w}}$ and $\mathbf {w'}$ denote the weights of assets before and after the portfolio rebalancing, respectively. Each element of vector ${\textbf{c}}$ denotes the transaction fee for the corresponding strategic asset.

The tracking error can be converted into risk-adjusted return by the certainty equivalent approach of Bernoulli (1954) illustrated in Fig. 1. The x- and y-axes denote the expected value and standard derivation of the portfolio return, respectively. The utility for each point located at the solid (or dashed) indifference curve is the same. The optimal portfolio with weight $\mathbf {w^*}$ and a non-optimal one with weight ${\textbf{w}}$ are denoted by the gray and black circles, respectively. The utility for investing the portfolio with the weight $\mathbf {w^{*}(w)}$ equals the utility for investing a riskless asset with return $r^{*}(r)$. The utility for investing the portfolio with the optimal weight allocation $\mathbf {w^*}$ is $\mathbf {w^{*T}\mu }-\frac{\alpha }{2}\mathbf {w^{*T}}\varvec{\Lambda } \textbf{w}^*$; a non-optimal allocation with weight ${\textbf{w}}$ reduces the utility to $\mathbf {w^T\mu }-\frac{\alpha }{2}\mathbf {w^T}\varvec{\Lambda } \textbf{w}$. To express the loss of utility in terms of cost without uncertainty, we calculate the riskless returns $r^*$ and r that have the same utilities for investing with allocation weights $\mathbf {w^*}$ and ${\textbf{w}}$, respectively, as

$$\begin{aligned} r^* \equiv \mathbf {w^{*T}\mu }-\frac{\alpha }{2}\mathbf {w^{*T}}\varvec{\Lambda } \textbf{w}^*,\ r \equiv \mathbf {w^{T}\mu }-\frac{\alpha }{2}\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}. \end{aligned}$$

Following Roll (1992), the tracking error can be expressed as

$$\begin{aligned} r^{*}-r. \end{aligned}$$

(3)

2.3 Bellman Equation for Minimizing Costs

The Bellman equation decomposes the “value” of a decision problem at a state as the sum of the current payoff determined by the portfolio weight state and the decided rebalancing actions, plus the expected discounted values contributed by the following state transitions and corresponding rebalancing actions. In this rebalancing problem, the portfolio weight space ${\mathbb {W}}$ is composed of discretely enumerated portfolio weights represented by the grid, as in Sun et al. (2006). Each rebalancing strategy $\pi $ from the strategy space ${\mathbb {S}}$ is defined as a function that maps any portfolio weight ${\textbf{w}} \in {\mathbb {W}}$ to a portfolio weight ${\textbf{w}}'\in {\mathbb {W}}$ indicating that $\pi $ adjusts the portfolio weight from ${\textbf{w}}$ to ${\textbf{w}}'$. Specifically, $\pi ({\textbf{w}})={\textbf{w}}'$. Optimization of the rebalancing strategy problem can then be accomplished by minimizing the lump sum of the expected present values of rebalancing costs for each state ${\textbf{w}}\in {\mathbb {W}}$ using dynamic programming, as articulated by the Bellman equation:

$$\begin{aligned} J({\textbf{w}}) \equiv {\min _{\pi \in {\mathbb {S}}}\left( \text {Cost}({\textbf{w}},\pi ({\textbf{w}}))+ \gamma \sum _{\mathbf {w'}\in {\mathbb {W}}} {\mathbb {P}}(\pi ({\textbf{w}}),\mathbf {w'})J(\mathbf {w'})\right) }. \end{aligned}$$

(4)

Here the value function $J({\textbf{w}})$ represents the minimized expected present value of the rebalancing cost given the current portfolio weight ${\textbf{w}}$. Terms inside the minimum operators are composed of the current rebalancing cost and the lump sum of future expected discounted rebalancing costs. $\text {Cost}({\textbf{w}},\pi ({\textbf{w}}))$ represents the rebalancing cost incurred when adjusting the portfolio weight vector ${\textbf{w}}$ to the portfolio weight $\pi ({\textbf{w}})$. This cost comprises the transaction cost and tracking error evaluated by Eqs. (2) and (3), respectively. $\gamma $ is the discount factor.^{Footnote 7} After adjusting the portfolio weight to $\pi ({\textbf{w}})$, the portfolio weight changes to $\mathbf {w'}$ with transition probability ${\mathbb {P}}(\pi {({\textbf{w}})},\mathbf {w'})$ at the next time step to reflect changes in asset prices due to market fluctuation. Transition probabilities are evaluated using Monte Carlo simulation by assuming that the returns of assets follow the multivariate normal distribution with mean $\mu $ and covariance matrix $\Lambda $.

A one-time step portfolio weight evolution and an example of rebalancing are illustrated in Fig. 2. $\mathbf {w_0}$, $\mathbf {w_1}$, $\mathbf {w_2}$, $\mathbf {w^*}$, $\mathbf {w_3}$, $\mathbf {w_4}$, and $\mathbf {w_5}$ denote the discretized portfolio weights, where $\mathbf {w^*}$ denotes the optimal portfolio weight defined in Eq. (1). $t_i$ and $t'_i$ denote the time immediately before and after portfolio weight rebalances, for $i=0$ and 1. For every ${\textbf{w}}\in {\textbf{W}}$, the best rebalancing strategy $\pi ^*$ selects a best adjusted portfolio weight $\pi ^*({\textbf{w}})$ to minimize the value $J({\textbf{w}})$ in Eq. (4). For example, assume the initial portfolio weight is $\mathbf {w_1}$ at time $t_0$. Instead of directly rebalancing the portfolio back to the optimal $\mathbf {w^*}$, we adopt the best rebalancing strategy $\mathbf {\pi ^*}({\textbf{w}}_1)=\mathbf {w_2}$ (denoted in bold blue) to rebalance the portfolio weight to $\mathbf {w_2}$. The goal of this strategy is to minimize $J(\mathbf {w_1})$ defined in Eq. (4), ensuring a balance between transaction costs and tracking errors while considering future expenses. Portfolio weight changes stemming from asset price shifts due to market fluctuations are reflected by transition branches between time $t'_0$ and $t_1$. For example, the thin blue branches that emit from the portfolio weight $\mathbf {w_2}$ at time $t'_0$ reflect the weight’s move to $\mathbf {w_0}, \mathbf {w_1}, \ldots $ at time $t_1$ with transitional probabilities ${\mathbb {P}}(\mathbf {w_2},\mathbf {w_0}), {\mathbb {P}}(\mathbf {w_2},\mathbf {w_1}), \ldots $. Unlike Leland (1999), who analytically solves the boundaries of the no-trade region where the best rebalancing strategy is not to adjust, we instead evaluate the best rebalancing strategy for each portfolio weight to identify this region to avoid solving the intractable free boundary problem. For example, we solve the best rebalancing strategy $\mathbf {\pi ^*}$ for every portfolio weight via Eq. (4) and find that the best rebalancing action for the three following portfolio weights is not to adjust: $\mathbf {\pi ^*}(\mathbf {w_2})=\mathbf {w_2}$, $\mathbf {\pi ^*}(\mathbf {w^*})=\mathbf {w^*}$, and $\mathbf {\pi ^*}(\mathbf {w_3})=\mathbf {w_3}$; the corresponding no-trade region is composed of {$\mathbf {w_2},\mathbf {w^*}, \mathbf {w_3}$} and is marked in red in Fig. 2.

2.4 Modified Policy Iteration

The above rebalancing problem finds an optimal rebalance portfolio weight $\mathbf {a^*}({\textbf{w}})$ for each state ${\textbf{w}}$ and can be interpreted as a Markov decision problem. We solve this problem by evaluating the value function J by the modified policy iteration algorithm proposed by Puterman and Shin (1978), as illustrated in Algorithm 1, since it is more efficient than value iteration or policy iteration. This method repeatedly uses partial policy evaluation and policy improvement to approximate the value function J and the rebalancing strategy with $J_n$ and $\pi _n$,^{Footnote 8} respectively. We first initialize the value function $J_0$ as 0 and the initial strategy $\pi _{0}$ to adjust all portfolio weights to the target ratio. The first step calculates the n-th round estimation of the value function, denoted as $\tilde{J_{n}}$, by repeatedly applying rebalancing strategy $\pi _n$ (line 6). Next, in the policy improvement step, we use $\tilde{J_{n}}$ to estimate the $(n+1)$-th round rebalancing strategy $\pi _{n+1}$ and value function $J_{n+1}$, respectively (lines 10 and 11). This iterative procedure stops when the value function or the rebalancing strategy converges (line 14). Here the tolerance level $\epsilon $ is set to 0.001, and the infinity norm $\left| {\mathbb {J}}\right| _{\infty } \equiv \max (J(\mathbf {w_1}),J(\mathbf {w_2}),\ldots ,J(\mathbf {w_{\left| {\mathbb {W}}\right| })})$, where $\left| {\mathbb {W}}\right| $ denotes the number of states in state space ${\mathbb {W}}$.

3 Construction of Two-Stage Multiresolution Grid Algorithm

To find the optimal portfolio rebalancing strategy that minimizes the overall costs of transaction and tracking errors, a Markov decision process is used to model portfolio weights with discrete states; changes in portfolio weights due to market price oscillations or rebalancing are represented by transitions between states. The optimal rebalancing strategy for each state is evaluated by solving the resulting Bellman equation. Although dividing the space of portfolio weights into finer partitions increases the accuracy of rebalancing solutions, it also increases the number of states and hence the running time of the rebalancing algorithm. It is thus critical to decrease the aforementioned discretization errors in a computationally tractable manner.

Instead of using ordinary dynamic programming with the uniform resolution mechanism (ODP), this article improves the rebalancing decision problem by a novel two-stage multiresolution grid algorithm (MRG) that varies state allocation resolution according to area importance. Specifically, portfolio weights move around the target ratio with high probability and are frequently rebalanced back to approximately this ratio, making the probability of staying near the ratio higher than the probability of straying far from the ratio. To decrease the expected discretization error due to categorizing portfolio weights into nearby states, we put more states in high-probability areas (i.e., high resolution) and fewer states in low-probability areas (low resolution). The first stage of MRG divides the space of portfolio weights into areas and determines the probability of each area and the upper bounds of the discretization errors as in Sect. 3.1. Area resolutions are determined by the Lagrange multiplier to minimize the upper bound of discretization errors in Sect. 3.2.

3.1 Area Probabilities and Discretization Errors

In the first stage, the portfolio weight space is divided into several areas and the probability of staying in each area is calculated as in Fig. 3. Note that allocations or rebalancing of $\Theta +1$ assets can be modeled by a $\Theta $-dimensional space of portfolio weights. For example, $\Theta =2$ in Fig. 3. The x- and y-coordinates denote the weights of the first and second asset, respectively. As the sum of the weights of all assets is 1, the weight of the third asset is one minus the weights of the first two assets.

The space of portfolio weights is divided into several even square areas with side length $\Delta $. The target ratio E denoted by the red dot is centered in one of the square areas.

The center of each square area, say A, B, C, D, E, F, G, H, or I, denotes the state, or the representative portfolio weight of that area. Instead of measuring an infinite number of transitions between all possible portfolio weights (due to market price changes and portfolio rebalancing), the proposed algorithm considers the transitions between states to prevent computational intractability. Discretization errors, therefore, occur since all portfolio weights in the square area are represented by the corresponding state.

To estimate the importance of each area, the transition probability from the target ratio E to each area i, $\texttt {P}(i)$, is estimated via Monte Carlo simulation. Specifically, the mean vector and the covariance matrix of the asset returns are calibrated by historical data as in Luenberger (2013); simulated asset returns are then generated to estimate the changes in portfolio weights and hence transition probabilities between areas (or between corresponding states).

We illustrate this using a real number example denoted by the x- and y-axes of Fig. 3. The portfolio weight for state E is (20%,20%,60%), where the weight of the third asset is determined by $60\%=100\%-20\%-20\%$. Assume the one-time step returns for these three assets are $5\%$, $0\%$, and $-5\%$. Then the portfolio return is $20\%\times 5\%+20\%\times 0\%+60\%\times (-5\%)=-2\%$, and the portfolio weight is changed to

$$\begin{aligned}\left( \frac{20\%\times (1+5\%)}{1-2\%},\frac{20\%\times (1+0\%)}{1-2\%},\frac{60\%\times (1-5\%)}{1-2\%}\right) \approx (21.4\%,20.4\%,58.2\%).\end{aligned}$$

This weight is represented by state F with weight (22%, 20%, 58%).

The estimation of the upper bound of the discretization error and the impact of changing the resolutions of an area are represented in Fig. 4. The side length is divided into n pieces (3 in this example) and the area is evenly cut into $n^\Theta $ subareas ($3^2$ in this example) denoted by red dashed lines. Note that any portfolio weight belonging to a subarea is adjusted to the center of the subarea, say the blue/green point. Since cost function J satisfies the Lipschitz continuity condition, as proved in Appendix B, the upper-bound discretization error of the cost changes due to portfolio weight adjustment is proportional to the Euclidean distance between the corner and the center of the subarea:

$$\begin{aligned} \frac{\sqrt{\Theta }\times \Delta }{2\times n}. \end{aligned}$$

(5)

Note that the runtime for each iteration in Algorithm 1 depends on the number of states $\left| {\mathbb {W}}\right| $. Given limited computational resources (i.e., a fixed number of states), the performance of these algorithms can be further improved by allocating states in a non-uniform fashion according to area importance. $\left| {\mathbb {W}}\right| $ states (such as the red, blue, and green nodes in Figs. 3 and 4 ) are allocated so as to minimize the overall expected upper-bound discretization error of the portfolio weight adjustment cost Expected_Error as

$$\begin{aligned} \min _{n_i,\forall i\in {\mathbb {A}}} \texttt {Expected\_Error} \nonumber \\ s.t. \ \ \left| {\mathbb {W}}\right| - \sum _{i\in {\mathbb {A}}}n_i^{\Theta }=0, \end{aligned}$$

(6)

where ${\mathbb {A}}$ denotes the set of all areas. As the side length of area i is cut into $n_i$ pieces, $n_i^{\Theta }$ states are allocated to area i. Expected_Error is the lump sum of the upper-bound discretization error due to portfolio weight adjustment for each area i and is proportional to the sum of Eq. (5) multiplied by the probability to reach the area:

$$\begin{aligned} \texttt {Expected\_Error} \propto \sum _{i\in {\mathbb {A}}}\frac{\sqrt{\Theta }*\Delta }{2n_i}\times \texttt {P}(i). \end{aligned}$$

(7)

3.2 Lagrange Multiplier Method

The resolution for (or the number of states allocated to) each area is optimally solved as depicted in Fig. 5 to minimize the Expected_Error under the constraint defined in Eq. (6) via Lagrange multipliers. Since the $\frac{\sqrt{\Theta }*\Delta }{2}$ term in Eq. (7) is a constant, it suffices to solve

$$\begin{aligned}{} & {} \min _{n_i, \forall i\in {\mathbb {A}}} \sum _{i\in {\mathbb {A}}}\frac{1}{n_i}\times \texttt {P}(i) \nonumber \\{} & {} s.t. \ \ \left| {\mathbb {W}}\right| - \sum _{i\in {\mathbb {A}}}n_i^{\Theta }=0. \end{aligned}$$

(8)

Without loss of generality, the space of portfolio weights is assumed to be divided into nine areas, A, B, $\ldots $, I as in Fig. 3 to simplify the subsequent derivation. The optimization problem defined in Eq. (6) can be simplified by the property in Eq. (8) as

$$\begin{aligned} \min _{\left[ n_A,\ldots ,n_I \right] } \{P(A)\frac{1}{n_A}+P(B)\frac{1}{n_B}+\cdots +P(I)\frac{1}{n_I}\}, \nonumber \\ s.t. \ \ n_A^{\Theta }+n_B^{\Theta }+\cdots +{n_I^{\Theta }}=|{\mathbb {W}} |. \end{aligned}$$

(9)

The Lagrange multiplier function is derived as

$$\begin{aligned} L(n_A,\ldots ,n_I,\lambda ) = P(A)\frac{1}{n_A}+P(B)\frac{1}{n_B}+\cdots +P(I)\frac{1}{n_I}+ \lambda (n_A^{\Theta }+n_B^{\Theta }+\cdots +{n_I^{\Theta }}-|{\mathbb {W}} |). \end{aligned}$$

(10)

Differentiating Eq. (10) with respect to $n_i$ for every $i\in \{A,B, \ldots ,I \}$ and equating the results to zero yields

$$\begin{aligned} \frac{\partial }{\partial n_i}L(n_A,\ldots ,n_I,\lambda ) = -P(i)\frac{1}{n_i^2}+ \lambda \theta n_i^{\Theta -1}=0. \end{aligned}$$

(11)

$n_i$ is then solved to be

$$\begin{aligned} n_i = \left[ \frac{P(i)}{\lambda \Theta } \right] ^\frac{1}{\Theta +1}. \end{aligned}$$

(12)

Differentiating Eq. (10) with respect to $\lambda $ and setting the result to zero yields

$$\begin{aligned} \frac{\partial }{\partial \lambda }L(n_A,\ldots ,n_I,\lambda ) = n_A^{\Theta }+n_B^{\Theta }+\cdots +{n_I^{\Theta }} - |{\mathbb {W}} |=0. \end{aligned}$$

(13)

For every $i\in \{A,B, \ldots ,I\}$, substituting $\left[ \frac{P(i)}{\lambda \Theta } \right] ^\frac{1}{\Theta +1}$ in Eq. (12) for $n_i$ in Eq. (13) yields

$$\begin{aligned} \lambda = \left( \frac{\left[ P(A)^\frac{\Theta }{\Theta +1} + P(B)^\frac{\Theta }{\Theta +1} +\cdots + P(I)^\frac{\Theta }{\Theta +1} \right] }{|{\mathbb {W}} |\times \Theta ^{\frac{\Theta }{\Theta +1}}}\right) ^{\frac{\Theta +1}{\Theta }}. \end{aligned}$$

(14)

Finally, $n_i$ is solved by substituting $\lambda $ obtained in Eq. (14) into Eq. (12). Since a non-integral $n_i$ does not fit the integer requirement as the number of vertical (or horizontal) red dotted lines illustrated in Fig. 4, in subsequent numerical experiments, the side length of area i is instead divided into $\lceil n_i \rceil $ pieces. This increases the number of states involved in MRG; in later experiments, $|{\mathbb {W}} |$ and $\#(\textrm{states})$ are used to denote the Lagrange multiplier constraint and the real number of states used in MRG, respectively. Varying $\lceil n_i \rceil $ reflects the importance of area i and the changing resolution as depicted in Fig. 5.

4 Experimental Results

To evaluate the superiority of MRG, Sect. 4.1 compares the relationships among the running time, allocation of computational resources ($|{\mathbb {W}} |$ and $\#(\textrm{states})$), the expected upper bound of the discretization error Expected_Error, and the lump sum of the future expected discounted costs of transaction and tracking errors^{Footnote 9} (denoted as Total Cost in the following experiments) for ODP and MRG. The investment performance for MRG and other related rebalancing methods discussed in investment textbooks such as Qian (2020) are compared in Sect. 4.2.

4.1 Comparison Between ODP and MRG

The investment portfolios in the following experiments are composed of strategic assets including stock and bond indexes described in Appendix A. The expected returns and covariance matrix of strategic assets are calibrated with historical trading records during the period from 1990 to 1999. The investment period is from 2000 to 2019 unless stated otherwise. The target ratios are determined by the mean–variance analysis stated in Eq. (1). To ensure “inclusive finance,” a goal widely mentioned among financial technology trends, our industrial cooperation partner (a commercial bank in Taiwan) plans to design a strategic asset allocation for individuals with limited assets. In practice, an initial set-up cost is required for investing in a strategic asset; transaction costs increase with the number of strategic assets in which one’s wealth is invested. Thus, it is inefficient to allocate excessively small amounts of wealth to specific assets. To reduce this cost, strategic assets with too-small weights, say $5\%$, are deleted as required by the cooperating bank. The remaining assets are substituted into Eq. (1) to find the target ratio that meets the $5\%$ constraint. The target portfolio is composed of assets SHCOMP, SENSEX, MXLA, and SPX with ratios of $28\%$, $22\%$, $25\%$, and $25\%$, respectively.

The computational time complexities for both ODP and MRG with different $\Delta $ are analyzed first as follows. Although it is difficult to theoretically analyze the time complexity for the modified policy iteration in Algorithm 1 due to the unknown number of iterations, the growth rate of the computation time T can be empirically determined in terms of $\#(\textrm{states})$, that is, the number of states. Specifically, let $T\in O\left( \#(\textrm{states})^k\right) $ for a positive constant k, which yields $T=c\times \left( \#(\textrm{states})\right) ^k$ for a constant c. Taking the logarithm on both sides of the above equation yields $\log T = \log c+k\log \left( \#(\textrm{states})\right) $.

Slope k can be interpreted as the growth rate of the running time with respect to $\#(\textrm{states})$. Term $\log c$ can be considered a measure for all factors except for $\#(\textrm{states})$ that influence computation time T. Figure 6 illustrates this linear growth relationship for MRG with different $\Delta $ and for ODP. The growth rate k and $\log c$ are estimated by applying OLS regression as shown in Table 1. Regardless of the $\Delta $ setting and the rebalancing method, growth rates are approximately 2.2, which implies that both ODP and MRG run in quadratic time. The $\log c$ of $\texttt {ODP}$ and $\texttt {MRG}$ are approximately $-11$. This suggests that factors other than $\#(\textrm{states})$, such as the number of iterations for executing modified policy iteration in Algorithm 1, have a minor impact on the execution time T.

Table 1 Linear regressions for logarithmic (#states) and runtime in Fig. 6

Full size table

The relation among $\Delta $, $|{\mathbb {W}} |$,^{Footnote 10} and Expected_Error are analyzed in Table 2. Clearly, increments in $|{\mathbb {W}} |$ increase the resolution of the portfolio weight space and hence reduce Expected_Error. The superiority of MRG is verified by its lower Expected_Error than that of ODP regardless of the changes in $\Delta $. The $O\left( \frac{1}{\root \Theta \of {\#(\textrm{states})}}\right) $ convergence rate of Expected_Error is confirmed by the adjusted $R^2\approx 1$ for linear regressions between Expected_Error and $\frac{1}{\root \Theta \of {\#(\textrm{states})}}$, as illustrated in Table 3.

Since the convergence rate of Expected_Error (i.e., the coefficient of $\frac{1}{\root \Theta \of {\#(\textrm{states})}}$) is the highest when $\Delta =2.5\%$, MRG in our latter experiments is based on $\Delta =2.5\%$.

Table 2 Relation between $|{\mathbb {W}} |$ and Expected_Error

Full size table

Table 3 Relation between $(\#\textrm{states})$ and Expected_Error

Full size table

The rebalancing performance of ODP and MRG is compared in Tables 4 and 5, respectively. $\#(\textrm{rebalances})$ denotes the number of rebalances during the trading period. TC and TE denote the transaction cost and tracking error defined in Eqs. (2) and (3), respectively. Increments in $\#(\textrm{rebalances})$ increase the chances to adjust portfolio weights to close target ratios; thus, tracking errors are reduced at the expense of increasing transaction costs. A good rebalancing algorithm minimizes the total cost—the lump sum of TC and TE—by optimizing its rebalancing action as in Eq. (4) with constrained computational resources (i.e., the number of grid points or resolution).

Table 4 Impact of changing resolution $\Delta $ on ODP Performance

Full size table

Table 5 Impact of changing resolution on MRG performance with $\Delta =0.025$

Full size table

As ODP allocates one state for each area, its resolution or $\#(\textrm{states})$ increases with decrements in the side length $\Delta $. However, as MRG allocates extra states to subareas as in Fig. 5, the resolution or $\#(\textrm{states})$ of MRG increases with increments in the Lagrange multiplier constraint $|{\mathbb {W}} |$ defined in Eq. (6). Increments in resolution decrease the discretization errors of the portfolio weights as in Fig. 4 and hence increase the accuracy of rebalancing policies; this reduces transaction costs, tracking errors, and thus total costs at the cost of increasing computational times. However, MRG could achieve better accuracy with fewer computational resources. For example, the scenario $(|{\mathbb {W}} |=500)$ in Table 5 uses 780 states—a 42-second runtime—to achieve a total cost of 0.012962, which is lower than that for the $\Delta $=0.015 scenario, 0.01318, in Table 4, which requires more computational resources: 1461 states or 206 s. In addition, increasing $|{\mathbb {W}} |$ to 1500 reduces the total cost to 0.012154, which is lower than the total cost for the finest $\Delta $=0.01 scenario (i.e., 0.01245); the former scenario, however, requires far fewer resources (1624 states and 221 s) than the latter (4897 states and 2820 s).

Since the upper bound of the discretization errors in Table 2 and the rebalancing performance in Table 4 and 5 all suggest that MRG outperforms ODP, subsequent experiments focus on comparisons among MRG and other rebalancing methods.

4.2 Grand Investment Performance Comparison

This section comprehensively compares the performance of MRG with popular traditional rebalancing methods such as the buy-and-hold strategy, periodic rebalancing, and threshold rebalancing under different $\alpha $ risk aversion coefficients and investment periods. Table 6 illustrates the target ratios generated by Eq. (1) and the corresponding mean (denoted by Mean) as well as the standard derivation (Std.) of the portfolio return under different $\alpha $ settings. Increments in $\alpha $ decrease the investment risk (proxied by Std.) at the expense of profitability (reflected by decreasing Mean).

Table 6 Target ratio given risk aversion

Full size table

Tables 7, 8, 9, 10 and 11 illustrate the impact of $\alpha $ on the performance of investment strategies. Twenty-year strategic asset price evolutions are simulated one million times via Monte Carlo simulation. All investment indicators in these tables are the averages when applying a rebalancing strategy on these one million simulations. Increments in $\alpha $ increase the likelihood of adjusting the portfolio weight and hence $\#(\textrm{rebalances})$ except for the buy-and-hold strategy, which does not adjust the portfolio. Increasing $\#(\textrm{rebalances})$ decreases tracking errors and hence the total costs at the expense of profitability as reflected by decreasing average returns. However, it also increases the Sharpe ratio^{Footnote 11} which measures the excess return for bearing a unit of risk.

In addition to MRG, Tables 7, 8, 9, 10 and 11 compare the investment performance of popular rebalancing strategies found in financial textbooks such as Qian (2020). Periodic (i) denotes periodic rebalancing, which adjusts the portfolio weight back to the target ratio every i months. Thus $\#(\textrm{rebalances})$ increases with increments in the rebalancing frequency (or decrements in i). Tolerance ($j\%$) denotes tolerance band rebalancing, which adjusts the portfolio weights back to the target ratio once one of the asset weights diverges from the target ratio by $j\%$. An increment of $j\%$ decreases the rebalancing likelihood and hence $\#(\textrm{rebalances})$. As a buy-and-hold strategy never adjusts the portfolio weight, $\#(\textrm{rebalances})$ is zero. Regardless of the changes to $\alpha $, the total costs of MRG are almost always smaller than those for other methods, suggesting that MRG strikes a better balance between transaction costs and tracking errors than other rebalancing methods. MRG also exhibits higher expected portfolio returns and Sharpe ratios than periodic and tolerance band rebalancing. Although the buy-and-hold strategy has the highest average return, the total cost and Sharpe ratio are the poorest, showing that it bears a far higher risk than the other methods.

Table 7 Comparison of rebalancing methods with $\alpha =2$

Full size table

Table 8 Comparison of Rebalancing Methods with $\alpha =3$

Full size table

Table 9 Comparison of Rebalancing Methods with $\alpha =4$

Full size table

Table 10 Comparison of Rebalancing Methods with $\alpha =5$

Full size table

Table 11 Comparison of Rebalancing Methods with $\alpha =6$

Full size table

Table 12 compares the Sharpe ratios of various rebalancing strategies for investing from 2000 to 2020. Sharpe ratios are listed for each two-year period. The MRG ratios are generally higher than those for other methods. In addition, the Sharpe ratios for all rebalancing strategies are generally similar in the short run. However, in the long term, MRG significantly outperforms the other strategies. This confirms the robustness of MRG to resist financial crises such as those that occurred during 2000–2020: the dot-com bubble, the financial tsunami of 2008, and the European debt crisis.

Table 12 Sharpe Ratios of Rebalancing Methods for Investing from 2000 to Year Listed in Second Row with $\alpha =4$

Full size table

5 Conclusion

This article improves the traditional uniform-resolution-based rebalancing strategies proposed by the multiresolution scheme illustrated in Fig. 5. To minimize the upper bound of the discretization error under constrained computational resources (i.e., $|{\mathbb {W}} |$ in Eq. (8)), grid nodes are allocated in a non-uniform fashion according to area importance in the portfolio weight space. Each area’s importance is estimated by the probability of reaching the area, and optimal node allocation is determined by a Lagrange multiplier. Experiments show that MRG outperforms ODP and popular rebalancing strategies such as the periodic, tolerance band, and buy-and-hold rebalancing strategies in that it efficiently strikes a balance between transaction costs and tracking errors to achieve higher Sharpe ratios.

Data Availibility

The market index historical data can be purchased from WRDS (https://wrds-www.wharton.upenn.edu/).

Notes

Sun et al. (2006) measure tracking error by how the portfolio weight diverges from the target ratio in terms of certainty equivalents (see Simon (1956)).
For example, the primitive mean–variance analysis model assumes that the returns of all assets follow a multivariate normal distribution.
Strategic asset allocation as introduced by Brennan et al. (1997) denotes the systematic distribution of a portfolio across various asset classes, such as stocks, bonds, real estate, and cash, diversified across different sectors or regions. As implied by its name, strategic asset allocation focuses on broad asset classes, typically adhering to a consistent portfolio structure over an extended investment timeframe, making it less susceptible to the curse of dimensionality.
This utility function is also used in trading analysis papers such as Almgren and Chriss (2001), Almgren (2003), Alfonsi et al. (2012), and Obizhaeva and Wang (2013).
To prevent tiny allocation to assets from significantly increasing management burdens, the weight for each asset is either at least $5\%$ or $0\%$ as required by the cooperating bank.
Empirical economists calibrate the risk aversion coefficient $\alpha $ to align with typical prevalent human risk preferences. We follow Li et al. (2020) to set $\alpha $ in our following experiments in Sect. 4.
This article sets $\gamma $ to $e^{-0.02/12}$, where the numerator denotes the U.S. 10-year T-note rate which was approximately $2\%$ in recent years. The denominator denotes the length of a time step set to one month (1/12 year).
Subscript n denotes the n-th round estimation result.
This is calculated by Eq. (4) by setting the initial weight to the optimal weight $\mathbf {w^*}$ defined in Eq. (1).
Note that $|{\mathbb {W}} |$ in Eq. (8) denotes the theoretical constraint of the state number. The number of divisions of each area’s side length is adjusted from a non-integer $n_i$ to an integer $\lceil n_i \rceil $; the real number of states allocated in MRG thus increases from $|{\mathbb {W}} |$ to $\#(\textrm{states})$.
Defined as the portfolio return minus the risk-free rate and then divided by the standard derivation of the return.

References

Alfonsi, A., Schied, A., & Slynko, A. (2012). Order book resilience, price manipulation, and the positive portfolio problem. SIAM Journal on Financial Mathematics, 3(1), 511–533. https://doi.org/10.1137/110822098
Article MathSciNet Google Scholar
Almahdi, S., & Yang, S. Y. (2017). An adaptive portfolio trading system: A risk-return portfolio optimization using recurrent reinforcement learning with expected maximum drawdown. Expert Systems with Applications, 87, 267–279. https://doi.org/10.1016/j.eswa.2017.06.023
Article Google Scholar
Almgren, R., & Chriss, N. (2001). Optimal execution of portfolio transactions. Journal of Risk, 3, 5–40. https://doi.org/10.21314/JOR.2001.041
Almgren, R. F. (2003). Optimal execution with nonlinear impact functions and trading-enhanced risk. Applied Mathematical Finance, 10(1), 1–18. https://doi.org/10.1080/135048602100056
Article Google Scholar
Amenc, N., Goltz, F., Lodh, A., et al. (2012). Diversifying the diversifiers and tracking the tracking error: Outperforming cap-weighted indices with limited risk of underperformance. The Journal of Portfolio Management, 38(3), 72–88. https://doi.org/10.3905/jpm.2012.38.3.072
Article Google Scholar
Bernoulli, D. (1954). Exposition of a new theory on the measurement of risk. Econometrica, 22(1), 23. https://doi.org/10.2307/1909829
Article MathSciNet Google Scholar
Black, F., & Litterman, R. (1990). Asset allocation: Combining investor views with market equilibrium. Goldman Sachs Fixed Income Research, 115(1), 7–18. https://doi.org/10.3905/jfi.1991.408013
Article Google Scholar
Bodnar, T., Ivasiuk, D., Parolya, N., et al. (2020). Mean-variance efficiency of optimal power and logarithmic utility portfolios. Mathematics and Financial Economics, 14, 675–698. https://doi.org/10.1007/s11579-020-00270-1
Article MathSciNet Google Scholar
Branger, N., Breuer, B., & Schlag, C. (2010). Discrete-time implementation of continuous-time portfolio strategies. The European Journal of Finance, 16(2), 137–152. https://doi.org/10.1080/13518470903075854
Article Google Scholar
Bregu, K. (2020). Overconfidence and (Over)trading: The effect of feedback on trading behavior. Journal of Behavioral and Experimental Economics, 88(101), 598. https://doi.org/10.1016/j.socec.2020.101598
Article Google Scholar
Brennan, M. J., Schwartz, E. S., & Lagnado, R. (1997). Strategic asset allocation. Journal of Economic Dynamics and Control, 21(8), 1377–1403. https://doi.org/10.1016/S0165-1889(97)00031-6
Article MathSciNet Google Scholar
Brito, P. (2008). Introduction to Dynamic Programming Applied to Economics. Universidade Técnica de Lisboa
Brown, D. B., & Smith, J. E. (2011). Dynamic portfolio optimization with transaction costs: heuristics and dual bounds. Management Science, 57(10), 1752–1770. https://doi.org/10.1287/mnsc.1110.1377
Article Google Scholar
Carroll, R., Conlon, T., Cotter, J., et al. (2017). Asset allocation with correlation: A composite trade-off. European Journal of Operational Research, 262(3), 1164–1180. https://doi.org/10.1016/j.ejor.2017.04.015
Article Google Scholar
Cong, L.W., Tang, K., Wang, J., et al. (2021). AlphaPortfolio: Direct construction through deep reinforcement learning and interpretable AI. https://doi.org/10.2139/ssrn.3554486
Dai, T. S., & Lyuu, Y. D. (2007). An exact subexponential-time lattice algorithm for Asian options. Acta Informatica, 44(1), 23–39. https://doi.org/10.1007/s00236-006-0033-9
Article MathSciNet Google Scholar
Dai, T. S., Huang, G. S., & Lyuu, Y. D. (2005). An efficient convergent lattice algorithm for European Asian options. Applied Mathematics and Computation, 169(2), 1458–1471. https://doi.org/10.1016/j.amc.2004.10.085
Article MathSciNet Google Scholar
De Prado, M. L. (2016). Building diversified portfolios that outperform out-of-sample. The Journal of Portfolio Management, 42(4), 59–69. https://doi.org/10.3905/jpm.2016.42.4.059
Article Google Scholar
Dichtl, H., Drobetz, W., & Wambach, M. (2014). Where is the value added of rebalancing? A systematic comparison of alternative rebalancing strategies. Financial Markets and Portfolio Management, 28(3), 209–231. https://doi.org/10.1007/s11408-014-0231-3
Article Google Scholar
Donohue, C., & Yip, K. (2003). Optimal portfolio rebalancing with transaction costs. The Journal of Portfolio Management, 29(4), 49–63. https://doi.org/10.3905/jpm.2003.319894
Article Google Scholar
Figlewski, S., & Gao, B. (1999). The adaptive mesh model: A new approach to efficient option pricing. Journal of Financial Economics, 53(3), 313–351. https://doi.org/10.1016/S0304-405X(99)00024-0
Article Google Scholar
Filos, A. (2019). Reinforcement Learning for Portfolio Management. arXiv preprint arXiv:1909.09571
Gallmeyer, M. F., Kaniel, R., & Tompaidis, S. (2006). Tax management strategies with multiple risky assets. Journal of Financial Economics, 80(2), 243–291. https://doi.org/10.1016/j.jfineco.2004.08.010
Article Google Scholar
Guan, M., & Liu, X.Y. (2022). Explainable deep reinforcement learning for portfolio management: an empirical approach. In: Proceedings of the Second ACM International Conference on AI in Finance. Association for Computing Machinery, New York, NY, USA, ICAIF ’21, pp 1–9, https://doi.org/10.1145/3490354.3494415
Holden, H., & Holden, L. (2013). Optimal rebalancing of portfolios with transaction costs. Stochastics, 85(3), 371–394. https://doi.org/10.1080/17442508.2011.651219
Article MathSciNet Google Scholar
Israelov, R., & Katz, M. (2011). To trade or not to trade? Informed trading with short-term signals for long-term investors. Financial Analysts Journal, 67(5), 23–36. https://doi.org/10.2469/faj.v67.n5.3
Article Google Scholar
Jain, P., & Jain, S. (2019). Can machine learning-based portfolios outperform traditional risk-based portfolios? The need to account for covariance misspecification. Risks, 7(3), 74. https://doi.org/10.3390/risks7030074
Article Google Scholar
Kinlaw, W., Kritzman, M., & Turkington, D. (2013). Liquidity and portfolio choice: A unified approach. The Journal of Portfolio Management, 39(2), 19–27. https://doi.org/10.3905/jpm.2013.39.2.019
Article Google Scholar
Kritzman, M., & Myrgren, S. (2009). Optimal rebalancing: A scalable solution. Journal of Investment Management, 7(1), 9–19.
Google Scholar
Leland, H.E. (1999). Optimal portfolio management with transactions costs and capital gains taxes. SSRN Scholarly Paper ID 206871, Social Science Research Network. https://doi.org/10.2139/ssrn.206871
Li, S., Liu, S., Zhou, Y., et al. (2020). Optimal portfolio selection of mean-variance utility with stochastic interest rate. Journal of Function Spaces, 2020, 1–10. https://doi.org/10.1155/2020/3153297
Article MathSciNet Google Scholar
Luenberger, D. G. (2013). Investment science (2nd ed.). Oxford University Press.
Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77–91. https://doi.org/10.2307/2975974
Article Google Scholar
Marti, G., Nielsen, F., Bińkowski, M., et al. (2021). A Review of Two Decades of Correlations, Hierarchies, Networks and Clustering in Financial Markets. In: Nielsen F (ed) Progress in information geometry: Theory and applications. Signals and Communication Technology, Springer International Publishing pp. 245–274, https://doi.org/10.1007/978-3-030-65459-7_10
Muthuraman, K., & Kumar, S. (2006). Multidimensional portfolio optimization with proportional transaction costs. Mathematical Finance, 16(2), 301–335. https://doi.org/10.1111/j.1467-9965.2006.00273.x
Article MathSciNet Google Scholar
Muthuraman, K., & Zha, H. (2008). Simulation-based portfolio optimization for large portfolios with transaction costs. Mathematical Finance, 18(1), 115–134. https://doi.org/10.1111/j.1467-9965.2007.00324.x
Article MathSciNet Google Scholar
Obizhaeva, A. A., & Wang, J. (2013). Optimal trading strategy and supply/demand dynamics. Journal of Financial Markets, 16(1), 1–32. https://doi.org/10.1016/j.finmar.2012.09.001
Article Google Scholar
Petronio, F., Tamborini, L., Lando, T., et al. (2014). Portfolio selection in the BRICs stocks markets using Markov processes. International Journal of Mathematical Models and Methods in Applied Sciences, 8, 311–318.
Google Scholar
Puterman, M. L., & Shin, M. C. (1978). Modified policy iteration algorithms for discounted Markov decision problems. Management Science, 24(11), 1127–1137. https://doi.org/10.1287/mnsc.24.11.1127
Article MathSciNet Google Scholar
Qian, E.E. (2020). Portfolio rebalancing. Chapman and Hall, https://www.routledge.com/Portfolio-Rebalancing/Qian/p/book/9780367732837
Raffinot, T. (2017). Hierarchical clustering-based asset allocation. The Journal of Portfolio Management, 44(2), 89–99. https://doi.org/10.3905/jpm.2018.44.2.089
Article Google Scholar
Ramírez-Hassan, A., & Guerra-Urzola, R. (2020). Optimal portfolio choice: A minimum expected loss approach. Mathematics and Financial Economics, 14(1), 97–120. https://doi.org/10.1007/s11579-019-00246-w
Article MathSciNet Google Scholar
Roll, R. (1992). A mean/variance analysis of tracking error. Journal of Portfolio Management, 18(4), 13–22. https://doi.org/10.3905/jpm.1992.701922
Article Google Scholar
Simon, H. A. (1956). Dynamic programming under uncertainty with a quadratic criterion function. Econometrica, 24(1), 74–81. https://doi.org/10.2307/1905261
Article MathSciNet Google Scholar
Sun, W., Fan, A., Chen, L. W., et al. (2006). Optimal rebalancing for institutional portfolios. The Journal of Portfolio Management, 32(2), 33–43. https://doi.org/10.3905/jpm.2006.611801
Article Google Scholar
Tahar, I. B., Soner, H. M., & Touzi, N. (2007). The dynamic programming equation for the problem of optimal investment under capital gains taxes. SIAM Journal on Control and Optimization, 46(5), 1779–1801. https://doi.org/10.1137/050646044
Article MathSciNet Google Scholar
Yeh, J.J., Kuo, T.T., Chen, W., et al. (2014). Minimizing expected loss for risk-avoiding reinforcement learning. In: 2014 International conference on data science and advanced analytics (DSAA) pp 11–17. https://doi.org/10.1109/dsaa.2014.7058045
Yun, K. K., Yoon, S. W., & Won, D. (2021). Prediction of stock price direction using a hybrid GA-XGBoost algorithm with a three-stage feature engineering process. Expert Systems with Applications, 186(115), 716. https://doi.org/10.1016/j.eswa.2021.115716
Article Google Scholar

Download references

Acknowledgements

We thank Hsiang-En Fu, Ya-Hsuan Yu, Yu-Ting Fu, Tzu-Chien, Hsien for their assistance in this work.

Funding

Open Access funding enabled and organized by National Yang Ming Chiao Tung University. Financial support was received from the project of the Ministry of Science and Technology (in Taiwan) MOST-109-2410-H-009-009-MY3.

Author information

Authors and Affiliations

Department of Information Management and Finance, National Yang Ming Chiao Tung University, Hsinchu, Taiwan
Tian-Shyr Dai, Bo-Jen Chen & You-Jia Sun
Institute of Information and Decision Sciences, National Taipei University of Business, Taipei, Taiwan
Dong-Yuh Yang
Department of Information and Finance Management, National Taipei University of Technology, Taipei, Taiwan
Mu-En Wu
Risk and Insurance Research Center, National Chengchi University, Taipei, Taiwan
Tian-Shyr Dai

Authors

Tian-Shyr Dai
View author publications
You can also search for this author in PubMed Google Scholar
Bo-Jen Chen
View author publications
You can also search for this author in PubMed Google Scholar
You-Jia Sun
View author publications
You can also search for this author in PubMed Google Scholar
Dong-Yuh Yang
View author publications
You can also search for this author in PubMed Google Scholar
Mu-En Wu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design.

Corresponding author

Correspondence to Tian-Shyr Dai.

Ethics declarations

Conflicts of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Ethical Approval

All authors consent to the ethical responsibilities.

Consent to Participate

All authors consent to participate.

Consent for Publication

All authors consent to publication.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix A: Lists of Strategic Assets

The fifteen strategic assets used to form the investment portfolio include 11 stock indexes and 4 bond indexes. The stock indexes include SPX (Standard & Poor’s 500 index), SXXP (Europe 600 stock index), NKY (Nikkei 225 index), SHCOMP (Shanghai Composite Index), HSCEI (Hang Seng China Enterprises Index), TWSE (Taiwan Stock Exchange), KOSPI (Korea Stock Exchange Index), SENSEX (S &P India stock index), MXSO (MSCI All Country ASEAN Index), MXMU (MSCI Emerging Markets Europe Index), and MXLA (MSCI Emerging Markets Latin America Index). The bond indexes include G0Q0 (US Treasury Index), G0BC (BofA Merrill Lynch Global Broad Market Index), H0A0 (US High Yield Index), and IP00 (BBB & Lower Sovereign External Debt Index). These data can be purchased from WRDS (https://wrds-www.wharton.upenn.edu/). We used daily close prices of the aforementioned assets to calibrate model parameters $\mu $ and $\Lambda $ for backtesting.

Appendix B: Lipschitz Continuity

To prove the cost function J defined in Eq. (4) meets the Lipschitz continuity condition, we find the Lipschitz constant L that satisfies the inequality $\left| J({\textbf{w}})-J(\mathbf {w_0})\right| \le L \Vert \mathbf {w-w_0}\Vert _{2} $. Recall that J is defined as the current costs of transactions and the tracking error plus the future expected discounted costs, as defined in Eq. (4). It suffices to prove that the current cost function satisfies the Lipschitz continuity condition. By substituting the costs defined in Sect. 2.2, we have

$$\begin{aligned} \left| \text {Cost}({\textbf{w}})-\text {Cost}(\mathbf {w_0})\right|&= \left| {\textbf{c}}\times \Vert \mathbf {w-w_0}\Vert _{1} + \mathbf {\mu ^{T}(w-w_0)}-\frac{\alpha }{2}(\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\right| \\&\le \left| {\textbf{c}}\times \Vert \mathbf {w-w_0}\Vert _{1}\right| + \left| \mathbf {\mu ^{T}(w-w_0)}\right| + \left| \frac{\alpha }{2}(\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\right| . \end{aligned}$$

(B1)

We use the Cauchy–Schwarz inequality to derive the 1- and 2-norm inequality $\Vert {\textbf{a}}- {\textbf{b}}\Vert _{1} \le \sqrt{\Theta +1}\times \Vert {\textbf{a}}- {\textbf{b}}\Vert _{2}$. Substituting this inequality into the first term of Eq. (B1) yields

$$\begin{aligned} \left| {\textbf{c}}\times \Vert {\textbf{w}}- \mathbf {w_0}\Vert _{1}\right|&\le \sqrt{\Theta +1}\times {\textbf{c}}\times \Vert {\textbf{w}}- \mathbf {w_0}\Vert _{2}. \end{aligned}$$

(B2)

To derive the last two parts of Eq. (B1), we first let $f({\textbf{x}})=\mathbf {\mu ^{T}{\textbf{x}}}$ and derive the following inequality:

$$\begin{aligned} f({\textbf{y}})-f({\textbf{x}})&=\langle \nabla f(\mathbf {(1-c)x+cy}), (\mathbf {y-x})\rangle \nonumber \\&\le \Vert \nabla f(\mathbf {(1-c)x+cy})\Vert _{2}\times \Vert \mathbf {y-x}\Vert _{2}, \end{aligned}$$

(B3)

where the mean value theorem ensures the existence of $c\in [0,1]$ and Cauchy inequality ensures the inequality.

Substituting Eq. (B3) into the second term of Eq. (B1) yields

$$\begin{aligned} \left| \mathbf {\mu ^{T}(w-w_0)}\right|&= \left| f({\textbf{w}}) - f(\mathbf {w_0})\right| \nonumber \\&\le \Vert \nabla f(\mathbf {(1-c)w_0+cw})\Vert _{2}\times \Vert \mathbf {w-w_0}\Vert _{2}. \end{aligned}$$

(B4)

Similarly, the third part of Eq. (B1) is proved by taking advantage of Eq. (B3) to yield

$$\begin{aligned} \left| \frac{\alpha }{2}(\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\right|&\le \frac{\alpha }{2}\Vert \nabla (\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\Vert _{2} \times \Vert \mathbf {w-w_0}\Vert _{2}. \end{aligned}$$

(B5)

Combining Eqs. (B2), (B4), and (B5) shows that the cost function satisfies the Lipschitz continuity condition:

$$\begin{aligned} \left| \text {Cost}({\textbf{w}})-\text {Cost}(\mathbf {w_0})\right|&\le \left( \sqrt{\Theta +1}\times {\textbf{c}}+\Vert \nabla \mathbf {\mu ^{T}(w-w_0)}\Vert _{2}+\frac{\alpha }{2}\Vert \nabla (\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\Vert _{2}\right) \\&\times \Vert \mathbf {w-w_0}\Vert _{2}, \end{aligned}$$

(B6)

where the Lipschitz constant L is

$$\begin{aligned} \left( \sqrt{\Theta +1}\times {\textbf{c}}+\Vert \nabla \mathbf {\mu ^{T}(w-w_0)}\Vert _{2}+\frac{\alpha }{2}\Vert \nabla (\mathbf {w^{T}}\varvec{\Lambda } \textbf{w}-\mathbf {w^{T}_0}\varvec{\Lambda } \mathbf {w_0})\Vert _{2}\right) . \end{aligned}$$

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Dai, TS., Chen, BJ., Sun, YJ. et al. Constructing Optimal Portfolio Rebalancing Strategies with a Two-Stage Multiresolution-Grid Model. Comput Econ (2024). https://doi.org/10.1007/s10614-024-10555-y

Download citation

Accepted: 12 January 2024
Published: 16 February 2024
DOI: https://doi.org/10.1007/s10614-024-10555-y

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Constructing Optimal Portfolio Rebalancing Strategies with a Two-Stage Multiresolution-Grid Model

Abstract

Similar content being viewed by others

Optimal portfolio selection with volatility information for a high frequency rebalancing algorithm

Heuristics for Portfolio Selection

Portfolio Optimization Via Online Gradient Descent and Risk Control

1 Introduction

2 Preliminaries

2.1 Target Ratio Construction

2.2 Costs of Tracking Errors and Transactions

2.3 Bellman Equation for Minimizing Costs

2.4 Modified Policy Iteration

3 Construction of Two-Stage Multiresolution Grid Algorithm

3.1 Area Probabilities and Discretization Errors

3.2 Lagrange Multiplier Method

4 Experimental Results

4.1 Comparison Between ODP and MRG

4.2 Grand Investment Performance Comparison

5 Conclusion

Data Availibility

Notes

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Conflicts of interest

Ethical Approval

Consent to Participate

Consent for Publication

Additional information

Publisher's Note

Appendices

Appendix A: Lists of Strategic Assets

Appendix B: Lipschitz Continuity

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation