Skip to main content
Log in

A theoretical foundation of portfolio resampling

  • Published:
Theory and Decision Aims and scope Submit manuscript

Abstract

A portfolio-resampling procedure invented by Richard and Robert Michaud is a subject of highly controversial discussion and big scientific dispute. It has been evaluated in many empirical studies and Monte Carlo experiments. Apart from the contradictory findings, the Michaud approach still lacks a theoretical foundation. I prove that portfolio resampling has a strong foundation in the classic theory of rational behavior. Every noise trader could do better by applying the Michaud procedure. By contrast, a signal trader who has enough prediction power and risk-management skills should refrain from portfolio resampling. The key note is that in most simulation studies, investors are considered as noise traders. This explains why portfolio resampling performs well in simulation studies, but could be mediocre in real life.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1

Similar content being viewed by others

Notes

  1. The Michaud approach is a US patented procedure with patent number 6,003,018.

  2. In the following, I use both terms “investor” and “trader” synonymously.

  3. Here it is implicitly assumed that the search costs for the information \(\mathcal {I}\) are negligible.

  4. Here \(\bar{w}\) denotes the theoretical expectation of \(\tilde{w}\) but not some corresponding approximation or estimate.

  5. Here the mean portfolio \(\bar{w}_t\) is a function of time, since the distribution of \(\tilde{w}_t\) might depend on \(t\in \mathbb {Z}\,\).

  6. In Sect. 4.2 I discuss the apparent link between the random-walk and the efficient-market hypothesis.

  7. In most publications, \(\Sigma \) typically denotes the covariance matrix \(Var (R)\) but not the uncentered second-moment matrix of \(R\). For short-term asset returns, the difference between \(Var (R)\) and \(E (RR')\) is negligible.

  8. Here \(\Sigma \) refers to the covariance matrix of \(R\) (Michaud 1998).

  9. Hence, \(\mathcal {C}\) belongs to the \(N\)-dimensional Euclidean simplex.

  10. This point has been suggested by Christoph Memmel in a personal communication.

  11. In the random-walk model (see Example 4), \(\theta \) is the parameter of \(F\), i.e., the joint cumulative distribution function of the asset returns.

  12. For example, it is typically assumed that the random-walk model is true, \(\Sigma \) is known, and \(\mathcal {C}=\mathbb {R}^N\). Then the desired portfolio corresponds to \(w^*=\Sigma ^{-1}\mu /\alpha \), which is a linear function of \(\mu \,\).

  13. I would like to thank Winfried Pohlmeier for this hint.

  14. For a precise definition of \(\lambda _{t,T}\) see Back (2010) and Frahm (2013).

  15. More details on this subject can be found in Frahm (2013).

  16. This means \(R\) is stochastically independent of \(\tilde{w}_t\sim \tilde{w}\) for \(t=1,2,\ldots ,n\,\).

  17. Here I do not distinguish between theories where a rational individual act as if he maximizes the expected utility based on his subjective probabilities and other theories implying that he actually does.

  18. This is also reflected by Markowitz’ two-stage portfolio optimization approach discussed in Sect. 2.

References

  • Arrow, K.J. (1971). “The theory of risk aversion”, in Yrjo Jahnssonin Saatio, ed., Aspects of the Theory of Risk Bearing, Markham, 1971, pp. 90–109.

  • Back, K. (2010). Martingale pricing. Annual Review of Financial Economics, 2, 235–250.

    Article  Google Scholar 

  • Bade, A., Frahm, G., & Jaekel, U. (2008). A general approach to Bayesian portfolio optimization. Mathematical Methods of Operations Research, 70, 337–356.

    Article  Google Scholar 

  • Becker, F., Gürtler, M., & Hibbeln, M. (2013). Markowitz versus Michaud: portfolio optimization strategies reconsidered. The European Journal of Finance,. doi:10.1080/1351847X.2013.830138.

  • Black, F. (1986). Noise. Journal of Finance, 41, 529–543.

    Article  Google Scholar 

  • Bradley, R. (2007). A unified Bayesian decision theory. Theory and decision, 63, 233–263.

    Article  Google Scholar 

  • Breiman, L. (1996). Bagging predictors. Machine Learning, 24, 123–140.

    Google Scholar 

  • Campbell, J. Y., Lo, A. W., & MacKinlay, A. C. (1997). The Econometrics of Financial Markets. Princeton University Press.

  • Choquet, G. (1953). Theory of capacities. Annales de l’Institut Fourier, 5, 131–295.

    Article  Google Scholar 

  • Cochrane, J. H. (2005). Asset Pricing. Princeton University Press.

  • DeMiguel, V., Garlappi, L., Nogales, F. J., & Uppal, R. (2009). A generalized approach to portfolio optimization: Improving performance by constraining portfolio norms. Management Science, 55, 798–812.

    Article  Google Scholar 

  • Fama, E. F. (1970). Efficient capital markets: A review of theory and empirical work. Journal of Finance, 25, 383–417.

    Article  Google Scholar 

  • Fernandes, J. L., & Barros, J. R. (2012). Haas Ornelas, and O.A. Martínez Cusicanqui, “Combining equilibrium, resampling, and analyst’s views in portfolio optimization”. Journal of Banking and Finance, 36, 1354–1361.

    Article  Google Scholar 

  • Ferson, W. (2007). Market efficiency and forecasting”, in “Forecasting Expected Returns in the Financial Markets. Academic Press.

  • Fletcher, J., & Hillier, J. (2001). An examination of resampled portfolio efficiency. Financial Analysts Journal, 57, 66–74.

    Article  Google Scholar 

  • Frahm, G. (2011). The determinants of the risk functions of estimators for expected asset returns. Technical Report, University of Cologne.

  • Frahm, G. (2013). “The fundamental theorem of asset pricing for liquid financial markets”, Working paper AP 2013–1, Helmut Schmidt University.

  • Frahm, G., & Memmel, C. (2010). Dominating estimators for minimum variance portfolios. Journal of Econometrics, 159, 289–302.

    Article  Google Scholar 

  • Frahm, G., Memmel, C., Wickern, T., & Wiechers, C. (2012). Multiple tests for the performance of different investment strategies. Advances in Statistical Analysis, 96, 343–383.

    Article  Google Scholar 

  • Hansen, L. P. (1982). Large sample properties of generalized method of moments estimators. Econometrica, 50, 1029–1054.

    Article  Google Scholar 

  • Harvey, C. R., Liechty, J. C., & Liechty, M. W. (2008). Bayes vs. resampling: a rematch. Journal of Investment Management, 6, 1–17.

    Google Scholar 

  • Herold, U., & Maurer, R. (2006). Portfolio choice and estimation risk. A comparison of Bayesian to heuristic approaches. ASTIN Bulletin, 36, 135–160.

    Article  Google Scholar 

  • Jarrow, R. (2012). “The Third Fundamental Theorem of Asset Pricing”, Annals of Financial Economics, 7. DOI:10.1142/S2010495212500078.

  • Jarrow, R. A., & Larsson, M. (2012). The meaning of market efficiency. Mathematical Finance, 22, 1–30.

    Article  Google Scholar 

  • Jobson, J. D., & Korkie, B. (1980). Estimation for Markowitz efficient portfolios. Journal of the American Statistical Association, 75, 544–554.

    Article  Google Scholar 

  • Kan, R., & Zhou, G. (2007). Optimal portfolio choice with parameter uncertainty. Journal of Financial and Quantitative Analysis, 42, 621–656.

    Article  Google Scholar 

  • Knight, F. H. (1921). Risk Uncertainty and Profit, Hart, Schaffner and Marx.

  • Kohli, J. (2005). “An Empirical Analysis of Resampled Efficiency”, Master’s thesis, Worcester Polytechnic Institute.

  • Kovaleski, D. (2001). Northfield to utilize New Frontier’s portfolio optimization process. Pensions & Investments, 29, 39.

    Google Scholar 

  • Kroll, Y., Levy, H., & Markowitz, H. M. (1984). Mean-variance versus direct utility maximization. Journal of Finance, 39, 47–61.

    Article  Google Scholar 

  • LeRoy, S. F. (1973). Risk aversion and the martingale property of stock prices. International Economic Review, 14, 436–446.

    Article  Google Scholar 

  • Lo, A.W., Mamaysky, H., & Wang, J. (2002). “Foundations of technical analysis: computational algorithms, statistical inference and empirical implementation”, in E. Acar and S. Satchell, eds., Advanced Trading Rules, Butterworth-Heinemann, chapter 2, pp. 42–111.

  • Lucas, R. E. (1978). Asset prices in an exchange economy. Econometrica, 46, 1429–1445.

    Article  Google Scholar 

  • Malkiel, B. (2003). The Efficient Market Hypothesis and its critics. Journal of Economic Perspectives, 17, 59–82.

    Article  Google Scholar 

  • Markowitz, H. M. (1952). Portfolio selection. Journal of Finance, 7, 77–91.

    Google Scholar 

  • Markowitz, H. M. (2012). The “great confusion” concerning MPT. AESTIMATIO, The IEB International Journal of Finance, 4, 8–27.

    Google Scholar 

  • Markowitz, H. M., & Usmen, N. (2003). Resampled frontiers versus diffuse Bayes. Journal of Investment Management, 1, 1–17.

    Google Scholar 

  • Michaud, R. O. (1998). Efficient Asset Management: A Practical Guide to Stock Portfolio Optimization and Asset Allocation. McGraw-Hill.

  • Michaud, R. O., & Michaud, R. (2003). An examination of resampled portfolio efficiency”: comment and response. Financial Analysts Journal, 59, 15–16.

    Article  Google Scholar 

  • Michaud, R. O., & Michaud, R. (2008). Defense of Markowitz-Usmen. New Frontier Advisors: Technical Report.

  • Mossin, J. (1968). Optimal multiperiod portfolio policies. The Journal of Business, 41, 215–229.

    Article  Google Scholar 

  • Pratt, J. W. (1964). Risk aversion in the small and in the large. Econometrica, 32, 122–136.

    Article  Google Scholar 

  • Pulley, L. B. (1981). A general mean-variance approximation to expected utility for short holding periods. Journal of Financial and Quantitative Analysis, 16, 361–373.

    Article  Google Scholar 

  • Ramsey, F. P. (1926). Truth and probability, in Philosophical Papers. Cambridge University Press.

  • Ross, S. A. (2005). Neoclassical Finance. Princeton University Press.

  • Savage, L. J. (1954). The Foundations of Statistics. Wiley.

  • Scherer, B. (2002). Portfolio resampling: review and critique. Financial Analysts Journal, 58, 98–109.

    Article  Google Scholar 

  • Scherer, B. (2004). Resampled efficiency and portfolio choice. Financial Markets and Portfolio Management, 18, 382–398.

    Article  Google Scholar 

  • Scherer, B. (2006). A note on the out-of-sample performance of resampled efficiency. Journal of Asset Management, 7, 170–178.

    Article  Google Scholar 

  • Scherer, B. (2007). Can robust portfolio optimisation help to build better portfolios? Journal of Asset Management, 7, 374–387.

    Article  Google Scholar 

  • Timmermann, A., & Granger, C. W. J. (2004). Efficient market hypothesis and forecasting. International Journal of Forecasting, 20, 15–27.

    Article  Google Scholar 

  • von Neumann, J., & Morgenstern, O. (1944). Theory of Games and Economic Behavior. Princeton University Press.

  • Wolf, M. (2007). Resampling vs. shrinkage for benchmarked managers, Wilmott Magazine, 2007, January, 76–81.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Gabriel Frahm.

Additional information

Many thanks belong to Uwe Jaekel, Bob Korkie, Christoph Memmel, Gert Mittring, Winfried Pohlmeier, Rainer Schüssler, and Niklas Wagner for their fruitful comments and suggestions. Moreover, I thank all participants of the workshop on portfolio optimization in Konstanz (2012) for their contributions.

Proofs

Proofs

In the following, it is implicitly assumed that the given expectations, variances, and covariances exist and are finite. Moreover, it is assumed that \(\Sigma \) is positive definite so that the inverse \(\Sigma ^{-1}\) exists.

Proposition 1

Let \(f\) be an increasing and strictly concave function from \(\mathbb {R}\) to \(\mathbb {R}\,\). Further, let \(g\) be a concave function from \(\mathbb {R}^N\) to \(\mathbb {R}\,\). Then \(f\circ g\) is strictly concave, too.

Proof

Since \(f\) is strictly concave, it holds that

$$\begin{aligned} \pi f\big (g(x_1)\big ) + (1-\pi ) f\big (g(x_2)\big ) < f\big (\pi g(x_1) + (1-\pi ) g(x_2)\big ) \end{aligned}$$

for every \(\pi \in ]0,1[\,\). Further, since \(g\) is concave, we have that

$$\begin{aligned} \pi g(x_1) + (1-\pi ) g(x_2) \le g\big (\pi x_1+(1-\pi )x_2\big ). \end{aligned}$$

Since \(f\) is increasing, it turns out that

$$\begin{aligned} \pi f\big (g(x_1)\big ) + (1-\pi ) f\big (g(x_2)\big ) < f\big (g(\pi x_1+(1-\pi )x_2)\big ) \end{aligned}$$

for every \(\pi \in ]0,1[\,\), which means that \(f\circ g\) is strictly concave.

1.1 Theorem 1

We have that

$$\begin{aligned} E \Bigl (u\bigl (W\,(1+r+\tilde{w}'R)\bigr )\Bigr ) = \int \int u\bigl (W\,(1+r+w'r)\bigr )\,p(w,r)\,\mathrm{d}r\,\mathrm{d}w \end{aligned}$$

and, since \(\tilde{w}\) is independent of \(R\,\), it follows that \(p(w,r)=p(w)p(r)\,\). This means

$$\begin{aligned} E \Bigl (u\bigl (W\,(1+r+\tilde{w}'R)\bigr )\Bigr )&= \int \int u\bigl (W\,(1+r+w'r)\bigr )\,p(w)p(r)\mathrm{d}r\mathrm{d}w \\&= \int \left[ \int u\bigl (W\,(1+r+w'r)\bigr )\,p(w)\,\mathrm{d}w\right] p(r)\,\mathrm{d}r. \end{aligned}$$

Since \(u\) is increasing and strictly concave in \(W\,(1+r+w'r)\), which is linear and thus concave in \(w\), Proposition 1 implies that \(u\big (W\,(1+r+w'r)\big )\) is strictly concave in \(w\,\). Further, since \(\tilde{w}\) is mixed, its probability distribution is not degenerated and thus

$$\begin{aligned} \int u\bigl (W\,(1+r+w'r)\bigr )\,p(w)\,\mathrm{d}w < u\bigl (W(1+r+\bar{w}'r)\bigr ) \end{aligned}$$

by Jensen’s inequality. Hence, it follows that

$$\begin{aligned} E \Big (u\bigl (W\,(1+r+\tilde{w}'R)\bigr )\Big )&< \int u\bigl (W(1+r+\bar{w}'r)\bigr )\,p(r)\,\mathrm{d}r\\&= E \Bigl (u\bigl (W\,(1+r+\bar{w}'R)\bigr )\Bigr ). \end{aligned}$$

Since \(\mathcal {C}\) is convex and \(\tilde{w}\in \mathcal {C}\), we have that \(\bar{w}=E (\tilde{w})\in \mathcal {C}\,\).

1.2 Theorem 2

The out-of-sample performance of \(w^*=\Sigma ^{-1}\mu /\alpha \) amounts to

$$\begin{aligned} \varphi _\alpha (w^*) = \frac{\mu '\Sigma ^{-1}\mu }{2\alpha }. \end{aligned}$$

Further, we have that

$$\begin{aligned} (\tilde{w}-w^*)'\Sigma \,(\tilde{w}-w^*)&= \tilde{w}'\Sigma \,\tilde{w}-2\tilde{w}'\Sigma \,w^*+w^{*\prime }\Sigma \,w^*\\&= \tilde{w}'\Sigma \,\tilde{w}-\frac{2\tilde{w}'\mu }{\alpha }+\frac{\mu '\Sigma ^{-1}\mu }{\alpha ^2}, \end{aligned}$$

so that

$$\begin{aligned} \frac{\alpha }{2}\,(\tilde{w}-w^*)'\Sigma \,(\tilde{w}-w^*) = \frac{\alpha }{2}\,\tilde{w}'\Sigma \,\tilde{w} - \tilde{w}'\mu + \frac{\mu '\Sigma ^{-1}\mu }{2\alpha } = \varphi _\alpha (w^*) - \varphi _\alpha (\tilde{w}). \end{aligned}$$

Hence,

$$\begin{aligned} \frac{\alpha }{2}\,E \Bigl ((\tilde{w}-w^*)'\Sigma \,(\tilde{w}-w^*)\Bigr ) = E \Bigl (\varphi _\alpha (w^*)-\varphi _\alpha (\tilde{w})\Bigr ) = \mathcal {L}_\alpha (\tilde{w}). \end{aligned}$$

The decomposition of \(\mathcal {L}_\alpha (\tilde{w})\) into \(\mathcal {B}_\alpha (\tilde{w})\) and \(\mathcal {V}_\alpha (\tilde{w})\) follows immediately from

$$\begin{aligned} \mathcal {L}_\alpha (\tilde{w})&= \frac{\alpha }{2}\,E \Bigl (\bigl [(\tilde{w}-\bar{w}) +(\bar{w}-w^*)\bigr ]'\Sigma \,\bigl [(\tilde{w}-\bar{w}) +(\bar{w}-w^*)\bigr ]\Bigr )\\&= \frac{\alpha }{2}\,E \Bigl ((\tilde{w}-\bar{w})'\Sigma \,(\tilde{w}-\bar{w})\Bigr ) +\frac{\alpha }{2}\,(\bar{w}-w^*)'\Sigma \,(\bar{w}-w^*) = \mathcal {V}_\alpha (\tilde{w}) + \mathcal {B}_\alpha (\tilde{w}). \end{aligned}$$

Since \(\Sigma \) is positive definite we have that

$$\begin{aligned} \mathcal {V}_\alpha (\tilde{w}) = \frac{\alpha }{2}\,E \Bigl ((\tilde{w}-\bar{w})'\Sigma \,(\tilde{w}-\bar{w})\Bigr ) \ge 0 \end{aligned}$$

and from

$$\begin{aligned} \mathcal {B}_\alpha (\tilde{w}) = \frac{\alpha }{2}\,(\bar{w}-w^*)'\Sigma \,(\bar{w}-w^*) = \mathcal {L}_\alpha (\bar{w}) \end{aligned}$$

it follows that \(\mathcal {L}_\alpha (\tilde{w})\ge \mathcal {L}_\alpha (\bar{w})\,\). Moreover, since \(\tilde{w}\in \mathcal {C}\) and \(\mathcal {C}\) is a convex set, it holds that \(\bar{w}=E (\tilde{w})\in \mathcal {C}\). If \(\tilde{w}\) is a mixed strategy, Jensen’s inequality implies that \(\mathcal {V}_\alpha (\tilde{w})>0\,\), i.e., the inequality is strict. Vice versa, if the inequality is strict, it must hold that \(\mathcal {V}_\alpha (\tilde{w})>0\,\). This is only possible if \(\tilde{w}\) is mixed.

1.3 Theorem 3

First of all, we have that

$$\begin{aligned} E \big (\phi _\alpha (\tilde{w})\big ) = E (\tilde{w}'R) - \frac{\alpha }{2}\,E \Big ((\tilde{w}'R)^2\Big ), \end{aligned}$$

where

$$\begin{aligned} E (\tilde{w}'R) = \sum _{i=1}^NE (\tilde{w}_iR_i) = \sum _{i=1}^N\bar{w}_i\mu _i + \sum _{i=1}^NCov (\tilde{w}_i,R_i) = \bar{w}'\mu + \sum _{i=1}^NCov (\tilde{w}_i,R_i). \end{aligned}$$

Moreover, it follows that

$$\begin{aligned} E \Big ((\tilde{w}'R)^2\Big ) = E \big (\tilde{w}'RR'\tilde{w}\big ) = E \big (\tilde{w}'\Sigma \,\tilde{w}\big ) + E \Big (\tilde{w}'\big (RR'-\Sigma \big )\tilde{w}\Big ), \end{aligned}$$

where

$$\begin{aligned} E \Big (\tilde{w}'\big (RR'-\Sigma \big )\tilde{w}\Big ) = \sum _{i,j=1}^N E \Big (\tilde{w}_i\tilde{w}_j\big (R_iR_j-E (R_iR_j)\big )\Big ) = \sum _{i,j=1}^N Cov \big (\tilde{w}_i\tilde{w}_j,R_iR_j\big ). \end{aligned}$$

This means

$$\begin{aligned} E \big (\phi _\alpha (\tilde{w})\big ) = \bar{w}'\mu + \sum _{i=1}^NCov (\tilde{w}_i,R_i) - \frac{\alpha }{2}\left( E \big (\tilde{w}'\Sigma \,\tilde{w}\big )+\sum _{i,j=1}^NCov \big (\tilde{w}_i\tilde{w}_j,R_iR_j\big )\right) . \end{aligned}$$

1.4 Theorem 4

Since \(\tilde{w}\) and its copies \(\tilde{w}_1,\tilde{w}_2,\ldots ,\tilde{w}_n\) are irrelevant, it follows that

$$\begin{aligned} p(w_t,r) = p(w_t)p(r) = p(w)p(r) = p(w,r) \end{aligned}$$

and thus

$$\begin{aligned} E \Big (u\big (W\,(1+r+\tilde{w}'_tR)\big )\Big )&= \int \int u\big (W\,(1+r+w'_tr)\big )\,p(w_t,r)\,\mathrm{d}r\mathrm{d}w_t \\&= \int \int u\big (W\,(1+r+w'r)\big )\,p(w,r)\,\mathrm{d}r\mathrm{d}w \\&= E \Big (u\big (W\,(1+r+\tilde{w}'R)\big )\Big ),\qquad t = 1,2,\ldots ,n, \end{aligned}$$

i.e.,

$$\begin{aligned} E \Big (u\big (W\,(1+r+\tilde{w}'R)\big )\Big ) = E \left( \frac{1}{n}\sum _{t=1}^{n} u\big (W\,(1+r+\tilde{w}'_tR)\big )\right) . \end{aligned}$$

Since \(u\big (W\,(1+r+x)\big )\) is strictly concave in \(x\), we have that

$$\begin{aligned} \frac{1}{n}\sum _{t=1}^{n} u\big (W\,(1+r+\tilde{w}'_tR)\big )&\le u\left( W\bigg (1+r+\frac{1}{n}\sum _{t=1}^{n}\tilde{w}'_tR\bigg )\right) \\&= u\Big (W\,(1+r+\hat{\bar{w}}'R)\Big ) \end{aligned}$$

with probability one, but since \(\tilde{w}'_1R,\tilde{w}'_2R,\ldots ,\tilde{w}'_nR\) are not almost surely identical, it follows that

$$\begin{aligned} \frac{1}{n}\sum _{t=1}^{n} u\big (W\,(1+r+\tilde{w}'_tR)\big ) < u\Big (W\,(1+r+\hat{\bar{w}}'R)\Big ) \end{aligned}$$

with positive probability. This means

$$\begin{aligned} E \Big (u\big (W\,(1+r+\tilde{w}'R)\big )\Big ) < E \Big (u\big (W\,(1+r+\hat{\bar{w}}'R)\big )\Big ). \end{aligned}$$

Finally, since \(\mathcal {C}\) is convex and \(\tilde{w}_t\in \mathcal {C}\) for \(t=1,2,\ldots ,n\), we have that \(\hat{\bar{w}}=\frac{1}{n}\sum _{t=1}^{n}\tilde{w}_t\in \mathcal {C}\).

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Frahm, G. A theoretical foundation of portfolio resampling. Theory Decis 79, 107–132 (2015). https://doi.org/10.1007/s11238-014-9453-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11238-014-9453-0

Keywords

JEL Classification

Navigation