Skip to main content
Log in

Feature selection for portfolio optimization

  • S.I.: APMOD 2014
  • Published:
Annals of Operations Research Aims and scope Submit manuscript

Abstract

Most portfolio selection rules based on the sample mean and covariance matrix perform poorly out-of-sample. Moreover, there is a growing body of evidence that such optimization rules are not able to beat simple rules of thumb, such as 1/N. Parameter uncertainty has been identified as one major reason for these findings. A strand of literature addresses this problem by improving the parameter estimation and/or by relying on more robust portfolio selection methods. Independent of the chosen portfolio selection rule, we propose using feature selection first in order to reduce the asset menu. While most of the diversification benefits are preserved, the parameter estimation problem is alleviated. We conduct out-of-sample back-tests to show that in most cases different well-established portfolio selection rules applied on the reduced asset universe are able to improve alpha relative to different prominent factor models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. See Kritzman (1993) who compares factor analysis and cross-sectional regression for that purpose.

  2. For the rest of the paper, when referring to the Tu and Zhou (2011) strategy, we mean the optimal combination of 1/N with Kan and Zhou (2007).

  3. In the following we use the term “feature selection” as synonym for “hierarchical clustering”.

  4. See http://mba.tuck.dartmouth.edu/pages/faculty/ken.french/data_library.html.

  5. Given the focus of this paper, we want to point out that the proposed investigation is not intended as a horse race between the different portfolio selection rules.

  6. For example, finding the optimal universe of the least correlated 15 out of 50 assets would require approximately \(2.25 \times 10^{12}\) permutations.

  7. The extreme reduction of dimensionality in this exhibition is used for the illustration purpose only.

  8. The structured estimator is based on the constant correlation matrix (set equal to the average sample correlation), which corresponds to the default value in the package.

  9. Given that the clusters are quite stable over time and the assets within a cluster are highly correlated, in order to reduce excessive trading we propose to apply feature selection on an annual basis.

  10. The most time consuming operation is shrinkage. For the 49 industry portfolios the computational time for all results is less than 10 min.

  11. When considering only out-of-sample returns of the same time-interval, we found that intermediate estimations windows of 10–20 years perform best. Results are available upon request.

References

  • Best, M. J., & Grauer, R. R. (1991). On the Sensitivity of mean–variance-efficient portfolios to changes in asset means: Some analytical and computational results. Review of Financial Studies, 4(2), 315–342.

    Article  Google Scholar 

  • Black, F., & Litterman, R. (1992). Global portfolio optimization. Financial Analysts Journal, 48(5), 28–43.

    Article  Google Scholar 

  • Carhart, M. M. (1997). On persistence in mutual fund performance. The Journal of Finance, 52(1), 57–82.

    Article  Google Scholar 

  • Chopra, V. K., & Ziemba, W. T. (1993). The effect of errors in means, variances, and covariances on optimal portfolio choice. The Journal of Portfolio Management, 19(2), 6–11.

    Article  Google Scholar 

  • Clarke, R. G., de Silva, H., & Thorley, S. (2006). Minimum-variance portfolios in the US equity market. The Journal of Portfolio Management, 33(1), 10–24.

    Article  Google Scholar 

  • DeMiguel, V., Garlappi, L., Nogales, F. J., & Uppal, R. (2009a). A generalized approach to portfolio optimization: Improving performance by constraining portfolio norms. Management Science, 55(5), 798–812.

    Article  Google Scholar 

  • DeMiguel, V., Garlappi, L., & Uppal, R. (2009b). Optimal versus naive diversification: How inefficient is the 1/N portfolio strategy? Review of Financial Studies, 22(5), 1915–1953.

    Article  Google Scholar 

  • DeMiguel, V., & Nogales, F. J. (2009). Portfolio selection with robust estimation. Operations Research, 57(3), 560–577.

    Article  Google Scholar 

  • Duchin, R., & Levy, H. (2009). Markowitz versus the talmudic portfolio diversification strategies. The Journal of Portfolio Management, 35(2), 71–74.

    Article  Google Scholar 

  • Fama, E. F., & French, K. R. (1993). Common risk factors in the returns on stocks and bonds. Journal of Financial Economics, 33(1), 3–56.

    Article  Google Scholar 

  • Frazzini, A., & Pedersen, L. H. (2014). Betting against beta. Journal of Financial Economics, 111(1), 1–25.

    Article  Google Scholar 

  • Hartigan, J. A. (1981). Consistency of single linkage for high-density clusters. Journal of the American Statistical Association, 76(374), 388–394.

    Article  Google Scholar 

  • Haugen, R. A., & Baker, N. L. (1991). The efficient market inefficiency of capitalization-weighted stock portfolios. The Journal of Portfolio Management, 17(3), 35–40.

    Article  Google Scholar 

  • Jagannathan, R., & Ma, T. (2003). Risk reduction in large portfolios: Why imposing the wrong constraints helps. The Journal of Finance, 58(4), 1651–1684.

    Article  Google Scholar 

  • Jorion, P. (1986). Bayes-Stein estimation for portfolio analysis. The Journal of Financial and Quantitative Analysis, 21(3), 279–292.

    Article  Google Scholar 

  • Kan, R., & Zhou, G. (2007). Optimal portfolio choice with parameter uncertainty. Journal of Financial and Quantitative Analysis, 42(03), 621–656.

    Article  Google Scholar 

  • Kritzman, M. (1993). What practitioners need to know\(\ldots \) about factor methods. Financial Analysts Journal, 49(1), 12–15.

    Article  Google Scholar 

  • Kritzman, M., Page, S., & Turkington, D. (2010). In defense of optimization: The fallacy of 1/ N. Financial Analysts Journal, 66(2), 31–39.

    Article  Google Scholar 

  • Ledoit, O., & Wolf, M. (2003). Improved estimation of the covariance matrix of stock returns with an application to portfolio selection. Journal of Empirical Finance, 10(5), 603–621.

    Article  Google Scholar 

  • Ledoit, O., & Wolf, M. (2004a). A well-conditioned estimator for large-dimensional covariance matrices. Journal of Multivariate Analysis, 88(2), 365–411.

    Article  Google Scholar 

  • Ledoit, O., & Wolf, M. (2004b). Honey, I shrunk the sample covariance matrix. The Journal of Portfolio Management, 30(4), 110–119.

    Article  Google Scholar 

  • Ledoit, O., & Wolf, M. (2012). Nonlinear shrinkage estimation of large-dimensional covariance matrices. The Annals of Statistics, 40(2), 1024–1060.

    Article  Google Scholar 

  • Levy, H., & Levy, M. (2014). The benefits of differential variance-based constraints in portfolio optimization. European Journal of Operational Research, 234(2), 372–381.

    Article  Google Scholar 

  • Lisi, F., & Corazza, M. (2008). Clustering financial data for mutual fund management. In C. Perna & M. Sibillo (Eds.), Mathematical and Statistical Methods in Insurance and Finance (pp. 157–164). Milan: Springer.

    Chapter  Google Scholar 

  • Markowitz, H. (1952). Portfolio selection. The Journal of Finance, 7(1), 77–91.

    Google Scholar 

  • Merton, R. (1980). On estimating the expected return on the market: An exploratory investigation. Journal of Financial Economics, 8, 323–361.

    Article  Google Scholar 

  • Nanda, S., Mahanty, B., & Tiwari, M. (2010). Clustering Indian stock market data for portfolio management. Expert Systems with Applications, 37(12), 8793–8798.

    Article  Google Scholar 

  • Scherer, B. (2011). A note on the returns from minimum variance investing. Journal of Empirical Finance, 18(4), 652–660.

    Article  Google Scholar 

  • Tan, P.-N., Steinbach, M., & Kumar, V. (2006). Introduction to data mining. London: Pearson.

    Google Scholar 

  • Tobin, J. (1958). Liquidity preference as behavior towards risk. The Review of Economic Studies, 25(2), 65–82.

    Article  Google Scholar 

  • Tola, V., Lillo, F., Gallegati, M., & Mantegna, R. N. (2008). Cluster analysis for portfolio optimization. Journal of Economic Dynamics and Control, 32(1), 235–258.

    Article  Google Scholar 

  • Tu, J., & Zhou, G. (2011). Markowitz meets Talmud: A combination of sophisticated and naive diversification strategies. Journal of Financial Economics, 99(1), 204–215.

    Article  Google Scholar 

  • Xu, R., & Wunsch, D. (2005). Survey of clustering algorithms. IEEE Transactions on Neural Networks, 16(3), 645–678.

    Article  Google Scholar 

Download references

Acknowledgments

We thank Michael Hanke, Kourosh Marjani Rasmussen, the Guest Editors Nalan Gulpinar, Xuan V. Doan and Arne K. Strauss, and two anonymous referees for helpful comments. Special thanks also to conference participants of the APMOD 2014 at the University of Warwick and seminar participants at the University of Oxford and the Technical University of Denmark.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alex Weissensteiner.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Bjerring, T.T., Ross, O. & Weissensteiner, A. Feature selection for portfolio optimization. Ann Oper Res 256, 21–40 (2017). https://doi.org/10.1007/s10479-016-2155-y

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10479-016-2155-y

Keywords

Navigation