Skip to main content
Log in

A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

This paper investigates the problem of the stock closing price forecasting for the stock market. Based on existing two-stage fusion models in the literature, two new prediction models based on clustering have been proposed, where k-means clustering method is adopted to cluster several common technical indicators. In addition, ensemble learning has also been applied to improve the prediction accuracy. Finally, a hybrid prediction model, which combines both the k-means clustering and ensemble learning, has been proposed. The experimental results on a number of Chinese stocks demonstrate that the hybrid prediction model obtains the best predicting accuracy of the stock price. The k-means clustering on the stock technical indicators can further enhance the prediction accuracy of the ensemble learning.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

References

  1. Fama EF (1970) Efficient capital markets: A review of theory and empirical work. Journal of Finance 25(2):383–417

  2. Burton G, Malkiel (2003) The efficient market hypothesis and its critics. Journal of Economic Perspectives 17(1):59–82

    Google Scholar 

  3. Timmermann A, Clive WJ (2004) Granger. Efficient market hypothesis and forecasting. International Journal of Forecasting 20(1):15–27

    Google Scholar 

  4. Ozdemir ZA (2008) Efficient market hypothesis: evidence from a small open-economy. Applied Economics 40(4–6):633–641

    Google Scholar 

  5. Mlambo C, Biekpe N (2007) The efficient market hypothesis: evidence from ten African stock markets. Invest Anal J 36(66):5–17

    Google Scholar 

  6. Fama EF, MacBeth JD (1973) Risk, return, and equilibrium: empirical tests. J Polit Econ 81(3):607–636

    Google Scholar 

  7. Kara Y (2011) Melek Acar Boyacioglu, and Ömer Kaan Baykan. Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the Istanbul stock exchange. Expert Syst Appl 38(5):5311–5319

    Google Scholar 

  8. Patel J, Shah S, Thakkar P, Kotecha K (2015) Predicting stock market index using fusion of machine learning techniques. Expert Systems with Application 42(4):2162–2172

    Google Scholar 

  9. Amit Sirohi, Pradeep Mahato, and Vahida Attar. Multiple kernel learning for stock price direction prediction. Adv Eng Technol Res(ICAETR), 01 2015

  10. Long W, Song L, Tian Y (2018) A new graphic kernel method of stock price trend prediction based on financial news semantic and structural similarity. Expert Syst Appl 118:411–424

    Google Scholar 

  11. Wang Z, Hu J, Wu Y (2018) A bimodel algorithm with data-divider to predict stock index. Math Probl Eng (PT.3):3967525.1–3967525.14

  12. Jeon S, Hong B, Chang V (2018) Pattern graph tracking-based stock price prediction using big data. Future Generation Computer Systems 80(MAR.):171–187

    Google Scholar 

  13. Feng Z, Hao-min Z, Yang Z, Yang L (2019) EMD2FNN: a strategy combining empirical mode decomposition and factorization machine based neural network for stock market trend prediction. Expert Syst Appl 115:136–151

    Google Scholar 

  14. Yauheniya Shynkevich, T. M. Mcginnity, Sonya Coleman, and Ammar Belatreche. Predicting stock price movements based on different categories of news articles. In 2015 IEEE Symposium Computat Intelligence Financial Eng Econ (IEEE CIFEr’15), 2015

  15. Debashish Das, Ali Safa Sadiq, Seyedali Mirjalili and A. Noraziah. Hybrid Clustering-GWO-NARX neural network technique in predicting stock price. J Phys: Conf Series, page 012018, 2017

  16. Xiu Li Shao, Doudou Ma, Yiwei Liu, and Yin Quan. Short-term forecast of stock price of multi-branch LSTM based on k-means. In Int Conf Syst Inform. IEEE, 2018

  17. Weng B, Lu L, Wang X, Megahed FM, Martinez W (2018) Predicting short-term stock prices using ensemble methods and online data sources. Expert Syst Appl 112:258–273

    Google Scholar 

  18. Zhang Guoying and Chen Ping. Forecast of yearly stock returns based on AdaBoost integration algorithm. In IEEE 2017 IEEE Int Conf Smart Cloud (SmartCloud) - New York, NY (2017.11.3–2017.11.5), pages 263–267, 2017

  19. Wang C, Bai X (2018) Boosting learning algorithm for stock price forecasting. IOP Conf Series: Mater Sci Eng 322:052053

    Google Scholar 

  20. Weiling Chen, Chai Kiat Yeo, Chiew Tong Lau, and Bu Sung Lee. Leveraging social media news to predict stock index movement using RNN-Boost. Data & Knowledge Engineering, pages S0169023X17305839–, 2018

  21. Nonita Sharma and Akanksha Juneja. Combining of random forest estimates using LSBoost for stock market index prediction. In Proc.2nd Int Conf Convergence Technol (I2CT), pages 1199–1202, 04 2017

  22. Wang H, Lu S, Zhao J (2019) Aggregating multiple types of complex data in stock market prediction: a model-independent framework. Knowledge Based Systems 164:193–204

    Google Scholar 

  23. Zhou X, Pan Z, Hu G, Tang S, Cheng Z (2018) Stock market prediction on high-frequency data using generative adversarial nets. Math Probl Eng 2018:1–11

    Google Scholar 

  24. Che Yu Lee and Von Wun Soo. Predict stock price with financial news based on recurrent convolutional neural networks. In 2017 Conf Technol Appl Artifificial Intelligence (TAAI), 2017

  25. Ziniu Hu, Weiqing Liu, Jiang Bian, Xuanzhe Liu, and Tie-Yan Liu. Listening to chaotic whispers: A deep learning framework for news-oriented stock trend prediction. In Proc.11th ACM International Conference on Web Search and Data Mining (WSDM’18), 12 2017

  26. Luo T (2018) Research on decision-making of complex venture capital based on financial big data platform. Complexity 2018:1–12, 12

    Google Scholar 

  27. Hyun Sik Sim, Hae In Kim, and Jae Joon Ahn. Is deep learning for image recognition applicable to stock market prediction? Complexity, 2019:1–10, 2019

  28. Kim HY, Won CH (2018) Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models. Expert Systems with Applications 103:25–37

    Google Scholar 

  29. Wang W, Shi Y, Luo R (2019) Sparse representation based approach to prediction for economic time series. IEEE Access 7:20614–20618

    Google Scholar 

  30. Liu G, Wang X (2019) A numerical-based attention method for stock market prediction with dual information. IEEE Access 7:7357–7367

    Google Scholar 

  31. Peter J. Rousseuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Computat Appl Mathematics, 20, 1987

Download references

Acknowledgements

This work has been supported by National Natural Science Foundation of China under Grants 61772543, U1435222, 61625202, and 61272056.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ying Xu.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xu, Y., Yang, C., Peng, S. et al. A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning. Appl Intell 50, 3852–3867 (2020). https://doi.org/10.1007/s10489-020-01766-5

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-020-01766-5

Keywords

Navigation