Abstract
This paper investigates the problem of the stock closing price forecasting for the stock market. Based on existing two-stage fusion models in the literature, two new prediction models based on clustering have been proposed, where k-means clustering method is adopted to cluster several common technical indicators. In addition, ensemble learning has also been applied to improve the prediction accuracy. Finally, a hybrid prediction model, which combines both the k-means clustering and ensemble learning, has been proposed. The experimental results on a number of Chinese stocks demonstrate that the hybrid prediction model obtains the best predicting accuracy of the stock price. The k-means clustering on the stock technical indicators can further enhance the prediction accuracy of the ensemble learning.
Similar content being viewed by others
References
Fama EF (1970) Efficient capital markets: A review of theory and empirical work. Journal of Finance 25(2):383–417
Burton G, Malkiel (2003) The efficient market hypothesis and its critics. Journal of Economic Perspectives 17(1):59–82
Timmermann A, Clive WJ (2004) Granger. Efficient market hypothesis and forecasting. International Journal of Forecasting 20(1):15–27
Ozdemir ZA (2008) Efficient market hypothesis: evidence from a small open-economy. Applied Economics 40(4–6):633–641
Mlambo C, Biekpe N (2007) The efficient market hypothesis: evidence from ten African stock markets. Invest Anal J 36(66):5–17
Fama EF, MacBeth JD (1973) Risk, return, and equilibrium: empirical tests. J Polit Econ 81(3):607–636
Kara Y (2011) Melek Acar Boyacioglu, and Ömer Kaan Baykan. Predicting direction of stock price index movement using artificial neural networks and support vector machines: the sample of the Istanbul stock exchange. Expert Syst Appl 38(5):5311–5319
Patel J, Shah S, Thakkar P, Kotecha K (2015) Predicting stock market index using fusion of machine learning techniques. Expert Systems with Application 42(4):2162–2172
Amit Sirohi, Pradeep Mahato, and Vahida Attar. Multiple kernel learning for stock price direction prediction. Adv Eng Technol Res(ICAETR), 01 2015
Long W, Song L, Tian Y (2018) A new graphic kernel method of stock price trend prediction based on financial news semantic and structural similarity. Expert Syst Appl 118:411–424
Wang Z, Hu J, Wu Y (2018) A bimodel algorithm with data-divider to predict stock index. Math Probl Eng (PT.3):3967525.1–3967525.14
Jeon S, Hong B, Chang V (2018) Pattern graph tracking-based stock price prediction using big data. Future Generation Computer Systems 80(MAR.):171–187
Feng Z, Hao-min Z, Yang Z, Yang L (2019) EMD2FNN: a strategy combining empirical mode decomposition and factorization machine based neural network for stock market trend prediction. Expert Syst Appl 115:136–151
Yauheniya Shynkevich, T. M. Mcginnity, Sonya Coleman, and Ammar Belatreche. Predicting stock price movements based on different categories of news articles. In 2015 IEEE Symposium Computat Intelligence Financial Eng Econ (IEEE CIFEr’15), 2015
Debashish Das, Ali Safa Sadiq, Seyedali Mirjalili and A. Noraziah. Hybrid Clustering-GWO-NARX neural network technique in predicting stock price. J Phys: Conf Series, page 012018, 2017
Xiu Li Shao, Doudou Ma, Yiwei Liu, and Yin Quan. Short-term forecast of stock price of multi-branch LSTM based on k-means. In Int Conf Syst Inform. IEEE, 2018
Weng B, Lu L, Wang X, Megahed FM, Martinez W (2018) Predicting short-term stock prices using ensemble methods and online data sources. Expert Syst Appl 112:258–273
Zhang Guoying and Chen Ping. Forecast of yearly stock returns based on AdaBoost integration algorithm. In IEEE 2017 IEEE Int Conf Smart Cloud (SmartCloud) - New York, NY (2017.11.3–2017.11.5), pages 263–267, 2017
Wang C, Bai X (2018) Boosting learning algorithm for stock price forecasting. IOP Conf Series: Mater Sci Eng 322:052053
Weiling Chen, Chai Kiat Yeo, Chiew Tong Lau, and Bu Sung Lee. Leveraging social media news to predict stock index movement using RNN-Boost. Data & Knowledge Engineering, pages S0169023X17305839–, 2018
Nonita Sharma and Akanksha Juneja. Combining of random forest estimates using LSBoost for stock market index prediction. In Proc.2nd Int Conf Convergence Technol (I2CT), pages 1199–1202, 04 2017
Wang H, Lu S, Zhao J (2019) Aggregating multiple types of complex data in stock market prediction: a model-independent framework. Knowledge Based Systems 164:193–204
Zhou X, Pan Z, Hu G, Tang S, Cheng Z (2018) Stock market prediction on high-frequency data using generative adversarial nets. Math Probl Eng 2018:1–11
Che Yu Lee and Von Wun Soo. Predict stock price with financial news based on recurrent convolutional neural networks. In 2017 Conf Technol Appl Artifificial Intelligence (TAAI), 2017
Ziniu Hu, Weiqing Liu, Jiang Bian, Xuanzhe Liu, and Tie-Yan Liu. Listening to chaotic whispers: A deep learning framework for news-oriented stock trend prediction. In Proc.11th ACM International Conference on Web Search and Data Mining (WSDM’18), 12 2017
Luo T (2018) Research on decision-making of complex venture capital based on financial big data platform. Complexity 2018:1–12, 12
Hyun Sik Sim, Hae In Kim, and Jae Joon Ahn. Is deep learning for image recognition applicable to stock market prediction? Complexity, 2019:1–10, 2019
Kim HY, Won CH (2018) Forecasting the volatility of stock price index: A hybrid model integrating LSTM with multiple GARCH-type models. Expert Systems with Applications 103:25–37
Wang W, Shi Y, Luo R (2019) Sparse representation based approach to prediction for economic time series. IEEE Access 7:20614–20618
Liu G, Wang X (2019) A numerical-based attention method for stock market prediction with dual information. IEEE Access 7:7357–7367
Peter J. Rousseuw. Silhouettes: a graphical aid to the interpretation and validation of cluster analysis. J Computat Appl Mathematics, 20, 1987
Acknowledgements
This work has been supported by National Natural Science Foundation of China under Grants 61772543, U1435222, 61625202, and 61272056.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Xu, Y., Yang, C., Peng, S. et al. A hybrid two-stage financial stock forecasting algorithm based on clustering and ensemble learning. Appl Intell 50, 3852–3867 (2020). https://doi.org/10.1007/s10489-020-01766-5
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-020-01766-5