Abstract
The Stock Market Prediction (SMP) has been a fascinating and challenging problem. The involvement of noisy, non-linear, and sparse features and poor feature selection degrades the prediction accuracies. The improved feature quality with an enhanced feature selection mechanism can increase the accuracy of prediction. This research article focuses on the Exploratory Data Analysis (EDA) of the Nifty50 index data of the National Stock Exchange (NSE). It proposes an enhanced hybrid feature engineering mechanism to extract the most relevant features that significantly impact SMP accuracy. This work employs Ensemble Regression Models viz. Random Forest Regression (RFR) and Extreme Gradient Boost Regression (XGBR) to predict the market trend. From the results we conclude that the proposed models achieve an improved R- squared values of 0.97 using RFR and 0.98 using XGBR, respectively.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Binkowski, M., Marti, G., Donnat, P.: Autoregressive convolutional neural networks for asynchronous time series. In: 35th International Conference on Machine Learning ICML 2018, vol. 2, pp. 933–945 (2018)
Upadhyay, A., Bandyopadhyay, G., Dutta, A.: Forecasting Stock Performance in Indian Market using Multinomial Logistic Regression (2019)
Avenue, N.: Biological Brain-Inspired Genetic Ccomplementry Learning for Stock Market and Bank Failure Prediction, vol. 23 (2007)
Asyraf, A.S., Abdul-Rahman, S., Mutalib, S.: Mining textual terms for stock market prediction analysis using financial news. Commun. Comput. Inf. Sci. 788, 293–305 (2007)
Zhang, L., Zhang, L.L., Teng, W., Chen, Y: Based on information fusion technique with data mining in the application of finance Early-Warning. Procedia Comput. Sci. 17, 695–703 (2013)
Gururaj, V., Shriya, V.R., Ashwini, K.: Stock market prediction using linear regression and support vector machines. Int. J. Appl. Eng. Res. 14, 1931–1934 (2019)
Bhuriya, D., Kaushal, G., Sharma, A., Singh, U.: Stock market predication using a linear regression. In: Proceedings of the International Conference on Electronics, Communication and Aerospace Technology ICECA 2017. 2017-January, pp. 510–513 (2017)
Kamley, S., Jaloree, S., Thakur, R.S.: Multiple regression: a data mining approach for predicting the stock market trends based on open, close and high price of the month. Int. J. Comput. Sci. Eng. Inf. Technol. Res. 3, 173–180 (2013)
Yuan, J., Luo, Y.: Test on the Validity of futures Market's high frequency volume and price on forecast. In: International Conference on Management of e-Commerce and e-Government, pp. 28–32 (2014)
Dutta, A.: Prediction of stock performance in the indian stock market using logistic regression. Int. J. Bus. Inf. 7, 105–136 (2012)
Siew, H.L., Nordin, M.J.: Regression techniques for the prediction of stock price trend. ICSSBE 2012 - Proceedings, International Conference on Statistics in Science, Business and Engineering: “Empowering Decision Making with Statistical Sciences”, pp. 99–103 (2012). https://doi.org/10.1109/ICSSBE.2012.6396535
Meesad, P., Rasel, R.I.: Predicting stock market price using support vector regression. In: 2013 International Conference Informatics, Electronics and Vision, ICIEV 2013. (2013). https://doi.org/10.1109/ICIEV.2013.6572570
Asghar, M.Z., Rahman, F., Kundi, F.M., Ahmad, S.: Development of stock market trend prediction system using multiple regression. Comput. Math. Organ. Theory 25(3), 271–301 (2019). https://doi.org/10.1007/s10588-019-09292-7
Seni, G., Elder, J.F.: Ensemble Methods in Data Mining: Improving Accuracy Through Combining Predictions (2010)
Nti, I.K., Adekoya, A.F., Weyori, B.A.: A comprehensive evaluation of ensemble learning for stock-market prediction. J. Big Data 7(1), 1–40 (2020). https://doi.org/10.1186/s40537-020-00299-5
Pavlov, Y.L.: Random forests. Random For, pp. 1–122 (2019) https://doi.org/10.1201/9780367816377-11
Khanday, A.M.U.D., Rabani, S.T., Khan, Q.R, Rouf, N., Mohi Ud Din, M.: Machine learning based approaches for detecting COVID-19 using clinical text data. Int. J. Inf. Technol. 12, 1–9 (2020)
Chen, T., Guestrin, C.: XGBoost: A scalable tree boosting system. In: Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 13–17-August, pp. 785–794 (2016). https://doi.org/10.1145/2939672.2939785
XGBoost Algorithm: Long May She Reign! | by Vishal Morde | Towards Data Science. https://towardsdatascience.com/https-medium-com-vishalmorde-xgboost-algorithm-long-she-may-rein-edd9f99be63d. Accessed on 07 Feb 2021
Ivanovski, Z., Ivanovski, Z., Narasanov, Z.: The regression analysis of stock returns at MSE. J. Mod. Account. Audit. 12(4) (2016). https://doi.org/10.17265/1548-6583/2016.04.003
Bhardwaj, A., Narayan, Y., Vanraj, Pawan, Dutta, M.: Sentiment analysis for indian stock market prediction using sensex and nifty. Procedia Comput. Sci. 70, 85–91 (2015). https://doi.org/10.1016/j.procs.2015.10.043
Khanthavit, A.: Investors’ Moods and the Stock Market (2018)
Kaleem, A.: Impact of political events on stock market returns :empirical evidence from Pakistan.(2014). https://doi.org/10.1108/JEAS-03-2013-0011
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Rouf, N., Malik, M.B., Arif, T. (2021). Predicting the Stock Market Trend: An Ensemble Approach Using Impactful Exploratory Data Analysis. In: Bhattacharya, M., Kharb, L., Chahal, D. (eds) Information, Communication and Computing Technology. ICICCT 2021. Communications in Computer and Information Science, vol 1417. Springer, Cham. https://doi.org/10.1007/978-3-030-88378-2_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-88378-2_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-88377-5
Online ISBN: 978-3-030-88378-2
eBook Packages: Computer ScienceComputer Science (R0)