Abstract
Over the past decade, extensive research on stock market prediction using machine learning models has been conducted. In this framework, different approaches for data standardisation methods have been used for financial time series analysis and to assess the impact of data standardisation on the final prediction outcome. The paper uses the feature-level optimal rolling-window batch data standardisation method to improve the machine learning model's predictive power significantly. Along with the standardisation method, the paper explores the performance of the automated feature interactions learner (Deep Cross Networks) effect on a plethora of technical indicators aiming at predicting the movements of the NIFTY50 index in India, as these predicted changes are reflected in options contracts. Feature-level optimal rolling window data standardisation can identify the optimal window of time such that the correlation between features and the response variable is maximized, with most features correlating at 0.7. In the experiment, 48% of important technical indicators negatively correlated with the response variable. The Deep Cross Network regression model combined with the optimal rolling window batch data standardisation method outperformed all other model configurations at weekly and monthly data frequency. It achieved a directional hit rate of 69.52% (weekly) and 79.17% (monthly) and root mean square error of 2.82 (weekly) and 5.01 (monthly), generating a profit 5.5 times (weekly) and 2.85 times (monthly) greater than the benchmark buy-and-hold strategy providing opposing evidence to the sub-martingale model.
Similar content being viewed by others
Notes
An Indian stock index with 50 of the largest Indian companies was launched in 1996 and accounts for 13 sectors of the Indian economy (Wikipedia 2021).
The model is retrained periodically using all the data available until that point in time (Mehtab et al., 2021).
The predicted response variable can also be used to build a one-week maturity NIFTY50 futures strategy.
References
Benediktsson, J., & Other Contributors. (2017). TA-Lib python technical analysis library. https://mrjbq7.github.io/ta-lib/doc_index.html. Accessed 28 Nov 2021.
Beutel, A., Covington, P., Jain, S., Xu, C., Li, J., Gatto, V., & Chi, E. H. (2018). Latent Cross: Making Use of Context in Recurrent Recommender Systems. WSDM’18:Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, 46–54. https://doi.org/10.1145/3159652.3159727
Caldera, H. A., & Lavanya, W. P. A. (2020). Combinatorial Impact of Technical Indicators on Price Prediction in Colombo Stock Market. 2020 20th International Conference on Advances in ICT for Emerging Regions (ICTer), 256–261. https://doi.org/10.1109/ICTer51097.2020.9325500
Cheng, H., Koc, L., Harmsen, J., Shaked, T., Chandra, T., Aradhye, H., Anderson, G., Corrado, G., Chai, W., Ispir, M., Anil, R., Haque, Z., Hong, L., Jain, V., Liu, X., & Shah, H. (2016). Wide & Deep Learning for Recommender Systems. DLRS 2016:Proceedings of the 1st Workshop on Deep Learning for Recommender Systems, 7–10. https://doi.org/10.1145/2988450.2988454
Dadhich, M., Doshi, R., Rao, S. S., & Sharma, R. (2022). Estimating and Predicting Models Using Stochastic Time Series ARIMA Modeling in Emergent Economy. In R. Kumar, C. W. Ahn, T. K. Sharma, O. P. Verma, & A. Agarwal (Eds.), Soft Computing: Theories and Applications. Lecture Notes in Networks and Systems, vol 425 (pp. 295–305). Springer Nature Singapore. https://doi.org/10.1007/978-981-19-0707-4_28
Fama, E. F. (1970). Efficient Capital Markets: A Review of Theory and Empirical Work. The Journal of Finance, 25(2), 383. https://doi.org/10.2307/2325486
Fama, E. F. (1976). Efficient Capital Markets: Reply. The Journal of Finance, 31(1), 143–145. https://doi.org/10.2307/2326404
Guo, H., Tang, R., Ye, Y., Li, Z., & He, X. (2017). DeepFM: A Factorization-Machine based Neural Network for CTR Prediction. IJCAI’17:Proceedings of the 26th International Joint Conference on Artificial Intelligence, 1725–1731. http://arxiv.org/abs/1703.04247
Jariwala, S. (2020). NSEpy Documentation. Accessed 2021–10–17. https://nsepy.xyz/
Kumari, B., & Swarnkar, T. (2020). Importance of Data Standardization Methods on Stock Indices Prediction Accuracy. In B. Pati, C. R. Panigrahi, R. Buyya, & K.-C. Li (Eds.), Advanced Computing and Intelligent Engineering. Advances in Intelligent Systems and Computing, vol 1082 (pp. 309–318). Singapore: Springer. https://doi.org/10.1007/978-981-15-1081-6_26
Lim, K.-P., & Brooks, R. (2011). The Evolution Of Stock Market Efficiency Over Time: A Survey Of The Empirical Literature. Journal of Economic Surveys, 25(1), 69–108. https://doi.org/10.1111/j.1467-6419.2009.00611.x
Liu, L., Ma, F., Zeng, Q., & Zhang, Y. (2020). Forecasting the aggregate stock market volatility in a data-rich world. Applied Economics, 52(32), 3448–3463. https://doi.org/10.1080/00036846.2020.1713291
Lo, A. W. (2004). The Adaptive Markets Hypothesis. The Journal of Portfolio Management, 30(5), 15–29. https://jpm.pm-research.com/content/30/5/15
López Padial, D., & Contributors, O. (2018). Python technical analysis library. https://technical-analysis-library-in-python.readthedocs.io/en/latest/index.html. Accessed 28 Nov 2021.
Mehtab, S., Sen, J., & Dutta, A. (2021). Stock Price Prediction Using Machine Learning and LSTM-Based Deep Learning Models. In S. M. Thampi, S. Piramuthu, K.-C. Li, S. Berretti, M. Wozniak, & D. Singh (Eds.), Communications in Computer and Information Science (Vol. 1366, pp. 88–106). Singapore: Springer. https://doi.org/10.1007/978-981-16-0419-5_8
Mishra, S. (2016). Technical Analysis and Risk Premium in Indian Equity Market: A Multiple Regression Analysis. IUP Journal of Applied Economics, 15(1), 51–68.
Nagula, P. K., & Alexakis, C. (2022). A new hybrid machine learning model for predicting the bitcoin (BTC-USD) price. Journal of Behavioral and Experimental Finance, 36, 100741. https://doi.org/10.1016/j.jbef.2022.100741
O’Hara, M. (1995). Market Microstructure Theory (1st Edition). Cambridge, MA: Blackwell Publishers Inc.
Polanco-Martínez, J. M. (2019). Dynamic relationship analysis between NAFTA stock markets using nonlinear, nonparametric, non-stationary methods. Nonlinear Dynamics, 97(1), 369–389. https://doi.org/10.1007/s11071-019-04974-y
Uma, G., Devi, B., Sundar, D., & Alli, P. (2013). An Effective Time Series Analysis for Stock Trend Prediction Using ARIMA Model for Nifty Midcap-50. International Journal of Data Mining & Knowledge Management Process, 3(1), 65–78. https://doi.org/10.5121/ijdkp.2013.3106
Van den Poel, D., Chesterman, C., Koppen, M., & Ballings, M. (2016). Equity price direction prediction for day trading: Ensemble classification using technical analysis indicators with interaction effects. IEEE Congress on Evolutionary Computation (CEC), 2016, 3455–3462. https://doi.org/10.1109/CEC.2016.7744227
Wang, R., Fu, B., Fu, G., & Wang, M. (2017). Deep & Cross Network for Ad Click Predictions. ADKDD’17:Proceedings of the ADKDD’17, 1–7. https://doi.org/10.1145/3124749.3124754
Wikipedia. (2021). NIFTY50 - Wikipedia. Accessed November 12, 2021. https://en.wikipedia.org/wiki/NIFTY_50
Yaohao, P., & Albuquerque, P. H. M. (2019). Non-Linear Interactions and Exchange Rate Prediction: Empirical Evidence Using Support Vector Regression. Applied Mathematical Finance, 26(1), 69–100. https://doi.org/10.1080/1350486X.2019.1593866
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Nagula, P.K., Alexakis, C. A Novel Machine Learning Approach for Predicting the NIFTY50 Index in India. Int Adv Econ Res 28, 155–170 (2022). https://doi.org/10.1007/s11294-022-09861-8
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11294-022-09861-8
Keywords
- Efficient Market
- Machine Learning
- Technical Indicators Interactions
- Deep Cross Networks
- Rolling Window Data Standardisation
- NIFTY50