Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models

Shahvaroughi Farahani, Milad; Razavi Hajiagha, Seyed Hossein

doi:10.1007/s00500-021-05775-5

Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models

Methodologies and Application
Published: 25 April 2021

Volume 25, pages 8483–8513, (2021)
Cite this article

Download PDF

Soft Computing Aims and scope Submit manuscript

Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models

Download PDF

Milad Shahvaroughi Farahani¹ &
Seyed Hossein Razavi Hajiagha ORCID: orcid.org/0000-0003-2084-7244²

9593 Accesses
45 Citations
Explore all metrics

Abstract

Today, stock market has important function and it can be a place as a measure of economic position. People can earn a lot of money and return by investing their money in the stock exchange market. But it is not easy because many factors should be considered. So, there are many ways to predict the movement of share price. The main goal of this article is to predict stock price indices using artificial neural network (ANN) and train it with some new metaheuristic algorithms such as social spider optimization (SSO) and bat algorithm (BA). We used some technical indicators as input variables. Then, we used genetic algorithms (GA) as a heuristic algorithm for feature selection and choosing the best and most related indicators. We used some loss functions such as mean absolute error (MAE) as error evaluation criteria. On the other hand, we used some time series models forecasting like ARMA and ARIMA for prediction of stock price. Finally, we compared the results with each other means ANN-Metaheuristic algorithms and time series models. The statistical population of research have five most important and international indices which were S&P500, DAX, FTSE100, Nasdaq and DJI.

Stock Market Forecasting Using Computational Intelligence: A Survey

Article 15 February 2020

A Novel Model for Stock Price Prediction Using Hybrid Neural Network

Article 12 June 2018

Improved market prediction using meta-heuristic algorithms and time series model and testing market efficiency

Article 21 October 2022

1 Introduction

People are always looking for some ways to invest their capital. Stock market is one of the main places to invest the money and capital. However, stock markets confront with different risks. Therefore, investors require forecasting stock price and it depends on several psychological, economic, etc., factors. Thus, several methods have been developed to predict stock prices. These forecasting methods aim at proposing approaches to predict index value or stock prices (Lah et al. 2019). They need different considerations due to the quality and quantity of data. Technical analysis, fundamental analysis and statistical methods are used for stock price prediction. One of the main hypotheses which should be considered and better to test it is efficient market hypothesis (EMH) (Malkiel 1989, 2003). EMH means that information has a high impact on stock prices and prices modifying themselves according to this information (Greco et al. 2019). The efficient market ensures investors that they access similar information (Naseer and Bin Tariq 2015). The efficient market is based on the assumption that no system can beat the market because if this system becomes general, everybody will use it. Thus, it negates its potential profitability.

Time series is a main method which is used for the prediction of share prices. Time series analysis deals with analyzing a series of data gathered during time. Time series are common in different fields of economy, finance, healthcare, etc (Bisgaard and Kulahci 2011). This method tries to forecast future by assuming that the previously observed pattern can be considered as the foundation to extract future behavior (Shin 2017). Heuristic algorithms are another set of methods being used for prediction. Heuristic algorithms are often used as an alternative optimization algorithm, instead on exact methods that usually deal with finding a good feasible solution without any assurance of being optimal (Kaveh and Ghazaan 2018). Heuristic algorithms are applicable in different decision problems which have complex structures, and it takes a long time to identify their characteristics. The other methods are metaheuristic algorithms. Metaheuristic algorithms are actually a set of algorithms that are applied to heuristic algorithms and simultaneously allow the use of heuristic algorithms in a large number of issues. It does not take into account the characteristics of the model and is compatible with any model and different solutions (Osman and Kelly 1996; Talbi 2009). In cases which the set of solutions is too large to being sampled completely, metaheuristics examine a set of these solutions. Since metaheuristics are usually developed based on a limited set of assumptions, they can be used for a variety of problems (Blum and Roli 2003).

Comparing with exact methods, there is no guarantee that metaheuristics can find global optimum of an optimization problem (Blum and Roli 2003; Khosravanian et al. 2018). Metaheuristic algorithms are applied to solve difficult and complicated problems in an affordable time. These algorithms usually found acceptable rather than optimal solutions for these types of problems (Talbi 2009). Gogna and Tayal (2013), Abdel-Basset et al. (2018), Wong and Ming (2019) are a sample of studies reviewed the applications of metaheuristic algorithms in different fields.

The other method is ANN which is retrieved from the function of human brain and thinking. ANN is in the subset of artificial intelligence (AI), and it is usable in different contents such as pattern recognition, classification, regression. Because most of the financial data are nonlinear and are asymmetric, ANN can recognize the relationship perfectly.

This paper aims to predict the stock price by ANN. The developed ANN is trained using some metaheuristic algorithms, including social spider optimization (SSO) and bat algorithm (BA). A group of technical measures are used as input variables. Genetic algorithm (GA) also is used as feature selection and choosing the best and most related indicators. Different loss function is used as error assessment criteria.

To evaluate the performance of the mentioned hybrid algorithm, the obtained results are compared with results of ARIMA as a time series model to predict the stock price. This obtained performance and its comparisons are done on five most important and international indices including S&P500, DAX, FTSE100, NASDAQ and DJI. The paper is structured as follows: the 2nd part reviews the available literature. The 3rd part describes ANN structure and proposed algorithm. In Sect. 4 ARIMA is used for time series forecasting. Sections 5 and 6 examine the experimental process and the results. Finally, last section means 7th part concludes the paper. You can see more results in Appendices A and B.

2 Literature review

Stock market is a place where you can invest your capital to buy or sell part of the company's assets in the form of shares (Preethi and Santhi 2012). We can see the market as a pulse of economic activities and almost country, which can be a place with high benefits for investors which they can grow their capital and money or totally their wealth. Stock market is characterized by features such as nonlinearity, discontinuity, and volatile multifaceted elements because many items affect is such as general economic situations, political actions and broker's assumptions (Hadavandi et al. 2010). Considering the amount of fluctuation in this market, a rapid decision making process is required. Therefore, it is very important that transactions are done in the shortest possible time (Barakat et al. 2016). Obtaining maximum profit is the ultimate goal of the investors. As a result, many researchers are looking for market forecasting capabilities in a variety of ways (Prasanna and Ezhilmaran 2013). According to previous studies, ANN seems a good and reasonably validated method in the prediction of stock price (Idris et al. 2015). The three most popular ANNs for stock prediction are the recurrent neural network (RNN) (Saad et al. 1998), the radial basis function (RBF) (Han et al. 2001), and multilayer perceptron (MLP). There are many methods for training the ANN and some of them are better than the others in finding the linear and nonlinear relationship. ANN uses two thresholds for exploration of linear and nonlinear qualifications. The number of layer is very important in predictability. If we use too many layers, the ANN couldn't find the fittest choice and the structure will be complicated. In addition, too few layers mean that the ANN is unable to find the global solution and nonlinear relationships. The researchers have tried to discover some methods which have high speed with high accuracy and lower the error. For this reason, the metaheuristic algorithms are used. These methods are used for the network optimization and finding the best number of input and hidden layers. The ANN models in forecasting stock price, stock return, exchange rate, inflation and imports work better than traditional statistical models (Yim and Mitchell 2002).

Gupta and Wang (2010) used feed-forward neural networks to forecast and trade the future index prices of the Standard and Poor’s 500 (S&P 500). The effect of training the network with the most recent data, together with gradually subsampled past index data, has been studied in this research. They also studied the effect of past NASDAQ 100 data on the prediction of future S&P 500. A daily trading strategy has been used, to buy/sell, according to the predicted prices, and hence calculate the directional efficiency and the rate of returns for different periods. They were able to obtain significantly higher returns compared to earlier work. There were numerous exchange-traded funds (ETFs), which attempted to replicate the performance of the S&P 500 by holding the same stocks in the same proportion as the index, and therefore, giving the same percentage returns as the S&P 500.

Zhu and Wang (2010) proposed an intelligent trading system using support vector regression optimized by genetic algorithms (SVR-GA) and multilayer perceptron optimized with GA (MLP-GA). Experimental results showed that both approaches outperformed conventional trading systems without prediction and a recent fuzzy trading system in terms of final equity and maximum drawdown for Hong Kong Hang Seng stock index.

He et al. (2013) did researches on the principles and theories in the field of financial market, and basic technical analysis methodologies about the stock market were studied and practiced with the help of Feature Selection algorithms. They used the data of Shanghai Stock Exchange Composite Index (SSECI) from 24 March 1997 to 23 August 2006 to measure twelve technical indicators for later research. The twelve chosen technical indicators were calculated, and the results were taken as the input of the Feature Selection algorithms. The three kinds of Feature Selection algorithms, principle component analysis (PCA), genetic algorithm (GA) and sequential forward selection (SFS) were studied. According to the results and analysis, PCA was the most reliable, but might be time-consuming if the input has very large dimensions. Genetic Algorithm would have a better performance since it takes the advantage of randomness in such a situation. SFS could generate the local optimal solution, but with a risk of “nesting problem”.

Dong et al. (2013) first reproduced the one-step ahead prediction system from Phua et al. for the prediction of stock price. Secondly, they made some modifications and successfully outperform the original prediction system in terms of MSE value, hit-rate and absolute error. Moreover, they explored a difficult multi-step prediction problem. Firstly, they reproduced a multi-step prediction system using simple recursive algorithm. Then, they proposed an error constraint algorithm in order to obtain better weights and bias, as well as smaller accumulated errors. The results outperformed the simple recursive algorithm by observation.

Zheng et al. (2013) explored the application of a wavelet neural network (WNN), whose hidden layer was comprised of neurons with adjustable wavelets as activation functions, to stock prediction. They discussed some basic rationales behind technical analysis, and based on which, inputs of the prediction system were carefully selected. This system was tested on Istanbul Stock Exchange National 100 Index and compared with traditional neural networks. The results showed that the WNN could achieve very good prediction accuracy.

Fang et al. (2014) improved stock market prediction based on genetic algorithms (GA) and wavelet neural networks (WNN) and reported significantly better accuracies compared to existing approaches to stock market prediction, including the hierarchical GA (HGA) WNN. Specifically, they added information such as trading volume as inputs and they used the Morlet wavelet function instead of Morlet–Gaussian wavelet function in their prediction model. They also employed a smaller number of hidden nodes in WNN compared to other research work. The prediction system was tested using Shenzhen Composite Index data.

Lim et al. (2016) used delayed neural network models to predict public housing prices in Singapore. The delayed neural networks are used to estimate the trend of the resale price index (RPI) of Singapore housing from the Singapore Housing Development Board (HDB), with nine independent economic and demographic variables. The results show that the delayed neural network model is able to produce a good fit and predictions.

Göçken et al. (2016) predicted Turkish stock price index using technical indicators and hybrid ANN based on GA and harmony search (HS). The results showed that the error of hybrid metaheuristic algorithms is less than ANN. They compared the hybrid ANN-HS and ANN-GA model and found that the error of ANN-HS is less than ANN-GA.

Considering the problem of dealing with features with a similar contribution, the feature weighted SVM (FWSVM) and feature weighted K-nearest neighbor (FWKNN) are proposed to forecast market indices of stock by assigning different weights to different features (Chen and Hao 2017).

Then this model is tested on two stock markets. Comparing the results, the FWSVM and FWKNN perform better than non-weighted models.

Ghasemiyeh et al. (2017) optimized artificial neural network by metaheuristic algorithms. In their research, cuckoo search, improved cuckoo search, enhanced cuckoo search genetic algorithm, genetic algorithm and particle swarm optimization (PSO) are examined. Testing these hybrid algorithms and using 28 variables as input, the results show that particle warm outperforms other algorithms in this study.

Goli et al (2018) used various metaheuristic algorithms for improving the results and predicting demand in dairy industry too. This study used two well liked metaheuristic algorithms, such as GA and PSO, together with two more recent algorithms titled invasive weed optimization (IWO) and cultural algorithm (CA) as feature selection and demand forecasting in dairy industry. According to the results, PSO showed the best performance in feature selection while IWO can significantly improve the prediction error.

Sin and Wang (2017) explored the relationship between the features of Bitcoin and the next day change in the price of Bitcoin using an Artificial Neural Network ensemble approach called Genetic Algorithm-based Selective Neural Network Ensemble, constructed using Multi-Layered Perceptron as the base model for each of the neural network in the ensemble. To better understand the practicality and its effectiveness in real-world application, the ensemble was used to predict the next day direction of the price of Bitcoin given a set of approximately 200 features of the cryptocurrency over a span of 2 years. Over a span of 50 days, a trading strategy based on the ensemble was compared against a “previous day trend following” trading strategy through back-testing. The former trading strategy generated almost 85% returns, outperforming the “previous day trend following” trading strategy which produced an approximate 38% returns and a trading strategy that follows the single, best MLP model in the ensemble that generated approximately 53% in returns.

Chong et al. (2017) applied three methods such as PCA, restricted Boltzmann machine (RBM) and auto encoder on the deep learning network as feature extraction with three loss functions such as root-mean-squared error (RMSE), mean absolute error (MAE), mutual information (MI) and normalized mean squared error (NMSE), to predict future market trend of South Korea. Sezer et al. (2017) employed GA for stock trading system on base of deep neural network (DNN) to anticipate buy–sell–hold. GA was used as feature selection and generates the buy–sell point in mentioned system. Later, Dixon (2018) also used a long short-term memory (LSTM) network and forecasted short-term price movements.

Zhang et al. (2018) designed a system for prediction of stock price trend which could predict stock price movement and its increase or decrease trend interval during predetermined periods. They used random forest model and trained it on historical data from a China Market to categorize the multiple clips of stocks into four major groups regard to the different kinds of their close prices. The result indicates the improvement in prediction of volatility in market and some merits such as precision and return per dealing.

Baek and Kim (2018) proposed a framework, entitled ModAugNet, consisting two modules based on LSTM: one for prevention and one for prediction. The framework is tested over two Korean stock data set. The obtained results show an improvement in different error measures.

Ahmed et al. (2019) used ant colony optimization (ACO) in forecasting stock price of Nigerian stock exchange. They compared ACO with three other algorithms including Price Momentum Oscillator, Stochastic and Moving Average. They concluded that ACO has more accuracy and lower error than other methods. Ghanbari and Arian (2019) used support vector regression (SVR) and butterfly optimization algorithm (BOA) in forecasting stock market. They presented a new BOA-SVR model based on BOA and compared it with results of 11 metaheuristic algorithms on NASDAQ data. The result indicated that the considered model can improve the results and optimizing the SVR Parameters. On the other hand, this model has worked very well with higher performance accuracy and lower time consumption compared to other models. Chandana (2019) used a novel approach based on least square support vector regression (LSSVR) and Machine Learning. He decided to design an expert system for prediction of stock price and he hoped to help strengthen the forecast with improving the power of accuracy. Their system was successful because the computation became fewer and calculation was simpler too. Rajesh et al. (2019) used ensemble learning techniques for stock trend prediction concentrating on the stock price change percentage. They predicted S&P500 and its future trend with ensemble learning. To this aim, they considered two foreseen methods: ensemble learning and heat map. Evidences suggest that support vector machine (SVM), random forest, and K-neighbor's classifiers have more promising results compared to other methods. The accuracy of the forecast model seems upper than 51%, which illustrate 23% increase in prediction accuracy.

Pal and Kar (2019) used a hybrid approach to forecast time series of stock price by using data discretization based on fuzzistics [1; 2], where cumulative probability distribution approach (CPDA) is used to get the intervals for the linguistic values. First-order fuzzy rule generation and reduction of rule sets by rough set theory have been performed. Thereafter, forecasting of the time series data is computed from defuzzification using reduced rule base and its historical evidences. Proposed approach is applied on stock index closing price of three-time series data (BSE, NYSE, and TAIEX) as experimental data sets and the results show that the method is more effective than its counter parts.

Liu and Wang (2019), in order to address the profit bias in model evaluation, proposed a new effective metric, mean profit rate (MPR). The effectiveness of metric was measured based on the correlation between the metric value and profit of the model. Experiments on five stock daily index data among four countries show that MPR outperformed the classification metrics in correlating to profit. In view of these findings, they suggested that MPR is a more effective metric than the classification metrics in stock trend prediction.

Lv et al. (2019) assessed different types of machine learning algorithms based on trading cost. They tried to compare traditional algorithms and advance DNN models. They used data of period 2010–2017 from different index component stocks. The random forest, naïve Bayes, logistic regression, classification and regression tree (CART), traditional machine learning algorithms are SVM, and extreme gradient boosting while the DNN architectures include deep belief network (DBN), multilayer perceptron (MLP), RNN, Stacked Auto encoders (SAE), LSTM, and gated recurrent unit (GRU). Their results indicated that each algorithm is superior than other based on transaction cost. For example, regardless of the transaction cost, traditional machine learning algorithms perform better in many directional assessment indices; however, DNN models perform better despite of transaction cost.

Zaman (2019) realized that efficiency of Bangladesh largest stock market is weak. To improve the results, he conducted parametric and nonparametric tests of DSE & CSE from 2013 to 2017. The results proved that two stock exchanges are not efficient in the weak form.

Zhou et al. (2020) investigated the SVM power in predicting stock price change direction. They used five different datasets, including technical indices, stock posts, transaction data records, news and Baidu index, and concluded that there are different ideal data source to forecast active stocks and inactive ones. Finally, they found that more active stocks can be predicted with higher accuracy for different periods of time.

Sahoo and Mohanty (2020) proposed a combination of ANN and gray wolf optimization (GWO) technique and compared the hybrid ANN-GWO with ANN. They compare these models on a dataset of Bombay stock exchange in a time period from 2004 to 2018. The performance of the ANN-GWO and ANN is evaluated according to different error measures. The comparisons illustrate that the mentioned hybrid method results better than the ANN model.

Kumar et al. (2020) reviewed and organized the published papers on stock market prediction using computational intelligence. The related papers are organized according to related datasets, input variables used, pre-processing methods, techniques used for future selection, forecasting methods and performance metrics to evaluate the models.

According to the above reviewed papers, it can be inferred that study on stock market prediction is still being raised among researchers. Also, it seems that hybrid methods are the permanent approach used in different researches. Considering the acceptance of ANN-based methods, the focus is to enhance the performance of ANN through some metaheuristics. Limitations of the previous methods are provided in Table 1 (Obthong et al. 2020).

Table 1 Limitations of the previous methods

Forecasting stock price using integrated artificial neural network and metaheuristic algorithms compared to time series models

Abstract

Similar content being viewed by others

Stock Market Forecasting Using Computational Intelligence: A Survey

A Novel Model for Stock Price Prediction Using Hybrid Neural Network

Improved market prediction using meta-heuristic algorithms and time series model and testing market efficiency

1 Introduction

2 Literature review

3 Hybrid metaheuristic ANN for stock price prediction

3.1 Technical indicators

3.2 Artificial neural network (ANN)

3.3 GA-ANN forecasting model

3.4 Bat algorithm (BA)

3.5 Social spider algorithm (SSA)

3.5.1 Initialization

3.5.2 Fitness assignation

3.5.3 Vibration modeling

3.5.4 Female cooperative operator

3.5.5 Male cooperative operator

3.5.6 Mating operator

4 Time series forecasting (ARMA and ARIMA)

5 Experimental results and findings

5.1 Artificial neural network (ANN)

5.2 GA-ANN algorithm

5.3 Bat algorithm (BA)

5.4 SSO (social spider optimization) algorithm

6 Time series forecasting (ARIMA)

7 Comparing results

8 Conclusions

Notes

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Appendices

Appendix A: training, validation and testing

Appendix B: time series results of indexes

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation