Skip to main content
Log in

Local and global characteristics-based kernel hybridization to increase optimal support vector machine performance for stock market prediction

  • Regular Paper
  • Published:
Knowledge and Information Systems Aims and scope Submit manuscript

Abstract

In this paper, a novel multi-kernel support vector machine (MKSVM) combining global and local characteristics of the input data is proposed. Along with, a parameter tuning approach is developed using the fruit fly optimization (FFO), which is applied to stock market movement direction prediction problem. At first, factor analysis is used for identifying reduced key features called as factor scores from the raw stock index data which when applied to the model contributes to improvement in prediction performance. Subsequently, a hybrid kernel method combining local and global characteristics of input data is proposed, where polynomial is used for global kernel and radial basis function is utilized for local kernel. Additionally, FFO-based parameter tuning scheme is proposed to enhance the prediction performance further. Lastly, the evolving MKSVM with best feature subset and optimal parameters is used to predict stock market movement direction based upon historical data series. For evaluation and illustration purposes, three significant stock databases, NYSE, DJI and S&P 500 are used as testing targets. The effectiveness of this proposed approach is proved by three different stock market datasets, which demonstrate that the proposed approach outperforms the MKSVM with default parameter, MKSVM with PSO, MKSVM with GA and other methods. In addition, our findings reveal that the optimization strategy proposed here may be used as a promising choice forecasting tool for better generalization ability higher forecasting accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16

Similar content being viewed by others

References

  1. Guo Y, Han S, Shen C, Li Y, Yin X, Bai Y (2018) An adaptive SVR for high-frequency stock price forecasting. IEEE Access 6:11397–11404

    Article  Google Scholar 

  2. Wang W (2018) A big data framework for stock price forecasting using fuzzy time series. Multimed Tools Appl 77(8):10123–10134

    Article  Google Scholar 

  3. Nahil A, Lyhyaoui A (2018) Short-term stock price forecasting using kernel principal component analysis and support vector machines: the case of Casablanca stock exchange. Procedia Comput Sci 127:61–169

    Article  Google Scholar 

  4. Chen Y, Hao Y (2017) A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Syst Appl 80:340–355

    Article  Google Scholar 

  5. Zahedi J, Rounaghi MM (2015) Application of artificial neural network models and principal component analysis method in predicting stock prices on Tehran Stock Exchange. Phys A 438:178–187

    Article  MathSciNet  Google Scholar 

  6. Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Addison-Wesley, New York

    MATH  Google Scholar 

  7. Gowthul Alam MM, Baulkani S (2017) Reformulated query-based document retrieval using optimised kernel fuzzy clustering algorithm. Int J Bus Intell Data Min 12(3):299–318

    Article  Google Scholar 

  8. Gowthul Alam MM, Baulkani S (2016) A hybrid approach for web document clustering using K-means and artificial bee colony algorithm. Int J Intell Eng Syst 9(4):11–20

    Google Scholar 

  9. Kennedy J (2007) The particle swarm as collaborative sampling of the search space. Adv Complex Syst 10:191–213

    Article  MATH  Google Scholar 

  10. Wu J, Jin L, Liu M (2010) A hybrid support vector regression approach for rainfall forecasting using particle swarm optimization and projection pursuit technology. Int J Comput Intell Appl 9:87–104

    Article  MATH  Google Scholar 

  11. Sundararaj V, Muthukumar S, Kumar RS (2018) An optimal cluster formation based energy efficient dynamic scheduling hybrid MAC protocol for heavy traffic load in wireless sensor network. Comput Secur 77:277–288

    Article  Google Scholar 

  12. Bäck T, Hammel U, Schwefel HP (1997) Evolutionary computation: comments on the history and current state. IEEE Trans Evol Comput 1:3–17

    Article  Google Scholar 

  13. Koziel S (1999) Evolutionary algorithms, homomorphous mappings, and constrained parameter optimization. Evol Comput 7:19–44

    Article  Google Scholar 

  14. Pan WT (2012) A new fruit fly optimization algorithm: taking the financial distress model as an example. Knowl Syst 26:69–74

    Article  Google Scholar 

  15. Lin F-L, Yang S-Y, Marsh T, Chen Y-F (2018) Stock and bond return relations and stock market uncertainty: evidence from wavelet analysis. Int Rev Econ Finance 55:285–294

    Article  Google Scholar 

  16. Wang H-F, Kuo CY (2005) Factor analysis in data mining. Comput Math Appl 48:1765–1778

    Article  MATH  Google Scholar 

  17. Hsia T-C, Hsu Y-L, Jen H-L (2009) A factor analysis based selection process for predicting successful university color guard club members. Expert Syst Appl 36:2719–2726

    Article  Google Scholar 

  18. Kim H, Soibelman L, Grobler F (2008) Factor selection for delay analysis using knowledge discovery in databases. Autom Constr 17:550–560

    Article  Google Scholar 

  19. Anish CM, Majhi B, Tonde HS (2014) A novel hybrid nonlinear adaptive model for prediction of stock indices. In: Proceedings of international conference on communication and computing, ICC-2014-Bangalore, pp 18–25

  20. Chen AS, Leung MT, Daouk H (2003) Application of neural networks to an emerging financial market: forecasting and trading the Taiwan Stock Index. Comput Oper Res 30(6):901–923

    Article  MATH  Google Scholar 

  21. Diler AI (2003) Predicting direction of ISE national-100 index with back propagation trained neural network. J Istanb Stock Exch 7(25–26):65–81

    Google Scholar 

  22. Altay E, Satman MH (2005) Stock market forecasting: artificial neural networks and linear regression comparison in an emerging market. J Financ Manag Anal 18(2):18–33

    Google Scholar 

  23. Cao Q, Leggio KB, Schniederjans MJA (2005) A comparison between Fama and French’s model and artificial neural networks in predicting the Chinese stock market. Comput Oper Res 32:2499–2512

    Article  MATH  Google Scholar 

  24. Vapnik V (2000) The nature of statistical learning theory. Springer, Berlin, pp 863–884

    Book  MATH  Google Scholar 

  25. Lins ID, Moura MDC, Zio E, Droguett EL (2012) A particle swarm-optimized support vector machine for reliability prediction. Qual Reliab Eng Int 28(2):141–158

    Article  Google Scholar 

  26. Hu Y, Wu C, Liu H (2011) Prediction of passenger flow on the highway based on the least square support vector machine. Transport 26(2):197–203

    Article  Google Scholar 

  27. Kim K (2003) Financial time series forecasting using support vector machines. Neuro Comput 55:307–319

    Google Scholar 

  28. Huang W, Nakamori Y, Wang SY (2005) Forecasting stock market movement direction with support vector machine. Comput Oper Res 32:2513–2522

    Article  MATH  Google Scholar 

  29. Nahil A, Lyhyaoui A (2018) Short-term stock price forecasting using kernel principal component analysis and support vector machines: the case of Casablanca stock exchange. Procedia Comput Sci 127(2018):161–169

    Article  Google Scholar 

  30. Hsu SH, Hsieh JJPA, Chih TC, Hsu KC (2009) A two-stage architecture for stock price forecasting by integrating self-organizing map and support vector regression. Expert Syst Appl 36(4):7947–7951

    Article  Google Scholar 

  31. Li H, Guo S, Zhao H, Su C, Wang B (2012) Annual electric load forecasting by a least squares support vector machine with a fruit fly optimization algorithm. Energies 5(11):4430–4445

    Article  Google Scholar 

  32. Chen P-W, Lin W-Y, Huang T-H, Pan W-T (2013) Using fruit fly optimization algorithm optimized grey model neural network to perform satisfaction analysis for e-business service. Appl Math Inf Sci 7(2):459–465

    Article  Google Scholar 

  33. Li H-Z, Guo S, Li C-J, Sun J-Q (2013) A hybrid annual power load forecasting model based on generalized regression neural network with fruit fly optimization algorithm. Knowl Syst 37:378–387

    Article  Google Scholar 

  34. Shan D, Cao G, Dong H (2013) LGMS-FOA: an improved fruit fly optimization algorithm for solving optimization problems. Math Probl Eng 9

  35. Wang H-F, Kuo CY (2005) Factor analysis in data mining. Comput Math Appl 48:1765–1778

    Article  MATH  Google Scholar 

  36. Sharma S, Kumar A (2006) Cluster analysis and factor analysis (chapter 18). Sage Publication, Thousand Oaks

    Google Scholar 

  37. Chen Y-S, Cheng C-H, Chiu C-L, Huang S-T (2016) A study of ANFIS-based multi-factor time series models for forecasting stock index. Appl Intell 45(2):277–292

    Article  Google Scholar 

  38. Jenkins G, Doney (2008) Principal component and factor analysis. In: Modeling methods for marine science, pp 81–117

  39. Pan WT (2012) A new fruit fly optimization algorithm: taking the financial distress model as an example. Knowl Syst 26:69–74

    Article  Google Scholar 

  40. Kennedy J, Eberhart R (2001) Swarm intelligence. Morgan Kaufmann publishers Inc, San Francisco

    Google Scholar 

  41. Chen HL, Yang B, Liu J, Liu D-Y (2011) A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis. Expert Syst Appl 38(7):9014–9022

    Article  Google Scholar 

  42. Yuan X, Tan Q, Lei X, Yuan Y, Wu X (2017) Wind power prediction using hybrid autoregressive fractionally integrated moving average and least square support vector machine. Energy 129:122–137

    Article  Google Scholar 

  43. Maldonado S, Pérez J, Bravo C (2017) Cost-based feature selection for support vector machines: an application in credit scoring. Eur J Oper Res 261(2):656–665

    Article  MathSciNet  MATH  Google Scholar 

  44. Mehmet G, Ethem A (2010) Cost-conscious multiple kernel learning. Pattern Recognit Lett 31:959–965

    Article  Google Scholar 

  45. Yuan SF, Chu FL (2007) Fault diagnosis based on support vector machines with parameter optimization by artificial immunization algorithm. Mech Syst Signal Process 21:1318–1330

    Article  Google Scholar 

  46. Smits GF, Jordaan EM (2002) Improved SVM regression using mixtures of kernels. In: Proceedings of the 2002 international joint conference on neural networks, vol 3. IEEE, Honolulu, Hi, USA, pp 2785–2790

  47. Chen L, Chen CLP, Lu M (2011) A multiple-kernel fuzzy c-means algorithm for image segmentation. IEEE Trans Syst Man Cybern 41(5):1263–1274

    Article  Google Scholar 

  48. Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22(4):679–688

    Article  Google Scholar 

  49. Chen Y, Hao Y (2017) A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Syst Appl 80:340–355

    Article  Google Scholar 

  50. Javedani Sadaei H, Lee MH (2014) Multilayer stock forecasting model using fuzzy time series. Sci World J 2014:3

    Article  Google Scholar 

  51. Cheng C-H, Chen T-L, Teoh HJ, Chiang CH (2008) Fuzzy time-series based on adaptive expectation model for TAIEX forecasting. Exp Syst Appl 34(2):1126–1132

    Article  Google Scholar 

  52. Chen SM, Chen CD (2011) Handling forecasting problems based on high-order fuzzy logical relationships. Expert Syst Appl 38(4):3857–3864

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to M. M. Gowthul Alam.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Gowthul Alam, M.M., Baulkani, S. Local and global characteristics-based kernel hybridization to increase optimal support vector machine performance for stock market prediction. Knowl Inf Syst 60, 971–1000 (2019). https://doi.org/10.1007/s10115-018-1263-1

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10115-018-1263-1

Keywords

Navigation