Skip to main content
Log in

Multimodal deep learning for finance: integrating and forecasting international stock markets

  • Published:
The Journal of Supercomputing Aims and scope Submit manuscript

Abstract

In today’s increasingly international economy, return and volatility spillover effects across international equity markets are major macroeconomic drivers of stock dynamics. Thus, information regarding foreign markets is one of the most important factors in forecasting domestic stock prices. However, the cross-correlation between domestic and foreign markets is highly complex. Hence, it is extremely difficult to explicitly express this cross-correlation with a dynamical equation. In this study, we develop stock return prediction models that can jointly consider international markets, using multimodal deep learning. Our contributions are threefold: (1) we visualize the transfer information between South Korea and US stock markets by using scatter plots; (2) we incorporate the information into the stock prediction models with the help of multimodal deep learning; (3) we conclusively demonstrate that the early and intermediate fusion models achieve a significant performance boost in comparison with the late fusion and single-modality models. Our study indicates that jointly considering international stock markets can improve the prediction accuracy and deep neural networks are highly effective for such tasks.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Similar content being viewed by others

References

  1. Eun C, Shim S (1989) International transmission of stock market movements. J Financ Quant Anal 24:241–256

    Article  Google Scholar 

  2. Bekaert G, Hodrick RJ, Zhang X (2009) International stock return comovements. J Financ 64:2591–626

    Article  Google Scholar 

  3. Armano G, Marchesi M, Murru A (2005) A hybrid genetic-neural architecture for stock indexes forecasting. Inf Sci 170:3–33

    Article  MathSciNet  Google Scholar 

  4. Kim K, Han I (2000) Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index. Expert Syst Appl 19:125–132

    Article  Google Scholar 

  5. Wang J-Z, Wang J-J, Zhang Z-G, Guo S-P (2011) Forecasting stock indices with back propagation neural network. Expert Syst Appl 38(11):14346–14355

    Google Scholar 

  6. Saadaoui F, Rabbouch H (2014) A wavelet-based multiscale vector-ANN model to predict comovement of econophysical systems. Expert Syst Appl 41(13):6017–6028

    Article  Google Scholar 

  7. Frank ZX, Cambria E, Welsch RE (2018) Natural language based financial forecasting: a survey. Artif Intell Rev 50:49–73

    Article  Google Scholar 

  8. Atsalakis GS, Valavanis KP (2009) Surveying stock market forecasting techniques-part ii: soft computing methods. Expert Syst Appl 36(3):5932–5941

    Article  Google Scholar 

  9. Cavalcante RC, Brasileiro RC, Souza VLF, Nobrega JP, Oliveira ALI (2016) Computational intelligence and financial markets: a survey and future directions. Expert Syst Appl 55:194–211

    Article  Google Scholar 

  10. Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–828

    Article  Google Scholar 

  11. Fischer T, Krauss C (2018) Deep learning with long short-term memory networks for financial market predictions. Eur J Oper Res 270(2):654–669

    Article  MathSciNet  MATH  Google Scholar 

  12. Heaton JB, Polson NG, Witte JH (2017) Deep learning for finance: deep portfolios. Appl Stoch Models Bus Ind 33:3–12

    Article  MathSciNet  MATH  Google Scholar 

  13. Lee SI, Yoo SJ (2018) Threshold-based portfolio: the role of the threshold and its applications. J Supercomput. https://doi.org/10.1007/s11227-018-2577-1

    Article  Google Scholar 

  14. Nakagawa K, Uchida T, Aoshima T (2018) Deep factor model. ECML PKDD 2018 Workshops

  15. Nakagawa K, Ito T, Abe M, Izumi K (2019) Deep recurrent factor model: interpretable non-linear and time-varying multi-factor Model. In AAAI-19

  16. Cochrane JH (2011) Presidential address: discount rates. J Financ 66(4):1047–1108

    Article  Google Scholar 

  17. Harvey CR, Liu Y, Zhu H (2015) ... and the cross-section of expected returns. Rev Financ Stud. 29(1): 5–68

  18. McLean RD, Pontiff J (2016) Does academic research destroy stock return predictability? J Financ 71(1):5–32

    Article  Google Scholar 

  19. Hou K, Xue C, Zhang L (2018) Replicating anomalies. Rev Financ Stud. hhy131. https://doi.org/10.1093/rfs/hhy131

  20. Feng G, Giglio S, Xiu D (2019) Taming the factor zoo: a test of new factors. Technical report. National Bureau of Economic Research

  21. Xing FZ, Cambria E, Malandri L, Vercellis C (2018) Discovering bayesian market views for intelligent asset allocation. ECML PKDD 2018 Workshops

  22. Bao W, Yue J, Rao Y (2017) A deep learning framework for financial time series using stacked autoencoders and long short-term memory. PLoS One 12(7):e0180944

    Article  Google Scholar 

  23. Campbell JY, Hamao Y (1992) Predictable stock returns in the United States and Japan: a study of long-term capital market integration. J Financ 47(1):43–69

    Article  Google Scholar 

  24. Karolyi GA, Stulz RM (1996) Why do markets move together? an investigation of U.S.-Japan stock return comovement. J Financ 51:951–986

    Article  Google Scholar 

  25. Taylor MP, Tonks I (1989) The internationalization of stock markets and the abolition of U.K. exchange control. Rev Econ Stat 71:332–336

    Article  Google Scholar 

  26. Jeon BN, Chiang T (1991) A system of stock prices in world stock exchanges: common stochastic trends for 1975–1990? J Econ Bus 43:329–338

    Article  Google Scholar 

  27. Kasa K (1992) Common stochastic trends in international stock markets. J Monet Econ 29:95–124

    Article  Google Scholar 

  28. Bachman D, Choi J, Jeon BN, Kopecky K (1996) Common factors in international stock prices: evidence from a cointegration study. Int Rev Financ Anal 5(1):9–53

    Article  Google Scholar 

  29. Booth GG, Martikainen T, Tse Y (1997) Price and volatility spillovers in Scandinavian stock markets. J Bank Financ 21(6):811–823

    Article  Google Scholar 

  30. Syriopoulos T (2004) International portfolio diversification to central European stock markets. Appl Financ Econ 14:1253–1268

    Article  Google Scholar 

  31. Ngiam J, Khosla A, Kim M, Nam J, Lee H, Ng AY (2011) Multimodal deep learning. In: Proceedings of the 28th International Conference on Machine Learning, pp 689–696

  32. Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Trans Big Data 1(1):16–34

    Article  Google Scholar 

  33. Srivastava N, Salakhutdinov R (2012) Multimodal learning with deep Boltzmann machines. Advances in Neural Information Processing Systems, pp 2222–2230

  34. Na SH, Sohn SY (2011) Forecasting changes in Korea composite stock price index (KOSPI) using association rules. Expert Syst Appl 38:9046–9049

    Article  Google Scholar 

  35. Jeon BN, Jang BS (2004) The linkage between the US and Korean stock markets: the case of NASDAQ, KOSDAQ, and the semiconductor stocks. Res Int Bus Financ 18:319–340

    Article  Google Scholar 

  36. Lee SJ (2006) Volatility spillover among six Asian countries and US. Unpublished Paper. Financial Supervisory, South Korea

  37. Xu C, Tao D, Xu C (2013) A survey on multi-view learning. Neural Comput Appl 23:2031–2038

    Article  Google Scholar 

  38. Shutova E, Kiela D, Maillard J (2016) Black holes and white rabbits: metaphor identification with visual features. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 160–170

  39. Morvant E, Habrard A, Ayache S (2014) Majority vote of diverse classifiers for late fusion. In: Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR). Springer, pp 153–162

  40. Glodek M, Tschechne S, Layher G, Schels M, Brosch T, Scherer S, Kächele M, Schmidt M, Neumann H, Palm G (2011) Multiple classifier systems for the classification of audio-visual emotional states. In: Affective Computing and Intelligent Interaction. Springer, pp 359–368

  41. Ramirez GA, Baltrusaitis T, Morency L-P (2011) Modeling latent discriminative dynamic of multi-dimensional affective signals. In: Affective Computing and Intelligent Interaction. Springer, pp 396–406

  42. Bergstra J, Bengio Y (2011) Algorithms for hyper-parameter optimization. In: Proceedings of International Conference on Neural Information Processing Systems, pp 2546–2554

  43. Werbos PJ (1974) Beyond regression: new tools for prediction and analysis in the behavioral Sciences. Ph.D. thesis, Harvard University

  44. Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536

    Article  MATH  Google Scholar 

  45. Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th International Conference on Machine Learning. vol 28(6)

  46. Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: International Conference on Artificial Intelligence & Statistics, pp 249–256

  47. Kingma DP, Adam J Ba (2014) A method for stochastic optimization. In: International Conference on Learning Representations

  48. Tieleman T, Hinton G (2012) Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw Mach Learn 4(2):26–31

    Google Scholar 

  49. Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overftting. J Mach Learn Res 15:1929–1958

    MathSciNet  MATH  Google Scholar 

  50. Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning. vol 37, pp 448–456

Download references

Acknowledgements

This work was supported by the ICT R&D program of MSIP/IITP [2017-0-00302, Development of Self Evolutionary AI Investing Technology].

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Seong Joon Yoo.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lee, S.I., Yoo, S.J. Multimodal deep learning for finance: integrating and forecasting international stock markets. J Supercomput 76, 8294–8312 (2020). https://doi.org/10.1007/s11227-019-03101-3

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11227-019-03101-3

Keywords

Navigation