Abstract
In today’s increasingly international economy, return and volatility spillover effects across international equity markets are major macroeconomic drivers of stock dynamics. Thus, information regarding foreign markets is one of the most important factors in forecasting domestic stock prices. However, the cross-correlation between domestic and foreign markets is highly complex. Hence, it is extremely difficult to explicitly express this cross-correlation with a dynamical equation. In this study, we develop stock return prediction models that can jointly consider international markets, using multimodal deep learning. Our contributions are threefold: (1) we visualize the transfer information between South Korea and US stock markets by using scatter plots; (2) we incorporate the information into the stock prediction models with the help of multimodal deep learning; (3) we conclusively demonstrate that the early and intermediate fusion models achieve a significant performance boost in comparison with the late fusion and single-modality models. Our study indicates that jointly considering international stock markets can improve the prediction accuracy and deep neural networks are highly effective for such tasks.
Similar content being viewed by others
References
Eun C, Shim S (1989) International transmission of stock market movements. J Financ Quant Anal 24:241–256
Bekaert G, Hodrick RJ, Zhang X (2009) International stock return comovements. J Financ 64:2591–626
Armano G, Marchesi M, Murru A (2005) A hybrid genetic-neural architecture for stock indexes forecasting. Inf Sci 170:3–33
Kim K, Han I (2000) Genetic algorithms approach to feature discretization in artificial neural networks for the prediction of stock price index. Expert Syst Appl 19:125–132
Wang J-Z, Wang J-J, Zhang Z-G, Guo S-P (2011) Forecasting stock indices with back propagation neural network. Expert Syst Appl 38(11):14346–14355
Saadaoui F, Rabbouch H (2014) A wavelet-based multiscale vector-ANN model to predict comovement of econophysical systems. Expert Syst Appl 41(13):6017–6028
Frank ZX, Cambria E, Welsch RE (2018) Natural language based financial forecasting: a survey. Artif Intell Rev 50:49–73
Atsalakis GS, Valavanis KP (2009) Surveying stock market forecasting techniques-part ii: soft computing methods. Expert Syst Appl 36(3):5932–5941
Cavalcante RC, Brasileiro RC, Souza VLF, Nobrega JP, Oliveira ALI (2016) Computational intelligence and financial markets: a survey and future directions. Expert Syst Appl 55:194–211
Bengio Y, Courville A, Vincent P (2013) Representation learning: a review and new perspectives. IEEE Trans Pattern Anal Mach Intell 35(8):1798–828
Fischer T, Krauss C (2018) Deep learning with long short-term memory networks for financial market predictions. Eur J Oper Res 270(2):654–669
Heaton JB, Polson NG, Witte JH (2017) Deep learning for finance: deep portfolios. Appl Stoch Models Bus Ind 33:3–12
Lee SI, Yoo SJ (2018) Threshold-based portfolio: the role of the threshold and its applications. J Supercomput. https://doi.org/10.1007/s11227-018-2577-1
Nakagawa K, Uchida T, Aoshima T (2018) Deep factor model. ECML PKDD 2018 Workshops
Nakagawa K, Ito T, Abe M, Izumi K (2019) Deep recurrent factor model: interpretable non-linear and time-varying multi-factor Model. In AAAI-19
Cochrane JH (2011) Presidential address: discount rates. J Financ 66(4):1047–1108
Harvey CR, Liu Y, Zhu H (2015) ... and the cross-section of expected returns. Rev Financ Stud. 29(1): 5–68
McLean RD, Pontiff J (2016) Does academic research destroy stock return predictability? J Financ 71(1):5–32
Hou K, Xue C, Zhang L (2018) Replicating anomalies. Rev Financ Stud. hhy131. https://doi.org/10.1093/rfs/hhy131
Feng G, Giglio S, Xiu D (2019) Taming the factor zoo: a test of new factors. Technical report. National Bureau of Economic Research
Xing FZ, Cambria E, Malandri L, Vercellis C (2018) Discovering bayesian market views for intelligent asset allocation. ECML PKDD 2018 Workshops
Bao W, Yue J, Rao Y (2017) A deep learning framework for financial time series using stacked autoencoders and long short-term memory. PLoS One 12(7):e0180944
Campbell JY, Hamao Y (1992) Predictable stock returns in the United States and Japan: a study of long-term capital market integration. J Financ 47(1):43–69
Karolyi GA, Stulz RM (1996) Why do markets move together? an investigation of U.S.-Japan stock return comovement. J Financ 51:951–986
Taylor MP, Tonks I (1989) The internationalization of stock markets and the abolition of U.K. exchange control. Rev Econ Stat 71:332–336
Jeon BN, Chiang T (1991) A system of stock prices in world stock exchanges: common stochastic trends for 1975–1990? J Econ Bus 43:329–338
Kasa K (1992) Common stochastic trends in international stock markets. J Monet Econ 29:95–124
Bachman D, Choi J, Jeon BN, Kopecky K (1996) Common factors in international stock prices: evidence from a cointegration study. Int Rev Financ Anal 5(1):9–53
Booth GG, Martikainen T, Tse Y (1997) Price and volatility spillovers in Scandinavian stock markets. J Bank Financ 21(6):811–823
Syriopoulos T (2004) International portfolio diversification to central European stock markets. Appl Financ Econ 14:1253–1268
Ngiam J, Khosla A, Kim M, Nam J, Lee H, Ng AY (2011) Multimodal deep learning. In: Proceedings of the 28th International Conference on Machine Learning, pp 689–696
Zheng Y (2015) Methodologies for cross-domain data fusion: an overview. IEEE Trans Big Data 1(1):16–34
Srivastava N, Salakhutdinov R (2012) Multimodal learning with deep Boltzmann machines. Advances in Neural Information Processing Systems, pp 2222–2230
Na SH, Sohn SY (2011) Forecasting changes in Korea composite stock price index (KOSPI) using association rules. Expert Syst Appl 38:9046–9049
Jeon BN, Jang BS (2004) The linkage between the US and Korean stock markets: the case of NASDAQ, KOSDAQ, and the semiconductor stocks. Res Int Bus Financ 18:319–340
Lee SJ (2006) Volatility spillover among six Asian countries and US. Unpublished Paper. Financial Supervisory, South Korea
Xu C, Tao D, Xu C (2013) A survey on multi-view learning. Neural Comput Appl 23:2031–2038
Shutova E, Kiela D, Maillard J (2016) Black holes and white rabbits: metaphor identification with visual features. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pp 160–170
Morvant E, Habrard A, Ayache S (2014) Majority vote of diverse classifiers for late fusion. In: Joint IAPR International Workshops on Statistical Techniques in Pattern Recognition (SPR) and Structural and Syntactic Pattern Recognition (SSPR). Springer, pp 153–162
Glodek M, Tschechne S, Layher G, Schels M, Brosch T, Scherer S, Kächele M, Schmidt M, Neumann H, Palm G (2011) Multiple classifier systems for the classification of audio-visual emotional states. In: Affective Computing and Intelligent Interaction. Springer, pp 359–368
Ramirez GA, Baltrusaitis T, Morency L-P (2011) Modeling latent discriminative dynamic of multi-dimensional affective signals. In: Affective Computing and Intelligent Interaction. Springer, pp 396–406
Bergstra J, Bengio Y (2011) Algorithms for hyper-parameter optimization. In: Proceedings of International Conference on Neural Information Processing Systems, pp 2546–2554
Werbos PJ (1974) Beyond regression: new tools for prediction and analysis in the behavioral Sciences. Ph.D. thesis, Harvard University
Rumelhart DE, Hinton GE, Williams RJ (1986) Learning representations by back-propagating errors. Nature 323(6088):533–536
Maas AL, Hannun AY, Ng AY (2013) Rectifier nonlinearities improve neural network acoustic models. In: Proceedings of the 30th International Conference on Machine Learning. vol 28(6)
Glorot X, Bengio Y (2010) Understanding the difficulty of training deep feedforward neural networks. In: International Conference on Artificial Intelligence & Statistics, pp 249–256
Kingma DP, Adam J Ba (2014) A method for stochastic optimization. In: International Conference on Learning Representations
Tieleman T, Hinton G (2012) Lecture 6.5-rmsprop: divide the gradient by a running average of its recent magnitude. COURSERA Neural Netw Mach Learn 4(2):26–31
Srivastava N, Hinton GE, Krizhevsky A, Sutskever I, Salakhutdinov R (2014) Dropout: a simple way to prevent neural networks from overftting. J Mach Learn Res 15:1929–1958
Ioffe S, Szegedy C. Batch normalization: accelerating deep network training by reducing internal covariate shift. Proceedings of the 32nd International Conference on Machine Learning. vol 37, pp 448–456
Acknowledgements
This work was supported by the ICT R&D program of MSIP/IITP [2017-0-00302, Development of Self Evolutionary AI Investing Technology].
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Lee, S.I., Yoo, S.J. Multimodal deep learning for finance: integrating and forecasting international stock markets. J Supercomput 76, 8294–8312 (2020). https://doi.org/10.1007/s11227-019-03101-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11227-019-03101-3