Skip to main content
Log in

Pattern Recognition in Microtrading Behaviors Preceding Stock Price Jumps: A Study Based on Mutual Information for Multivariate Time Series

  • Published:
Computational Economics Aims and scope Submit manuscript

Abstract

In this study, we propose a new framework to analyze the stock-specific mictrotrading patterns preceding stock price jumps, which should be useful for financial regulation or investment decisions. Using high-frequency trading data, the key step of our framework is to extract a set of core features to distinguish the prejump trading patterns of various stocks taking into account of the temporal information within the feature trajectories. We adopt 10 liquidity measures and 30 technical indicators to generate a high-dimensional candidate feature trajectory set and use a combination of the time-series-based mutual information and the minimum-Redundancy Maximum-Relevancy technique to perform the feature selection. A clustering analysis is then adopted to identify the outlier stocks with idiosyncratic prejump trading patterns. In the end, an application case is conducted based on the level-2 data of 189 constituent stocks of the China Security Index 300 to illustrate the viability of our proposed methodology. Comparison results show that the features we selected has higher capacity to identify the similarity among trading trajectories than those without considering temporal feature information, which provides more reliable features in detecting the outlier trading patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data Availability

The data that support the findings of this study are provided by Shanghai Wind Information Co., Ltd. (http://www.wind.com.cn).

Code availability

The code is available upon request.

Notes

  1. In Lee and Mykland (2008), the numerator of the second term of $C_n$ was incorrect. There should be \(\log(4 \pi)\) instead of \(\log\pi\). We use the corrected formula. It has also been noted in some studies, for example, in Gilder, Shackleton and Taylor (2014). We thank Ping-Chen Tsai for pointing out this issue.

References

  • Bȩdowska-Sójka, B. (2016). Liquidity dynamics around jumps: The evidence from the Warsaw stock exchange. Emerging Markets Finance and Trade, 52(12), 2740–2755.

    Article  Google Scholar 

  • Bȩdowska-Sójka, B., Kliber, A. (2021). Information content of liquidity and volatility measures. Physica A: Statistical Mechanics and its Applications, 563, 125436.

    Article  Google Scholar 

  • Berndt, D., Clifford, J. (1994). Using dynamic time warping to find patterns in times series. In AAAI-94 workshop on knowledge discovery in databases (pp. 359–370).

  • Bollerslev, T., & Todorov, V. (2011). Tails, fears, and risk premia. The Journal of Finance, 66(6), 2165–2211.

    Article  Google Scholar 

  • Boudt, K., & Pertitjean, M. (2014). Intraday liquidity dynamics and news releases around price jumps: Evidence from the DJIA stocks. Journal of Financial Markets, 17, 121–149.

    Article  Google Scholar 

  • Brennan, M. J., Huh, S. W., & Subrahmanyam, A. (2018). High-frequency measures of informed trading and corporate announcements. Review of Financial Studies, 31(6), 2326–2376.

    Article  Google Scholar 

  • Caporin, M., Kolokolov, A., & Renò, R. (2017). Systemic co-jumps. Journal of Financial Economics, 126(3), 563–591.

    Article  Google Scholar 

  • Chen, Y. T., Lai, W. N., & Sun, E. W. (2019). Jump detection and noise separation by a singular wavelet method for predictive analytics of high-frequency data. Computational Economics, 54(2), 809–844.

    Article  Google Scholar 

  • Das, A. K., Sengupta, S., & Bhattacharyya, S. (2018). A group incremental feature selection for classification using rough set theory based genetic algorithm. Applied Soft Computing, 65(65), 400–411.

    Article  Google Scholar 

  • Dobrushin, R. L. (1958). A simplified method of experimentally evaluating the entropy of a stationary sequence. Theory of Probability & Its Applications, 3(4), 428–430.

    Article  Google Scholar 

  • Duffie, D., & Pan, J. (2001). Analytical value-at-risk with jumps and credit risk. Finance and Stochastics, 5(6), 155–180.

    Article  Google Scholar 

  • Fang, L., Zhao, H., Wang, P., Yu, M., Yan, J., Cheng, W., & Chen, P. (2015). Feature selection method based on mutual information and class separability for dimension reduction in multidimensional time series for clinical data. Biomedical Signal Processing and Control, 21, 82–89.

    Article  Google Scholar 

  • Gao, Y., Han, X., Li, Y., & Xiong, X. (2019). Overnight momentum, informational shocks, and late informed trading in China. International Review of Financial Analysis, 66, 101394.

    Article  Google Scholar 

  • Gonzalez-Lopez, J., Ventura, S., & Cano, A. (2020). Distributed multi-label feature selection using individual mutual information measures. Knowledge-Based Systems, 188, 105052.

    Article  Google Scholar 

  • Groth, S. S., Siering, M., & Gomber, P. (2014). How to enable automated trading engines to cope with news-related liquidity shocks? Extracting signals from unstructured data. Decision Support Systems, 62, 32–42.

    Article  Google Scholar 

  • Han, M., Ren, W., & Liu, X. (2015). Joint mutual information-based input variable selection for multivariate time series modeling. Engineering Applications of Artificial Intelligence, 37, 250–257.

    Article  Google Scholar 

  • He, G., Zhao, W., Xia, X., Peng, R., & Wu, X. (2019). An ensemble of shapelet-based classifiers on inter-class and intra-class imbalanced multivariate time series at the early stage. Soft Computing, 23(15), 6097–6114.

    Article  Google Scholar 

  • Ircio, J., Lojo, A., Mori, U., & Lozano, J. A. (2020). Mutual information based feature subset selection in multivariate time series classification. Pattern Recognition, 108, 107525.

    Article  Google Scholar 

  • Jang, J., & Kang, J. (2019). Probability of price crashes, rational speculative bubbles, and the cross-section of stock returns. Journal of Financial Economics, 132(1), 222–247.

    Article  Google Scholar 

  • Jarrow, R., & Rosenfeld, E. (1984). Jump risks and the intertemporal capital asset pricing model. Journal of Business, 57(3), 337–351.

    Article  Google Scholar 

  • Jiang, G. J., Lo, I., & Verdelhan, A. (2011). Information shocks, liquidity shocks, jumps, and price discovery: Evidence from the U.S. treasury market. Journal of Financial and Quantitative Analysis, 46(2), 527–551.

    Article  Google Scholar 

  • Joulin A, Lefevre A, Grunberg D, Bouchaud JP (2008) Stock price jumps: news and volume play a minor role. arXiv:0803.1769v1

  • Jovic, A., & Jovic, F. (2017). Classification of cardiac arrhythmias based on alphabet entropy of heart rate variability time series. Biomedical Signal Processing and Control, 31, 217–230.

    Article  Google Scholar 

  • Kapadia, N., & Zekhnini, M. (2019). Do idiosyncratic jumps matter? Journal of Financial Economics, 131(3), 666–692.

    Article  Google Scholar 

  • Kong, A., Zhu, H., & Azencott, R. (2021). Predicting intraday jumps in stock prices using liquidity measures and technical indicators. Journal of Forecasting, 40(3), 416–438.

    Article  Google Scholar 

  • Kozachenko, L. F., & Leonenko, N. N. (1987). Sample estimate of the entropy of a random vector. Problemy Peredachi Informatsii, 23(2), 9–16.

    Google Scholar 

  • Kraskov, A., Stögbauer, H., & Grassberger, P. (2004). Estimating mutual information. Physical Review E-Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics, 69, 066138.

    Article  Google Scholar 

  • Lahaye, J., Laurent, S., & Neely, C. (2011). Jumps, Cojumps and Maro announcements. Journal of Applied Econometrics, 26(6), 893–921.

    Article  Google Scholar 

  • Lakhal, F. (2008). Stock market liquidity and information asymmetry around voluntary earnings disclosures New evidence from France. International Journal of Managerial Finance, 4(1), 60–75.

    Article  Google Scholar 

  • Lee, S. S., & Mykland, P. A. (2008). Jumps in financial markets: A new nonparametric test and jump dynamics. Review of Financial Studies, 21(6), 2535–2563.

    Article  Google Scholar 

  • Leong, C. K. (2016). Credit risk scoring with Bayesian network models. Computational Economics, 47(3), 423–446.

    Article  Google Scholar 

  • Li, S., Ning, K., & Zhang, T. (2021). Sentiment-aware jump forecasting. Knowledge-Based Systems, 228, 107292.

    Article  Google Scholar 

  • Liu, L., Patton, A. J., & Sheppard, K. (2015). Does anything beat 5-minute RV? A comparison of realized measures across multiple asset classes. Journal of Econometrics, 187(1), 293–311.

    Article  Google Scholar 

  • Mäkinen, M., Kanniainen, J., Gabbouj, M., & Iosifidis, A. (2019). Forecasting of jump arrivals in stock prices: New attention-based network architecture using limit order book data. Quantitative Finance, 19(12), 2033–2050.

    Article  Google Scholar 

  • Merton, R. (1976). Option pricing when underlying stock returns are discontinuous. Journal of Financial Economics, 3(1–2), 125–144.

    Article  Google Scholar 

  • Meyer, P. E., Schretter, C., & Bontempi, G. (2008). Information-theoretic feature selection in microarray data using variable complementarity. IEEE Journal of Selected Topics in Signal Processing, 2(3), 261–274.

    Article  Google Scholar 

  • Ni, X., Peng, Q., Yin, S., & Zhang, T. (2020). Attention! Distracted institutional investors and stock price crash. Journal of Corporate Finance, 64(64), 101701.

    Article  Google Scholar 

  • Podolskij, M., & Ziggel, D. (2010). New tests for jumps in semimartingale models. Statistical Inference for Stochastic Processes, 13(2), 15–41.

    Article  Google Scholar 

  • Prodromou, T., & Westerholm, P. J. (2022). Are high frequency traders responsible for extreme price movements? Economic Analysis and Policy, 73, 94–111.

    Article  Google Scholar 

  • Ranaldo, A. (2008). Intraday market dynamics around public information arrivals intraday market dynamics around public information arrivals. In S. Market, G. N. Gregoriou, & F. S. Lhabitant (Eds.), Implications for market microstructure and asset pricing, liquidity (pp. 199–226). Hoboken: Wiley.

    Google Scholar 

  • Riordan, R., Storkenmaier, A., Wagener, M., & Zhang, S. S. (2013). Public information arrival: Price discovery and liquidity in electronic limit order markets. Journal of Banking and Finance, 37(4), 1148–1159.

    Article  Google Scholar 

  • Ross, B. C. (2014). Mutual information between discrete and continuous data sets. PLoS ONE, 9(2), e87357.

    Article  Google Scholar 

  • Saikhu, A., Arifin, A. Z., & Fatichah, C. (2019). Correlation and symmetrical uncertainty-based feature selection for multivariate time series classification. International Journal of Intelligent Engineering and Systems, 12(3), 129–137.

    Article  Google Scholar 

  • Shannon, C. E. (2001). A mathematical theory of communication. ACM SIGMOBILE Mobile Computing and Communication Review, 5(1), 3–55.

    Article  Google Scholar 

  • Song, X. F., Zhang, Y., Gong, D. W., Sun, X. Y. (2021). Feature selection using bare-bones particle swarm optimization with mutual information. Pattern Recognition, 112, 107804.

    Article  Google Scholar 

  • Steuer, R., Kurths, J., Daub, C. O., Weise, J., & Selbig, J. (2002). The mutual information: Detecting and evaluating dependencies between variables. Bioinformatics, 18(Suppl. 2), S231-240.

    Article  Google Scholar 

  • Su, Z., Bao, H., Li, Q., Xu, B., & Cui, X. (2022). The prediction of price gap anomaly in Chinese stock market: Evidence from the dependent functional logit model. Finance Research Letters, 47, 102702.

    Article  Google Scholar 

  • Sun, B., & Gao, Y. (2020). Market liquidity and macro announcement around intraday jumps: Evidence from Chinese stock index futures markets. Physica A: Statistical Mechanics and its Applications, 541, 123308.

    Article  Google Scholar 

  • Van Ness, B., Van Ness, R., & Yildiz, S. (2021). Private information in trades, R2, and large stock price movements. Journal of Banking and Finance, 131, 106194.

    Article  Google Scholar 

  • Vergara, J. R., & Estévez, P. A. (2014). A review of feature selection methods based on mutual information. Neural Computing and Applications, 24(1), 175–186.

    Article  Google Scholar 

  • Wan, D., Wei, X., & Yang, X. (2017). Liquidity dynamics around intraday price jumps in Chinese stock market. Journal of Systems Science and Complexity, 30(2), 434–463.

    Article  Google Scholar 

  • Wen, F., Xu, L., Ouyang, G., & Kou, G. (2019). Retail investor attention and stock price crash risk: Evidence from China. International Review of Financial Analysis, 65(65), 101376.

    Article  Google Scholar 

  • Xu, Y., Xuan, Y., & Zheng, G. (2021). Internet searching and stock price crash risk: Evidence from a quasi-natural experiment. Journal of Financial Economics, 141(1), 255–275.

    Article  Google Scholar 

  • Yoon, H., Yang, K., & Shahabi, C. (2005). Feature subset selection and feature ranking for multivariate time series. IEEE Transactions on Knowledge and Data Engineering, 17(9), 1186–1198.

    Article  Google Scholar 

  • Zhou, C., Wu, C., & Wang, Y. (2018). Dynamic portfolio allocation with time-varying jump risk. Journal of Empirical Finance, 50, 113–124.

    Article  Google Scholar 

  • Zhou, H., Wang, X., & Zhu, R. (2022). Feature selection based on mutual information with correlation coefficient. Applied Intelligence, 52(5), 5457–5474.

    Article  Google Scholar 

  • Zhu, Y., Wu, Z., Zhang, H., & Yu, J. (2017). Media sentiment, institutional investors and probability of stock price crash: Evidence from Chinese stock markets. Accounting and Finance, 57(5), 1635–1670.

    Article  Google Scholar 

Download references

Funding

This work was supported by China Postdoctoral Science Foundation (2021M701670), Jiangsu Planned Projects for Postdoctoral Research Funds (2021K357C), National Natural Science Foundation of China (71720107001, U1811462, 72071103, 72001104), Fundamental Research Funds for the Central Universities(0118/14370107),and Humanities and Social Science Fund of the Ministry of Education (17YJA790101).

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Xindan Li.

Ethics declarations

Conflict of interest

The authors declared that they have no conflict of interest to this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kong, A., Azencott, R., Zhu, H. et al. Pattern Recognition in Microtrading Behaviors Preceding Stock Price Jumps: A Study Based on Mutual Information for Multivariate Time Series. Comput Econ 63, 1401–1429 (2024). https://doi.org/10.1007/s10614-023-10367-6

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10614-023-10367-6

Keywords

Navigation