Predicting status of Chinese listed companies based on features selected by penalized regression

  • Rui Ma
  • Honghao Zhao
  • Ligang Zhou


China’s companies have attracted much attention due to the development of stock market in China. The listing status of listed Chinese companies becomes an important indicator which implies the potential risk of a stock. Thus predicting the status of listed Chinese companies is obviously crucial for stockholders and investors when they make further decisions. According to the four possible listing statuses for Chinese companies, researchers formulate the above issue as a classification problem which is typical in data mining area. Plenty of classification techniques have been implemented to predict the status of the listing Chinese companies based on their financial factors. Usually, there are more than 150 financial factors for each of the listed companies, and feature selection is needed before the implementation of classification methods. In the literature, researcher used t-test with variance inflation factor (VIF) analysis to select relevant factors. However, such method can not be applied in the high dimensional case. In this paper, we apply the idea of penalized regression to select the interested factors based on a logistic regression model, and then apply popular classification methods to predict the companies’ statuses. Our results show that the proposed method can find more representative factors and improves the prediction accuracy of the classification methods.


Classification data mining feature selection penalized regression 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.



The authors would like to thank the anonymous reviewers for their valuable comments and suggestions that helped improve the quality of this manuscript. Honghao Zhao’s research is supported by the Macau University of Science and Technology and the Macau Foundation. Rui Ma’s research is funded by Premier-Discipline Enhancement Scheme supported by Zhuhai Government and Premier Key-Discipline Enhancement Scheme supported by Guangdong Government Funds.


  1. [1]
    Altman, E.I. (1968). Financial ratios, discriminant analysis and the prediction of corporate bankruptcy. The Journal of Finance, 23 (4): 589–609.CrossRefGoogle Scholar
  2. [2]
    Ding, Y., Song, X. & Zen, Y. (2008). Forecasting financial condition of Chinese listed companies basedonsupport vector machine. Expert Systems with Applications, 34 (4): 3081–3089.CrossRefGoogle Scholar
  3. [3]
    Fiedman, J., Hastie, T., Hofling, H. & Tibshirani, R. (2007). Path wise coordinate optimization. Annals of Applied Statistics, 1 (2): 302–332.MathSciNetCrossRefzbMATHGoogle Scholar
  4. [4]
    Fiedman, J., Hastie, T., Hofling, H. & Tibshirani, R. (2010). Regularization paths for generalized linear models via coordinate descent. Journal of Statistical Software, 33: 1–22.Google Scholar
  5. [5]
    Gavin, C.C. & Talbot, L.C. (2006). Gene selection in cancer classification using sparse logistic regression with Bayesian regularization. Bioinformatics, 22 (19): 2348–2355.CrossRefGoogle Scholar
  6. [6]
    Geng, R.B., Bose, I. & Chen, X. (2014). Prediction offinancial distress: anempirical study of listed chinese companies using data mining. European Journal of Operations Research, 241 (1): 236–247.CrossRefGoogle Scholar
  7. [7]
    Huang, C., Dai, C. & Guo, M. (2015). A hybrid approach using two-level DEA for financial failure prediction and integrated SE-DEA and GCA for indicators selection. Applied Mathematics and Computation, 251: 431–441.MathSciNetCrossRefzbMATHGoogle Scholar
  8. [8]
    Li, S.J. & Wang, S. (2014). A financial early warning logit model and its efficiency verification approach. Knowledge based Systems, 70: 78–87.CrossRefGoogle Scholar
  9. [9]
    Li, Z.Y. Crook J. & Andreeva, G. (2014). Chinese companies distress prediction: an application of data envelopment analysis. Journal of the Operational Research Society, 65 (3): 466–479.CrossRefGoogle Scholar
  10. [10]
    Liang, Y., Liu, C., Luan, X.Z., Leung, K.S., Chan, T.M., Xu, Z.B. & Zhang, H. (2013). Sparse logistic regression with a L1=2 penalty for gene selection in cancer classification. BMC Bioinformatics, 14: 198.CrossRefGoogle Scholar
  11. [11]
    Shevade, S.K. & Keerthi, S.S. (2003). A simple and efficient algorithm for gene selection using sparse logistic regression. Bioinformatics, 19 (17): 2246–2253.CrossRefGoogle Scholar
  12. [12]
    Shumway, T. (2001). Forecasting bankruptcy more accurately: a simple hazard model. Journal of Business, 74 (1): 101–124.MathSciNetCrossRefGoogle Scholar
  13. [13]
    Sun, J., Shang, Z.M. & Li, H. (2014). Im balanceriented SVM methods for financial distress prediction: a comparative study among the new SBSVM-ensemble method and traditional methods. Journal of the Operational Research Society, 65 (12): 1905–1919.CrossRefGoogle Scholar
  14. [14]
    Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society, Series B, 58 (1): 267–288.MathSciNetzbMATHGoogle Scholar
  15. [15]
    Xiao, Z., Yang, X.L., Pang, Y. & Dang, X. (2012). The prediction for listed companies financial distress by using multiple prediction methods with rough set and dempster-shafer evidence theory. Knowledge based Systems, 6: 196–206.CrossRefGoogle Scholar
  16. [16]
    Zhang, L., Altman, E.I. & Yen, J. (2010). Corporate financial distress diagnosis model and application in credit rating for listing firms in China. Frontiers of Computer Science in China, 4 (2): 220–236.CrossRefGoogle Scholar
  17. [17]
    Zhou, L.G, Tam, K.P. & Fujita. (2016). Predictiing the listing status of Chinese Listed companies with multi-class classification models. Information Sciences, 328: 222–236.CrossRefGoogle Scholar
  18. [18]
    Zou, H. & Hastie, T. (2005). Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B, 67 (2): 301–320.MathSciNetCrossRefzbMATHGoogle Scholar

Copyright information

© Systems Engineering Society of China and Springer-Verlag Berlin Heidelberg 2017

Authors and Affiliations

  1. 1.School of BusinessMacau University of Science and TechnologyMacauChina

Personalised recommendations