Abstract
Credit default prediction (CDP) modeling is a fundamental and critical issue for financial institutions. However, the previous studies indicate that the classifier’s performances in CDP analysis differ using different performance criterions on different databases under different circumstances. The performance assessment exercise under a set of criteria remains understudied in nature, on the one hand, and the real–scenario is not taken into account in that a single/very limited number of measure only are used, on the other hand. These problems affect the ability to make a consistent conclusion. Therefore, the aim of this study is to address this methodological issue by applying support vector machine (SVM)-based CDP algorithm by means of a set of representative performance criterions, with enclosing some novel performance measures, its performance compare with the results gained by statistical and intelligent approaches using six different types of databases from the credit prediction domains. Experimental results show that SVM model is marginally superior to CART with DA, being more robust than its other counterparts. In consequence, this study recommends that the supremacy of a classifier is linked to the way in which evaluations are measured.
Similar content being viewed by others
References
Akkoc, S. 2012. An Empirical Comparison of Conventional Techniques, Neural Networks and the Three Stage Hybrid Adaptive Neuro Fuzzy Inference System (ANFIS) Model for Credit Scoring Analysis: the Case of Turkish Credit Card Data. European Journal of Operational Research 222: 168–178. doi:10.1016/j.ejor.2012.04.009.
Ala’raj, M., and M.F. Abbod. 2016. Classifiers Consensus System Approach for Credit Scoring. Applied Soft Computing 144: 89–105. doi:10.1016/j.knosys.2016.04.013.
Arisawa, M., and J. Watada. 1994. Enhanced Learning in Neural Networks and its Application to Financial Statement Analysis. Paper presented at IEEE International Conference on Neural Networks.
Biggerstaff, B.J. 2000. Comparing Diagnostic Tests: A Simple Graphic Using Likelihood Ratios. Statistics in Medicine 19: 649–663.
Blakeley, D., and E. Oddone. 1995. Noninvasive Carotid Artery Testing. Annals of Internal Medicine 122: 360–367.
Chen, M.Y. 2011. Bankruptcy Prediction in Firms with Statistical and Intelligent Techniques and a Comparison of Evolutionary Computation Approaches. Computers and Mathematics with Applications 62: 4514–4524. doi:10.1016/j.camwa.2011.10.030.
Cherkassky, V., and Y. Ma. 2004. Practical Selection of SVM Parameters and Noise Estimation for SVM Regression. Neural Networks 17 (1): 113–126.
Ferri, C., J.H. Orallo, and R. Modroiu. 2009. An Experimental Comparison of Performance Measures for Classification. Pattern Recognition Letters 30: 27–38. doi:10.1016/j.patrec.2008.08.010.
He, H., and E. García. 2009. Learning from Imbalanced Data. IEEE Transactions on Knowledge and Data Engineering 21: 1263–1284. doi:10.1109/TKDE.2008.239.
Hu, Y.C., and J. Ansell. 2007. Measuring Retail Company Performance Using Credit Scoring Techniques. European Journal of Operational Research 183: 1595–1606. doi:10.1016/j.ejor.2006.09.101.
Jones, S., D. Johnstone, and R. Wilson. 2015. An Empirical Evaluation of the Performance of Binary Classifiers in the Prediction of Credit Ratings Changes. Journal of Banking & Finance 56: 72–85. doi:10.1016/j.jbankfin.2015.02.006.
Khashei, M., A.Z. Hamadani, and M. Bijari. 2012. A Novel Hybrid Classification Model of Artificial Neural Networks and Multiple Linear Regression Models. Expert Systems with Applications 39: 2606–2620. doi:10.1016/j.eswa.2011.08.116.
Lee, T.S., and I.F. Chen. 2005. A Two-Stage Hybrid Credit Scoring Model Using Artificial Neural Networks and Multivariate Adaptive Regression Splines. Expert Systems with Applications 28: 743–752. doi:10.1016/j.eswa.2004.12.031.
Lichman, M. 2013. uci machine learning repository. http://archive.ics.uci.edu/ml/.
Musa, A.B. 2013. Comparative Study on Classification Performance Between Support Vector Machine and Logistic Regression. International Journal of Machine Learning and Cybernetics 4: 13–24. doi:10.1007/s13042-012-0068-x.
Nanda, S., and P. Pendharkar. 2001. Linear Models for Minimizing Misclassification Costs in Bankruptcy Prediction. International Journal of Intelligent Systems in Accounting, Finance & Management 10: 155–168. doi:10.1002/isaf.203.
Rodríguez, L.C., E.P. Castaño, and C.R. Samblás. 2016. Quality Performance Metrics in Multivariate Classification Methods for Qualitative Analysis. Trends in Analytical Chemistry 80: 612–624. doi:10.1016/j.trac.2016.04.021.
Schaefer, S.M., and I.A. Strebulaev. 2008. Structural Models of Credit Risk are Useful: Evidence from Hedge Ratios on Corporate Bonds. Journal of Financial Economics 90: 1–19. doi:10.1016/j.jfineco.2007.10.006.
Sokolova, M., and G. Lapalme. 2009. A Systematic Analysis of Performance Measures for Classification Tasks. Information Processing and Management 45: 427–437. doi:10.1016/j.ipm.2009.03.002.
Sun, J., and H. Li. 2012. Financial Distress Prediction Using Support Vector Machines: Ensemble Vs Individual. Applied Soft Computing 12: 2254–2265. doi:10.1016/j.asoc.2012.03.028.
Tinoco, M.H., and N. Wilson. 2013. Financial Distress and Bankruptcy Prediction among Listed Companies Using Accounting, Market and Macroeconomic Variables. International Review of Financial Analysis 30: 394–419. doi:10.1016/j.irfa.2013.02.013.
Van Gestel, T., B. Baesens, J.A.K. Suykens, D. Van den Poel, D. Baestaens, and M. Willekens. 2006. Bayesian Kernel Based Classification for Financial Distress Detection. European Journal of Operational Research 172: 979–1003.
Vapnik, V.N. 1995. The Nature of Statistical Learning Theory. New York: Springer-Verlag.
Vapnik, V.N. 1998. Statistical Learning Theory. New York: Wiley.
Vidakovic, B. 2011. Statistics for Bioengineering Sciences: With MATLAB and WinBUGS Support. New York: Springer. doi:10.1007/978-1-4614-0394-4_3.
West, D. 2000. Neural Network Credit Scoring Models. Computers & Operations Research 27: 1131–1152.
Youden, W. 1950. Index for Rating Diagnostic Tests. Cancer 3: 32–35.
Zhong, H., C. Miao, Z. Shen, and Y. Feng. 2014. Comparing the Learning Effectiveness of BP, ELM, I-ELM, and SVM for Corporate Credit Ratings. Neurocomputing 128: 285–295. doi:10.1016/j.neucom.2013.02.054.
Zhou, L., K.K. Lai, and Yu. Lean. 2010. Least Squares Support Vector Machines Ensemble Models for Credit Scoring. Expert Systems with Applications 37: 127–133. doi:10.1016/j.eswa.2009.05.024.
Acknowledgments
We are very grateful to the editor-in-chief, Igor Loncarski; managing editors; and anonymous reviewers for their substantial contribution because with their assistance the manuscript has been significantly improved. The research is supported by the National Natural Science Foundation of China (71171031 and 71471027), National Social Science Foundation of China (16BTJ017), Social Science Foundation of Liaoning province of China (L16BJY016), Credit Risks Rating System and Loan Pricing Project of Small Enterprises for Bank of Dalian (2012–2001) and Credit Risks Evaluation and Loan Pricing For Petty Loan Funded for the Head Office of Postal Savings Bank of China (2009–2007). We thank the organizations mentioned above.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that there is no conflict of interests regarding the publication of the paper.
Additional information
Fahmida E. Moula, Chi Guotai and Mohammad Zoynul Abedin have contributed equally to this work.
Rights and permissions
About this article
Cite this article
Moula, F.E., Guotai, C. & Abedin, M.Z. Credit default prediction modeling: an application of support vector machine. Risk Manag 19, 158–187 (2017). https://doi.org/10.1057/s41283-017-0016-x
Published:
Issue Date:
DOI: https://doi.org/10.1057/s41283-017-0016-x