FRFE: Fast Recursive Feature Elimination for Credit Scoring
Credit scoring is one of the most important issues in financial decision-making. The use of data mining techniques to build models for credit scoring has been a hot topic in recent years. Classification problems often have a large number of features, but not all of them are useful for classification. Irrelevant and redundant features in credit data may even reduce the classification accuracy. Feature selection is a process of selecting a subset of relevant features, which can decrease the dimensionality, reduce the running time, and improve the accuracy of classifiers. Random forest (RF) is a powerful classification tool which is currently an active research area and successfully solves classification problems in many domains. In this study, we constructed a fast credit scoring model based on parallel Random forests and Recursive Feature Elimination (FRFE) . Two public UCI data sets, Australia and German credit have been used to test our method. The experimental results of the real world data showed that the proposed method results in a higher prediction rate than a baseline method for some certain datasets and also shows comparable and sometimes better performance than the feature selection methods widely used in credit scoring.
KeywordsCredit risk Credit scoring Feature selection Random forests RFE Machine learning
- 2.Davoodabadi, Z., Moeini, A.: Building customers’ credit scoring models with combination of feature selection and decision tree algorithms 4(2), 97–103 (2015)Google Scholar
- 13.Ghatge, A.R., Halkarnikar, P.P.: Ensemble neural network strategy for predicting credit default evaluation 2(7), 223–225 (2013)Google Scholar
- 15.Ghodselahi, A.: A hybrid support vector machine ensemble model for credit scoring. Int. J. Comput. Appl. 17(5), 1–5 (2011)Google Scholar
- 19.Wang, Y., Wang, S., Lai, K.: A new fuzzy support vector machine to evaluate credit risk. Comput. J. IEEE Trans. Fuzzy Syst. 13(6), 25–29 (2005)Google Scholar
- 21.Ling, Y., Cao, Q.Y., Zhang, H.: Application of the PSO-SVM model for credit scoring. In: Proceedings of the 2011 7th International Conference on Computational Intelligent and Security, CIS 2011, pp. 47–51 (2011)Google Scholar