Skip to main content

Principal Component Analysis and ReliefF Cascaded with Decision Tree for Credit Scoring

  • Conference paper
  • First Online:

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 985))

Abstract

The objective of this paper is to propose a credit scoring approval model using a feature selection technique performed by Principal Component Analysis (PCA) and ReliefF algorithm followed by a decision tree classifier. As a reference classifier, we have chosen Support Vector Machine (SVM). The performance of our proposed model has been tested using the German credit dataset. The experimental results of the proposed signal processing cascade for the credit scoring lead to the best accuracy of 91.67%, while classifiers without feature selection show the best accuracy of only 75.35%. On the other side, using the same combination of feature selection (PCA and ReliefF) but cascaded with SVM classifier, one has obtained an accuracy of only 85.15%. The experimental results confirm the accuracy of the proposed model, and at the same time they show the importance of feature selection and its optimization for credit scoring decision systems.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Abdelmoula, A.K.: Bank credit risk analysis with k-nearest- neighbor classifier: case of Tunisian banks. Account. Manag. Inf. Syst. 14(1), 79–106 (2015)

    Google Scholar 

  2. Agre, G., Dzhondzhorov, A.: A weighted feature selection method for instance-based classification. In: 17th International Conference on Artificial Intelligence : Methodology, Systems and Applications (AIMSA), pp. 14–25. Springer, Switzerland (2016)

    Chapter  Google Scholar 

  3. Anaei, S.M., Moradi, M.: A new method based on clustering and feature selection for credit scoring of banking customers. Int. J. Mod. Trends Eng. Res. 3(2), 123–128 (2016)

    Google Scholar 

  4. Antunes, F., Ribeiro, B., Pereira, F.: Probabilistic modeling and visualization for bankruptcy prediction. Appl. Soft Comput. J. 60, 831–843 (2017)

    Article  Google Scholar 

  5. Beretta, L., Santaniello, A.: Implementing ReliefF filters to extract meaningful features from genetic lifetime datasets. J. Biomed. Inform. 44(2), 361–369 (2011)

    Article  Google Scholar 

  6. Browne, D., Prestwich, S.: Credit scoring : feature selection on machine learning algorithms (2016)

    Google Scholar 

  7. Damrongsakmethee, T., Neagoe, V.: Data Mining and machine learning for financial analysis. Indian J. Sci. Technol. 10(39), 1–7 (2017)

    Article  Google Scholar 

  8. Gepp, A., Kumar, K.: Predicting financial distress: a comparison of survival analysis and decision tree techniques. In: Eleventh International Multi-Conference on Information Processing-2015 (IMCIP-2015), pp. 396–404. Elsevier (2015)

    Google Scholar 

  9. Go, W., Lee, T., Kim, I., Lee, K.: Feature selection practice for unsupervised learning of credit card fraud detection. J. Theor. Appl. Inf. Technol. 96(2), 408–417 (2018)

    Google Scholar 

  10. Gupta, A.: Classification of complex UCI datasets using machine learning and evolutionary algorithms. Int. J. Sci. Technol. Res. 4(5), 85–94 (2015)

    Google Scholar 

  11. Ha, V., Nguyen, H.: Credit scoring with a feature selection approach based deep learning. In: MATEC web of Conferences (MIMT), pp. 1–5 (2016)

    Article  Google Scholar 

  12. Ilgun, E., Mekic, E., Mekic, E.: Application of Ann in Australian credit card apporval. Int. Multidiscip. J. 69(2), 334–342 (2014)

    Google Scholar 

  13. Louzada, F., Ara, A., Fernandes, G.B.: Surveys in operations research and management science classification methods applied to credit scoring: systematic review and overall comparison. Surv. Oper. Res. Manag. Sci. 21(2), 117–134 (2016)

    MathSciNet  Google Scholar 

  14. Neagoe, V., Ciotec, A., Cucu, S.: Deep convolutional neural networks versus multilayer perceptron for financial prediction. In: International Conference on Communications (COMM-2018), pp. 201–206. IEEE, Bucharest (2018)

    Google Scholar 

  15. Pendey, S., Benkatesh, N.: Analysis of German credit data using microsoft azure machine learning. J. Emerg. Technol. Innov. Res. 5(2), 302–305 (2018)

    Google Scholar 

  16. Sang, H., Nam, N., Nhan, N.: A novel credit scoring prediction model based on feature selection approach and parallel random forest. Indian J. Sci. Technol. 9(20), 1–6 (2016)

    Google Scholar 

  17. Shukla, A., Mishra, A., Gwalior, M.: Design of credit approval system using artificial neural network: a case study. Int. J. Eng. Res. Comput. Sci. Eng. 4(1), 1–6 (2017)

    Article  Google Scholar 

  18. Steinberg, D.: CART : Classification And Regression Trees, Chap. 10 (2009)

    Google Scholar 

  19. Urbanowicz, R.J., Olson, R.S., Schmitt, P., Meeker, M.: Benchmarking relief-based feature selection methods for bioinformatics data mining. J. Biomed. Inform. 4(2), 1–21 (2018)

    Article  Google Scholar 

  20. Van, S., Ha, N., Bao, H.: A hybrid feature selection method for credit scoring. EAI Endorsed Trans. Context. Syst. Appl. 4(11), 1–6 (2017)

    Google Scholar 

  21. UCI. https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data). Accessed 10 Sept 2018

  22. ReliefF algorithm. https://uk.mathworks.com/help/stats/relieff.html#responsive_offcanvas. Accessed 10 Feb 2019

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Thitimanan Damrongsakmethee .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2019 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Damrongsakmethee, T., Neagoe, VE. (2019). Principal Component Analysis and ReliefF Cascaded with Decision Tree for Credit Scoring. In: Silhavy, R. (eds) Artificial Intelligence Methods in Intelligent Algorithms. CSOC 2019. Advances in Intelligent Systems and Computing, vol 985. Springer, Cham. https://doi.org/10.1007/978-3-030-19810-7_9

Download citation

Publish with us

Policies and ethics