Skip to main content

An Empirical Study on Credit Risk Assessment Using Ensemble Classifiers

  • Conference paper
  • First Online:
Advances in Data Science and Computing Technologies (ADSC 2022)

Abstract

Credit risk assessment is the major concerning issue for every financial organization. A simple change in evaluation procedure can have a large influence on the financial market. The impact of machine learning techniques in credit risk assessment is highly influential. The intention of this research is to give insight into ensemble strategies for credit risk assessment and to evaluate with other standalone methods. The paper compares four tree-based classifiers namely—Decision Tree (DT), Random Forest (RF), AdaBoost and XGBoost using three different train-test splits for two publicly available German and Australian datasets. As per the findings, bagging based Random Forest classifier outperforms all the other classifiers for both datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    Staglog (German Credit Data) Data Set—https://archive.ics.uci.edu/ml/datasets/statlog+(german+credit+data).

  2. 2.

    Staglog (Australian Credit Data) Data Set—https://archive.ics.uci.edu/ml/datasets/statlog+(australian+credit+approval).

References

  1. Balin BJ (2008) Basel I, Basel II, and emerging markets: a nontechnical analysis

    Google Scholar 

  2. Levy A, Baha R (2021) Credit risk assessment: a comparison of the performances of the linear discriminant analysis and the logistic regression. Int J Entrep Small Bus 42:169–186

    Google Scholar 

  3. Chang Y-C, Chang K-H, Chu H-H, Tong L-I (2016) Establishing decision tree-based short-term default credit risk assessment models. Commun Stat—Theor Methods 45:6803–6815

    Article  MathSciNet  MATH  Google Scholar 

  4. Satchidananda SS, Simha JB (2006) Comparing decision trees with logistic regression for credit risk analysis. International Institute of Information Technology, Bangalore, India

    Google Scholar 

  5. Roy AG, Urolagin S (2019) Credit risk assessment using decision tree and support vector machine based data analytics. In: Creative business and social innovations for a sustainable future. Springer, pp 79–84

    Google Scholar 

  6. Wang Y, Duan D (2021) Research on risk assessment of clients before loan based on decision tree algorithm. J Phys: Conf Ser. IOP Publishing 012056

    Google Scholar 

  7. Quinlan JR (1986) Induction of decision trees. Mach Learn 1:81–106

    Article  Google Scholar 

  8. Ayodele OE (2021) Development of credit risk prediction model using support vector machine technique. Ph.d. Thesis, Federal University of Technology Akure

    Google Scholar 

  9. Cortes C, Vapnik V (1995) Support-vector networks. Mach Learn 20:273–297

    Article  MATH  Google Scholar 

  10. DanÄ—nas P (2009) Support vector machines and their application in credit risk evaluation process

    Google Scholar 

  11. Ghodselahi A (2011) A hybrid support vector machine ensemble model for credit scoring. Int J Comput Appl 17:1–5

    Google Scholar 

  12. Danenas P, Garsva G (2015) Selection of support vector machines based classifiers for credit risk domain. Expert Syst Appl 42:3194–3204

    Article  Google Scholar 

  13. Bahrammirzaee A (2010) A comparative survey of artificial intelligence applications in finance: artificial neural networks, expert system and hybrid intelligent systems. Neural Comput Appl 19:1165–1195

    Article  Google Scholar 

  14. Bhattacharjee B, Sridhar A, Shafi M (2017) An artificial neural network-based ensemble model for credit risk assessment and deployment as a graphical user interface. Int J Data Min, Model Manage 9:122–141

    Google Scholar 

  15. Aithal V, Jathanna RD (2019) Credit risk assessment using machine learning techniques. Int J Innovative Technol Exploring Eng 9:3482–3486

    Article  Google Scholar 

  16. Tang L, Cai F, Ouyang Y (2019) Applying a nonparametric random forest algorithm to assess the credit risk of the energy industry in China. Technol Forecast Soc Chang 144:563–572

    Article  Google Scholar 

  17. Ye X, Dong L, Ma D (2018) Loan evaluation in P2P lending based on random forest optimized by genetic algorithm with profit score. Electron Commer Res Appl 32:23–36

    Article  Google Scholar 

  18. Tian Z, Xiao J, Feng H, Wei Y (2020) Credit risk assessment based on gradient boosting decision tree. Procedia Comput Sci 174:150–160

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Arijit Bhattacharya .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Bhattacharya, A., Parui, S.K., Biswas, S.K., Mandal, A. (2023). An Empirical Study on Credit Risk Assessment Using Ensemble Classifiers. In: Chakraborty, B., Biswas, A., Chakrabarti, A. (eds) Advances in Data Science and Computing Technologies. ADSC 2022. Lecture Notes in Electrical Engineering, vol 1056. Springer, Singapore. https://doi.org/10.1007/978-981-99-3656-4_16

Download citation

Publish with us

Policies and ethics