A Comparative Study of Machine Learning Algorithms for Enhanced Credit Default Prediction

Uddin, Mohammad Salah; Rahman, Md. Anishur

doi:10.1007/978-981-99-8438-1_15

Mohammad Salah Uddin⁸ &
Md. Anishur Rahman⁸

Part of the book series: Algorithms for Intelligent Systems ((AIS))

Included in the following conference series:

International Conference on Engineering, Applied Sciences and System Modeling

52 Accesses

Abstract

In today’s competitive financial arena, accurate credit default prediction is very important for sustaining the stability and profitability of banks. This research study presents a comparative analysis of various machine learning algorithms, which are used for forecasting the likelihood of credit default. Six diverse algorithms—Support Vector Machine (SVM), K-Nearest Neighbor (K-NN), Logistic Regression, Decision Tree (DT), Gaussian Naive Bayes, and Random Forest (RF)—are used to construct the predictive comparison. All the models were trained and evaluated by using an investment dataset obtained from a private bank located in Dhaka, Bangladesh. The results of the study indicate that the Random Forest (RF) and Decision Tree (DT) models have achieved higher accuracy in predicting the outcomes when compared to other machine learning methods, with an accuracy of 92 and 94%, respectively. This study also highlights the importance of feature selection and prediction boosting in order to optimize the credit default prediction rates.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 219.00; Price excludes VAT (USA)

Hardcover Book: USD 279.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Ozili PK, Outa E (2017) Bank loan loss provisions research: a review. Borsa Istanbul Rev 17(3):144–163
Article Google Scholar
Malczyk A (2011) Good debt, bad debt: your money. Pers Financ 2011(367):9–10
Google Scholar
Hanson J (2006) Good debt, bad debt: Knowing the difference can save your financial life. Jon Hanson
Google Scholar
Löeffler G, Posch PN (2011) Credit risk modeling using Excel and VBA. Wiley
Google Scholar
Ghorbani R, Kordestani G, Haghighat H, Ghaemi MH, Azizmohammadlou H (2021) Developing a model for evaluating the effectiveness of risk management in the banking industry. Financ Res J 22(4):496–520
Google Scholar
Madaan M, Kumar A, Keshri C, Jain R, Nagrath P (2021) Loan default prediction using decision trees and random forest: a comparative study. In: IOP conference series: materials science and engineering, vol 1022, no 1, p 012042
Google Scholar
Coşer A, Maer-matei MM, Albu C (2019) Predictive models for loan default risk assessment. Econ Comput Econ Cybern Stud Res 53(2)
Google Scholar
Li Z, Li K, Yao X, Wen Q (2019) Predicting prepayment and default risks of unsecured consumer loans in online lending. Emerg Mark Financ Trade 55(1):118–132
Article Google Scholar
Anand M, Velu A, Whig P (2022) Prediction of loan behaviour with machine learning models for secure banking. J Comput Sci Eng (JCSE) 3(1):1–13
Article Google Scholar
Zanin L (2020) Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market. J Behav Exp Financ 25:100272
Article Google Scholar
Conklin JD (2002) Applied logistic regression
Google Scholar
Steinbach M, Tan PN (2009) kNN: k-nearest neighbors. The top ten algorithms in data mining, pp 151–162
Google Scholar
Ma Y, Guo G (2014) Support vector machines applications, vol 649. Springer
Google Scholar
Leung KM et al (2007) Naive bayesian classifier. Polytechnic University Department of Computer Science/Finance and Risk Engineering, vol 2007, pp 123–156
Google Scholar
Izza Y, Ignatiev A, Marques-Silva J (2020) On explaining decision trees. arXiv preprint arXiv:2010.11034
Schonlau M, Zou RY (2020) The random forest algorithm for statistical learning. Stata J 20(1):3–29
Article Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science and Engineering Department, East West University, Dhaka, Bangladesh
Mohammad Salah Uddin & Md. Anishur Rahman

Authors

Mohammad Salah Uddin
View author publications
You can also search for this author in PubMed Google Scholar
Md. Anishur Rahman
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohammad Salah Uddin .

Editor information

Editors and Affiliations

Faculty of Innovation and Technology, Taylor’s University, Subang Jaya, Selangor, Malaysia
David Asirvatham
University of Southeast Norway, Notodden, Norway
Francisco M. Gonzalez-Longatt
Gdansk University of Technology, Gdańsk, Poland
Przemyslaw Falkowski-Gilski
Professor of Computer Engineering, Papua New Guinea University of Technology, Lae, Papua New Guinea
R. Kanthavel

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Uddin, M.S., Rahman, M.A. (2024). A Comparative Study of Machine Learning Algorithms for Enhanced Credit Default Prediction. In: Asirvatham, D., Gonzalez-Longatt, F.M., Falkowski-Gilski, P., Kanthavel, R. (eds) Evolutionary Artificial Intelligence. ICEASSM 2017. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-8438-1_15

Download citation

DOI: https://doi.org/10.1007/978-981-99-8438-1_15
Published: 14 March 2024
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8437-4
Online ISBN: 978-981-99-8438-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics

A Comparative Study of Machine Learning Algorithms for Enhanced Credit Default Prediction