Abstract
In today’s competitive financial arena, accurate credit default prediction is very important for sustaining the stability and profitability of banks. This research study presents a comparative analysis of various machine learning algorithms, which are used for forecasting the likelihood of credit default. Six diverse algorithms—Support Vector Machine (SVM), K-Nearest Neighbor (K-NN), Logistic Regression, Decision Tree (DT), Gaussian Naive Bayes, and Random Forest (RF)—are used to construct the predictive comparison. All the models were trained and evaluated by using an investment dataset obtained from a private bank located in Dhaka, Bangladesh. The results of the study indicate that the Random Forest (RF) and Decision Tree (DT) models have achieved higher accuracy in predicting the outcomes when compared to other machine learning methods, with an accuracy of 92 and 94%, respectively. This study also highlights the importance of feature selection and prediction boosting in order to optimize the credit default prediction rates.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ozili PK, Outa E (2017) Bank loan loss provisions research: a review. Borsa Istanbul Rev 17(3):144–163
Malczyk A (2011) Good debt, bad debt: your money. Pers Financ 2011(367):9–10
Hanson J (2006) Good debt, bad debt: Knowing the difference can save your financial life. Jon Hanson
Löeffler G, Posch PN (2011) Credit risk modeling using Excel and VBA. Wiley
Ghorbani R, Kordestani G, Haghighat H, Ghaemi MH, Azizmohammadlou H (2021) Developing a model for evaluating the effectiveness of risk management in the banking industry. Financ Res J 22(4):496–520
Madaan M, Kumar A, Keshri C, Jain R, Nagrath P (2021) Loan default prediction using decision trees and random forest: a comparative study. In: IOP conference series: materials science and engineering, vol 1022, no 1, p 012042
Coşer A, Maer-matei MM, Albu C (2019) Predictive models for loan default risk assessment. Econ Comput Econ Cybern Stud Res 53(2)
Li Z, Li K, Yao X, Wen Q (2019) Predicting prepayment and default risks of unsecured consumer loans in online lending. Emerg Mark Financ Trade 55(1):118–132
Anand M, Velu A, Whig P (2022) Prediction of loan behaviour with machine learning models for secure banking. J Comput Sci Eng (JCSE) 3(1):1–13
Zanin L (2020) Combining multiple probability predictions in the presence of class imbalance to discriminate between potential bad and good borrowers in the peer-to-peer lending market. J Behav Exp Financ 25:100272
Conklin JD (2002) Applied logistic regression
Steinbach M, Tan PN (2009) kNN: k-nearest neighbors. The top ten algorithms in data mining, pp 151–162
Ma Y, Guo G (2014) Support vector machines applications, vol 649. Springer
Leung KM et al (2007) Naive bayesian classifier. Polytechnic University Department of Computer Science/Finance and Risk Engineering, vol 2007, pp 123–156
Izza Y, Ignatiev A, Marques-Silva J (2020) On explaining decision trees. arXiv preprint arXiv:2010.11034
Schonlau M, Zou RY (2020) The random forest algorithm for statistical learning. Stata J 20(1):3–29
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Uddin, M.S., Rahman, M.A. (2024). A Comparative Study of Machine Learning Algorithms for Enhanced Credit Default Prediction. In: Asirvatham, D., Gonzalez-Longatt, F.M., Falkowski-Gilski, P., Kanthavel, R. (eds) Evolutionary Artificial Intelligence. ICEASSM 2017. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-99-8438-1_15
Download citation
DOI: https://doi.org/10.1007/978-981-99-8438-1_15
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-8437-4
Online ISBN: 978-981-99-8438-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)