Skip to main content

Probability of Default Modeling: A Machine Learning Approach

  • Chapter
  • First Online:
Mathematical and Statistical Methods for Actuarial Sciences and Finance

Abstract

Default prediction through probability of default modeling has attracted lots of research interests in the past literature and recent studies have shown that Artificial Intelligence (AI) methods achieved better performance than traditional statistical methods. This paper empirically investigates the results of applying different machine learning techniques through the overall estimation process to reduce the running time, maximize—in the first stage—the predictive power and contribute of each variable to the estimation of PDs. In the second stage, we have identified the best multivariate combination of drivers by comparing the results of a set of supervised machine learning algorithm. In the last development stage, we have applied an unsupervised machine learning to calibrate parameters and ranked the customers within an ordinal n-class scale obtained through the application of an unsupervised learning classification technique. Finally, we have verified the calibration goodness through classical calibration test (e.g. binomial tests). The study has been done on big data sample with more than 800,000 Retail customers of a European Bank under ECB Supervision, with 10 years of historical information and more than 600 variables to be analyzed for each customer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 249.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Zhao, Z., Xu, S., Kang, B.H., Kabir, M.J., Liu, Y., Wasinger, R.: Investigation and improvement of multi-layer perceptron neural networks for credit scoring. Expert Syst. Appl. 48, 3508–3516 (2015)

    Article  Google Scholar 

  2. McLachlan, G.: Discriminant Analysis and Statistical Pattern Recognition. Wiley Interscience, London (2004)

    MATH  Google Scholar 

  3. Steinbach, M., Tan, P.N.: kNN: k-nearest neighbors. In: Wu, X., Kumar, V. (eds.) The Top Ten Algorithms in Data Mining, pp. 151–162. Chapman & Hall/CRC, Boca Raton (2009)

    Chapter  Google Scholar 

  4. Antunes, F., Ribeiroa, B., Pereira, F.: Probabilistic modeling and visualization for bankruptcy prediction. Appl. Soft Comput. 60, 831–843 (2017)

    Article  Google Scholar 

  5. Dwyer, D.W., Stein, R.M.: Inferring the default rate in a population by comparing two incomplete default databases. J. Bank. Financ. 30, 797–810 (2006)

    Article  Google Scholar 

  6. Han, L., Fraser, S., Storey, D.J.: Are good or bad borrowers discouraged from applying for loans? J. Bank. Financ. 33, 415–424 (2009)

    Article  Google Scholar 

  7. Yu, L., Yue, W., Wang, S., Lai, K.K.: Support vector machine based multiagent ensemble learning for credit risk evaluation. Expert Syst. Appl. 37, 1351–1360 (2010)

    Article  Google Scholar 

  8. Steinberg, D.: CART: classification and regression trees. In: Wu, X., Kumar, V. (eds.) The Top Ten Algorithms in Data Mining, pp. 180–201. Chapman & Hall/CRC, Boca Raton (2009)

    Google Scholar 

  9. Fonseca, P., Lopes, H.: Calibration of Machine Learning Classifiers for Probability of Default Modelling. James Finance, Crowd Process Inc. (2017)

    Google Scholar 

  10. Khandani, A.E., Kim, J., Lo, A.W.: Consumer credit-risk models via machine-learning algorithms. J. Bank. Financ. 34, 2767–2787 (2010)

    Article  Google Scholar 

  11. Lessmann, S., Baesens, B., Seow, H.V., Thomas, L.C.: Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur. J. Oper. Res. 247(1), 124–136 (2015)

    Article  Google Scholar 

  12. Nanni, L., Lumini, A.: An experimental comparison of ensemble of classifiers for bankruptcy prediction and credit scoring. Expert Syst. Appl. 36, 3028–3033 (2009)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Giuliana Caivano .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG, part of Springer Nature

About this chapter

Check for updates. Verify currency and authenticity via CrossMark

Cite this chapter

Bonini, S., Caivano, G. (2018). Probability of Default Modeling: A Machine Learning Approach. In: Corazza, M., Durbán, M., Grané, A., Perna, C., Sibillo, M. (eds) Mathematical and Statistical Methods for Actuarial Sciences and Finance. Springer, Cham. https://doi.org/10.1007/978-3-319-89824-7_32

Download citation

Publish with us

Policies and ethics