Skip to main content

An Unsupervised Learning Approach Towards Credit Risk Modelling Using DFT Features and Gaussian Mixture Models

  • Conference paper
  • First Online:
Advances in Intelligent Computing and Communication

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 430))

  • 310 Accesses

Abstract

One of the most important problems in the present times is to estimate the risk in lending financial resources with respect to its returns. Credit risk can harm a lender by increasing collection costs and causing cash flow inconsistency. Lenders use credit risk modelling to assess the amount of credit risk associated with lending credit to borrowers. Financial statement analysis, default likelihood and machine learning are the options for credit risk analysis models. And, solving this kind of problem using machine learning techniques is known as credit risk modelling. In this process, we fit data having loads of features related to the financial conditions of a person into the model to classify a lender as a defaulter or non-defaulter. In this study, we used an unsupervised machine learning technique for this task. First, we applied two feature selection methods, viz. using the Pearson’s correlation coefficient and chi-square test, to select certain features which are less informative for the task. Feature selection is one of the pre-processing standards in designing advanced solutions because it does not only alleviate dataset dimensions but also improves a model’s performance measures. We also applied the fast Fourier transform (FFT) algorithm to get the discrete Fourier transform (DFT) of all the selected features, as supplementary and artificial feature vectors to the model. To deal with class imbalance, we used a specific variant of synthetic minority oversampling technique (SMOTE), i.e. Gaussian-SMOTE. Finally, we applied Gaussian mixture model (GMM) at distinct values for its parameters like the ‘covariance type’. With this and the optimally selected parameters, our methodology was able to achieve a classification accuracy of 85.48%.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Arora N, Kaur PD (2020) A Bolasso based consistent feature selection enabled random forest classification algorithm: an application to credit risk assessment. Appl Soft Comput 86:105936

    Google Scholar 

  2. Wang Y et al (2020) A comparative assessment of credit risk model based on machine learning—a case study of bank loan data. Procedia Comput Sci 174:141–149

    Article  Google Scholar 

  3. Shen F et al (2021) A new deep learning ensemble credit risk evaluation model with an improved synthetic minority oversampling technique. Appl Soft Comput 98:106852

    Google Scholar 

  4. Jadwal PK, Jain S, Agarwal B (2020) Financial credit risk evaluation model using machine learning-based approach. World Rev Entrepreneurship Manag Sustain Dev 16(6):576–589

    Article  Google Scholar 

  5. Munkhdalai L et al (2021) A partially interpretable adaptive Softmax regression for credit scoring. Appl Sci 11(7):3227

    Article  Google Scholar 

  6. Suhaimi NAD, Abas H (2020) A systematic literature review on supervised machine learning algorithms. PERINTIS eJ 10(1):1–24

    Google Scholar 

  7. Bhatore S, Mohan RYR (2020) Machine learning techniques for credit risk evaluation: a systematic literature review. J Bank Financ Technol 4(1):111–138

    Article  Google Scholar 

  8. https://archive.ics.uci.edu/ml/datasets/Statlog+%28Australian+Credit+Approval%29

  9. https://archive.ics.uci.edu/ml/datasets/Japanese+Credit+Screening

  10. https://archive.ics.uci.edu/ml/datasets/default+of+credit+card+clients

  11. Chawla NV et al (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357

    Article  Google Scholar 

  12. Lee H, Kim J, Kim S (2017) Gaussian-based SMOTE algorithm for solving skewed class distributions. Int J Fuzzy Logic Intell Syst 17(4):229–234

    Article  MathSciNet  Google Scholar 

Download references

Funding Acknowledgement

This research was supported by the IMPRESS Grants of Indian Council of Social Science and Research, Government of India.

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pandit, A.K., Vashishtha, A., Sumbria, S., Mahajan, S. (2022). An Unsupervised Learning Approach Towards Credit Risk Modelling Using DFT Features and Gaussian Mixture Models. In: Mohanty, M.N., Das, S. (eds) Advances in Intelligent Computing and Communication. Lecture Notes in Networks and Systems, vol 430. Springer, Singapore. https://doi.org/10.1007/978-981-19-0825-5_1

Download citation

Publish with us

Policies and ethics