Skip to main content
Log in

CDGAT: a graph attention network method for credit card defaulters prediction

  • Published:
Applied Intelligence Aims and scope Submit manuscript

Abstract

Recognizing potential defaulters is a crucial problem for financial institutions. Therefore, many credit scoring methods have been proposed in the past to address this issue. However, these methods rarely consider the interaction among customers such as bank transfer and remittance. With rapid growth in the number of customers adopting online banking services, such interaction information plays a significant role in assessing their credit score. In this paper, we propose a novel scalable credit scoring approach called CDGAT (Graph attention network for credit card defaulters) for predicting potential credit card defaulters. In CDGAT, a customer’s credit score is calculated based on transaction embedding and neighborhood embedding. To obtain the neighborhood embedding, CDGAT first utilizes the Amount-bias Sampling (AbS) strategy to extract a subgraph for each customer. Next, CDGAT directly aggregates neighbors’ features according to their influence weights. The experimental results on the dataset from Industrial and Commercial Bank of China (Macau) Limited (ICBC (Macau)) show that CDGAT significantly outperforms the baseline methods. Furthermore, experimental results reveal that the proposed method is also superior to several state-of-the-art Graph Convolutional Neural Network models in terms of scalability and performance.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Code Availability

The program developed during the current study are not publicly available.

References

  1. Berg T, Burg V, Gombović A, Puri M (2019) On the rise of fintechs: credit scoring using digital footprints. Rev Financ Stud 33(7):2845–2897

    Article  Google Scholar 

  2. Bellotti T, Crook J (2013) Forecasting and stress testing credit card default using dynamic models. Int J Forecast 29(4):563–574

    Article  Google Scholar 

  3. Butaru F, Chen Q, Clark B, Das S, Lo AW, Siddique A (2016) Risk and risk management in the credit card industry. J B Financ 72:218–239

    Article  Google Scholar 

  4. Björkegren D, Grissen D (2017) Behavior revealed in mobile phone usage predicts loan repayment. arXiv:1712:05840. 1–28

  5. Bhattacharyya S, Jha S, Tharakunnel KK, Westland JC (2011) Data mining for credit card fraud: a comparative study. Decis Support Syst 50(3):602–613

    Article  Google Scholar 

  6. Babaev D, Savchenko M, Tuzhilin A, Umerenkov D (2019) ET-RNN: applying deep learning to credit loan applications. In: Teredesai A, Kumar V, Li Y, Rosales R, Terzi E, Karypis G (eds) Proceedings of the 25th ACM SIGKDD International conference on knowledge discovery and data mining. ACM, Anchorage, pp 2183–2190

  7. Chung J, Gülçehre C, KyungHyun C, Bengio Y (2014) Empirical evaluation of gated recurrent neural networks on sequence modeling. arXiv:1412.3555, 1–9

  8. Cao B, Mao M, Viidu S, Philip SYu (2017) Collective fraud detection capturing inter-transaction dependency. In: Anandakrishnan A, Kumar S, Statnikov AR, Faruquie TA, Xu D (eds) Proceedings of the 23th KDD workshop on anomaly detection. PMLR, Halifax, pp 66–75

  9. Defferrard M, Bresson X, Vandergheynst P (2016) Convolutional neural networks on graphs with fast localized spectral filtering. In: Lee DD, Sugiyama M, von Luxburg U, Guyon I, Garnett R (eds) Proceedings of the 29th advances in neural information processing systems. Curran Associates Inc, Barcelona, pp 3837–3845

  10. Hand DJ, Henley WE (1997) Statistical classification methods in consumer credit scoring: a review. J R Stat Soc Ser A Stat Soc 160(3):523–541

    Article  Google Scholar 

  11. Hu D (2019) An introductory survey on attention mechanisms in NLP problems. In: Bi Y, Bhatia R, Kapoor S (eds) Proceedings of the 5th intelligent systems conference. Springer, London, pp 432–448

  12. Hamilton WL, Ying Z, Leskovec J (2017) Inductive representation learning on large graphs. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 1024–1034

  13. Ke G, Qi M, Finley T, Wang T, Chen W, Ma W, Ye Q, Liu T-Y (2017) Lightgbm: a highly efficient gradient boosting decision tree. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 3146–3154

  14. Kvamme H, Sellereite N, Aas K, Sjursen S (2018) Predicting mortgage default using convolutional neural networks. Expert Syst Appl 102:207–217

    Article  Google Scholar 

  15. Kipf TN, Welling M (2017) Semi-supervised classification with graph convolutional networks. In: Bengio Y, LeCun Y, Ranzato MA, Larochelle H, Vinyals O, Sainath T (eds) Proceedings of the 5th international conference on learning representations. OpenReview, Toulon, pp 1–14

  16. LeCun Y, Bengio Y, Hinton GE (2015) Deep learning. Nat 521(7553):436–444

    Article  Google Scholar 

  17. Lessmann S, Baesens B, Seow H-V, Thomas LC (2015) Benchmarking state-of-the-art classification algorithms for credit scoring: an update of research. Eur J Oper Res 247(1):124–136

    Article  MATH  Google Scholar 

  18. Lin T-Y, Goyal P, Girshick RB, He K, Dollár P (2020) Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell 42(2):318–327

    Article  Google Scholar 

  19. Masmoudi K, Abid L, Masmoudi A (2019) Credit risk modeling using bayesian network with a latent variable. Syst Expert Appl 127:157–166

    Article  Google Scholar 

  20. Maldonado S, Bravo C, López J, Pérez J (2017) Integrated framework for profit-based feature selection and SVM classification in credit scoring. Decis Support Syst 104:113–121

    Article  Google Scholar 

  21. Niu K, Zhang Z, Liu Y, Li R (2020) Resampling ensemble model based on data distribution for imbalanced credit risk evaluation in P2P lending. Inf Sci 536:120–134

    Article  MathSciNet  Google Scholar 

  22. Onay C, Ozturk E (2018) A review of credit scoring research in the age of big data. J Financial Regul Compliance 6(3):382–405

    Article  Google Scholar 

  23. Plawiak P, Abdar M, Plawiak J, Makarenkov V, Rajendra Acharya U (2020) DGHNL: a new deep genetic hierarchical network of learners for prediction of credit scoring. Inf Sci 516:401–418

    Article  Google Scholar 

  24. Sun J, Lang J, Fujita H, Li H (2018) Imbalanced enterprise credit evaluation with DTE-SBD: decision tree ensemble based on SMOTE and bagging with differentiated sampling rates. Inf Sci 425:76–91

    Article  MathSciNet  Google Scholar 

  25. Tobback E, Martens D (2019) Retail credit scoring using fine-grained payment data. J R Stat Soc Ser A Stat Soc 182(4):1227–1246

    Article  MathSciNet  Google Scholar 

  26. Velickovic P, Cucurull G, Casanova A, Romero A, Lió P., Bengio Y (2018) Graph attention networks. In: Bengio Y, LeCun Y, Sainath T, Murray I, Ranzato MA, Vinyals O (eds) Proceedings of the 6th international conference on learning representations. OpenReview, Vancouver, pp 1–12

  27. Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN , Kaiser L, Polosukhin I (2017) Attention is all you need. In: Guyon I, von Luxburg U, Bengio S, Wallach HM, Fergus R, Vishwanathan SVN, Garnett R (eds) Proceedings of the 30th advances in neural information processing systems. Curran Associates Inc, Long Beach, pp 5998–6008

  28. Whitrow C, Hand DJ, Juszczak P, Weston DJ, Adams NM (2009) Transaction aggregation as a strategy for credit card fraud detection. Data Min Knowl Discov 18(1):30–55

    Article  MathSciNet  Google Scholar 

  29. Wang C, Han D, Liu Q, Luo S (2019) A deep learning approach for credit scoring of peer-to-peer lending using attention mechanism LSTM. IEEE Access 7:2161–2168

    Article  Google Scholar 

  30. Wan’an L, Hong F, Min X (2021) Multi-grained and multi-layered gradient boosting decision tree for credit scoring. Appl Intell. https://doi.org/10.1007/s10489-021-02715-6

  31. Wu F, Souza AH Jr, Zhang T, Fifty C, Tao Y u, Weinberger K. (2019) Simplifying graph convolutional networks. In: Chaudhuri K, Salakhutdinov R (eds) Proceedings of the 36th international conference on machine learning, Long Beach, California, USA, volume 97 of Proceedings of Machine Learning Research, pp 6861–6871. PMLR

  32. Zonghan W u, Pan S, Chen F, Long G, Zhang C, Philip SYu (2019) A comprehensive survey on graph neural networks. arXiv:1901.00596, 1–22

  33. Wang M, Lingfan Yu, Da Z, Gan Q, Gai Y u, Ye Z, Li M, Zhou J, Qi H, Ma C, Huang Z, Guo Q, Zhang H, Lin H, Zhao J, Li J, Smola AJ, Zhang Z (2019) Deep graph library: towards efficient and scalable deep learning on graphs. arXiv:1909.01315, 1–18

  34. Keyulu X u, Weihua H u, Leskovec J, Jegelka S (2019) How powerful are graph neural networks? In: Sainath T, Rush A, Levine S, Livescu K, Mohamed S (eds) Proceedings of the 7th international conference on learning representations. OpenReview, New Orleans, pp 1–17

  35. Zhou J, Cui G, Zhang Z, Yang C, Liu Z, Sun M (2018) Graph neural networks: a review of methods and applications. arXiv:1812.08434, 1–22

Download references

Funding

This research was funded by the University of Macau (File no. MYRG2019-00136-FST).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yain-Whar Si.

Ethics declarations

Conflicts of interest/Competing interests

The authors declare that there are no conflicts of interest.

Additional information

Availability of data and material

The data are not publicly available due to them containing information that could compromise the customers’ privacy/consent.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, J., Zhao, X., Yuan, H. et al. CDGAT: a graph attention network method for credit card defaulters prediction. Appl Intell 53, 11538–11552 (2023). https://doi.org/10.1007/s10489-022-03996-1

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10489-022-03996-1

Keywords

Navigation