Abstract
Behavioral credit scoring models are a specific kind of credit scoring models, where time-evolving data about delinquency pattern, outstanding amounts, and account activity, is used. These data have a dynamic nature as they evolve over time in accordance with the economic environment. On the other hand, scoring models are usually static, implicitly assuming that the relationship between the performance characteristics and the subsequent performance of a customer will be the same under the current situation as it was when the information on which the scorecard was built was collected, no matter what economic changes have occurred in that period. In this study we investigate how this assumption affects the predictive power of behavioral scoring models, using a large data set from Greece, where consumer credit has been heavily affected by the economic crisis that hit the country since 2009.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Just to name a few constraints the European Consumer Credit Directive 2008/48/EC (http://ec.europa.eu/consumers/financial_services/consumer_credit_directive/index_en.htm) stipulates among other that an applicant has the right to be comprehensively informed about the reasons of a rejection; The Basel II Accord (http://www.bis.org/publ/bcbsca.htm) imposes specific requirements for risk evaluation that have to be accredited.
- 2.
We shall note here that there is an expanding research in credit scoring –and especially behavioral scoring- to support decisions in areas such as marketing, through the use of propensity scores (Bijak 2011; Thomas 2003; Thomas et al. 2005): there are response models (will the consumer respond to marketing offers), usage models (will the consumer use a credit line) and attrition models (will a customer continue with the lender). A recent trend is also profit scoring, that is the use of scorecards to maximize profit (Andreeva et al. 2007; Crook et al. 2007; Finlay 2010). Additional areas in which scoring models find application are collection scoring (dividing insolvent customers into groups, separating those who require decisive actions from those who don’t need to be attended to immediately) and fraud detection (ranking applicants according to the relative likelihood that their application may be fraudulent) (Phua et al. 2010).
- 3.
A customer may be an existing one who has already a credit history or a new one applying for first time. This distinction is important in the context of behavioral score.
- 4.
- 5.
We shall note here that these data reflect the traditional and well-established academic and financial industry perspective. However, there is an increasing fad (eg FICO score with use of alternative data, see Andriotis (2015) and similar approaches from Equifax and TransUnion) for using alternative data (mainly from sources such as utilities & telecommunication bill payments, but also from data from social networks, rentals etc). The aim is to outreach those with very thin data or serious delinquencies, sometimes mentioned as “credit invisibles” [see http://www.perc.net (Turner et al. 2015; 2009)].
- 6.
- 7.
Concept drift is a wider phenomenon than population drift; it can refer eg to situations where the classes of the classification problem change over time (Kelly and Hand 1999).
- 8.
IMF: Compilation Guide on Financial Soundness Indicators, https://www.imf.org/external/np/sta/fsi/eng/2004/guide/index.htm
References
Allen L, Saunders A (2002) A survey of cyclical effects in credit risk measurement models (SSRN Scholarly Paper No. ID 315561). Social Science Research Network, Rochester, NY
Anderson R (2007) The credit scoring toolkit: theory and practice for retail credit risk management and decision automation. Oxford University Press, Oxford
Andreeva G, Ansell J, Crook J (2007) Modelling profitability using survival combination scores. Eur J Oper Res 183:1537–1549. doi:10.1016/j.ejor.2006.10.064
Andriotis A (2015) FICO announces new credit score based on alternative data. Wall Street J. http://www.wsj.com/articles/fico-announces-new-credit-score-based-on-alternative-data-1427989748
Baesens B, Gestel TV, Stepanova M, Van den Poel D, Vanthienen J (2005) Neural network survival analysis for personal loan data. J Oper Res Soc 56:1089–1098. doi:10.1057/palgrave.jors.2601990
Baesens B, Van Gestel T, Viaene S, Stepanova M, Suykens J, Vanthienen J (2003) Benchmarking state-of-the-art classification algorithms for credit scoring. J Oper Res Soc 54:627–635
Bellotti T, Crook J (2014) Retail credit stress testing using a discrete hazard model with macroeconomic factors. J Oper Res Soc 65:340–350. doi:10.1057/jors.2013.91
Bellotti T, Crook J (2008a) Credit scoring with macroeconomic variables using survival analysis. J Oper Res Soc 60:1699–1707
Bellotti T, Crook J (2008b) Support vector machines for credit scoring and discovery of significant features. Expert Syst Appl 36:3302–3308
Besanko D, Thakor AV (1987) Competitive equilibrium in the credit market under asymmetric information. J Econ Theory 42:167–182. doi:10.1016/0022-0531(87)90108-6
Bijak K (2011) Kalman filtering as a performance monitoring technique for a propensity scorecard. J Oper Res Soc 62:29–37. doi:10.1057/jors.2009.183
Bijak K, Thomas LC (2012) Does segmentation always improve model performance in credit scoring? Expert Syst Appl 39:2433–2442. doi:10.1016/j.eswa.2011.08.093
Breeden J, Thomas L, McDonald J III (2007) Stress testing retail load portfolios with dual-time dynamics. J Risk Model Valid 2:1–19
Chang S-Y, Yeh T-Y (2012) An artificial immune classifier for credit scoring analysis. Appl Soft Comput 12:611–618. doi:10.1016/j.asoc.2011.11.002
Chen Y (2012) Research on multi-classification of credit rating of small and medium-sized enterprises in growth enterprises board based on fuzzy ordinal regression support vector machine. Int J Econ Finance 4:248–252. doi:10.5539/ijef.v4n3p248
Clémençon S, Vayatis N (2010) Overlaying classifiers: a practical approach to optimal scoring. Constr Approx 32:619–648. doi:10.1007/s00365-010-9084-9
Crook J, Banasik J (2005) Explaining aggregate consumer delinquency behaviour over time. Working Paper Series-University of Edinburgh Management School 5.
Crook J, Bellotti T (2010) Time varying and dynamic models for default risk in consumer loans. J R Stat Soc A 173:283–305. doi:10.1111/j.1467-985X.2009.00617.x
Crook JN, Edelman DB, Thomas LC (2007) Recent developments in consumer credit risk assessment. Eur J Oper Res 183:1447–1465
De Jongh P, De Jongh E, Pienaar M, Gordon-Grant H, Oberholzer M, Santana L (2015) The impact of pre-selected variance inflation factor thresholds on the stability and predictive power of logistic regression models in credit scoring. ORiON 31:17. doi:10.5784/31-1-162
Durand D (1941) Credit-rating formulae. In: Durand D (Ed) Risk elements in consumer installment financing. NBER, pp. 83–91.
Elton EJ, Gruber MJ, Blake CR (1996) Survivor bias and mutual fund performance. Rev Financ Stud 9:1097–1120
Fawcett T (2006) An introduction to ROC analysis. Pattern Recogn Lett 27:861–874
Finlay S (2010) Credit scoring for profitability objectives. Eur J Oper Res 202:528–537. doi:10.1016/j.ejor.2009.05.025
Gama J, Žliobait\.e I, Bifet A, Pechenizkiy M, Bouchachia A (2014) A survey on concept drift adaptation. ACM Comput Surv 46, 44:1–44:37. doi:10.1145/2523813
Gao J, Fan W, Han J, Yu PS (2007) A general framework for mining concept-drifting data streams with skewed distributions. In: Proceedings of SDM’07
Giambona F (2012) A discrete-time hazard model for loans: some evidence from Italian banking system. Am J Appl Sci 9:1337–1346
Guyon I, Elisseeff A (2003) An introduction to variable and feature selection. J Mach Learn Res 3:1157–1182
Hajek P, Michalak K (2013) Feature selection in corporate credit rating prediction. Knowl-Based Syst 51:72–84. doi:10.1016/j.knosys.2013.07.008
Hand DJ (2009) Measuring classifier performance: a coherent alternative to the area under the ROC curve. Mach Learn 77:103–123
Hand DJ (2006) Classifier technology and the illusion of progress. Statist Sci 21:1–14. doi:10.1214/088342306000000060
Hand DJ, Anagnostopoulos C (2013) When is the area under the receiver operating characteristic curve an appropriate measure of classifier performance? Pattern Recogn Lett 34:492–495. doi:10.1016/j.patrec.2012.12.004
Hand DJ, Henley WE (1997) Statistical classification methods in consumer credit scoring: a review. J R Statist Soc A 160:523–541
Harrell F (2015) Regression modeling strategies: with applications to linear models, logistic and ordinal regression, and survival analysis. Springer, Heidelberg
Hofer V, Krempl G (2013) Drift mining in data: a framework for addressing drift in classification. Comput Stat Data Anal 57:377–391. doi:10.1016/j.csda.2012.07.007
Holte RC (1993) Very simple classification rules perform well on most commonly used datasets. Mach Learn 11:63–90. doi:10.1023/A:1022631118932
Hsieh H-I, Lee T-P, Lee T-S (2010) Data mining in building behavioral scoring models. In: 2010 international conference on computational intelligence and software engineering (CiSE). IEEE, pp 1–4
Im J, Apley DW, Qi C, Shan X (2012) A time-dependent proportional hazards survival model for credit risk analysis. J Oper Res Soc 63:306–321. doi:10.1057/jors.2011.34
Ingolfsson S, Elvarsson BT (2010) Cyclical adjustment of point-in-time PD. J Oper Res Soc 61:374–380. doi:10.1057/jors.2009.136
Kelly MG, Hand DJ (1999) Credit scoring with uncertain class definitions. IMA J Manag Math 10:331–345. doi:10.1093/imaman/10.4.331
Kelly, M.G., Hand, D.J., Adams, N.M., 1999. The impact of changing populations on classifier performance. In: Proceedings of the 5th ACM SIGKDD international conference on knowledge discovery and data mining, pp 367–371
Kennedy K, Mac Namee B, Delany SJ, O’Sullivan M, Watson N (2013) A window of opportunity: assessing behavioural scoring. Expert Syst Appl 40:1372–1380. doi:10.1016/j.eswa.2012.08.052
Kraft H, Kroisandt G, Müller M (2002) Assessing the discriminatory power of credit scores. Discussion papers. Interdisciplinary Research Project 373: quantification and simulation of economic processes
Liu Y, Schumann M (2005) Data mining feature selection for credit scoring models. J Oper Res Soc 56:1099–1108. doi:10.1057/palgrave.jors.2601976
Thomas L (2007) Measuring the discrimination quality of suites of scorecards : ROCs Ginis, bounds and segmentation. Presented at the Credit Scoring and Credit Control X, Edinburgh
Thomas L, Jung KM (2013) When to rebuild or when to recalibrate scorecards. Presented at the Credit Scoring and Credit Control XIII, Edinburgh
Maldonado S, Weber R, Basak J (2011) Simultaneous feature selection and classification using kernel-penalized support vector machines. Inf Sci 181:115–128. doi:10.1016/j.ins.2010.08.047
Marrs GR, Hickey RJ, Black MM (2010) The impact of latency on online classification learning with concept drift. In: Bi Y, Williams M-A (eds) Knowledge science, engineering and management. Lecture notes in computer science. Springer, Berlin, Heidelberg, pp 459–469
Mays E (2005) Handbook of credit scoring. Publishers Group Uk, [S.l.]
Medema L, Koning RH, Lensink R (2009) A practical approach to validating a PD model. J Bank Financ 33:701–708
Turner M, Varghese R, Walker P (2015) Research consensus confirms benefits of alternative data (Technical Report). PERC
Niklis D, Doumpos M, Zopounidis C (2014) Combining market and accounting-based models for credit scoring using a classification scheme based on support vector machines. Appl Math Comput 234:69–81. doi:10.1016/j.amc.2014.02.028
O’brien RM (2007) A caution regarding rules of thumb for variance inflation factors. Qual Quant 41:673–690
Overstreet GA, Bradley EL, Kemp RS (1992) The flat-maximum effect and generic linear scoring models: a test. IMA J Manag Math 4:97–109. doi:10.1093/imaman/4.1.97
Pavlidis NG, Tasoulis DK, Adams NM, Hand DJ (2012) Adaptive consumer credit classification. J Oper Res Soc 63:1645–1654. doi:10.1057/jors.2012.15
Phua C, Lee V, Smith K, Gayler R (2010) A comprehensive survey of data mining-based fraud detection research. arXiv preprint arXiv:1009.6119.
Rezac M, Rezac F (2011) How to measure the quality of credit scoring models. Czech J Econ Finance (Finance a uver) 61:486–507
Rona-Tas A, Hiss S (2008) Consumer and corporate credit ratings and the subprime crisis in the US with some lessons for Germany. SCHUFA, Wiesbaden
Saha A, Siddiqi N (2011) Survival analysis workflow: assessing the impact of macro-economic shocks on credit portfolios and predicting the time of default
Sarlija N, Bensic M, Zekic-Susac M (2009) Comparison procedure of predicting the time to default in behavioural scoring. Expert Syst Appl 36:8778–8788. doi:10.1016/j.eswa.2008.11.042
Shi J, Zhang S, Qiu L (2013) Credit scoring by feature-weighted support vector machines. J Zhejiang Univ Sci C 14:197–204
Shmueli G (2010) To explain or to predict? Stat Sci 25:289–310. doi:10.1214/10-STS330
Shtatland ES, Cain E, Barton MB (2001) The perils of stepwise logistic regression and how to escape them using information criteria and the output delivery system. In: 26th Annual SAS users group international conference, Long Beach, California
Siami M, Gholamian MR, Basiri J (2013) An application of locally linear model tree algorithm with combination of feature selection in credit scoring. Int J Syst Sci 1–10. doi:10.1080/00207721.2013.767395
Siarka P (2012) Quality measures of scoring models. J Risk Manag Financ Instit 5:432–446
Siddiqi N (2005) Credit risk scorecards: developing and implementing intelligent credit scoring (Wiley and SAS Business Series), 1st ed. Wiley
Song X, Ding Y, Huang J, Ge Y (2010) Feature selection for support vector machine in financial crisis prediction: a case study in China. Expert Syst 27:299–310. doi:10.1111/j.1468-0394.2010.00546.x
Sousa MR, Gama J, Brandão E (2013) Introducing time-changing economics into credit scoring (No. 513), FEP Working Papers. Universidade do Porto, Faculdade de Economia do Porto
Sousa MR, Gama J, Gonçalves MJS (2014) A two-stage model for dealing with temporal degradation of credit scoring. arXiv:1406.7775 [q-fin]
Lessmann S, Thomas LC, Seow H-V, Baesens B (2013). Benchmarking state-of-the-art classification algorithms for credit scoring: A ten-year update. Presented at the Credit Scoring and Credit Control XIII
Stepanova M, Thomas L (2002) Survival analysis methods for personal loan data. Oper Res:277–289
Stiglitz JE, Weiss A (1981) Credit rationing in markets with imperfect information. Am Econ Rev 71:393–410
Tang B, Qiu SB (2012) Multi-class support vector machine for credit scoring. Appl Mech Mater 235. doi:10.4028/www.scientific.net/AMM.235.419
Thomas LC (2003) Consumer credit modelling: context and current issues, Workshop Paper Presented on the Banff Credit Risk conference
Thomas LC, Edelman DB, Crook JN (2002) Credit scoring & its applications (monographs on mathematical modeling and computation), 1st ed. Society for Industrial and Applied Mathematics
Thomas LC, Ho J, Scherer WT (2001) Time will tell: behavioural scoring and the dynamics of consumer credit assessment. IMA J Manag Math 12:89
Thomas LC, Malik M (2010) Comparison of credit risk models for portfolios of retail loans based on behavioural scores. In: Rausch D, Scheule H (Eds) Model risk in financial crises. Risk Books, pp 209–232.
Thomas LC, Oliver RW, Hand DJ (2005) A survey of the issues in consumer credit modelling research. J Oper Res Soc 56:1006–1015. doi:10.1057/palgrave.jors.2602018
Tsai C-F (2009) Feature selection in bankruptcy prediction. Knowl-Based Syst 22:120–127. doi:10.1016/j.knosys.2008.08.002
Tsai C-F, Chen M-L (2010) Credit rating by hybrid machine learning techniques. Appl Soft Comput 10:374–380. doi:10.1016/j.asoc.2009.08.003
Tsymbal A (2004) The problem of concept drift: definitions and related work. Computer Science Department, Trinity College Dublin
Turner MA, Walker PD, Dusek K (2009) New to credit from alternative data. Information Policy Institute/Political and Economic Research Council
Van Gestel T, Baesens B, Martens D (2010) From linear to non-linear kernel based classifiers for bankruptcy prediction. Neurocomput Int J 73:2955–2970. doi:10.1016/j.neucom.2010.07.002
Verikas A, Kalsyte Z, Bacauskiene M, Gelzinis A (2010) Hybrid and ensemble-based soft computing techniques in bankruptcy prediction: a survey. Soft Comput 14:995–1010. doi:10.1007/s00500-009-0490-5
Waad B, Ghazi BM, Mohamed L (2013) A three-stage feature selection using quadratic programming for credit scoring. Appl Artif Intell 27:721–742. doi:10.1080/08839514.2013.823327
Wang J, Hedar A-R, Wang S, Ma J (2012) Rough set and scatter search metaheuristic based feature selection for credit scoring. Expert Syst Appl 39:6123–6128. doi:10.1016/j.eswa.2011.11.011
Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Mach Learn 23:69–101
Yao P (2009) Fuzzy rough set and information entropy based feature selection for credit scoring. In: 6th international conference on fuzzy systems and knowledge discovery, 2009. FSKD’09. Presented at the 6th International Conference on Fuzzy Systems and Knowledge Discovery, 2009. FSKD’09, pp. 247–251. doi:10.1109/FSKD.2009.713
Yu L, Wang S, Lai KK (2008a) Credit risk assessment with a multistage neural network ensemble learning approach. Expert Syst Appl 34:1434–1444. doi:10.1016/j.eswa.2007.01.009
Yu L, Wang S, Lai KK, Zhou L (2008b) Bio-inspired credit risk analysis: computational intelligence with support vector machines. Springer, Heidelberg
Žliobaitė I (2009) Learning under concept drift: an overview (Technical Report). Faculty of Mathematics and Informatics, Vilnius University, Vilnius, Lithuania
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendices
Appendix 1: Scoring Parameters
Appendix 2: Variables
Rights and permissions
Copyright information
© 2017 Springer International Publishing Switzerland
About this paper
Cite this paper
Nikolaidis, D., Doumpos, M., Zopounidis, C. (2017). Exploring Population Drift on Consumer Credit Behavioral Scoring. In: Grigoroudis, E., Doumpos, M. (eds) Operational Research in Business and Economics. Springer Proceedings in Business and Economics. Springer, Cham. https://doi.org/10.1007/978-3-319-33003-7_7
Download citation
DOI: https://doi.org/10.1007/978-3-319-33003-7_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-33001-3
Online ISBN: 978-3-319-33003-7
eBook Packages: Business and ManagementBusiness and Management (R0)