Abstract
Survival analysis can be applied to build models for time to default on debt. In this paper, we report an application of survival analysis to model default on a large data set of credit card accounts. We explore the hypothesis that probability of default (PD) is affected by general conditions in the economy over time. These macroeconomic variables (MVs) cannot readily be included in logistic regression models. However, survival analysis provides a framework for their inclusion as time-varying covariates. Various MVs, such as interest rate and unemployment rate, are included in the analysis. We show that inclusion of these indicators improves model fit and affects PD yielding a modest improvement in predictions of default on an independent test set.
Similar content being viewed by others
References
Andersen PK (1992). Repeated assessment of risk factors in survival analysis. Statist Meth Med Res 1: 297–315.
Andreeva G (2006). European generic scoring models using survival analysis. J Opl Res Soc 57: 1180–1187.
Andreeva G, Ansell J and Crook J (2007). Modelling profitability using survival combination scores. Eur J Opl Res 183: 1537–1549.
Banasik J, Crook JN and Thomas LC (1999). Not if but when will borrowers default. J Opl Res Soc 50: 1185–1190.
Baesens B, van Gestel T, Viaene S, Stepanova M, Suykens J and Vanthienen J (2003). Benchmarking state-of-the-art classification algorithms for credit scoring. J Opl Res Soc 54: 1082–1088.
Basel Committee on Banking Supervision (BCBS) (2006). Basel II: International Convergence of Capital Measurement and Capital Standards at www.bis.org/publ/bcbsca.htm.
Brambor T, Clark WR and Golder M (2005). Understanding interaction models: Improving empirical analyses. Pol Anal 14: 63–82.
Burez J and Van den Poel D (2008). Handling class imbalance in customer churn prediction. Working paper 2008/517 Faculteit Economie en Bedrijfskunde, Universiteit Gent.
Chawla NV, Japkowicz N and Kolcz A (2004). Editorial: Special issue on learning from imbalanced data sets. SIGKDD Explor 6(1): 1–6.
Chen L-S, Yen M-F, Wu H-M, Liao C-S, Liou D-M, Kuo H-S and Chen TH-H (2005). Predictive survival model with time-dependent prognostic factors: Development of computer-aided SAS Macro program. J Eval Clin Pract 11: 181–193.
Collett D (1994). Modelling Survival Data in Medical Research. Chapman & Hall: London.
Crook J (2001). The demand for household debt in the USA: Evidence from the 1995 Survey of Consumer Finance. Appl Fin Econ 11: 83–91.
Crook J and Banasik J (2005). Explaining aggregate consumer credit delinquency behaviour over time. University of Edinburgh Credit Research Centre Working Paper, 05/03.
Crook J and Hochguertel S (2007). Household debt and credit constraints: Comparative microevidence from four OECD countries. University of Edinburgh Credit Research Centre Working Paper 05/02.
Deaton A (1992). Understanding Consumption. Oxford University Press: Oxford.
Efron B and Tibshirani RJ (1993). An Introduction to the Bootstrap. Chapman & Hall: New York.
Friedman M (1955). A Theory of the Consumption Function. Princeton University Press: Princeton.
Hand DJ (2005). Good practice in retail credit scorecard assessment. J Opl Res Soc 56: 1109–1117.
Hosmer DW Jr, and Lemeshow S (1999). Applied Survival Analysis: Regression Modelling of Time to Event Data. Wiley: New York.
Scheurmann E and Matthews C (2005). Neural network classifiers in Arrears Management. In: Duch W. et al (ed). ICANN 2005, Lecture Notes in Computer Science Vol. 3697. Springer-Verlag: Berlin, pp. 325–330.
Stepanova M and Thomas LC (2001). PHAB scores: Proportional hazards analysis behavioural scores. J Opl Res Soc 52: 1007–1016.
Stepanova M and Thomas LC (2002). Survival analysis methods for personal loan data. Opns Res 50: 277–289.
Tang L, Thomas LC, Thomas S and Bozzetto J-F (2007). It's the economy stupid: Modelling financial product purchases. Int J Bank Market 25(1): 22–38.
Thomas LC, Edelman DB and Crook JN (2002). Credit Scoring and its Applications. SIAM Monographs on Mathematical Modeling and Computation. SIAM: Philadelphia, USA.
Weiss G (2004). Mining with rarity: A unifying framework. SIGKDD Explor 6(1): 7–19.
Weiss G and Provost F (2003). Learning when training data are costly: The effect of class distribution on tree induction. J Artif Intell Res 19: 315–354.
Witten IH and Frank E (2005). Data Mining, 2nd edn. Morgan Kaufmann: San Francisco, USA.
Acknowledgements
We thank Professors David Hand, Lyn Thomas and other members of the Quantitative Financial Risk Management Centre for discussion of survival analysis and credit scoring during the course of this research. We are grateful for funding through an EPSRC grant EP/D505380/1. We also thank our anonymous referees for their useful comments that allowed us to make significant improvements to this paper.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Bellotti, T., Crook, J. Credit scoring with macroeconomic variables using survival analysis. J Oper Res Soc 60, 1699–1707 (2009). https://doi.org/10.1057/jors.2008.130
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1057/jors.2008.130