Mining Rare Events Data for Assessing Customer Attrition Risk

Au, Tom; Chin, Meei-Ling Ivy; Ma, Guangqin

doi:10.1007/978-3-642-00405-6_8

Tom Au⁴,
Meei-Ling Ivy Chin⁴ &
Guangqin Ma⁴

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 31))

Included in the following conference series:

International Conference on Information Systems, Technology and Management

1346 Accesses

Abstract

Customer attrition refers to the phenomenon whereby a customer leaves a service provider. As competition intensifies, preventing customers from leaving is a major challenge to many businesses such as telecom service providers. Research has shown that retaining existing customers is more profitable than acquiring new customers due primarily to savings on acquisition costs, the higher volume of service consumption, and customer referrals. For a large enterprise, its customer base consists of tens of millions service subscribers, more often the events, such as switching to competitors or canceling services are large in absolute number, but rare in percentage, far less than 5%. Based on a simple random sample, popular statistical procedures, such as logistic regression, tree-based method and neural network, can sharply underestimate the probability of rare events, and often result a null model (no significant predictors). To improve efficiency and accuracy for event probability estimation, a case-based data collection technique is then considered. A case-based sample is formed by taking all available events and a small, but representative fraction of nonevents from a dataset of interest. In this article we showed a consistent prior correction method for events probability estimation and demonstrated the performance of the above data collection techniques in predicting customer attrition with actual telecommunications data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

King, G., Zeng, L.: Logistic Regression in Rare Events Data. Society for Political Methodology, 137–163 (February 2001)
Google Scholar
Prentice, R.L.: A Case-cohort Design for Epidemiologic Cohort Studies and Disease Prevention Trials. Biometrika 73, 1–11 (1986)
Article MathSciNet MATH Google Scholar
Jacob, R.: Why Some Customers Are More Equal Than Others. Fortune, 200–201 (September 19, 1994)
Google Scholar
Walker, O.C., Boyd, H.W., Larreche, J.C.: Marketing Strategy: Planning and Implementation, 3rd edn., Irwin, Boston (1999)
Google Scholar
Li, S.: Applications of Demographic Techniques in Modeling Customer Retention. In: Rao, K.V., Wicks, J.W. (eds.) Applied Demography, pp. 183–197. Bowling Green State University, Bowling Green (1994)
Google Scholar
Li, S.: Survival Analysis. Marketing Research, 17–23 (Fall, 1995)
Google Scholar
Breslow, N.E.: Statistics in Epidemiology: The case-Control Study. Journal of the American Statistical Association 91, 14–28 (1996)
Article MathSciNet MATH Google Scholar
Hanley, J.A., McNeil, B.J.: The Meaning and Use of the Area under a ROC Curve. Radiology 143, 29–36 (1982)
Article Google Scholar
Ma, G., Hall, W.J.: Confidence Bands for ROC Curves. Medical Decision Making 13, 191–197 (1993)
Article Google Scholar
Au, T., Li, S., Ma, G.: Applications Applying and Evaluating Models to Predict Customer Attrition Using Data Mining Techniques. J. of Cmparative International Management 6, 10–22 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

AT&T Labs, Inc.-Research, USA
Tom Au, Meei-Ling Ivy Chin & Guangqin Ma

Authors

Tom Au
View author publications
You can also search for this author in PubMed Google Scholar
Meei-Ling Ivy Chin
View author publications
You can also search for this author in PubMed Google Scholar
Guangqin Ma
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Computer Science, Georgia State University, 34 Peachtree Street, P.O. Box, Atlanta, GA, USA
Sushil K. Prasad
Institute of Management Technology, Ghaziabad, India
Susmi Routray & Reema Khurana &
Department of Computer and Information Science and Technology, University of Florida, USA
Sartaj Sahni

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Au, T., Chin, ML.I., Ma, G. (2009). Mining Rare Events Data for Assessing Customer Attrition Risk. In: Prasad, S.K., Routray, S., Khurana, R., Sahni, S. (eds) Information Systems, Technology and Management. ICISTM 2009. Communications in Computer and Information Science, vol 31. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00405-6_8

Download citation

DOI: https://doi.org/10.1007/978-3-642-00405-6_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00404-9
Online ISBN: 978-3-642-00405-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics