Abstract
Supervised classification embraces theories and algorithms for disclosing patterns within large, heterogeneous data streams. Several empirical experiments in various domains including medical diagnosis, drug design, document and image classification as well as text recognition have proven its effectiveness to solve complex forecasting and identification tasks. This paper considers applications of classification within the scope of customer relationship management (CRM). Representative operational planning tasks are reviewed to describe the potential and limitations of classification analysis. To that end, a survey of the relevant literature is given to summarize the body of knowledge in each field and identify similarities across applications. The discussion provides a general understanding of technical and managerial challenges encountered in typical CRM applications and indicates promising areas for future research.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
Literature
Akbani, R.; Kwek, S.; Japkowicz, N.: Applying Support Vector Machines to Imbalanced Datasets. In: Boulicaut, J.-F.; Esposito, F.; Giannotti, F.; Pedreschi, D. (eds.): Machine Learning — Proc. of the 15th European Conference on Machine Learning. Springer, Berlin 2004, 39–50.
Allwein, E.L.; Schapire, R.E.; Singer, Y.: Reducing multi-class to binary: A unifying approach for margin classifiers. In: Journal of Machine Learning Research 1 (2000), 113–141.
Baesens, B.; Verstraeten, G.; Van den Poel, D.; Egmont-Petersen, M.; Van Kenhove, P.; Vanthienen, J.: Bayesian network classifiers for identifying the slope of the customer lifecycle of long-life customers. In: European Journal of Operational Research 156 (2004), 508–523.
Baesens, B.; Viaene, S.; Van den Poel, D.; Vanthienen, J.; Dedene, G.: Bayesian neural network learning for repeat purchase modelling in direct marketing. In: European Journal of Operational Research 138 (2002), 191–211.
Banslaben, J.: Predictive Modelling. In: Nash, E.L. (ed.): The Direct Marketing Handbook. McGraw-Hill, New York 1992, 620–636.
Barakat, N.H.; Bradley, A.P.: Rule extraction from support vector machines: A sequential covering approach. In: IEEE Transactions on Knowledge and Data Engineering 19 (2007), 729–741.
Bauer, C.L.: A direct mail customer purchase model. In: Journal of Direct Marketing 2 (1988), 16–24.
Bennett, K.P.; Wu, S.; Auslender, L.: On Support Vector Decision Trees for Database Marketing. In: Proc. of the Intern. Joint Conf. on Neural Networks. IEEE Press, Piscataway 1999, 904–909.
Berger, P.; Magliozzi, T.: The effect of sample size and proportion of buyers in the sample on the performance of list segmentation equations generated by regression analysis. In: Journal of Direct Marketing 6 (1992), 13–22.
Bhattacharyya, S.: Direct marketing performance modeling using genetic algorithms. In: INFORMS Journal on Computing 11 (1999), 248–257.
Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Oxford 1995.
Bitran, G.R.; Mondschein, S.V.: Mailing decisions in the catalog sales industry. In: Management Science 59 (1996), 1364–1381.
Bolton, R.J.; Hand, D.J.: Statistical fraud detection: A review. In: Statistical Science 17 (2002), 235–255.
Bradley, A.P.: The use of the area under the ROC curve in the evaluation of machine learning algorithms. In: Pattern Recognition 30 (1997), 1145–1159.
Breiman, L.: Random forests. In: Machine Learning 45 (2001), 5–32.
Buckinx, W.; Van den Poel, D.: Customer base analysis: Partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting. In: European Journal of Operational Research 164 (2005), 252–268.
Buckinx, W.; Verstraeten, G.; Van den Poel, D.: Predicting customer loyalty using the internal transactional database. In: Expert Systems with Applications 32 (2007), 125–134.
Bucklin, R.E.; Lattin, J.M.; Ansari, A.; Gupta, S.; Bell, D.; Coupey, E.; Little, J.D.C.; Mela, C.; Montgomery, A.; Steckel, J.: Choice and the internet: From clickstream to research stream. In: Marketing Letters 13 (2002), 245–258.
Bult, J.R.; Wansbeek, T.: Optimal selection for direct mail. In: Marketing Science 14 (1995), 378–394.
Burez, J.; Van den Poel, D.: CRM at a pay-TV company: Using analytical models to reduce customer attrition by targeted marketing for subscription services. In: Expert Systems with Applications 32 (2007), 277–288.
Burge, P.; Shawe-Taylor, J.; Cooke, C.; Moreau, Y.; Preneel, B.; Stoermann, C.: Fraud Detection and Management in Mobile Telecommunications Networks. In: IEE (ed.): ECOS97 — Proc. of the 2nd European Convention on Security and Detection. IEE, London 1997, 91–96.
Chan, P.K.; Fan, W.; Prodromidis, A.L.; Stolfo, S.J.: Distributed data mining in credit card fraud detection. In: IEEE Intelligent Systems 14 (1999), 67–74.
Chan, P.K.; Stolfo, S.J.: Toward Scalable Learning with Nonuniform Class and Cost Distributions: A Case Study in Credit Card Fraud Detection. In: Agrawal, R.; Stolorz, P.E.; Piatetsky-Shapiro, G. (eds.): KDD’98 — Proc. of the 4th Intern. Conf. on Knowledge Discovery and Data Mining. AAAI Press, Menlo Park 1998, 164–168.
Coenen, F.; Swinnen, G.; Vanhoof, K.; Wets, G.: The improvement of response modeling: Combining rule-induction and case-based reasoning. In: Expert Systems with Applications 18 (2000), 307–313.
Coussement, K.; Van den Poel, D.: Churn prediction in subscription services: An application of support vector machines while comparing two parameter-selection techniques. In: Expert Systems with Applications 34 (2008), 313–327.
Crone, S.F.; Lessmann, S.; Stahlbock, R.: The impact of preprocessing on data mining: An evaluation of classifier sensitivity in direct marketing. In: European Journal of Operational Research 173 (2006), 781–800.
Cui, D.; Curry, D.: Predictions in marketing using the support vector machine. In: Marketing Science 24 (2005), 595–615.
Cui, G.; Wong, M.L.: Implementing neural networks for decision support in direct marketing. In: International Journal of Market Research 46 (2004), 235–254.
Cui, G.; Wong, M.L.; Lui, H.-K.: Machine learning for direct marketing response models: Bayesian networks with evolutionary programming. In: Management Sciences 52 (2006), 597–612.
Deichmann, J.; Eshghi, A.; Haughton, D.; Sayek, S.; Teebagy, N.: Application of multiple adaptive regression splines (MARS) in direct response modeling. In: Journal of Interactive Marketing 16 (2002), 15–27.
Duda, R.O.; Hart, P.E.; Stork, D.G.: Pattern Classification. Wiley, New York 2001.
Eiben, A.E.; Euverman, T.J.; Kowalczyk, W.; Peelen, E.; Slisser, F.; Wesseling, J.A.M.: Comparing Adaptive and Traditional Techniques for Direct Marketing. In: Zimmermann, H.-J. (ed.): EUFIT’96 — Proc. of the 4th European Congress on Intelligent Techniques and Soft Computing. Verlag Mainz, Aachen 1996, 434–437.
Emmanouilides, C.; Hammond, K.: Internet usage: Predictors of active users and frequency use. In: Journal of Interactive Marketing 14 (2000), 17–32.
Estevez, P.A.; Held, C.M.; Perez, C.A.: Subscription fraud prevention in telecommunications using fuzzy rules and neural networks. In: Expert Systems with Applications 31 (2006), 337–344.
Fawcett, T.: An introduction to ROC analysis. In: Pattern Recognition Letters 27 (2006), 861–874.
Fawcett, T.; Provost, F.: Adaptive fraud detection. In: Data Mining and Knowledge Discovery 1 (1997), 291–316.
Fayyad, U.; Piatetsky-Shapiro, G.; Smyth, P.: From data mining to knowledge discovery in databases: An overview. In: AI Magazine 17 (1996), 37–54.
Ferreira, J.B.; Marley Vellasco: Data Mining Techniques on the Evaluation of Wireless Churn. In: Verleysen, M. (ed.): Trends in Neurocomputing — Proc. of the 12th European Symposium on Artificial Neural Networks. Elsevier, Amsterdam 2004, 483–488.
Freund, Y.; Schapire, R.E.: A decision-theoretic generalization of on-line learning and an application to boosting. In: Journal of Computer and System Science 55 (1997), 119–139.
Friedman, J.H.: Recent advances in predictive (machine) learning. In: Journal of Classification 23 (2006), 175–197.
Friedman, N.; Geiger, D.; Goldszmidt, M.: Bayesian network classifiers. In: Machine Learning 29 (1997), 131–163.
Hand, D.J.: Construction and Assessment of Classification Rules. John Wiley, Chichester 1997.
Hanley, J.A.; McNeil, B.J.: The meaning and use of the area under the receiver operating characteristic (ROC) curve. In: Radiology 143 (1982), 29–36.
Hastie, T.; Tibshirani, R.; Friedman, J.: The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Springer, New York 2002.
Haughton, D.; Oulabi, S.: Direct marketing modeling with CART and CHAID. In: Journal of Direct Marketing 11 (1997), 42–52.
Haykin, S.S.: Neural Networks: A Comprehensive Foundation. Prentice Hall, Upper Saddle River 1999.
He, H.; Wang, J.; Graco, W.; Hawkins, S.: Application of neural networks to detection of medical fraud. In: Expert Systems with Applications 13 (1997), 329–336.
Heilman, C.M.; Kaefer, F.; Ramenofsky, S.D.: Determining the appropriate amount of data for classifying consumers for direct marketing purposes. In: Journal of Interactive Marketing 17 (2003), 5–28.
Hoath, P.: Telecoms fraud, the gory details. In: Computer Fraud & Security 1998 (1998), 10–14.
Hung, S.-Y.; Yen, D.C.; Wang, H.-Y.: Applying data mining to telecom churn management. In: Expert Systems with Applications 31 (2006), 515–524.
Hur, Y.; Lim, S.: Customer Churning Prediction Using Support Vector Machines in Online Auto Insurance Service. In: Wang, J.; Liao, X.; Yi, Z. (eds.): Advances in Neural Networks — Proc. of the 2nd Intern. Symposium on Neural Networks. Springer, Berlin 2005, 928–933.
Hwang, H.; Jung, T.; Suh, E.: An LTV model and customer segmentation based on customer value: A case study on the wireless telecommunication industry. In: Expert Systems with Applications 26 (2004), 181–188.
Jain, A.K.; Duin, R.P.W.; Mao, J.: Statistical pattern recognition: A review In: IEEE Transactions on Pattern Analysis and Machine Intelligence 22 (2000), 4–37.
Jain, D.; Singh, S.S.: Customer lifetime value research in marketing: A review and future directions. In: Journal of Interactive Marketing 16 (2002), 34–46.
Japkowicz, N.; Stephen, S.: The class imbalance problem: A systematic study. In: Intelligent Data Analysis 6 (2002), 429–450.
Kim, H.-C.; Pang, S.; Je, H.-M.; Kim, D.; Yang Bang, S.: Constructing support vector machine ensemble. In: Pattern Recognition 36 (2003), 2757–2767.
Kim, S.; Shin, K.-S.; Park, K.: An Application of Support Vector Machines for Customer Churn Analysis: Credit Card Case. In: Wang, L.; Chen, K.; Ong, Y.S. (eds.): Advances in Natural Computation — Proc. of the 1st Intern. Conf. on Advances in Natural Computation. Springer, Berlin 2005, 636–647.
Kim, Y.; Street, W.N.: An intelligent system for customer targeting: A data mining approach. In: Decision Support Systems 37 (2003), 215–228.
Kim, Y.S.; Street, W.N.; Russell, G.J.; Menczer, F.: Customer targeting: A neural network approach guided by genetic algorithms. In: Management Science 51 (2005), 264–276.
Kirkos, E.; Spathis, C.; Manolopoulos, Y.: Data mining techniques for the detection of fraudulent financial statements. In: Expert Systems with Applications 32 (2007), 995–1003.
Kohavi, R.: A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection. In: Mellish, C.S. (ed.): IJCAI’95 — Proc. of the 14th Intern. Joint Conf. on Artificial Intelligence. Morgan Kaufmann, San Fransisco 1995, 1137–1143.
Kohavi, R.; John, G.H.: Wrappers for feature subset selection. In: Artificial Intelligence 97 (1997), 273–324.
Kubat, M.; Holte, R.C.; Matwin, S.: Machine learning for the detection of oil spills in satellite radar images. In: Machine Learning 30 (1998), 195–215.
Lariviere, B.; Van den Poel, D.: Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: The case of financial services. In: Expert Systems with Applications 27 (2004), 277–285.
Lariviere, B.; Van den Poel, D.: Predicting customer retention and profitability by using random forests and regression forests techniques. In: Expert Systems with Applications 29 (2005), 472–484.
Lewis, M.: The influence of loyalty programs and short-term promotions on customer retention. In: Journal of Marketing Research 41 (2004), 281–292.
Ling, C.X.; Li, C.: Data Mining for Direct Marketing: Problems and Solutions. In: Agrawal, R.; Stolorz, P. (eds.): KDD’98 — Proc. of the 4th Intern. Conf. on Knowledge Discovery and Data Mining. AAAI Press, Menlo Park 1998, 73–79.
Lo, V.S.Y.: The true lift model: A novel data mining approach to response modeling in database marketing. In: ACM SIGKDD Explorations Newsletter 4 (2002), 78–86.
Madeira, S.; Sousa, J.M.: Comparison of Target Selection Methods in Direct Marketing. In: Lieven, K. (ed.): EUNITE’02 — Proc. of the European Symposium on Intelligent Technologies, Hybrid Systems and their implementation on Smart Adaptive Systems Elite Foundation, Aachen 2002, 333–338.
Magidson, J.: Improved statistical techniques for response modeling: Progression beyond regression. In: Journal of Direct Marketing 2 (1988), 6–18.
Martens, D.; Baesens, B.; van Gestel, T.; Vanthienen, J.: Comprehensible credit scoring models using rule extraction from support vector machines. In: European Journal of Operational Research 183 (2007), 1466–1476.
McDonald, W.J.: Direct Marketing. McGraw-Hill, Singapore 1998.
Metz, C.E.: Basic principles of ROC analysis. In: Seminars in Nuclear Medicine 8 (1978), 283–298.
Mozer, M.C.; Dodier, R.; Colagrosso, M.D.; Guerra-Salcedo, C.; Wolniewicz, R.: Prodding the ROC Curve: Constrained Optimization of Classifier Performance. In: Dietterich, T.G.; Becker, S.; Ghahramani, Z. (eds.): Advances in Neural Information Processing Systems 14. MIT Press, Cambridge 2002, 1409–1415.
Mozer, M.C.; Wolniewicz, R.; Grimes, D.B.; Johnson, E.; Kaushansky, H.: Predicting subscriber dissatisfaction and improving retention in the wireless telecommunications industry. In: IEEE Transactions on Neural Networks 11 (2000), 690–696.
Nash, E.L.: The Direct Marketing Handbook. McGraw-Hill, New York 1992.
Navia-Vázquez, A.; Parrado-Hernándeza, E.: Support vector machine interpretation. In: Neurocomputing 69 (2006), 1754–1759.
O’Brien, T.V.: Neural nets for direct marketers. In: Marketing Research 6 (1994), 47.
Pan, J.; Yang, Q.; Yang, Y.; Li, L.; Li, F.T.; Li, G.W.: Cost-sensitive preprocessing for mining customer relationship management databases. In: IEEE Intelligent Systems 22 (2007), 46–51.
Phua, C.; Alahakoon, D.; Lee, V.: Minority report in fraud detection: Classification of skewed data. In: ACM SIGKDD Explorations Newsletter 6 (2004), 50–59.
Piatetsky-Shapiro, G.; Masand, B.: Estimating Campaign Benefits and Modeling Lift. In: Chaudhuri, S.; Madigan, D. (eds.): KDD’99 — Proc. of the 5th Intern. Conf. on Knowledge Discovery and Data Mining. ACM Press 1999, 185–193.
Provost, F.; Fawcett, T.: Analysis and Visualization of Classifier Performance: Comparison Under Imprecise Class and Cost Distributions. In: Heckerman, D.; Mannila, H.; Pregibon, D.; Uthurusamy, R. (eds.): KDD’97 — Proc. of the 3rd Intern. Conf. on Knowledge Discovery and Data Mining. AAAI Press, Menlo Park 1997, 43–48.
Provost, F.; Fawcett, T.: Robust classification for imprecise environments. In: Machine Learning 42 (2001), 203–231.
Provost, F.; Fawcett, T.; Kohavi, R.: The Case Against Accuracy Estimation for Comparing Induction Algorithms. In: Shavlik, J.W. (ed.): Machine Learning — Proc. of the 15th Intern. Conf. on Machine Learning. Morgan Kaufmann, San Francisco 1998, 445–453.
Quah, J.T.S.; Sriganesh, M.: Real-time credit card fraud detection using computational intelligence. In: Expert Systems with Applications (doi:10.1016/j.eswa.2007.08.093) (2007).
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Mateo 1993.
Ratner, B.: Finding the best variables for direct marketing models. In: Journal of Targeting, Measurement & Analysis for Marketing 9 (2001), 270–296.
Reinartz, W.; Kumar, V.: On the profitability of long-life customers in a noncontractual setting: An empirical investigation and implications for marketing. In: Journal of Marketing 64 (2000), 17–35.
Reinartz, W.J.; Kumar, V.: The impact of customer relationship characteristics on profitable lifetime duration. In: Journal of Marketing 67 (2003), 77–99.
Rosset, S.; Murad, U.; Neumann, E.; Idan, Y.; Pinkas, G.: Discovery of Fraud Rules for Telecommunications—Challenges and Solutions. In: Chaudhuri, S.; Madigan, D. (eds.): Proc. of the 5th Intern. Conf. on Knowledge Discovery and Data Mining. ACM, New York 1999, 409–413.
Rosset, S.; Neumann, E.; Eick, U.; Vatnik, N.; Idan, I.: Evaluation of Prediction Models for Marketing Campaigns. In: Provost, F.; Srikant, R. (eds.): KDD’01 — Proc. of the 7th Intern. Conf. on Knowledge Discovery and Data Mining. ACM Press, New York 2001, 456–461.
Rossi, P.E.; McCulloch, R.E.; Allenby, G.M.: The value of purchase history data in target marketing. In: Marketing Science 15 (1996), 321.
Schölkopf, B.; Platt, J.C.; Shawe-Taylor, J.; Smola, A.J.; Williamson, R.C.: Estimating the support of a high-dimensional distribution. In: Neural Computation 13 (2001), 1443–1471.
Shawe-Taylor, J.; Howker, K.; Gosset, P.; Hyland, M.; Verrelst, H.; Moreau, Y.; Stoermann, C.; Burge, P.: Novel Techniques for Profiling and Fraud Detection in Mobile Telecommunications. In: Lisboa, P.J.G.; B. Edisbury; Vellido, A. (eds.): Business Applications of Neural Networks. World Scientific, Singapore 2000, 113–139.
Shin, H.; Cho, S.: Response modeling with support vector machines. In: Expert Systems with Applications 30 (2006), 746–760.
Smith, K.A.; Willis, R.J.; Brooks, M.: An analysis of customer retention and insurance claim patterns using data mining: A case study. In: Journal of the Operational Research Society 51 (2000), 532–541.
Thrasher, R.P.: CART: A recent advance in tree-structured list segmentation methodology. In: Journal of Direct Marketing 5 (1991), 35–47.
Van den Poel, D.; Buckinx, W.: Predicting online-purchasing behaviour. In: European Journal of Operational Research 166 (2005), 557–575.
Van den Poel, D.; Prinzie, A.: Constrained optimization of data-mining problems to improve model performance: A direct-marketing application. In: Expert Systems with Applications (doi:10.1016/j.eswa.2005.04.017) (2008).
Vapnik, V.; Kotz, S.: Estimation of Dependences Based on Empirical Data. Springer, New York 2006.
Vapnik, V.N.: The Nature of Statistical Learning Theory. Springer, New York 1995.
Viaene, S.; Ayuso, M.; Guillen, M.; Van Gheel, D.; Dedene, G.: Strategies for detecting fraudulent claims in the automobile insurance industry. In: European Journal of Operational Research 176 (2007), 565–583.
Viaene, S.; Baesens, B.; Van den Poel, D.; Dedene, G.; Vanthienen, J.: Wrapped input selection using multilayer perceptrons for repeat-purchase modeling in direct marketing. In: International Journal of Intelligent Systems in Accounting, Finance & Management 10 (2001), 115–126.
Viaene, S.; Baesens, B.; Van Gestel, T.; Suykens, J.A.K.; Van den Poel, D.; Vanthienen, J.; De Moor, B.; Dedene, G.: Knowledge discovery in a direct marketing case using least squares support vector machines. In: International Journal of Intelligent Systems 16 (2001), 1023–1036.
Viaene, S.; Dedene, G.: Cost-sensitive learning and decision making revisited. In: European Journal of Operational Research 166 (2004), 212–220.
Viaene, S.; Derrig, R.A.; Baesens, B.; Dedene, G.: A comparison of state-of-the-art classification techniques for expert automobile insurance claim fraud detection. In: Journal of Risk & Insurance 69 (2002), 373–421.
Wei, C.P.; Chiu, I.T.: Turning telecommunications call details to churn prediction: A data mining approach. In: Expert Systems with Applications 23 (2002), 103–112.
Wheeler, R.; Aitken, S.: Multiple algorithms for fraud detection. In: Knowledge-Based Systems 13 (2000), 93–99.
Xing, D.; Girolami, M.: Employing latent Dirichlet allocation for fraud detection in telecommunications. In: Pattern Recognition Letters 28 (2007), 1727–1734.
Yan, L.; Wolniewicz, R.H.; Dodier, R.: Predicting customer behavior in telecommunications. In: IEEE Intelligent Systems 19 (2004), 50–58.
Yang, W.-S.; Hwang, S.-Y.: A process-mining framework for the detection of healthcare fraud and abuse. In: Expert Systems with Applications 31 (2006), 56–68.
Yu, E.; Cho, S.: Constructing response model using ensemble based on feature subset selection. In: Expert Systems with Applications 30 (2006), 352–360.
Zahavi, J.; Levin, N.: Applying neural computing to target marketing. In: Journal of Direct Marketing 11 (1999), 76–93.
Zahavi, J.; Levin, N.: Issues and problems in applying neural computing to target marketing. In: Journal of Direct Marketing 11 (1999), 63–75.
Zhao, Y.; Li, B.; Li, X.; Liu, W.; Ren, S.: Customer Churn Prediction Using Improved One-Class Support Vector Machine. In: Li, X.; Wang, S.; Dong, Z.Y. (eds.): Advanced Data Mining and Applications — Proc. of the 1st Intern. Conf. on Advanced Data Mining and Applications. Springer, Berlin 2005, 300–306.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2008 Betriebswirtschaftlicher Verlag Dr. Th. Gabler | GWV Fachverlage GmbH, Wiesbaden
About this chapter
Cite this chapter
Lessmann, S., Voß, S. (2008). Supervised Classification for Decision Support in Customer Relationship Management. In: Bortfeldt, A., Homberger, J., Kopfer, H., Pankratz, G., Strangmeier, R. (eds) Intelligent Decision Support. Gabler. https://doi.org/10.1007/978-3-8349-9777-7_14
Download citation
DOI: https://doi.org/10.1007/978-3-8349-9777-7_14
Publisher Name: Gabler
Print ISBN: 978-3-8349-0930-5
Online ISBN: 978-3-8349-9777-7
eBook Packages: Business and EconomicsBusiness and Management (R0)