Evolving Systems

, Volume 4, Issue 1, pp 27–42 | Cite as

Detecting and Visualizing the Change in Classification of Customer Profiles based on Transactional Data

  • Edward ApehEmail author
  • Bogdan Gabrys
Original Paper


Customer transactions tend to change over time with changing customer behaviour patterns. Classifier models, however, are often designed to perform prediction on data which is assumed to be static. These classifier models thus deteriorate in performance over time when predicting in the context of evolving data. Robust adaptive classification models are therefore needed to detect and adjust to the kind of changes that are common in transactional data. This paper presents an investigation into using change mining to monitor the adaptive classification of customers based on their transactions through moving time windows. The classification performance of two-class decision tree ensembles built using the data binning process based on the number of items purchased was monitored over varying 3, 6, 9 and 12 months time windows. The changing class values of the customer profiles were analysed and described. Results from our experiments show that the proposed approach can be used for learning and adapting to changing customer profiles.


Adaptive systems Classification algorithms Change detection algorithms Data mining Decision support systems Industry applications 



This research work was jointly funded by Great Western Research and Screwfix Limited.


  1. Adomavicius G, Tuzhilin A (2011) Context-aware recommender systems. In: Ricci F, Rokach L, Shapira B, Kantor PB (eds) Context-aware recommender systems. Springer, Berlin, pp 217–253Google Scholar
  2. Apeh E, Gabrys B (2011) Change mining of customer profiles based on transactional data. In: Proceedings of the 11th IEEE international conference on data mining workshops (ICDMW 2011). IEEEGoogle Scholar
  3. Apeh E, Gabrys B, Schierz A (2011) Customer profile classification using transactional data. In: Proceedings of the third world congress on nature and biologically inspired computing (NaBIC2011). IEEEGoogle Scholar
  4. Battiti R (1994) Using mutual information for selecting features in supervised neural net learning. IEEE Trans Neural Netw 5(4):537–550CrossRefGoogle Scholar
  5. Basik M, Feltes D (1999) Defining your customer profile—an essential tool. J Ext (online), 37(6). Available at:
  6. Berry MJA, Linoff GS (2000) Mastering data mining: the art and science of customer relationship management. Wiley, HobokenGoogle Scholar
  7. Berry MJA, Linoff GS (2004) Data mining techniques: for marketing, sales, and customer relationship management. Wiley, HobokenGoogle Scholar
  8. Böttcher M (2011) Contrast and change mining. Wiley Interdisc Rew: Data Mining and Knowledge Discovery 1(3):215–230CrossRefGoogle Scholar
  9. Böttcher M, Höppner F, Spiliopoulou M (2008) On exploiting the power of time in data mining. SIGKDD Explor Newsl 10:3–11CrossRefGoogle Scholar
  10. Böttcher M, Ru G, Nauck D, Kruse R (2009) Post-mining of association rules: techniques for effective knowledge extraction, IGI Global, chap from change mining to relevance feedback: a unified view on assessing rule interestingness, pp 12–37Google Scholar
  11. Böttcher M, Spott M, Nauck D, Kruse R (2009) Mining changing customer segments in dynamic markets. Expert Syst Appl 36:155–164CrossRefGoogle Scholar
  12. Böttcher M, Nauck D, Borgelt C, Kruse R (2010) Temporal aspects in data mining. In: WCCI 2010 Plenary and Invited Lectures, Institute of Electrical and Electronics Engineering, Inc., pp 1–22Google Scholar
  13. Catlett J (1991) On changing continuous attributes into ordered discrete attributes. In: Proceedings of the European working session on learning on machine learning, Springer, New York, pp 164–178Google Scholar
  14. Chiu DKY, Wong AKC, Chan KCC (1991) Synthesis of statistical knowledge from time-dependent data. IEEE Trans Pattern Anal Mach Intell 13:265–271CrossRefGoogle Scholar
  15. Chmielewski MR, Grzymala-Busse JW (1995) Global discretization of continuous attributes as preprocessing for machine learning. In: Proceedings of the Third International Workshop on Rough Sets and Soft Computing, San Jose, pp 474–480Google Scholar
  16. Church KW, Li P, Hastie TJ (2006) Conditional random sampling: a sketch-based sampling technique for sparse data. In: In NIPS, pp 873–880Google Scholar
  17. Ester M, Kriegel HP, Sander J, Wimmer M, Xu X (1998) Incremental clustering for mining in a data warehousing environment. In: Proceedings of the 24rd international conference on very large data bases. Morgan Kaufmann Publishers Inc., San Francisco, CA, VLDB ’98, pp 323–333Google Scholar
  18. Gemulla R (2008) Sampling algorithms for evolving datasets. PhD thesis, Technische UniversitSt DresdenGoogle Scholar
  19. Gemulla R, Lehner W (2008) Sampling time-based sliding windows in bounded space. In: Proceedings of the 2008 ACM SIGMOD international conference on Management of data, ACM, New York, SIGMOD ’08, pp 379–392Google Scholar
  20. Giannotti F, Gozzi C, Manco G (2002) Clustering transactional data. In: Proceedings of the 6th European conference on principles of data mining and knowledge discovery, Springer, London, PKDD ’02, pp 175–187Google Scholar
  21. Günther C, Rinderle S, Reichert M, van der Aalst W (2006) Change mining in adaptive process management systems. On the move to meaningful internet systems 2006: CoopIS, DOA, GADA, and ODBASE pp 309–326Google Scholar
  22. Hall M, Frank E, Holmes G, Pfahringer B, Reutemann P, Witten IH (2009) The weka data mining software: an update. SIGKDD Explor Newsl 11:10–18CrossRefGoogle Scholar
  23. Han J, Kamber M (2006) Data Mining, 2nd edn. Morgan Kaufmann, San FranciscoGoogle Scholar
  24. Holte RC (1993) Very simple classification rules perform well on most commonly used datasets. Mach Learn 11:63–90zbMATHCrossRefGoogle Scholar
  25. Jin P, Zhu Y (2007) Mining customer change model based on swarm intelligence. In: Proceedings of the 3rd international conference on intelligent computing: advanced intelligent computing theories and applications. With aspects of artificial intelligence, ICIC ’07, Springer, Berlin, pp 456–464Google Scholar
  26. Kerber R (1992) Chimerge: discretization of numeric attributes. In: Proceedings of the ninth international conference on artificial intelligence, pp 123–128Google Scholar
  27. Klinkenberg R (2004) Learning drifting concepts: example selection vs. example weighting. Intell Data Anal 8:281–300Google Scholar
  28. Kohavi R, Sahami M (1996) Error-based and entropy-based discretization of continuous features. In: Proceedings of the 13th international joint conference on artificial intelligence, pp 1022–1027Google Scholar
  29. Kruse R, Steinbrecher M, Moewes C (2010) Temporal pattern mining. In: International conference on signals and electronic systems (ICSES), pp 3–8Google Scholar
  30. Kuncheva L, Bezdek JC, Duin R (2001) Decision templates for multiple classifier fusion: an experimental comparison. Pattern Recogn 34(2):299–314zbMATHCrossRefGoogle Scholar
  31. Li RP, Wang ZO (2002) An entropy-based discretization method for classification rules with inconsistency checking. In: Proceedings of the international conference on machine learning and cybernetics, vol 1, pp 243–246Google Scholar
  32. Liu X, Wang H (2005) A discretization algorithm based on a heterogeneity criterion. IEEE Trans Knowl Data Eng 17(9):1166–1173CrossRefGoogle Scholar
  33. Mannila H (2000) Theoretical frameworks for data mining. SIGKDD Explor Newsl 1:30–32CrossRefGoogle Scholar
  34. Ngai E, Xiu L, Chau D (2009) Application of data mining techniques in customer relationship management: a literature review and classification. Expert Syst Appl 36(2):2592–2602CrossRefGoogle Scholar
  35. Nisbet R, Elder J, Miner G (2009) Handbook of statistical analysis and data mining applications. Academic Press, LondonGoogle Scholar
  36. Polikar R (2006) Ensemble based systems in decision making. IEEE Circuits Syst Mag 6(3):21 –45CrossRefGoogle Scholar
  37. Quinlan JR (1993) C4.5: programs for machine learning. Morgan Kaufmann Publishers Inc., San FranciscoGoogle Scholar
  38. Ruta D, Gabrys B (2000) An overview of classifier fusion methods. Comput Inf Syst 7(1):1–10Google Scholar
  39. Song HS, Kim JK, Kim SH (2001) Mining the change of customer behavior in an internet shopping mall. Expert Syst Appl 21(3):157–168CrossRefGoogle Scholar
  40. Subramanian A, Pramala S, Rajalakshmi B, Rajaram R (2010) Improving decision tree performance by exception handling. Int J Autom Comput 7(3):372–380CrossRefGoogle Scholar
  41. Žliobaite I, Bakker J, Pechenizkiy M (2012) Beating the baseline prediction in food sales: how intelligent an intelligent predictor is? Expert Syst Appl 39(1):806–815CrossRefGoogle Scholar
  42. Wang J, Karypis G (2004) Summary: Efficiently summarizing transactions for clustering. In: Proceedings of the fourth IEEE international conference on data mining, IEEE Computer Society, Washington, DC, ICDM ’04, pp 241–248Google Scholar
  43. Webb GI, Pazzani MJ, Billsus D (2001) Machine learning for user modeling. User Model User-Adap Inter 11(1–2):19–29zbMATHCrossRefGoogle Scholar
  44. Widmer G, Kubat M (1996) Learning in the presence of concept drift and hidden contexts. Tech. Rep. 1, HinghamGoogle Scholar
  45. Witten IH, Frank E (1999) Practical machine learning tools and techniques with java implementations, The Morgan Kaufmann series in data management systems, vol 1. Elsevier, AmsterdamGoogle Scholar
  46. Yan H, Chen K, Liu L (2006) Efficiently clustering transactional data with weighted coverage density. In: Proceedings of the 15th ACM international conference on Information and knowledge management, ACM, New York, NY, CIKM ’06, pp 367–376Google Scholar
  47. Yan H, Chen K, Liu L, Yi Z (2010) Scale: a scalable framework for efficiently clustering transactional data. Data Min Knowl Discov 20(1):1–27MathSciNetCrossRefGoogle Scholar
  48. Yu PS (1999) Data mining and personalization technologies. In: Proceedings of the sixth international conference on database systems for advanced applications, IEEE Computer Society, Washington, pp 6–13Google Scholar

Copyright information

© Springer-Verlag 2012

Authors and Affiliations

  1. 1.Smart Technology Research Centre, School of Design, Engineering and ComputingBournemouth UniversityDorsetUK

Personalised recommendations