Abstract
Data mining and in particular forecasting tools and techniques are being increasingly exploited by businesses to predict customer behavior and to formulate effective marketing programs. Conventionally, customer segmentation approaches are utilized when dealing with a large population of customers. Inspired by this idea, a new methodology is proposed in this study to perform segment-level customer behavior forecasting. To keep the dynamic nature of customer behavior, customer behavior is represented as a time series. Therefore, customer behavior forecasting is changed into a time series forecasting problem. The proposed methodology contains two main components i.e. clustering and forecasting. In the clustering phase, time series are clustered using time series clustering algorithms, and then, in the forecasting phase, the behavior of each segment is predicted via time series forecasting techniques. The main objective is to predict future behavior at segment level. The forecasting component also consists of a combined method exploiting the concept of forecast fusion. The combined method employs a pool of forecasters both from traditional time series forecasting and computational intelligence methods. To test the usefulness of the proposed method, a case study is carried out using the data of customers’ point of sale (POS) in a bank. The results of the experiments demonstrate that the combined method outperforms all other individual forecasters in terms of symmetric mean absolute percentage error (SMAPE). The proposed methodology can be correspondingly applied in other areas and applications of time series forecasting.
Similar content being viewed by others
References
Abirami M, Pattabiraman V (2016) Data mining approach for intelligent customer behavior analysis for a retail store. In: Vijayakumar V, Neelanarayanan V (eds) Proceedings of the 3rd international symposium on big data and cloud computing challenges (ISBCC-16’), 2016. Springer, Cham, pp 283–291
Aghabozorgi S, Seyed Shirkhorshidi A, Ying Wah T (2015) Time-series clustering – A decade review. Information Systems 53:16–38. https://doi.org/10.1016/j.is.2015.04.007
Akhondzadeh-Noughabi E, Albadvi A (2015) Mining the dominant patterns of customer shifts between segments by using top-k and distinguishing sequential rules. Manag Decis 53:1976–2003
Anantasech P, Ratanamahatana CA Enhanced weighted dynamic time warping for time series classification. In: Third international congress on information and communication technology, 2019. Springer, New York, pp 655–664
Andrawis RR, Atiya AF, El-Shishiny H (2011) Forecast combinations of computational intelligence and linear models for the NN5 time series forecasting competition. Int J Forecast 27:672–688
Ballestar MT, Grau-Carles P, Sainz J (2018) Customer segmentation in e-commerce: applications to the cashback business model. J Bus Res 88:407–414. https://doi.org/10.1016/j.jbusres.2017.11.047
Batista GE, Keogh EJ, Tataw OM, De Souza VM (2014) CID: an efficient complexity-invariant distance for time series. Data Min Knowl Disc 28:634–669
Böttcher M, Spott M, Nauck D, Kruse R (2009) Mining changing customer segments in dynamic markets. Expert Syst Appl 36:155–164. https://doi.org/10.1016/j.eswa.2007.09.006
Box GE, Jenkins GM, Reinsel GC, Ljung GM (2015) Time series analysis: forecasting and control. Wiley, New Jersey
Brockwell PJ, Davis RA, Calder MV (2002) Introduction to time series and forecasting. Springer New York. https://doi.org/10.1007/b97391
Čater T, Čater B (2010) Product and relationship quality influence on customer commitment and loyalty in B2B manufacturing relationships. Ind Mark Manag 39:1321–1333
Cen Z, Wang J (2018) Forecasting neural network model with novel CID learning rate and EEMD algorithms on energy market. Neurocomputing 317:168–178. https://doi.org/10.1016/j.neucom.2018.08.021
Chan CCH, Hwang Y-R, Wu H-C (2016) Marketing segmentation using the particle swarm optimization algorithm: a case study. J Ambient Intell Hum Comput 7:855–863. https://doi.org/10.1007/s12652-016-0389-9
Chen Y, Hao Y (2017) A feature weighted support vector machine and K-nearest neighbor algorithm for stock market indices prediction. Expert Syst Appl 80:340–355. https://doi.org/10.1016/j.eswa.2017.02.044
Cheng D, Zhu Q, Huang J, Wu Q, Yang L (2019) A novel cluster validity index based on local cores. IEEE Trans Neural Netw Learn Syst 30:985–999. https://doi.org/10.1109/TNNLS.2018.2853710
Chiang W-Y (2018) Applying data mining for online CRM marketing strategy: an empirical case of coffee shop industry in Taiwan. Br Food J 120:665–675
Chouakria AD, Nagabhushan PN (2007) Adaptive dissimilarity index for measuring time series proximity. Adv Data Anal Classif 1:5–21
Crone SF, Hibon M, Nikolopoulos K (2011) Advances in forecasting with neural networks? Empirical evidence from the NN3 competition on time series prediction. Int J Forecast 27:635–660
Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30
Desgraupes B (2013) Clustering indices. Univ Paris Ouest-Lab Modal’X 1:34
Dimitriadis S, Kyrezis N, Chalaris M (2018) A comparison of two multivariate analysis methods for segmenting users of alternative payment means. Int J Bank Market 36:322–335
Doğan O, Ayçi̇n E, Bulut ZA (2018) Customer segmentation by using RFM model and clustering methods: a case study in retail industry. Int J Contemp Econ Admin Sc 8:1–19
Duan Y, Cao G, Edwards JS (2018) Understanding the impact of business analytics on innovation. Eur J Oper Res. https://doi.org/10.1016/j.ejor.2018.06.021
Dursun A, Caber M (2016) Using data mining techniques for profiling profitable hotel customers: an application of RFM analysis. Tour Manag Perspect 18:153–160. https://doi.org/10.1016/j.tmp.2016.03.001
Grover V, Chiang RH, Liang T-P, Zhang D (2018) Creating strategic business value from big data analytics: a research framework. J Manag Inf Syst 35:388–423
Han J, Kamber M, Pei J (2011) Data mining: concepts and techniques: concepts and techniques. Elsevier, Waltham
Heldt R, Silveira CS, Luce FB (2019) Predicting customer value per product: From RFM to RFM/P. J Bus Res. https://doi.org/10.1016/j.jbusres.2019.05.001
Hosseini M, Shabani M (2015) New approach to customer segmentation based on changes in customer value. J Market Anal 3:110–121. https://doi.org/10.1057/jma.2015.10
Hu Y-H, Huang TC-K, Kao Y-H (2013) Knowledge discovery of weighted RFM sequential patterns from customer sequence databases. J Syst Softw 86:779–788. https://doi.org/10.1016/j.jss.2012.11.016
Hu Y-H, Yeh T-W (2014) Discovering valuable frequent patterns based on RFM analysis without customer identification information. Knowl-Based Syst 61:76–88
Hughes A (2011) Strategic database marketing: the masterplan for starting and managing a profitable, customer-based marketing program, 4th edn. McGraw-Hill, New York
Khajvand M, Tarokh MJ (2011) Estimating customer future value of different customer segments based on adapted RFM model in retail banking context. Proc Comput Sci 3:1327–1332
Khashei M, Bijari M (2011) A novel hybridization of artificial neural networks and ARIMA models for time series forecasting. Appl Soft Comput 11:2664–2675
Khobzi H, Akhondzadeh-Noughabi E, Minaei-Bidgoli B (2014) A new application of RFM clustering for guild segmentation to mine the pattern of using banks’ e-payment services. J Global Market 27:178–190. https://doi.org/10.1080/08911762.2013.878428
Kourentzes N, Petropoulos F (2016) Forecasting with multivariate temporal aggregation: the case of promotional modelling. Int J Prod Econ 181:145–153
Kumar V, Reinartz W (2018) Customer relationship management: concept, strategy, and tools, 3 edn. Springer, Berlin. https://doi.org/10.1007/978-3-662-55381-7
Kumar V, Shah D (2004) Building and sustaining profitable customer loyalty for the 21st century. J Retail 80:317–329
Le DD, Gross G, Berizzi A (2015) Probabilistic modeling of multisite wind farm production for scenario-based applications. IEEE Trans Sustain Energy 6:748–758
Lemmens A, Croux C, Stremersch S (2012) Dynamics in the international market segmentation of new product growth. Int J Res Mark 29:81–92. https://doi.org/10.1016/j.ijresmar.2011.06.003
Lessmann S, Haupt J, Coussement K, Bock KWD (2019) Targeting customers for profit: An ensemble learning framework to support marketing decision-making. Inf Sci. https://doi.org/10.1016/j.ins.2019.05.027
Liu J, Liao X, Huang W, Liao X (2019) Market segmentation: a multiple criteria approach combining preference analysis and segmentation decision. Omega 83:1–13
Martínez F, Frías MP, Pérez-Godoy MD, Rivera AJ (2018) Dealing with seasonality by narrowing the training set in time series forecasting with kNN. Expert Syst Appl 103:38–48
Martínez F, Frías MP, Pérez MD, Rivera AJ (2017) A methodology for applying k-nearest neighbor to time series forecasting. Artif Intell Rev 1–19
Montero P, Vilar JA (2014) Tsclust: an R package for time series clustering. J Stat Softw 62:1–43
Mueen A, Chavoshi N, Abu-El-Rub N, Hamooni H, Minnich A, MacCarthy J (2018) Speeding up dynamic time warping distance for sparse time series data. Knowl Inf Syst 54:237–263
Murray PW, Agard B, Barajas MA (2018) Forecast of individual customer’s demand from a large and noisy dataset. Comput Ind Eng 118:33–43
Murtagh F, Legendre P (2014) Ward’s hierarchical agglomerative clustering method: which algorithms implement Ward’s criterion? J Classif 31:274–295. https://doi.org/10.1007/s00357-014-9161-z
Ngai EWT, Xiu L, Chau DCK (2009) Application of data mining techniques in customer relationship management: a literature review and classification. Expert Syst Appl 36:2592–2602. https://doi.org/10.1016/j.eswa.2008.02.021
Panigrahi S, Behera HS (2017) A hybrid ETS–ANN model for time series forecasting. Eng Appl Artif Intell 66:49–59
Paparrizos J, Gravano L (2017) Fast and accurate time-series clustering. ACM Trans Database Syst 42:1–49. https://doi.org/10.1145/3044711
Parvaneh A, Abbasimehr H, Tarokh MJ (2012) Integrating AHP and data mining for effective retailer segmentation based on retailer lifetime value. Journal of Optimization in Industrial Engineering 5:25–31
Parvaneh A, Tarokh M, Abbasimehr H (2014) Combining data mining and group decision making in retailer segmentation based on LRFMP variables. Int J Ind Eng Prod Res 25:197–206
Petitjean F, Ketterlin A, Gançarski P (2011) A global averaging method for dynamic time warping, with applications to clustering. Pattern Recognit 44:678–693
Ramon-Gonen R, Gelbard R (2017) Cluster evolution analysis: identification and detection of similar clusters and migration patterns. Expert Syst Appl 83:363–378. https://doi.org/10.1016/j.eswa.2017.04.007
Ramos P, Santos N, Rebelo R (2015) Performance of state space and ARIMA models for consumer retail sales forecasting. Rob Comput Integr Manuf 34:151–163
Saas A, Guitart A, Periáñez Á (2016) Discovering playing patterns: time series clustering of free-to-play game data. In: 2016 IEEE conference on computational intelligence and games (CIG), Santorini, Greece, 20–23 Sept 2016. IEEE, New York, pp 1–8. https://doi.org/10.1109/CIG.2016.7860442
Sarstedt M, Mooi E (2019) Cluster analysis. A concise guide to market research. Springer texts in business and economics. Springer, Berlin, pp 301–354
Serhat P, Altan K, Erhan EP (2017) LRFMP model for customer segmentation in the grocery retail industry: a case study. Mark Intell Plan 35:544–559. https://doi.org/10.1108/MIP-11-2016-0210
Song HS, Kyeong Kim J, Kim SH (2001) Mining the change of customer behavior in an internet shopping mall. Expert Syst Appl 21:157–168
Song M, Zhao X, Ou EH (2017) Statistics-based CRM approach via time series segmenting RFM on large scale data. Knowl-Based Syst 132:21–29. https://doi.org/10.1016/j.knosys.2017.05.027
Svetunkov I (2017) Statistical models underlying functions of'smooth'package for R. In: Working paper of Department of Management Science, Lancaster University, pp 1–52
Svetunkov I, Petropoulos F (2017) Old dog, new tricks: a modelling view of simple moving averages. Int J Prod Res 56:6034–6047. https://doi.org/10.1080/00207543.2017.1380326
Yan W (2012) Toward automatic time-series forecasting using neural networks. IEEE Trans Neural Netw Learn Syst 23:1028–1039. https://doi.org/10.1109/TNNLS.2012.2198074
Yildirim P, Birant D, Alpyildiz T (2018) Data mining and machine learning in textile industry. Wires Data Min Knowl Discov 8:e1228. https://doi.org/10.1002/widm.1228
You Z, Si Y-W, Zhang D, Zeng X, Leung SCH, Li T (2015) A decision-making framework for precision marketing. Expert Syst Appl 42:3357–3367. https://doi.org/10.1016/j.eswa.2014.12.022
Yu B, Song X, Guan F, Yang Z, Yao B (2016) k-Nearest neighbor model for multiple-time-step prediction of short-term traffic condition. J Transp Eng 142:04016018
Acknowledgments
The authors would like to thank Aram Bahrini for providing language help during the writing of this paper.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Abbasimehr, H., Shabani, M. A new framework for predicting customer behavior in terms of RFM by considering the temporal aspect based on time series techniques. J Ambient Intell Human Comput 12, 515–531 (2021). https://doi.org/10.1007/s12652-020-02015-w
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12652-020-02015-w