Abstract
Since periodic events are very common everywhere, periodic pattern mining is increasingly more important in today’s data mining domain. However, there is currently no uniform definition of periodic patterns, and all of these definitions are incapable of discovering seasonally prevalent events. In this paper, we first define the periodic event based on the coefficient of variation of the event’s periods in event sequence. Then, in order to discover seasonally prevalent events, we propose a new concept of periodic cluster patterns and design an efficient algorithm named the PCPM(Periodic Cluster Pattern Miner) to mine periodic cluster patterns in event sequence datasets. To illustrate the application of periodic cluster patterns, we propose a new method employed periodic cluster pattern prediction for next basket recommendation, and the method is named PCPP(Periodic Cluster Pattern Predictor). Experiments show that the PCPM is effective for periodic cluster pattern mining and that PCPP has performances close to those of the baseline methods on four real-world transaction datasets. Furthermore, we believe that periodic cluster patterns, as a new concept, will have a wider application in other domains, such as time series prediction, meteorological forecasting, etc.
Similar content being viewed by others
Notes
The source code of our PCPM algorithm is provided at https://github.com/gshchen-10/PCPM
Provided by Guidotti et al.[35] at https://github.com/riccotti/CustomerTemporalRegularities/tree/master/datasets
References
Zhang R, Huang Y, Pu M, Zhang J, Ling H (2020) Object discovery from a single unlabeled image by mining frequent itemsets with multi-scale features. IEEE Transactions on Image Processing. PP(99) 1–1
Soleimani G, Abessi M (2020) DLCSS: A new similarity measure for time series data mining. Eng Appl Artif Intell 92:103664
Zhou H, Hirasawa K (2019) Evolving temporal association rules in recommender system. Neural Comput & Applic 2019(31):2605–2619
Mabu AM, Prasad R, Yadav R, Jauro SS (2018) A Review of Data Mining Methods in Bioinformatics. 2018 Recent Advances on Engineering, Technology and Computational Sciences (RAETCS)
Agrawal R, Imieliński T, Swami A (1993) Mining association rules between sets of items in large databases. ACM SIGMOD Rec 22(2):207–216
Islam MA, Rafi MR, Azad AA, Ovi JA (2021) Weighted frequent sequential pattern mining. Appl Intell. https://doi.org/10.1007/s10489-021-02290-w
Van T, Le B (2021) Mining sequential rules with itemset constraints. Appl Intell. https://doi.org/10.1007/s10489-020-02153-w
Kiran RU, Venkatesh JN, Toyoda M, Kitsuregawa M, Reddy PK (2017) Discovering partial periodic-frequent patterns in a transactional database. J Syst Softw 125(Mar.):170–182
Sethi KK, Ramesh D (2020) High average-utility itemset mining with multiple minimum utility threshold: a generalized approach. Eng Appl Artif Intell 96:103933
Nguyen L, Vo B, Le NT, Snasel V, Zelinka I (2020) Fast and scalable algorithms for mining subgraphs in a single large graph. Eng Appl Artif Intell 90(Apr.):103539.1–103539.12
Kanaan M, Cazabet R, Kheddouci H (2020) Temporal Pattern Mining for Ecommerce Dataset. In: Transactions on large-scale data- and knowledge-centered systems XLVI, Springer Berlin Heidelberg, Berlin, Heidelberg, 2020, pp. 67–90. https://doi.org/10.1007/978-3-662-62386-2_3
Chanda AK, Saha S, Nishi MA, Samiullah M, Ahmed CF (2015) An efficient approach to mine flexible periodic patterns in time series databases. Eng Appl Artif Intell. (2015) 44:46–63
Yuan Q, Shang J, Cao X, Zhang C, Geng X, Han J (2017) Detecting multiple periods and periodic patterns in event time sequences. CIKM 2017:617–626
Kostrzewa J (2015) Time series forecasting using clustering with periodic pattern. IJCCI (NCTA) 2015:85–92
Tanbeer SK, Ahmed CF, Jeong B, Lee Y (2009) Discovering Periodic-Frequent patterns in transactional databases. PAKDD 2009:242–253
Fournier-Viger P, Lin CW, Duong QH, Dam TL, Voznak M (2016) PFPM: Discovering periodic frequent patterns with novel periodicity measures. In: Proceedings of the 2nd Czech-China scientific conference, 2016, 2017
Rana S, Mondal MNI (2021) An approach for seasonally periodic frequent pattern mining in retail supermarket. SSRN Electronic Journal 2021;(1)
Li Z, Wang J, Han J (2015) ePeriodicity:, Mining event periodicity from incomplete observations. TKDE (2015) 27(5):1219– 1232
Ester M, Kriegel H, Sander J, Xu X (1996) A Density-Based algorithm for discovering clusters in large spatial databases with noise. KDD 1996:226–231
Cumby CM, Fano AE, Ghani R, Krema M (2004) Predicting customer shopping lists from point-of-sale purchase data. In: Proceedings of the Tenth ACM SIGKDD international conference on knowledge discovery and data mining, pp 402–409. https://doi.org/10.1145/1014052.1014098
Yang KJ, Hong TP, Chen YM, Lan GC (2013) Projection-based partial periodic pattern mining for event sequences. Expert Syst Appl 40(10):4232–4240
Amphawan K, Lenca P, Surarerks A (2009) Mining Top-K Periodic-Frequent pattern from transactional databases without support threshold. IAIT, pp 18–29
Kiran RU, Reddy PK (2011) An alternative interestingness measure for mining Periodic-Frequent patterns. DASFAA 1:183–192
Rashid MM, Karim MR, Jeong BS, Choi HJ (2012) Efficient mining regularly frequent patterns in transactional databases. DASFAA 1:258–271
Kiran RU, Kitsuregawa M, Reddy PK (2016) Efficient discovery of periodic-frequent patterns in very large databases. J Syst Softw 112:110–121
Fournier-Viger P, Li Z, Lin JC, Kiran RU, Fujita H (2019) Efficient algorithms to identify periodic patterns in multiple sequences. Inf Sci 489:205–226. https://doi.org/10.1016/j.ins.2019.03.050
Fournier-Viger P, Yang P, Kiran RU, Ventura S, Luna JM (2021) Mining local periodic patterns in a discrete sequence. Inf Sci 544:519–548
Fournier-Viger P, Wang Y, Yang P, Lin CW, Kiran RU (2021) TSPIN: mining top-k stable periodic patterns. Applied Intelligence(439)
Nofong VM (2018) Fast and memory efficient mining of periodic frequent patterns, vol 2018
Ma S, Hellerstein JL (2001) Mining partially periodic event patterns with unknown periods. ICDE, pp 205–214
Akther S, Karim MR, Samiullah M, Ahmed CF (2018) Mining non-redundant closed flexible periodic patterns. Eng Appl Artif Intell 69:1–23
Kiran RU, Saideep C, Zettsu K, Toyoda M, Kitsuregawa M, Reddy PK (2019) Discovering partial periodic spatial patterns in spatiotemporal databases. IEEE BigData, pp 233–238
Afriyie MK, Nofong VM, Wondoh J, Abdel-Fatao H (2020) Mining Non-redundant Periodic Frequent Patterns. ACIIDS 1:321–331
Huang JW, Jaysawal BP, Wang CC (2021) Mining full, inner and tail periodic patterns with perfect, imperfect and asynchronous periodicity simultaneously. Data Min Knowl Disc 35:1225–1257. https://doi.org/10.1007/s10618-021-00753-9
Guidotti R, Gabrielli L, Monreale A, Pedreschi D, Giannotti F (2018) Discovering temporal regularities in retail customers’ shopping behavior. Epj Data Science 7(1):6
Guidotti R, Rossetti G, Pappalardo L, Giannotti F, Pedreschi D (2019) Personalized market basket prediction with temporal annotated recurring sequences. IEEE Trans Knowl Data Eng 31(11):2151–2163. https://doi.org/10.1109/TKDE.2018.2872587
Lee DD, Seung HS (2000) Algorithms for non-negative matrix factorization. In: Advances in neural information processing systems 13, papers from neural information processing systems (NIPS), vol 2000, pp 556–562
Wang P, Guo J, Lan Y, Xu J, Wan S, Cheng X (2015) Learning hierarchical representation model for next basket recommendation. In: Proceedings of the 38th international ACM SIGIR conference on research and development in information retrieval, pp 403–412. https://doi.org/10.1145/2766462.2767694
Hu HJ, He XN, Gao JY, Zhang ZL (2020) Modeling Personalized Item Frequency Information for Next-basket Recommendation. SIGIR 2020:1071–1080
Faggioli G, Polato M, Aiolli F (2020) Recency aware collaborative filtering for next basket recommendation. In: UMAP’20, July. https://doi.org/10.1145/3340631.3394850, vol 14-17. Genoa, Italy, pp 80–87
Acknowledgements
We truly thank the reviewers for valuable and suggestion comments. This work was supported in part by the National Natural Science Foundation of China under Grant 61672261 and Grant 61802056, and in part by the Industrial Technology Research and Development Project of Jilin Development and Reform Commission under Grant 2019C053-9.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
We declare that we do not have any commercial or associative interest that represents a conflict of interest in connection with the work submitted.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Chen, G., Li, Z. Discovering periodic cluster patterns in event sequence databases. Appl Intell 52, 15387–15404 (2022). https://doi.org/10.1007/s10489-022-03186-z
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10489-022-03186-z