Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns

Abstract

Traditional frequent sequential pattern mining only considers the time point-based item or event in the patterns. However, in many application, the events may span over multiple time points and the relations among events are also important. Time interval-based pattern mining is proposed to mine the interesting patterns of events that span over some time periods and also by considering the relations among events. Previous works of time interval-based pattern mining focused on the relations between events without considering the duration of each event. However, the same event with different time duration may cause different results. In this work, we propose two algorithms, SARA and SARS, for mining frequent time interval-based events with duration, TIED, patterns. TIED patterns not only keep the relations between two events but also reveal the time periods when each event happens and ends. For the performance evaluation, we propose a naive algorithm and modify a previous algorithm along with the implementation of SARA and SARS. The experimental results show that SARA and SARS are more efficient in terms of execution time and memory usage than other two algorithms. Moreover, we extend this work by considering utility value or importance of event in each time stamp. Therefore, we propose another new High Utility Time Interval-based Events with Duration, HU-TIED, pattern. HU-TIED incorporates the concept of utility pattern mining and TIED pattern mining. We design an algorithm, LMSpan, to mine top-K HU-TIED patterns. For the performance evaluation, we design a baseline algorithm, GenerateNCheck to compare with LMSpan. LMSpan takes less time and memory and generates less candidates than GenerateNCheck.

This is a preview of subscription content, log in to check access.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25

References

  1. 1.

    Agrawal R, Srikant R (1994) Fast algorithms for mining association rules in large databases. In: Proceedings of the 20th international conference on very large data base, pp 487–499

  2. 2.

    Agrawal R, Srikant R (1995) Mining sequential patterns. In: Proceedings of the 12nd IEEE international conference on data engineering, pp 3–14

  3. 3.

    Allen J (1983) Maintaining knowledge about temporal intervals. Commun ACM 26(11):832–843

    Article  Google Scholar 

  4. 4.

    Batal I, Fradkin D, Harrison J, Moerchen F, Hauskrecht M (2012) Mining recent temporal patterns for event detection in multivariate time series data. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 280–288

  5. 5.

    Bergen B, Chang N (2005) Embodied construction grammar in simulation-based language understanding. In: Construction grammars: cognitive grounding and theoretical extensions, vol 3. John Benjamins, Amsterdam, pp 147–190

    Google Scholar 

  6. 6.

    Chen KY, Jaysawal BP, Huang JW, Wu YB (2014) Mining frequent time interval-based event with duration patterns from temporal database. In: 2014 international conference on data science and advanced analytics (DSAA), pp 548–554

  7. 7.

    Chen YC, Chen CC, Peng WC, Lee WC (2014) Mining correlation patterns among appliances in smart home environment. In: Proceedings of the 18th Pacific-Asia conference on knowledge discovery and data mining, pp 222–233

    Google Scholar 

  8. 8.

    Chen YC, Jiang JC, Peng WC, Lee SY (2010) An efficient algorithm for mining time interval-based patterns in large database. In: Proceedings of the 19th ACM international conference on information and knowledge management, pp 49–58

  9. 9.

    Cheng H, Yan XY, Han JW (2004) IncSpan: incremental mining of sequential patterns in large database. In: Proceedings of the 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 527–532

  10. 10.

    Erwin A, Gopalan RP, Achuthan NR (2008) Efficient mining of high utility itemsets from large datasets. In: Proceedings of the 12th Pacific-Asia conference on knowledge discovery and data mining, pp 554–561

  11. 11.

    Guyet T, Quiniou R (2008) Mining temporal patterns with quantitative intervals. In: Proceedings of the 18th IEEE international conference on data mining workshops, pp 218–227

  12. 12.

    Han JW, Pei J, Yin YW (2000) Mining frequent patterns without candidate generation. In: ACM special interest group on management of data, vol 29(2), pp 1–12

    Article  Google Scholar 

  13. 13.

    Huang F, Zou Z, Liu X, He J (2012) Association rules mining for academic cooperation based on time extension and duration accumulation. In: 2012 international conference on computer science and service system, pp 2007–2012. https://doi.org/10.1109/CSSS.2012.500

  14. 14.

    Huang JW, Tseng CY, Ou JC, Chen MS (2008) A general model for sequential pattern mining with a progressive database. IEEE Trans Knowl Data Eng 20(9):1153–1167

    Article  Google Scholar 

  15. 15.

    Kostakis O, Papapetrou P (2015) Finding the longest common sub-pattern in sequences of temporal intervals. Data Min Knowl Discov 29(5):1178–1210. https://doi.org/10.1007/s10618-015-0404-3

    MathSciNet  Article  MATH  Google Scholar 

  16. 16.

    Laxman S, Sastry P, Unnikrishnan K (2007) Discovering frequent generalized episodes when events persist for different durations. IEEE Trans Knowl Data Eng 19(9):1188–1201. https://doi.org/10.1109/TKDE.2007.1055

    Article  Google Scholar 

  17. 17.

    Liu J, Wang K, Fung BCM (2012) Direct discovery of high utility itemsets without candidate generation. In: 2012 IEEE 12th international conference on data mining, pp. 984–989

  18. 18.

    Liu J, Wang K, Fung BCM (2016) Mining high utility patterns in one phase without generating candidates. IEEE Trans Knowl Data Eng 28(5):1245–1257

    Article  Google Scholar 

  19. 19.

    Liu M, Qu J (2012) Mining high utility itemsets without candidate generation. In: Proceedings of the 21st ACM international conference on information and knowledge management, CIKM ’12. ACM, New York, pp 55–64

  20. 20.

    Mörchen F, Fradkin D (2010) Robust mining of time intervals with semi-interval partial order patterns. In: Proceedings of the 2010 SIAM international conference on data mining. SIAM, Philadelphia, pp 315–326

  21. 21.

    Nakagaito F, Ozaki T, Ohkawa T (2009) Discovery of quantitative sequential patterns from event sequences. In: Proceedings of the 19th IEEE international conference on data mining workshops, pp 31–36

  22. 22.

    Papapetrou P, Kollios G, Sclaroff S (2009) Mining frequent arrangements of temporal intervals. Knowl Inf Syst 21(2):133–171

    Article  Google Scholar 

  23. 23.

    Papapetrou P, Kollios G, Sclaroff S, Gunopulos D (2005) Discovering frequent arrangements of temporal intervals. In: Proceedings of the 15th IEEE international conference on data mining, pp 354–361

  24. 24.

    Patel D, Hsu W, Lee ML (2008) Mining relationships among interval-based events for classification. In: Proceedings of the ACM special interest group on management of data, pp 393–404

  25. 25.

    Pei J, Han J, Mortazavi-Asl B, Wang J, Pinto H, Chen Q, Dayal U, Hsu MC (2004) Mining sequential patterns by pattern-growth: the prefixspan approach. IEEE Trans Knowl Data Eng 16(11):1424–1440

    Article  Google Scholar 

  26. 26.

    Ryang H, Yun U (2015) Top-k high utility pattern mining with effective threshold raising strategies. Knowl Based Syst 76:109–126

    Article  Google Scholar 

  27. 27.

    Tseng VS, Wu CW, Fournier-Viger P, Yu PS (2016) Efficient algorithms for mining top-k high utility itemsets. IEEE Trans Knowl Data Eng 28(1):54–67

    Article  Google Scholar 

  28. 28.

    Tseng VS, Wu CW, Shie BE, Yu PS (2010) UP-Growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD international conference on knowledge discovery and data mining, pp 253–262

  29. 29.

    Wu CW, Lin YF, Yu PS, Tseng VS (2013) Mining high utility episodes in complex event sequences. In: Proceedings of the 19th ACM SIGKDD international conference on knowledge discovery and data mining, KDD ’13. ACM, New York, pp 536–544

  30. 30.

    Wu CW, Shie BE, Tseng VS, Yu PS (2012) Mining top-K high utility itemsets. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 78–86

  31. 31.

    Wu SY, Chen YL (2007) Mining nonambiguous temporal patterns for interval-based events. IEEE Trans Knowl Data Eng 19(6):742–758

    Article  Google Scholar 

  32. 32.

    Yao H, Hamilton HJ, Geng L (2006) A unified framework for utility-based measures for mining itemsets. In: Proceedings of 2nd workshop on utility-based data mining, joint with the 11th ACM SIGKDD international conference on knowledge discovery and data mining, pp 28–37

  33. 33.

    Yin JF, Zheng ZG, Cao LB (2012) USpan: an efficient algorithm for mining high utility sequential patterns. In: Proceedings of the 18th ACM SIGKDD international conference on knowledge discovery and data mining, pp 660–668

  34. 34.

    Yin JF, Zheng ZG, Cao LB, Song Y, Wei W (2013) Efficiently mining top-K high utility sequential patterns. In: Proceedings of the 23th IEEE international conference on data mining, pp 1259–1264

Download references

Author information

Affiliations

Authors

Corresponding author

Correspondence to Bijay Prasad Jaysawal.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Huang, J., Jaysawal, B.P., Chen, K. et al. Mining frequent and top-K High Utility Time Interval-based Events with Duration patterns. Knowl Inf Syst 61, 1331–1359 (2019). https://doi.org/10.1007/s10115-019-01333-6

Download citation

Keywords

  • Temporal pattern mining
  • Time interval-based events with duration pattern mining
  • High utility time interval-based events pattern mining
  • Data mining