Skip to main content
Log in

A sequential tree approach for incremental sequential pattern mining

  • Published:
Sādhanā Aims and scope Submit manuscript

Abstract

“Sequential pattern mining” is a prominent and significant method to explore the knowledge and innovation from the large database. Common sequential pattern mining algorithms handle static databases. Pragmatically, looking into the functional and actual execution, the database grows exponentially thereby leading to the necessity and requirement of such innovation, research, and development culminating into the designing of mining algorithm. Once the database is updated, the previous mining result will be incorrect, and we need to restart and trigger the entire mining process for the new updated sequential database. To overcome and avoid the process of rescanning of the entire database, this unique system of incremental mining of sequential pattern is available. The previous approaches, system, and techniques are a priori-based frameworks but mine patterns is an advanced and sophisticated technique giving the desired solution. We propose and incorporate an algorithm called STISPM for incremental mining of sequential patterns using the sequence tree space structure. STISPM uses the depth-first approach along with backward tracking and the dynamic lookahead pruning strategy that removes infrequent and irregular patterns. The process and approach from the root node to any leaf node depict a sequential pattern in the database. The structural characteristic of the sequence tree makes it convenient and appropriate for incremental sequential pattern mining. The sequence tree also stores all the sequential patterns with its count and statistics, so whenever the support system is withdrawn or changed, our algorithm using frequent sequence tree as the storage structure can find and detect all the sequential patterns without mining the database once again.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7

Similar content being viewed by others

References

  1. Agarwal R and Srikant R 1995 Mining sequential pattern. In: Proceedings of the 11th International Conference on Data Engineering pp. 3–14

  2. Zhao Q and Bhowmick S S 2003 Sequential pattern mining: A survey. Technical Report, CAIS, Nanyang Technological University Singapore. No. 118

    Google Scholar 

  3. Han J, Pei J, Mortazavi Asl B, Chen Q, Dayal U and Hsu M C 2000a FreeSpan: Frequent pattern-projected sequential pattern mining. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 355–359

  4. Cheung D W, Lee D W and Kao S D 1997 A general incremental technique for maintaining discovered association rules. In: Proceedings of the 5th International Conference on Database System for Advanced Applications, pp. 185–194

  5. Lin M Y and Lee S Y 1998 Incremental update on sequential patterns in large databases. In: The 10th IEEE international conference on tools with artificial intelligence, pp. 24–31

  6. Sarda N L and Srinivas N V 1998 An adaptive algorithm for incremental mining of association rules. In: The 9th International Workshop on Database and Expert Systems, pp. 240–245

  7. Zhang S 1999 Aggregation and maintenance for database mining. Intell. Data Anal. 475–490

  8. Srikant R and Agrawal R 1996 Mining sequential patterns: Generalizations and performance improvements. In: Proceedings of the 5th International Conference on Extending Database Technology, pp. 3–17

  9. Garofalakis M N, Rastogi R and Shim K 1999 SPIRIT: Sequential pattern mining regular expression constraints. In: Proceedings of the 25th international conference on very large data base, pp. 223–234

  10. Parthasarathy S, Zaki M, Ogihara M and Dwarkadas S 1999 Incremental and interactive sequence mining. In: Proceedings of the 8th International Conference on Information and Knowledge Management, pp. 251–258

  11. Ayan N F, Tansel A U and Arkun E 1999 An Efficient algorithm to update large itemsets with early pruning. In: Proceedings of the 5th ACM SIGKDD International Conferences on Knowledge Discovery and Data Mining, pp. 287–291

  12. Omiecinski E and Savasere A 1998 Efficient mining of association rules in large dynamic databases. In: Proceedings of 16th British national conference on databases, pp. 49–63

  13. Cheng H, Yan X and Han J 2004 IncSpan: Incremental mining of sequential patterns in large database. In: Proceedings of the 10th ACM SIGKDD International Conference Knowledge Discovery and Data Mining, pp. 527–532

  14. Huang J W, Tseng C Y, Ou J C and Chen M S 2008 A general model for sequential pattern mining with a progressive database. IEEE Trans. Knowl. Data Eng. 20(9): 1153–1167

    Article  Google Scholar 

  15. Hong T P, Wang C Y and Tseng S S 2011 An incremental mining algorithm for maintaining sequential patterns using pre-large sequences. Expert Syst. Appl. 38: 7051–7058

    Article  Google Scholar 

  16. Pei J, Han J, Mortazavi Asl B, Pinto H, Chen Q, Dayal U and Hsu M C 2001 Prefixspan: mining sequential patterns efficiently by prefix-projected pattern growth. In: International Conference on Knowledge Discovery in Databases and Data Mining, pp. 215–224

  17. Ayres J, Flannick J, Gehrke J and Yiu T 2002 Sequential pattern mining using a bitmap representation. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 429–435

  18. Zaki M J 2001 SPADE: An efficient algorithm for mining frequent sequences. In: Proc. Mach. Learn. (special issue on unsupervised learning), 42: 31–60

    MATH  Google Scholar 

  19. Han J, Pei J and Yin Y 2000b Mining frequent patterns without candidate generation. In: The Proceedings of 2000 ACM SIGMOD International Conference on Management of Data pp. 1–12

  20. Lin C W, Hong T P and Lu W H 2009 The Pre-FUFP algorithm for incremental mining. J. Expert Syst. Appl. 36: 9498–9505

    Article  Google Scholar 

  21. Hong T P, Lin J W and Wu Y L 2006 A fast updated frequent pattern tree. In: The IEEE International Conference on System, Man, and Cybernetics, pp. 2167–2172

  22. Cheung D W, Han J, Ng V T and Wong C Y 1996 Maintenance of discovered association rules in large databases: An incremental updating approach. In: 12th IEEE International Conference on Data Engineering, pp. 106–114

  23. Agrawal R and Srikant R 1994 Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Databases, pp. 487–499

  24. Thomas S, Bodagala S, Alsabti K and Ranka S 1997 An efficient algorithm for the incremental updating of association rules in large databases. In: Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining, pp. 263–266

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Rajesh Kumar Boghey.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Boghey, R.K., Singh, S. A sequential tree approach for incremental sequential pattern mining. Sādhanā 41, 1369–1380 (2016). https://doi.org/10.1007/s12046-016-0567-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12046-016-0567-7

Keywords

Navigation