A sequential tree approach for incremental sequential pattern mining

Boghey, Rajesh Kumar; Singh, Shailendra

doi:10.1007/s12046-016-0567-7

A sequential tree approach for incremental sequential pattern mining

Published: 25 November 2016

Volume 41, pages 1369–1380, (2016)
Cite this article

Sādhanā Aims and scope Submit manuscript

Rajesh Kumar Boghey¹ &
Shailendra Singh²

286 Accesses
4 Citations
Explore all metrics

Abstract

“Sequential pattern mining” is a prominent and significant method to explore the knowledge and innovation from the large database. Common sequential pattern mining algorithms handle static databases. Pragmatically, looking into the functional and actual execution, the database grows exponentially thereby leading to the necessity and requirement of such innovation, research, and development culminating into the designing of mining algorithm. Once the database is updated, the previous mining result will be incorrect, and we need to restart and trigger the entire mining process for the new updated sequential database. To overcome and avoid the process of rescanning of the entire database, this unique system of incremental mining of sequential pattern is available. The previous approaches, system, and techniques are a priori-based frameworks but mine patterns is an advanced and sophisticated technique giving the desired solution. We propose and incorporate an algorithm called STISPM for incremental mining of sequential patterns using the sequence tree space structure. STISPM uses the depth-first approach along with backward tracking and the dynamic lookahead pruning strategy that removes infrequent and irregular patterns. The process and approach from the root node to any leaf node depict a sequential pattern in the database. The structural characteristic of the sequence tree makes it convenient and appropriate for incremental sequential pattern mining. The sequence tree also stores all the sequential patterns with its count and statistics, so whenever the support system is withdrawn or changed, our algorithm using frequent sequence tree as the storage structure can find and detect all the sequential patterns without mining the database once again.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Agarwal R and Srikant R 1995 Mining sequential pattern. In: Proceedings of the 11th International Conference on Data Engineering pp. 3–14
Zhao Q and Bhowmick S S 2003 Sequential pattern mining: A survey. Technical Report, CAIS, Nanyang Technological University Singapore. No. 118
Google Scholar
Han J, Pei J, Mortazavi Asl B, Chen Q, Dayal U and Hsu M C 2000a FreeSpan: Frequent pattern-projected sequential pattern mining. In: Proceedings of the 6th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining pp. 355–359
Cheung D W, Lee D W and Kao S D 1997 A general incremental technique for maintaining discovered association rules. In: Proceedings of the 5th International Conference on Database System for Advanced Applications, pp. 185–194
Lin M Y and Lee S Y 1998 Incremental update on sequential patterns in large databases. In: The 10th IEEE international conference on tools with artificial intelligence, pp. 24–31
Sarda N L and Srinivas N V 1998 An adaptive algorithm for incremental mining of association rules. In: The 9th International Workshop on Database and Expert Systems, pp. 240–245
Zhang S 1999 Aggregation and maintenance for database mining. Intell. Data Anal. 475–490
Srikant R and Agrawal R 1996 Mining sequential patterns: Generalizations and performance improvements. In: Proceedings of the 5th International Conference on Extending Database Technology, pp. 3–17
Garofalakis M N, Rastogi R and Shim K 1999 SPIRIT: Sequential pattern mining regular expression constraints. In: Proceedings of the 25th international conference on very large data base, pp. 223–234
Parthasarathy S, Zaki M, Ogihara M and Dwarkadas S 1999 Incremental and interactive sequence mining. In: Proceedings of the 8th International Conference on Information and Knowledge Management, pp. 251–258
Ayan N F, Tansel A U and Arkun E 1999 An Efficient algorithm to update large itemsets with early pruning. In: Proceedings of the 5th ACM SIGKDD International Conferences on Knowledge Discovery and Data Mining, pp. 287–291
Omiecinski E and Savasere A 1998 Efficient mining of association rules in large dynamic databases. In: Proceedings of 16th British national conference on databases, pp. 49–63
Cheng H, Yan X and Han J 2004 IncSpan: Incremental mining of sequential patterns in large database. In: Proceedings of the 10th ACM SIGKDD International Conference Knowledge Discovery and Data Mining, pp. 527–532
Huang J W, Tseng C Y, Ou J C and Chen M S 2008 A general model for sequential pattern mining with a progressive database. IEEE Trans. Knowl. Data Eng. 20(9): 1153–1167
Article Google Scholar
Hong T P, Wang C Y and Tseng S S 2011 An incremental mining algorithm for maintaining sequential patterns using pre-large sequences. Expert Syst. Appl. 38: 7051–7058
Article Google Scholar
Pei J, Han J, Mortazavi Asl B, Pinto H, Chen Q, Dayal U and Hsu M C 2001 Prefixspan: mining sequential patterns efficiently by prefix-projected pattern growth. In: International Conference on Knowledge Discovery in Databases and Data Mining, pp. 215–224
Ayres J, Flannick J, Gehrke J and Yiu T 2002 Sequential pattern mining using a bitmap representation. In: Proceedings of the 8th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 429–435
Zaki M J 2001 SPADE: An efficient algorithm for mining frequent sequences. In: Proc. Mach. Learn. (special issue on unsupervised learning), 42: 31–60
MATH Google Scholar
Han J, Pei J and Yin Y 2000b Mining frequent patterns without candidate generation. In: The Proceedings of 2000 ACM SIGMOD International Conference on Management of Data pp. 1–12
Lin C W, Hong T P and Lu W H 2009 The Pre-FUFP algorithm for incremental mining. J. Expert Syst. Appl. 36: 9498–9505
Article Google Scholar
Hong T P, Lin J W and Wu Y L 2006 A fast updated frequent pattern tree. In: The IEEE International Conference on System, Man, and Cybernetics, pp. 2167–2172
Cheung D W, Han J, Ng V T and Wong C Y 1996 Maintenance of discovered association rules in large databases: An incremental updating approach. In: 12th IEEE International Conference on Data Engineering, pp. 106–114
Agrawal R and Srikant R 1994 Fast algorithms for mining association rules. In: Proceedings of the 20th International Conference on Very Large Databases, pp. 487–499
Thomas S, Bodagala S, Alsabti K and Ranka S 1997 An efficient algorithm for the incremental updating of association rules in large databases. In: Proceedings of 3rd International Conference on Knowledge Discovery and Data Mining, pp. 263–266

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Technocrats Institute of Technology (Excellence), Bhopal, 462021, India
Rajesh Kumar Boghey
Department of Computer Engineering and Application, National Institute of Technical Teachers Training and Research, Bhopal, 462002, India
Shailendra Singh

Authors

Rajesh Kumar Boghey
View author publications
You can also search for this author in PubMed Google Scholar
Shailendra Singh
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Rajesh Kumar Boghey.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Boghey, R.K., Singh, S. A sequential tree approach for incremental sequential pattern mining. Sādhanā 41, 1369–1380 (2016). https://doi.org/10.1007/s12046-016-0567-7

Download citation

Received: 21 October 2015
Revised: 04 May 2016
Accepted: 26 May 2016
Published: 25 November 2016
Issue Date: December 2016
DOI: https://doi.org/10.1007/s12046-016-0567-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A sequential tree approach for incremental sequential pattern mining

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of data mining

A Systematic Review of Hidden Markov Models and Their Applications

On the nature and types of anomalies: a review of deviations in data

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A sequential tree approach for incremental sequential pattern mining

Abstract

Access this article

Similar content being viewed by others

A comprehensive survey of data mining

A Systematic Review of Hidden Markov Models and Their Applications

On the nature and types of anomalies: a review of deviations in data

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation