iTM: An Efficient Algorithm for Frequent Pattern Mining in the Incremental Database without Rescanning

Dai, Bi-Ru; Lin, Pai-Yu

doi:10.1007/978-3-642-02568-6_77

Bi-Ru Dai²³ &
Pai-Yu Lin²³

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5579))

Included in the following conference series:

International Conference on Industrial, Engineering and Other Applications of Applied Intelligent Systems

1600 Accesses
1 Citations

Abstract

Frequent pattern mining plays an important role in the data mining community since it is usually a fundamental step in various mining tasks. However, maintenance of frequent patterns is very expensive in the incremental database. In addition, the status of a pattern changes with time. In other words, a frequent pattern is possible to become infrequent, and vice versa. In order to exactly find all frequent patterns, most algorithms have to scan the original database completely whenever an update occurs. In this paper, we propose a new algorithm iTM, stands for incremental Transaction Mapping algorithm for incremental frequent pattern mining without rescanning the whole database. It transfers the transaction dataset to the vertical representation such that the incremental dataset can be integrated to the original database easily. As demonstrated in our experiments, the proposed method is very efficient and suitable for mining frequent patterns in the incremental database.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules Between Sets of Items in Large Databases. In: 1993 ACM SIGMOD Conference, ACM SIGMOD, Washington, DC, pp. 207–216 (1993)
Google Scholar
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group (1984)
Google Scholar
Lloyd, S.P.: Least Squares Quantization in PCM. IEEE Transactions on Information Theory it-28(2), 129–137 (1982)
Google Scholar
Song, M., Rajasekaran, S.: A Transaction Mapping Algorithm for Frequent Itemsets Mining. IEEE Transactions on Knowledge and Data Engineering 18(4), 472–481 (2006)
Article Google Scholar
Cheung, D.W., Han, J., Ng, V.T., Wong, C.Y.: Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique. In: 12th International Conference on Data Engineering, ICDE, New Orleans, pp. 106–114 (1996)
Google Scholar
Cheung, D.W., Lee, S.D., Kao, B.: A General Incremental Technique for Maintaining Discovered Association Rules. In: Fifth International Conference on Database Systems for Advanced Application, DASFAA, Melbourne, pp. 185–194 (1997)
Google Scholar
Chang, C.C., Li, Y.C., Lee, J.S.: An Efficient Algorithm for Incremental Mining of Association Rules. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, RIDE-SDMA 2005, pp. 3–10 (2005)
Google Scholar
Zhang, S., Zhang, J., Zhang, C.: EDUA: An Efficient Algorithm for Dynamic Database Mining. Information Sciences 177, 2756–2767 (2007)
Article Google Scholar
Liu, B., Hsu, W., Ma, Y.: Mining Association Rules With Multiple Minimum Supports. In: Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM SIGKDD, San Diego, pp. 337–341 (1999)
Google Scholar
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-projected Pattern Growth. In: 17th International Conference on Data Engineering, ICDE 2001, pp. 215–224 (2001)
Google Scholar
Leung, C.K.S., Khan, Q.I., Hoque, T.: CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns. In: Fifth IEEE International Conference on Data Mining, pp. 274–281. IEEE Computer Society Press, Los Alamitos (2005)
Chapter Google Scholar
Thomas, S., Bodagala, S., Alsabti, K., Ranka, S.: An Efficient Algorithm for the Incremental Updation of Association Rules in Large Databases. In: Third International Conference on Knowledge Discovery and Data Mining, KDD 1997, Newport Beach, pp. 263–266 (1997)
Google Scholar
Lee, C.H., Lin, C.R., Chen, M.S.: Sliding-window Filtering: An Efficient Algorithm for Incremental Mining. In: Tenth International Conference on Information and Knowledge Management, CIKM 2001, Atlanta, pp. 263–270 (2001)
Google Scholar
Cheung, W., Zaiane, O.R.: Incremental Mining of Frequent Patterns Without Candidate Generation or Support Constraint. In: Seventh International Database Engineering and Applications Symposium, pp. 111–116. IEEE Computer Society Press, Los Alamitos (2003)
Google Scholar
Koh, J.-L., Shieh, S.-F.: An efficient approach for maintaining association rules based on adjusting FP-tree structures1. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 417–424. Springer, Heidelberg (2004)
Chapter Google Scholar
Frequent Itemset Mining Dataset Repository, http://fimi.cs.helsinki.fi/data/

Download references

Author information

Authors and Affiliations

Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, ROC
Bi-Ru Dai & Pai-Yu Lin

Authors

Bi-Ru Dai
View author publications
You can also search for this author in PubMed Google Scholar
Pai-Yu Lin
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Information Engineering, National University of Tainan, 700, Tainan, Taiwan
Been-Chian Chien
Department of Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong
Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan
Shyi-Ming Chen
Department of Computer Science, Texas State University-San Marcos, 601 University Drive, 78666-4616, San Marcos, TX, USA
Moonis Ali

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Dai, BR., Lin, PY. (2009). iTM: An Efficient Algorithm for Frequent Pattern Mining in the Incremental Database without Rescanning. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_77

Download citation

DOI: https://doi.org/10.1007/978-3-642-02568-6_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02567-9
Online ISBN: 978-3-642-02568-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics