Abstract
Frequent pattern mining plays an important role in the data mining community since it is usually a fundamental step in various mining tasks. However, maintenance of frequent patterns is very expensive in the incremental database. In addition, the status of a pattern changes with time. In other words, a frequent pattern is possible to become infrequent, and vice versa. In order to exactly find all frequent patterns, most algorithms have to scan the original database completely whenever an update occurs. In this paper, we propose a new algorithm iTM, stands for incremental Transaction Mapping algorithm for incremental frequent pattern mining without rescanning the whole database. It transfers the transaction dataset to the vertical representation such that the incremental dataset can be integrated to the original database easily. As demonstrated in our experiments, the proposed method is very efficient and suitable for mining frequent patterns in the incremental database.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agrawal, R., Imielinski, T., Swami, A.: Mining Association Rules Between Sets of Items in Large Databases. In: 1993 ACM SIGMOD Conference, ACM SIGMOD, Washington, DC, pp. 207–216 (1993)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann, San Francisco (1993)
Breiman, L., Friedman, J.H., Olshen, R.A., Stone, C.J.: Classification and Regression Trees. Wadsworth International Group (1984)
Lloyd, S.P.: Least Squares Quantization in PCM. IEEE Transactions on Information Theory it-28(2), 129–137 (1982)
Song, M., Rajasekaran, S.: A Transaction Mapping Algorithm for Frequent Itemsets Mining. IEEE Transactions on Knowledge and Data Engineering 18(4), 472–481 (2006)
Cheung, D.W., Han, J., Ng, V.T., Wong, C.Y.: Maintenance of Discovered Association Rules in Large Databases: An Incremental Updating Technique. In: 12th International Conference on Data Engineering, ICDE, New Orleans, pp. 106–114 (1996)
Cheung, D.W., Lee, S.D., Kao, B.: A General Incremental Technique for Maintaining Discovered Association Rules. In: Fifth International Conference on Database Systems for Advanced Application, DASFAA, Melbourne, pp. 185–194 (1997)
Chang, C.C., Li, Y.C., Lee, J.S.: An Efficient Algorithm for Incremental Mining of Association Rules. In: 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, RIDE-SDMA 2005, pp. 3–10 (2005)
Zhang, S., Zhang, J., Zhang, C.: EDUA: An Efficient Algorithm for Dynamic Database Mining. Information Sciences 177, 2756–2767 (2007)
Liu, B., Hsu, W., Ma, Y.: Mining Association Rules With Multiple Minimum Supports. In: Fifth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, ACM SIGKDD, San Diego, pp. 337–341 (1999)
Pei, J., Han, J., Mortazavi-Asl, B., Pinto, H., Chen, Q., Dayal, U., Hsu, M.C.: PrefixSpan: Mining Sequential Patterns Efficiently by Prefix-projected Pattern Growth. In: 17th International Conference on Data Engineering, ICDE 2001, pp. 215–224 (2001)
Leung, C.K.S., Khan, Q.I., Hoque, T.: CanTree: A Tree Structure for Efficient Incremental Mining of Frequent Patterns. In: Fifth IEEE International Conference on Data Mining, pp. 274–281. IEEE Computer Society Press, Los Alamitos (2005)
Thomas, S., Bodagala, S., Alsabti, K., Ranka, S.: An Efficient Algorithm for the Incremental Updation of Association Rules in Large Databases. In: Third International Conference on Knowledge Discovery and Data Mining, KDD 1997, Newport Beach, pp. 263–266 (1997)
Lee, C.H., Lin, C.R., Chen, M.S.: Sliding-window Filtering: An Efficient Algorithm for Incremental Mining. In: Tenth International Conference on Information and Knowledge Management, CIKM 2001, Atlanta, pp. 263–270 (2001)
Cheung, W., Zaiane, O.R.: Incremental Mining of Frequent Patterns Without Candidate Generation or Support Constraint. In: Seventh International Database Engineering and Applications Symposium, pp. 111–116. IEEE Computer Society Press, Los Alamitos (2003)
Koh, J.-L., Shieh, S.-F.: An efficient approach for maintaining association rules based on adjusting FP-tree structures1. In: Lee, Y., Li, J., Whang, K.-Y., Lee, D. (eds.) DASFAA 2004. LNCS, vol. 2973, pp. 417–424. Springer, Heidelberg (2004)
Frequent Itemset Mining Dataset Repository, http://fimi.cs.helsinki.fi/data/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Dai, BR., Lin, PY. (2009). iTM: An Efficient Algorithm for Frequent Pattern Mining in the Incremental Database without Rescanning. In: Chien, BC., Hong, TP., Chen, SM., Ali, M. (eds) Next-Generation Applied Intelligence. IEA/AIE 2009. Lecture Notes in Computer Science(), vol 5579. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-02568-6_77
Download citation
DOI: https://doi.org/10.1007/978-3-642-02568-6_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-02567-9
Online ISBN: 978-3-642-02568-6
eBook Packages: Computer ScienceComputer Science (R0)