Abstract
Online mining the maximal frequent itemsets over data streams is an important problem in data mining. In order to solve mining maximal frequent itemsets from data streams using the Landmark Window model, Mao et al. propose the INSTANT algorithm. The structure of the INSTANT algorithm is simple and it can save much memory space. But it takes long time in mining the maximal frequent itemsets. When the new transaction comes, the number of comparisons between the old transactions of the INSTANT algorithm is too much. Therefore, in this chapter, we propose the Set-Checking algorithm to mine frequent itemsets from data streams using the Landmark Window model. We use the structure of the lattice to store our information. The structure of the lattice records the subset relationship between the child node and the parent node. From our simulation results, we show that the process time of our Set-Checking algorithm is faster than that of the INSTANT algorithm.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agrawal R, Srikant R (1994) Fast algorithm for mining association rules in large databases. In: 20th international conference on very large data bases. Morgan Kaufmann, San Francisco, pp 487–499
Li H, Zhang N (2010) Mining maximal frequent itemsets over a stream sliding window. In: IEEE youth conference on information computing and telecommunications. IEEE Press, New York, pp 110–113
Li JW, Lee GQ (2009) Mining frequent itemsets over data streams using efficient window sliding techniques. Int J Expert Syst Appl 36(2):1466–1477. Pergamon Press, New York
Lin KC, Liao IE, Chen ZS (2011) An improved frequent pattern growth method for mining association rules. Int J Expert Syst Appl 38(5):5154–5161. Pergamon Press, New York
Mao G, Wu X, Zhu X, Chen G, Liu C (2007) Mining maximal frequent itemsets from data streams. J Inf Sci 33(3):251–262. Sage, Thousand Oaks
Xin JW, Yang GQ, Sun JZ, Zhang YP (2006) A new algorithm for discovery maximal frequent itemsets based on binary vector sets. In: 5th international conference on machine learning and cybernetics. IEEE Press, New York, pp 1120–1124
Acknowledgments
The research was supported in part by the National Science Council of Republic of China under Grant No. NSC-101-2221-E-110-091-MY2.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer Science+Business Media New York
About this paper
Cite this paper
Chang, YI., Tsai, MH., Li, CE., Lin, PY. (2013). A Set-Checking Algorithm for Mining Maximal Frequent Itemsets from Data Streams. In: Juang, J., Huang, YC. (eds) Intelligent Technologies and Engineering Systems. Lecture Notes in Electrical Engineering, vol 234. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6747-2_29
Download citation
DOI: https://doi.org/10.1007/978-1-4614-6747-2_29
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6746-5
Online ISBN: 978-1-4614-6747-2
eBook Packages: EngineeringEngineering (R0)