A Set-Checking Algorithm for Mining Maximal Frequent Itemsets from Data Streams

Chang, Ye-In; Tsai, Meng-Hsuan; Li, Chia-En; Lin, Pei-Ying

doi:10.1007/978-1-4614-6747-2_29

Ye-In Chang³,
Meng-Hsuan Tsai³,
Chia-En Li³ &
…
Pei-Ying Lin³

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 234))

1888 Accesses

Abstract

Online mining the maximal frequent itemsets over data streams is an important problem in data mining. In order to solve mining maximal frequent itemsets from data streams using the Landmark Window model, Mao et al. propose the INSTANT algorithm. The structure of the INSTANT algorithm is simple and it can save much memory space. But it takes long time in mining the maximal frequent itemsets. When the new transaction comes, the number of comparisons between the old transactions of the INSTANT algorithm is too much. Therefore, in this chapter, we propose the Set-Checking algorithm to mine frequent itemsets from data streams using the Landmark Window model. We use the structure of the lattice to store our information. The structure of the lattice records the subset relationship between the child node and the parent node. From our simulation results, we show that the process time of our Set-Checking algorithm is faster than that of the INSTANT algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agrawal R, Srikant R (1994) Fast algorithm for mining association rules in large databases. In: 20th international conference on very large data bases. Morgan Kaufmann, San Francisco, pp 487–499
Google Scholar
Li H, Zhang N (2010) Mining maximal frequent itemsets over a stream sliding window. In: IEEE youth conference on information computing and telecommunications. IEEE Press, New York, pp 110–113
Google Scholar
Li JW, Lee GQ (2009) Mining frequent itemsets over data streams using efficient window sliding techniques. Int J Expert Syst Appl 36(2):1466–1477. Pergamon Press, New York
Google Scholar
Lin KC, Liao IE, Chen ZS (2011) An improved frequent pattern growth method for mining association rules. Int J Expert Syst Appl 38(5):5154–5161. Pergamon Press, New York
Google Scholar
Mao G, Wu X, Zhu X, Chen G, Liu C (2007) Mining maximal frequent itemsets from data streams. J Inf Sci 33(3):251–262. Sage, Thousand Oaks
Google Scholar
Xin JW, Yang GQ, Sun JZ, Zhang YP (2006) A new algorithm for discovery maximal frequent itemsets based on binary vector sets. In: 5th international conference on machine learning and cybernetics. IEEE Press, New York, pp 1120–1124
Google Scholar

Download references

Acknowledgments

The research was supported in part by the National Science Council of Republic of China under Grant No. NSC-101-2221-E-110-091-MY2.

Author information

Authors and Affiliations

Department of Computer Science and Engineering, National Sun Yat-Sen University, Kaohsiung, Taiwan, R.O.C.
Ye-In Chang, Meng-Hsuan Tsai, Chia-En Li & Pei-Ying Lin

Authors

Ye-In Chang
View author publications
You can also search for this author in PubMed Google Scholar
Meng-Hsuan Tsai
View author publications
You can also search for this author in PubMed Google Scholar
Chia-En Li
View author publications
You can also search for this author in PubMed Google Scholar
Pei-Ying Lin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ye-In Chang .

Editor information

Editors and Affiliations

School of Engineering, Mercer University, 151 Brookefield Drive, Macon, 31210, Georgia, USA
Jengnan Juang
National Changhua University of Educatio, No. 1, Jin De Road, Changhua City, 500, Taiwan R.O.C.
Yi-Cheng Huang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chang, YI., Tsai, MH., Li, CE., Lin, PY. (2013). A Set-Checking Algorithm for Mining Maximal Frequent Itemsets from Data Streams. In: Juang, J., Huang, YC. (eds) Intelligent Technologies and Engineering Systems. Lecture Notes in Electrical Engineering, vol 234. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6747-2_29

Download citation

DOI: https://doi.org/10.1007/978-1-4614-6747-2_29
Published: 28 February 2013
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4614-6746-5
Online ISBN: 978-1-4614-6747-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics