Maximal Frequent Itemset Mining Using Breadth-First Search with Efficient Pruning

Sumathi, K.; Kannan, S.; Nagarajan, K.

doi:10.1007/978-981-10-8681-6_31

K. Sumathi⁶,
S. Kannan⁷ &
K. Nagarajan⁸

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 15))

1873 Accesses
1 Citations

Abstract

Maximal Frequent Patterns can be mined using breadth-first search or depth-first search. The pure BFS algorithms work well when all maximal frequent itemsets are short. The pure DFS algorithms work well when all maximal frequent itemsets are long. Both the pure BFS and pure DFS techniques will not be efficient, when the dataset contains some of long maximal frequent itemsets and some of short maximal frequent itemsets. Efficient pruning techniques are required to mine MFI from these kinds of datasets. An algorithm (MFIMiner) using Breadth-First search with efficient pruning mechanism that competently mines both long and short maximal frequent itemsets is proposed in this paper. The performance of the algorithm is evaluated and compared with GenMax and Mafia algorithms for T40I10D100K, T10I4D100K, and Retail dataset. The result shows that the proposed algorithm has significant improvement than existing algorithms for sparse datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bayardo, R.: Efficiently mining long patterns from databases. In ACM SIGMOD Conference (1998)
Google Scholar
Lin, D., Kedem, Z.M.: Pincer-Search: a new algorithm for discovering the maximum frequent set. In: Proceedings of VI International Conference on Extending Database Technology (1998)
Google Scholar
Agrawal, R., Aggarwal, C., Prasad, V.: Depth first generation of long patterns. In: 7th International Conference on Knowledge Discovery and Data Mining, pp. 108–118 (2000)
Google Scholar
Burdick, D., Calimlim, M., Gehrke, J.: MAFIA: a maximal frequent itemset algorithm for transactional databases. In: International Conference on Data Engineering, pp: 443–452, April 2001, doi: 10.1.1.100.6805
Google Scholar
Gouda, K., Zaki, M.J. (2005). GenMax: an efficient algorithm for mining maximal frequent itemsets. Proc. Data Min. Knowl. Discov, 11, 1–20 (2005)
Article MathSciNet Google Scholar
www.fimi.cs.helsinki.fi/fimi03/datasets.html
Agrawal, R., Imielienski, T., Swami, A.: Mining association rules between sets of items in largedatabases. In: Bunemann, P., Jajodia, S. (eds.) Proceedings of the 1993 ACM SIGMOD Conference on Management of Data, pp. 207–216. ACM Press, Newyork (1993)
Google Scholar
https://arxiv.org/ftp/arxiv/papers/1109/1109.2427.pdf
http://www.aaai.org/Papers/KDD/1997/KDD97-060.pdf
http://sci-hub.tw/http://www.aaai.org/Papers/KDD/1997/KDD97-060.pdf
http://www.philippe-fournier-viger.com/spmf/LCM2.pdf
https://link.springer.com/chapter/10.1007/3-540-44957-4_65

Download references

Author information

Authors and Affiliations

Kalasalingam Academy of Research and Education, Krishnan Koil, Virudhu Nagar, India
K. Sumathi
Madurai Kamaraj University, Madurai, India
S. Kannan
Tata Consultancy Service, Chennai, Tamil Nadu, India
K. Nagarajan

Authors

K. Sumathi
View author publications
You can also search for this author in PubMed Google Scholar
S. Kannan
View author publications
You can also search for this author in PubMed Google Scholar
K. Nagarajan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to K. Sumathi .

Editor information

Editors and Affiliations

Department of CSE, RVS Technical Campus, Coimbatore, Tamil Nadu, India
S. Smys
Department of Telecommunication Engineering, Czech Technical University in Prague, Czechia, Czech Republic
Robert Bestak
Department of Electrical Engineering, Dayeh University, Taiwan, Taiwan
Joy Iong-Zong Chen
Faculty of Informatics and Information Technology, Slovak University of Technology in Bratislava, Bratislava, Slovakia
Ivan Kotuliak

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Sumathi, K., Kannan, S., Nagarajan, K. (2019). Maximal Frequent Itemset Mining Using Breadth-First Search with Efficient Pruning. In: Smys, S., Bestak, R., Chen, JZ., Kotuliak, I. (eds) International Conference on Computer Networks and Communication Technologies. Lecture Notes on Data Engineering and Communications Technologies, vol 15. Springer, Singapore. https://doi.org/10.1007/978-981-10-8681-6_31

Download citation

DOI: https://doi.org/10.1007/978-981-10-8681-6_31
Published: 18 September 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-8680-9
Online ISBN: 978-981-10-8681-6
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics