FP-Tree and Its Variants: Towards Solving the Pattern Mining Challenges

Borah, Anindita; Nath, Bhabesh

doi:10.1007/978-981-10-5828-8_51

Anindita Borah⁷ &
Bhabesh Nath⁷

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 79))

1319 Accesses
7 Citations

Abstract

Mining patterns from databases is like searching for precious gems which is a gruesome task but still a rewarding one. The frequent patterns are believed to be valuable assets for the researchers that provide them useful information. The frequent and rare pattern mining paradigm is broadly divided into Apriori and FP-Tree-based approaches. Experimental results and performance evaluation available in the literature have established the fact that FP-Tree-based approaches are superior to the Apriori ones on various grounds. This paper explores the various modifications of FP-Tree that were developed to tackle the major pattern mining research challenges. Through this paper, an attempt has been made to review the usefulness and applicability of the most eminent data structure in the domain of pattern mining, the FP-Tree.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

References

Agrawal, R., Imielinski, T., Swami, A.: Mining association rules between sets of items in large databases. In: Acm sigmod record. vol. 22, pp. 207–216. ACM (1993).
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: ACM Sigmod Record. vol. 29, pp. 1–12. ACM (2000).
Google Scholar
Liu, G., Lu, H., Yu, J.X., Wang, W., Xiao, X.: Afopt: An efficient implementation of pattern growth approach. In: FIMI (2003).
Google Scholar
Racz, B.: nonordfp: An fp-growth variation without rebuilding the fp-tree. In: FIMI (2004).
Google Scholar
Wang, K., Tang, L., Han, J., Liu, J.: Top down fp-growth for association rule mining. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 334–340. Springer (2002).
Google Scholar
Sucahyo, Y.G., Gopalan, R.P.: Ct-pro: A bottom-up non recursive frequent Itemset mining algorithm using compressed fp-tree data structure. In: FIMI. vol. 4, pp. 212–223 (2004).
Google Scholar
Grahne, G., Zhu, J.: Efficiently using prefix-trees in mining frequent itemsets. In: FIMI. vol. 90 (2003).
Google Scholar
Pei, J., Han, J., Mao, R., et al.: Closet: An efficient algorithm for mining frequent closed itemsets. In: ACM SIGMOD workshop on research issues in data mining and knowledge discovery. vol. 4, pp. 21–30 (2000).
Google Scholar
Wang, J., Han, J., Pei, J.: Closet+: Searching for the best strategies for mining frequent closed itemsets. In: Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 236–245. ACM (2003).
Google Scholar
Grahne, G., Zhu, J.: Fast algorithms for frequent itemset mining using fp-trees. IEEE transactions on knowledge and data engineering 17(10), 1347–1362 (2005).
Google Scholar
Adnan, M., Alhajj, R.: Drfp-tree: disk-resident frequent pattern tree. Applied Intelligence 30(2), 84–97 (2009).
Google Scholar
Bonchi, F., Goethals, B.: Fp-bonsai: the art of growing and pruning small fp-trees. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 155–160. Springer (2004).
Google Scholar
Xu, B., Yi, T., Wu, F., Chen, Z.: An incremental updating algorithm for mining association rules. Journal of Electronics (China) 19(4), 403–407 (2002).
Google Scholar
Koh, J.L., Shieh, S.F.: An efficient approach for maintaining association rules based on adjusting fp-tree structures. In: International Conference on Database Systems for Advanced Applications. pp. 417–424. Springer (2004).
Google Scholar
Cheung, W., Zaiane, O.R.: Incremental mining of frequent patterns without candidate generation or support constraint. In: Database Engineering and Applications Symposium, 2003. Proceedings. Seventh International. pp. 111–116. IEEE (2003).
Google Scholar
Leung, C.K.S., Khan, Q.I., Li, Z., Hoque, T.: Cantree: a canonical-order tree for incremental frequent-pattern mining. Knowledge and Information Systems 11(3), 287–311 (2007).
Google Scholar
Tanbeer, S.K., Ahmed, C.F., Jeong, B.S., Lee, Y.K.: Cp-tree: a tree structure for single-pass frequent pattern mining. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 1022–1027. Springer (2008).
Google Scholar
Leung, C.K.S., Carmichael, C.L., Hao, B.: Efficient mining of frequent patterns from uncertain data. In: Seventh IEEE International Conference on Data Mining Workshops (ICDMW 2007). pp. 489–494. IEEE (2007).
Google Scholar
Calders, T., Garboni, C., Goethals, B.: Efficient pattern mining of uncertain data with sampling. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 480–487. Springer (2010).
Google Scholar
Tseng, V.S., Wu, C.W., Shie, B.E., Yu, P.S.: Up-growth: an efficient algorithm for high utility itemset mining. In: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 253–262. ACM (2010).
Google Scholar
Tseng, V.S., Shie, B.E., Wu, C.W., Philip, S.Y.: Efficient algorithms for mining high utility itemsets from transactional databases. IEEE transactions on knowledge and data engineering 25(8), 1772–1786 (2013).
Google Scholar
Lin, C.W., Hong, T.P., Lu, W.H.: An effective tree structure for mining high utility itemsets. Expert Systems with Applications 38(6), 7419–7424 (2011).
Google Scholar
Lin, C.W., Hong, T.P., Lu, W.H., Lin, W.Y.: An incremental fusp-tree maintenance algorithm. In: 2008 Eighth International Conference on Intelligent Systems Design and Applications. vol. 1, pp. 445–449. IEEE (2008).
Google Scholar
Pei, J., Han, J., Mortazavi-Asl, B., Zhu, H.: Mining access patterns efficiently from web logs. In: Pacific-Asia Conference on Knowledge Discovery and Data Mining. pp. 396–407. Springer (2000).
Google Scholar
Grahne, G., Zhu, J.: High performance mining of maximal frequent itemsets. In:6th International Workshop on High Performance Data Mining (2003).
Google Scholar
Yan, Y.J., Li, Z.J., Chen, H.W.: Efficiently mining of maximal frequent item sets based on fp-tree. Ruan Jian Xue Bao (J. Softw.) 16(2), 215–222 (2005).
Google Scholar
Giannella, C., Han, J., Pei, J., Yan, X., Yu, P.S.: Mining frequent patterns in data streams at multiple time granularities. Next generation data mining 212, 191–212 (2003).
Google Scholar
Leung, C.K.S., Khan, Q.I.: Dstree: a tree structure for the mining of frequent sets from data streams. In: Sixth International Conference on Data Mining (ICDM’06). pp. 928–932. IEEE (2006).
Google Scholar
Tanbeer, S.K., Ahmed, C.F., Jeong, B.S., Lee, Y.K.: Efficient frequent pattern mining over data streams. In: Proceedings of the 17th ACM conference on Information and knowledge management. pp. 1447–1448. ACM (2008).
Google Scholar
Hu, Y.H., Chen, Y.L.: Mining association rules with multiple minimum supports: anew mining algorithm and a support tuning mechanism. Decision Support Systems 42(1), 1–24 (2006).
Google Scholar
Tsang, S., Koh, Y.S., Dobbie, G.: Rp-tree: rare pattern tree mining. In: Data Warehousing and Knowledge Discovery, pp. 277–288. Springer (2011).
Google Scholar
Bhatt, U., Patel, P.: A novel approach for finding rare items based on multiple minimum support framework. Procedia Computer Science 57, 1088–1095 (2015).
Google Scholar
Chen, M., Gao, X., Li, H.: An efficient parallel fp-growth algorithm. In: Cyber-Enabled Distributed Computing and Knowledge Discovery, 2009. CyberC’09. International Conference on. pp. 283–286. IEEE (2009).
Google Scholar
Leung, C.K.S., Hayduk, Y.: Mining frequent patterns from uncertain data with map reduce for big data analytics. In: International Conference on Database Systems for Advanced Applications. pp. 440–455. Springer (2013).
Google Scholar
Chang, H.Y., Lin, J.C., Cheng, M.L., Huang, S.C.: A novel incremental data mining algorithm based on fp-growth for big data. In: Networking and Network Applications (NaNA), 2016 International Conference on. pp. 375–378. IEEE (2016).
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science & Engineering, Tezpur University, Napaam, Sonitpur, 784028, Assam, India
Anindita Borah & Bhabesh Nath

Authors

Anindita Borah
View author publications
You can also search for this author in PubMed Google Scholar
Bhabesh Nath
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anindita Borah .

Editor information

Editors and Affiliations

College of Engineering, Iowa State University, Ames, Iowa, USA
Arun K. Somani
Department of Information Technology, Manipal University Jaipur, Jaipur, Rajasthan, India
Sumit Srivastava
Department of Information Technology, Manipal University Jaipur, Jaipur, Rajasthan, India
Ankit Mundra
Department of Electronics and Communication Engineering, Manipal University Jaipur, Jaipur, Rajasthan, India
Sanyog Rawat

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Borah, A., Nath, B. (2018). FP-Tree and Its Variants: Towards Solving the Pattern Mining Challenges. In: Somani, A., Srivastava, S., Mundra, A., Rawat, S. (eds) Proceedings of First International Conference on Smart System, Innovations and Computing. Smart Innovation, Systems and Technologies, vol 79. Springer, Singapore. https://doi.org/10.1007/978-981-10-5828-8_51

Download citation

DOI: https://doi.org/10.1007/978-981-10-5828-8_51
Published: 09 January 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-5827-1
Online ISBN: 978-981-10-5828-8
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics