Abstract
In actual association rule mining, data sets collected from enterprises or real life often have some problems, such as a large amount of data missing or data redundancy, which greatly increases the spatial complexity of mining association rules and makes mining efficiency inefficient. Not only that, some actual data set contain hundreds or even more attributes. Not only does it take too long to mine association rules, but there are too many association rules obtained, making it difficult for users to distinguish which is more valuable information in practical applications. It is difficult to apply these data to actual enterprises to get greater benefits. In response to these problems, this paper proposes an association rule algorithm based on the FP-Growth association rule algorithm of information gain ratio attribute reduction to extract more valuable information and improve the efficiency of association rule mining. Finally, through experiments and comparisons, it is verified that the algorithm proposed in this paper can effectively mine the association rule information of multi-attribute data sets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Rakesh, A., Tomasz, I., Arun, S.: Mining association rules between sets of items in large databases. Manage. Data 22(2), 207–216 (1993)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 16–18 May (2000)
Pawlak, Z.: Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)
Pawlak, Z.: A rough set perspective. Comput. Intell. 11, 227–232 (1995)
Pawlak, Z.: Rough set approach to knowledge-based decision support. Eur. J. Oper. Res. 99, 48–57 (1997)
Pawlak, Z.: Rough set theory and its applications to data analysis. Cybernet. Syst. Int. J. 29, 661–688 (1998)
Xun, J., Xu, L., Qi, L.: Association rules mining algorithm based on rough set. In: Proceedings 2012 IEEE International Symposium on Information Technology in Medicine and Education, pp. 361−364 (2012)
Ma, J., Ge, Y., Pu, H.: Survey of attribute reduction methods. Data Anal. Knowl. Discovery 4(1), 40–50 (2020)
Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. J. 13(1), 211–221 (2013)
Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 11(5), 341–356 (1982)
Liu, D., Li, T., Miao, D.: Three-Way Decision and Granular Computing. Science Press, Beijing (2013)
Hu, Q., Yu, D., Xie, Z., Liu, J.: Fuzzy probabilistic approximation spaces and their information measures. IEEE Trans. Fuzzy Syst. 14, 191–201 (2006)
Lee, T.: An information-theoretic analysis of relational databases-part I: data dependencies and information metric. IEEE Trans. Softw. Eng. 13, 1049–1061 (1987)
Miao, D., Hu, G.: A heuristic algorithm for knowledge reduction. Comput. Res. Dev. 36, 681–684 (1999)
Jia, P., Dai, J., Pan, Y., Zhu, M.: Novel algorithm for attribute reduction based on mutual-information gain ratio. J. Zhejiang Univ. (Engineering Edition) 40, 1041–1044 (2005)
Zhang, Q., Yang, J., Yao, L.: Attribute reduction based on rough approximation set in algebra and information views. IEEE Access 4, 5399–5407 (2016)
Wang, Q., Xie, F., Zhao, M.: Analysis of cross-selling model of digital tv package based on data mining. Radio Telev. Inf. 12, 57–59 (2016)
Yue, S.: Research on Association Rule Mining Algorithm Based on Frequent Pattern Tree. Tianjin Polytechnic University, Tianjin (2019)
Ranjith, K., Yang, Z., Caytiles, R., Iyengar, N.: Comparative analysis of association rule mining algorithms for the distributed data. Int. J. Adv. Sci. Technol. 102, 49–60 (2017)
Ma, R., Wu, H.: Research and application of association rules mining based on FP_growth algorithm. Jo. Taiyuan Normal Univ. (Natural Science Edition) 20(01), 19–22 (2021)
Yang, Z., Geng, X.: Research on mining association rules based on multi-granularity attribute reduction. Comput. Eng. Appl. 55(6), 133–139 (2019)
Nick, S., Wolberg, W., Mangasarian, O.: Nuclear feature extraction for breast tumor diagnosis. In: Biomedical Image Processing and Biomedical Visualization, International Society for Optics and Photonics, vol. 1905 (1993)
Moro, S., Cortez, P., Rita, P.: A data-driven approach to predict the success of bank telemarketing. Decis. Support Syst. Elsevier 62, 22–31 (2014)
Lin, T.Y., Yin, P.: Heuristically fast finding of the shortest reducts. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 465–470. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25929-9_55
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Han, T., Wang, W., Guo, M., Ning, S. (2022). Association Rules Mining Algorithm Based on Information Gain Ratio Attribute Reduction. In: Hassanien, A.E., Xu, Y., Zhao, Z., Mohammed, S., Fan, Z. (eds) Business Intelligence and Information Technology. BIIT 2021. Lecture Notes on Data Engineering and Communications Technologies, vol 107. Springer, Cham. https://doi.org/10.1007/978-3-030-92632-8_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-92632-8_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92631-1
Online ISBN: 978-3-030-92632-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)