Association Rules Mining Algorithm Based on Information Gain Ratio Attribute Reduction

Han, Tongtong; Wang, Wenjing; Guo, Min; Ning, Shiyong

doi:10.1007/978-3-030-92632-8_18

Tongtong Han⁷,
Wenjing Wang⁷,
Min Guo⁷ &
…
Shiyong Ning⁸

Part of the book series: Lecture Notes on Data Engineering and Communications Technologies ((LNDECT,volume 107))

Included in the following conference series:

International Conference on Business Intelligence and Information Technology

1103 Accesses

Abstract

In actual association rule mining, data sets collected from enterprises or real life often have some problems, such as a large amount of data missing or data redundancy, which greatly increases the spatial complexity of mining association rules and makes mining efficiency inefficient. Not only that, some actual data set contain hundreds or even more attributes. Not only does it take too long to mine association rules, but there are too many association rules obtained, making it difficult for users to distinguish which is more valuable information in practical applications. It is difficult to apply these data to actual enterprises to get greater benefits. In response to these problems, this paper proposes an association rule algorithm based on the FP-Growth association rule algorithm of information gain ratio attribute reduction to extract more valuable information and improve the efficiency of association rule mining. Finally, through experiments and comparisons, it is verified that the algorithm proposed in this paper can effectively mine the association rule information of multi-attribute data sets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Rakesh, A., Tomasz, I., Arun, S.: Mining association rules between sets of items in large databases. Manage. Data 22(2), 207–216 (1993)
Google Scholar
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, 16–18 May (2000)
Google Scholar
Pawlak, Z.: Theoretical Aspects of Reasoning about Data. Kluwer Academic Publishers, Dordrecht (1991)
MATH Google Scholar
Pawlak, Z.: A rough set perspective. Comput. Intell. 11, 227–232 (1995)
Article MathSciNet Google Scholar
Pawlak, Z.: Rough set approach to knowledge-based decision support. Eur. J. Oper. Res. 99, 48–57 (1997)
Article Google Scholar
Pawlak, Z.: Rough set theory and its applications to data analysis. Cybernet. Syst. Int. J. 29, 661–688 (1998)
Article Google Scholar
Xun, J., Xu, L., Qi, L.: Association rules mining algorithm based on rough set. In: Proceedings 2012 IEEE International Symposium on Information Technology in Medicine and Education, pp. 361−364 (2012)
Google Scholar
Ma, J., Ge, Y., Pu, H.: Survey of attribute reduction methods. Data Anal. Knowl. Discovery 4(1), 40–50 (2020)
Google Scholar
Dai, J., Xu, Q.: Attribute selection based on information gain ratio in fuzzy rough set theory with application to tumor classification. Appl. Soft Comput. J. 13(1), 211–221 (2013)
Article Google Scholar
Pawlak, Z.: Rough sets. Int. J. Comput. Inf. Sci. 11(5), 341–356 (1982)
Article Google Scholar
Liu, D., Li, T., Miao, D.: Three-Way Decision and Granular Computing. Science Press, Beijing (2013)
Google Scholar
Hu, Q., Yu, D., Xie, Z., Liu, J.: Fuzzy probabilistic approximation spaces and their information measures. IEEE Trans. Fuzzy Syst. 14, 191–201 (2006)
Article Google Scholar
Lee, T.: An information-theoretic analysis of relational databases-part I: data dependencies and information metric. IEEE Trans. Softw. Eng. 13, 1049–1061 (1987)
Article Google Scholar
Miao, D., Hu, G.: A heuristic algorithm for knowledge reduction. Comput. Res. Dev. 36, 681–684 (1999)
Google Scholar
Jia, P., Dai, J., Pan, Y., Zhu, M.: Novel algorithm for attribute reduction based on mutual-information gain ratio. J. Zhejiang Univ. (Engineering Edition) 40, 1041–1044 (2005)
MATH Google Scholar
Zhang, Q., Yang, J., Yao, L.: Attribute reduction based on rough approximation set in algebra and information views. IEEE Access 4, 5399–5407 (2016)
Article Google Scholar
Wang, Q., Xie, F., Zhao, M.: Analysis of cross-selling model of digital tv package based on data mining. Radio Telev. Inf. 12, 57–59 (2016)
Google Scholar
Yue, S.: Research on Association Rule Mining Algorithm Based on Frequent Pattern Tree. Tianjin Polytechnic University, Tianjin (2019)
Google Scholar
Ranjith, K., Yang, Z., Caytiles, R., Iyengar, N.: Comparative analysis of association rule mining algorithms for the distributed data. Int. J. Adv. Sci. Technol. 102, 49–60 (2017)
Article Google Scholar
Ma, R., Wu, H.: Research and application of association rules mining based on FP_growth algorithm. Jo. Taiyuan Normal Univ. (Natural Science Edition) 20(01), 19–22 (2021)
Google Scholar
Yang, Z., Geng, X.: Research on mining association rules based on multi-granularity attribute reduction. Comput. Eng. Appl. 55(6), 133–139 (2019)
Google Scholar
Nick, S., Wolberg, W., Mangasarian, O.: Nuclear feature extraction for breast tumor diagnosis. In: Biomedical Image Processing and Biomedical Visualization, International Society for Optics and Photonics, vol. 1905 (1993)
Google Scholar
Moro, S., Cortez, P., Rita, P.: A data-driven approach to predict the success of bank telemarketing. Decis. Support Syst. Elsevier 62, 22–31 (2014)
Article Google Scholar
Lin, T.Y., Yin, P.: Heuristically fast finding of the shortest reducts. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 465–470. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25929-9_55
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Harbin University of Commerce, Harbin, 150028, China
Tongtong Han, Wenjing Wang & Min Guo
Heilongjiang Provincial Key Laboratory of Electronic Commerce and Information Processing, Harbin, 150028, China
Shiyong Ning

Authors

Tongtong Han
View author publications
You can also search for this author in PubMed Google Scholar
Wenjing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Min Guo
View author publications
You can also search for this author in PubMed Google Scholar
Shiyong Ning
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Shiyong Ning .

Editor information

Editors and Affiliations

Information Technology Department, Cairo University, Giza, Egypt
Aboul Ella Hassanien
School of Computer and Information Engineering, Harbin University of Commerce, Harbin, Heilongjiang, China
Yaoqun Xu
School of Computer and Information Engineering, Harbin University of Commerce, Harbin, Heilongjiang, China
Zhijie Zhao
Department of Computer Science, Lakehead University, Thunder Bay, ON, Canada
Sabah Mohammed
School of Computer and Information Engineering, Harbin University of Commerce, Harbin, Heilongjiang, China
Zhipeng Fan

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Han, T., Wang, W., Guo, M., Ning, S. (2022). Association Rules Mining Algorithm Based on Information Gain Ratio Attribute Reduction. In: Hassanien, A.E., Xu, Y., Zhao, Z., Mohammed, S., Fan, Z. (eds) Business Intelligence and Information Technology. BIIT 2021. Lecture Notes on Data Engineering and Communications Technologies, vol 107. Springer, Cham. https://doi.org/10.1007/978-3-030-92632-8_18

Download citation

DOI: https://doi.org/10.1007/978-3-030-92632-8_18
Published: 16 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92631-1
Online ISBN: 978-3-030-92632-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics