Abstract
Appropriate feature selection is an important aspect in the fields of data mining and machine learning. Feature selection reduces data dimensionality and produces simpler classification models that have lower variance. In this paper, we propose a robust feature selection method that produces smaller and yet more effective set of features. The proposed feature selection method leverages association rules to select the effective features for text classification. Our experiment shows that the proposed method outperforms its pears in terms of execution time and classification accuracy.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Cai, J., Luo, J., Wang, S., Yang, S.: Feature selection in machine learning: a new perspective. Neurocomputing 300, 70–79 (2018)
Al Aghbari, Z., Kamel, I., Elbaroni, W.: Energy-efficient distributed wireless sensor network scheme for cluster detection. Int. J. Parallel Emerg. Distrib. Syst. 28(1), 1–28 (2013)
Al Aghbari, Z., Kamel, I., Awad, T.: On clustering large number of data streams. Intell. Data Anal. 16(1), 69–91 (2012)
Hanif, S., Khedr, A.M., Al Aghbari, Z., Agrawal, D.P.: Opportunistically exploiting internet of things for wireless sensor network routing in smart cities. J. Sensor Actuator Netw. 7(4), 46 (2018)
Alkouz, B., Al Aghbari, Z.: SNSJam: Road traffic analysis and prediction by fusing data from multiple social networks. Inf. Process. Manage. 57(1), 102–139 (2020)
Sheydaei, N., Saraee, M., Shahgholian, A.: A novel feature selection method for text classification using association rules and clustering. J. Inf. Sci. 41(1), 3–15 (2015)
Şahin, D.O., Kılıç, E.: Two new feature selection metrics for text classification. Automatika 60(2), 162–171 (2019)
Al Aghbari, Z., Junejo, I.N.: DisCoSet: discovery of contrast sets to reduce dimensionality and improve classification. Int. J. Comput. Intell. Syst. 8(6), 1178–1191 (2015)
Uysal, A.K., Gunal, S.: Text classification using genetic algorithm oriented latent semantic features. Expert Syst. Appl. 41(13), 5938–5947 (2014)
Kim, K., Zang, S.Y.: Trigonometric comparison measure: a feature selection method for text categorization. Data Knowl. Eng. 119, 1–21 (2019)
Lee, J., Yu, I., Park, J., et al.: Memetic feature selection for multilabel text categorization label frequency difference. Inf. Sci. 485, 263–280 (2019)
Labani, M., Moradi, P., Ahmadizar, F., et al.: A novel multivariate filter method for feature selection in text classification problems. Eng. Appl. Artif. Intell. 70, 25–37 (2018)
Webb, G.I.: Discovering significant patterns. J. Mach. Learn. 68, 1–33 (2007)
Song, M., Song, I.Y., Hu, X., Allen, R.B.: Integration of association rules and ontologies for semantic query expansion. Data Knowl. Eng. 63, 63–75 (2007)
Kaoungku, N., Suksut, K., Chanklan, R., Kerdprasop, K., Kerdprasop, N.: Data classification based on feature selection with association rule mining. In: International MultiConference of Engineers and Computer Scientists, Hong Kong (2017)
Xie, J., Wu, J., Qian, Q.: Feature selection algorithm based on association rules mining method (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Aghbari, Z.A., Saeed, M.M. (2022). Leveraging Association Rules in Feature Selection to Classify Text. In: Smys, S., Bestak, R., Palanisamy, R., Kotuliak, I. (eds) Computer Networks and Inventive Communication Technologies . Lecture Notes on Data Engineering and Communications Technologies, vol 75. Springer, Singapore. https://doi.org/10.1007/978-981-16-3728-5_53
Download citation
DOI: https://doi.org/10.1007/978-981-16-3728-5_53
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-3727-8
Online ISBN: 978-981-16-3728-5
eBook Packages: EngineeringEngineering (R0)