Abstract
Data mining and machine learning fields are facing with a great challenge of mass data with high dimensionality. Feature selection can contribute a lot to address this issue with the concept of reducing the number of features by eliminating the redundant and irrelevant ones while preserving the information of original features maximally. This paper analyzes and compares two common feature selection methods, then puts forward a novel method for feature selection based on information gain and BP neural network (IGBP). The experimental result shows that IGBP method can reduce the time cost and improve the accuracy of the model at the meantime. The scientificity and superiority of IGBP are demonstrated in this paper, making it an efficient approach to deal with high-dimensional data.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Agarwal S. Data mining: data mining concepts and techniques. In: 2013 international conference on machine intelligence and research advancement (ICMIRA). IEEE; 2013. p. 203–7.
Almasoud AM, Al-Khalifa HS, Al-Salman A. Recent developments in data mining applications and techniques. In: 2015 tenth international conference on digital information management (ICDIM). IEEE; 2015. p. 36–42.
Liu H, Motoda H. Feature selection for knowledge discovery and data mining. Springer Science & Business Media; 2012.
Guyon I, Elisseeff A. An introduction to variable and feature selection. J Mach Learn Res. 2003;3(March):1157–82.
González A, Pérez R. Selection of relevant features in a fuzzy genetic learning algorithm. IEEE Trans Syst Man Cybern Part B (Cybern). 2001;31(3):417–25.
Yu L, Liu H. Efficient feature selection via analysis of relevance and redundancy. J Mach Learn Res. 2004;5(October):1205–24.
Sheikhpour R, Sarram MA, Gharaghani S, et al. A survey on semi-supervised feature selection methods. Pattern Recogn. 2017;64:141–58.
Kohavi R, John GH. Wrappers for feature subset selection. Artif Intell. 1997;97(1–2):273–324.
Wu G, Xu J. Optimized approach of feature selection based on information gain. In: 2015 international conference on computer science and mechanical automation (CSMA). IEEE; 2015. p. 157–61.
Najah A, El-Shafie A, Karim OA, et al. Application of artificial neural networks for water quality prediction. Neural Comput Appl. 2013;22(1):187–201.
Ennett CM, Frize M, Walker CR. Influence of missing values on artificial neural network performance. Stud Health Technol Inform. 2001;1:449–53.
Ding S, Li H, Su C, et al. Evolutionary artificial neural networks: a review. Artif Intell Rev. 2013;1–10.
Acknowledgements
This work was supported by The National Key Technology R&D Program of China (2015BAK36B04).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Wang, X., Zuo, M., Song, L. (2018). A Feature Selection Method Based on Information Gain and BP Neural Network. In: Jia, Y., Du, J., Zhang, W. (eds) Proceedings of 2017 Chinese Intelligent Systems Conference. CISC 2017. Lecture Notes in Electrical Engineering, vol 459. Springer, Singapore. https://doi.org/10.1007/978-981-10-6496-8_3
Download citation
DOI: https://doi.org/10.1007/978-981-10-6496-8_3
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6495-1
Online ISBN: 978-981-10-6496-8
eBook Packages: EngineeringEngineering (R0)