Abstract
The rapid advances in the field of information and communication technology has made the ubiquitous type of computing along with the internet of things extremely popular. Such applications have created the volumes of the data that are available for the analysis as well as the classification which is an aid to the process of decision making. Among the several methods that are used for the purpose of dealing with the big data, feature selection is found to be very effective. One of the common approaches that involve the searching using a subset of features that have been relevant to that of the topic or will represent an accurate description of this dataset. But unfortunately, the searching using this type of a subset is a problem that is combinatorial and may also be quite time consuming. The meta-heuristic algorithms have been commonly used for the purpose of facilitating the choice of features. Artificial fish swarm optimization (AFSO) algorithms will employ the fish swarming behavior to be the means of overcoming the combinatorial problems. The AFSA has now proved to be highly successful in the applications of a diverse nature. The results of the experiment show that this method proposed will achieve better performance than that of the other methods.
Similar content being viewed by others
Change history
01 December 2022
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s10586-022-03889-5
References
Kune, R., Konugurthi, P.K., Agarwal, A., Chillarige, R.R., Buyya, R.: The anatomy of big data computing. Softw. Pract. Exp. 46(1), 79–105 (2016)
Tembely, M., Sadiku, M.N., Musa, S.M.: Big data: an introduction for engineers. J. Sci. Eng. Res. 3(2), 106–108 (2016)
Mujawar, S., Kulkarni, S.: Big data: tools and applications. Int. J. Comput. Appl. 115(23), 7–11 (2015)
Cheng, S., Liu, B., Shi, Y., Jin, Y., Li, B.: Evolutionary computation and big data: Key challenges and future directions. In: International Conference on Data Mining and Big Data, Springer International Publishing, pp. 3–14, June 2016
García, S., Luengo, J., Herrera, F.: Data Preprocessing in Data Mining, pp. 59–139. Springer, New York (2015)
Chan, J.O.: An architecture for big data analytics. Commun. IIMA 13(2), 1–14 (2014)
Peralta, D., del Río, S., Ramírez-Gallego, S., Triguero, I., Benitez, J.M., Herrera, F.: Evolutionary feature selection for big data classification: a mapreduce approach. Math. Probl. Eng. 501(246139), 1–11 (2015)
Sutha, K., Tamilselvi, J.J.: A review of feature selection algorithms for data mining techniques. Int. J. Comput. Sci. Eng. 7(6), 63 (2015)
Yu, K., Wu, X., Ding, W., Pei, J.: Towards scalable and accurate online feature selection for big data. In: 2014 IEEE International Conference on Data Mining, pp. 660–669. IEEE, 2014
Catak, F.O.U.: Genetic algorithm based feature selection in high dimensional text dataset classification. In: WSEAS Transactions on Information Sciences and Application, vol. 12, pp. 290–296 (2015)
Fong, S., Wong, R., Vasilakos, A.V.: Accelerated PSO swarm search feature selection for data stream mining big data. IEEE Trans. Serv. Comput. 9(1), 33–45 (2016)
Shivani, H., Vaishali, S.: ACO swarm search feature selection for data stream mining in big data. Int. J. Innov. Res. Comput. Commun. Eng. 3(12), 12087–12089 (2015)
Yang, B., Zhang, T.: A scalable feature selection and model updating approach for big data machine learning. In: IEEE International Conference on Smart Cloud (SmartCloud), pp. 146–151. IEEE, 2016
Vinod, D. F., Vasudevan, V.: A filter based feature set selection approach for big data classification of patient records. In: International Conference on IEEE, pp. 3684–3687, 2016
Zhao, L., Chen, Z., Hu, Y., Min, G., Jiang, Z.: Distributed Feature Selection for Efficient Economic Big Data Analysis. IEEE Transactions on Big Data, pp.1-1 (2016)
Wang, L., Wang, Y., Chang, Q.: Feature selection methods for big data bioinformatics: a survey from the search perspective. Methods 111, 21–31 (2016)
Villar-Rodriguez, E., Gonzalez-Pardo, A., Del Ser, J., Bilbao, M. N., Salcedo-Sanz, S.: A novel adaptive density-based ACO algorithm with minimal encoding redundancy for clustering problems. In: 2016 IEEE Congress on Evolutionary Computation (CEC), pp. 3139–3145, IEEE, 2016
Harde, S., Sahare, V.: ACO swarm search feature selection for data stream mining in big data. Int. J. Innov. Res. Comput. Commun. Eng. 2(12), 12087–12089 (2015)
Sabar, N.R., Abawajy, J., Yearwood, J.: Heterogeneous cooperative co-evolution memetic differential evolution algorithms for big data optimisation problems. IEEE Trans. Evolut. Comput. 21(2), 315–327 (2017)
Gülşen, E., Gündüz, H., Cataltepe, Z., Serinol, L.: Big data feature selection and projection for gender prediction based on user web behaviour. In: 2015 23nd Signal Processing and Communications Applications Conference (SIU), pp. 1545–1548. IEEE, 2015
Saranya, S., Austrina, S. J., Ravikumar, K.: Efficient feature subset selection using Kruskal’s process in big data. Int. J. Inn. Res. Comp. Comm. Eng. (2015). doi:10.15680/ijircce.2015.0304171
Baccarelli, E., Cordeschi, N., Mei, A., Panella, M., Shojafar, M., Stefa, J.: Energy-efficient dynamic traffic offloading and reconfiguration of networked data centers for big data stream mobile computing: review, challenges, and a case study. IEEE Netw. 30(2), 54–61 (2016)
Cordeschi, N., Shojafar, M., Amendola, D., Baccarelli, E.: Energy-saving QoS resource management of virtualized networked data centers for big data stream computing. Emerg. Res. Cloud Distrib. Comput. Syst. 122, 1–31 (2015). doi:10.4018/978-1-4666-8213-9.ch004
Shojafar, M., Javanmardi, S., Abolfazli, S., Cordeschi, N.: FUGE: a joint meta-heuristic approach to cloud job scheduling algorithm using fuzzy theory and a genetic method. Clust. Comput. 18(2), 829–844 (2015)
Neshat, M., Sepidnam, G., Sargolzaei, M., Toosi, A.N.: Artificial fish swarm algorithm: a survey of the state-of-the-art, hybridization, combinatorial and indicative applications. Artif. Intell. Rev 42(4), 965–997 (2014)
Lin, K.C., Chen, S.Y., Hung, J.C.: Feature selection and parameter optimization of support vector machines based on modified artificial fish swarm algorithms. Math. Probl. Eng. 2015, 1–9 (2015)
Wang, G., Dai, D.: Network intrusion detection based on the improved artificial fish swarm algorithm. J. Comput. 8(11), 2990–2996 (2013)
Huang, Z., Chen, Y.: An improved artificial fish swarm algorithm based on hybrid behavior selection. Int. J. Control Autom. 6(5), 103–116 (2013)
Ghosh, P.S.: Parallelization of particle swarm optimization algorithm using Hadoop Mapreduce. Circulation 701, 8888 (2016)
Yang, J.B., Ong, C.J.: An effective feature selection method via mutual information estimation. IEEE Trans. Syst. Man Cybern. Part B 42(6), 1550–1559 (2012). (Cybernetics)
Patel Brijain, R., Rana, K.K.: A survey on decision tree algorithm for classification. Int. J. Eng. Dev. Res. 2(1), 1–5 (2014)
Liu, Y.: Random forest algorithm in big data environment. CMNT 18(12A), 147–51 (2014)
Author information
Authors and Affiliations
Corresponding author
Additional information
This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s10586-022-03889-5
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Manikandan, R.P.S., Kalpana, A.M. RETRACTED ARTICLE: Feature selection using fish swarm optimization in big data. Cluster Comput 22 (Suppl 5), 10825–10837 (2019). https://doi.org/10.1007/s10586-017-1182-z
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10586-017-1182-z