Evolutionary Computation, Optimization, and Learning Algorithms for Data Science

Mohammadi, Farid Ghareh; Amini, M. Hadi; Arabnia, Hamid R.

doi:10.1007/978-3-030-34094-0_3

Farid Ghareh Mohammadi¹⁵,
M. Hadi Amini^16,17 &
Hamid R. Arabnia¹⁵

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1123))

870 Accesses
12 Citations

Abstract

A large number of engineering, science, and computational problems have yet to be solved in a computationally efficient way. One of the emerging challenges is how evolving technologies grow towards autonomy and intelligent decision making. This leads to collection of large amounts of data from various sensing and measurement technologies, e.g., cameras, smart phones, health sensors, smart electricity meters, and environment sensors. Hence, it is imperative to develop efficient algorithms for generation, analysis, classification, and illustration of data. Meanwhile, data is structured purposefully through different representations, such as large-scale networks and graphs. Therefore, data plays a pivotal role in technologies by introducing several challenges: how to present, what to present, why to present. Researchers explored various approaches to implement a comprehensive solution to express their results in every particular domain, such that the solution enhances the performance and minimizes cost, especially time complexity. In this chapter, we focus on data science as a crucial area, specifically focusing on a curse of dimensionality (CoD) which is due to the large amount of generated/sensed/collected data, especially large sets of extracted features for a particular purpose. This motivates researchers to think about optimization and apply nature-inspired algorithms, such as meta-heuristic and evolutionary algorithms (EAs) to solve large-scale optimization problems. Building on the strategies of these algorithms, researchers solve large-scale engineering and computational problems with innovative solutions. Although these algorithms look un-deterministic, they are robust enough to reach an optimal solution. To that end, researchers try to run their algorithms more than usually suggested, around 20 or 30 times, then they compute the mean of result and report only the average of 20/30 runs’ result. This high number of runs becomes necessary because EAs, based on their randomness initialization, converge the best result, which would not be correct if only relying on one specific run. Certainly, researchers do not adopt evolutionary algorithms unless they face a problem which is suffering from placement in local optimal solution, rather than global optimal solution. In this chapter, we first develop a clear and formal definition of the CoD problem, next we focus on feature extraction techniques and categories, then we provide a general overview of meta-heuristic algorithms, its terminology, and desirable properties of evolutionary algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Hardcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

P. Marrow, Nature-inspired computing technology and applications. BT Technol. J. 18(4), 13–23 (2000)
Article Google Scholar
M.H. Amini, Distributed computational methods for control and optimization of power distribution networks, PhD Dissertation, Carnegie Mellon University, 2019
Google Scholar
M.H. Amini, J. Mohammadi, S. Kar, Distributed holistic framework for smart city infrastructures: tale of interdependent electrified transportation network and power grid. IEEE Access 7, 157535–157554 (2019)
Article Google Scholar
M.H. Amini, J. Mohammadi, S. Kar, Distributed intelligent algorithm for interdependent electrified transportation and power networks, Proceedings of the 9th ACM Symposium on Design and Analysis of Intelligent Vehicular Networks and Applications (ACM, New York, 2019)
Google Scholar
A. Imteaj, M.H. Amini, J. Mohammadi, Leveraging decentralized artificial intelligence to enhance resilience of energy networks. arXiv preprint arXiv:1911.07690 (2019)
Google Scholar
M.H. Amini, B. Nabi, M.R. Haghifam, Load management using multi-agent systems in smart distribution network, in 2013 IEEE Power and Energy Society General Meeting (IEEE, New York, 2013)
Google Scholar
M.H. Amini, S. Bahrami, F. Kamyab, S. Mishra, R. Jaddivada, K. Boroojeni, P. Weng, Y. Xu, Decomposition methods for distributed optimal power flow: panorama and case studies of the DC model, in Classical and recent aspects of power system optimization (Academic Press, Cambridge, 2018), pp. 137–155
Google Scholar
A. Imteaj, M.H. Amini, Distributed sensing using smart end-user devices: pathway to federated learning for autonomous IoT, in 2019 International Conference on Computational Science and Computational Intelligence (Las Vegas, 2019)
Google Scholar
F.G. Mohammadi, H.R. Arabnia, ISEA: image steganalysis using evolutionary algorithms. arXiv preprint, arXiv:1907.12914 (2019)
Google Scholar
N. Maheswaranathan, L. Metz, G. Tucker, J. Sohl-Dickstein, Guided evolutionary strategies: escaping the curse of dimensionality in random search. arXiv preprint, arXiv:1806.10230 (2018)
Google Scholar
M.J.L.F. Cruyff, U. Böckenholt, P.G.M. Van Der Heijden, L.E. Frank, A review of regression procedures for randomized response data, including univariate and multivariate logistic regression, the proportional odds model and item response model, and self-protective responses, in Handbook of Statistics, vol. 34 (Elsevier, Amsterdam, 2016), pp. 287–315
MATH Google Scholar
F. Zhuang, X. Cheng, P. Luo, S.J. Pan, Q. He, Supervised representation learning: transfer learning with deep autoencoders, in Twenty-Fourth International Joint Conference on Artificial Intelligence (2015)
Google Scholar
J. Yang, S. Shebalov, D. Klabjan, Semi-supervised learning for discrete choice models, in IEEE Transactions on Intelligent Transportation Systems (2018)
Google Scholar
N. Altman, M. Krzywinski, The curse (s) of dimensionality. Nat. Methods 15, 399–400 (2018)
Article Google Scholar
W. Guo, G. Lynch, J.P. Romano, A new approach for large scale multiple testing with application to FDR control for graphically structured hypotheses. arXiv preprint, arXiv:1812.00258 (2018)
Google Scholar
S. Gupta, S. Bhardwaj, P.K. Bhatia, A reminiscent study of nature inspired computation. Int. J. Adv. Eng. Technol. 1(2), 117 (2011)
Google Scholar
R. Balamurugan, A.M. Natarajan, K. Premalatha, Stellar-mass black hole optimization for biclustering microarray gene expression data. Appl. Artif. Intell. 29(4), 353–381 (2015)
Article Google Scholar
C. Blum, A. Roli, Metaheuristics in combinatorial optimization: overview and conceptual comparison. ACM Comput. Surv. 35(3), 268–308 (2003)
Article Google Scholar
S.F. Razavi, H. Sajedi, SVSA: a semi-vortex search algorithm for solving optimization problems. Int. J. Data Sci. Anal. 8, 1–18 (2018)
Google Scholar
P. Moscato, C. Cotta, An accelerated introduction to memetic algorithms, in Handbook of Metaheuristics (Springer, Cham, 2019), pp. 275–309
Google Scholar
T. Bäck, D.B. Fogel, Z. Michalewicz, Evolutionary Computation 1: Basic Algorithms and Operators (CRC Press, Boca Raton, 2018)
MATH Google Scholar
J. Pierezan, L.D.S. Coelho, Coyote optimization algorithm: a new metaheuristic for global optimization problems, in 2018 IEEE Congress on Evolutionary Computation (CEC) (IEEE, Rio de Janeiro, 2018), pp. 1–8
Google Scholar
F.G. Mohammadi, M.S. Abadeh, Image steganalysis using a bee colony based feature selection algorithm. Eng. Appl. Artif. Intell. 31, 35–43 (2014)
Article Google Scholar
F.G. Mohammadi, M.S. Abadeh, A new metaheuristic feature subset selection approach for image steganalysis. J. Intell. Fuzzy Syst. 27(3), 1445–1455 (2014)
Article Google Scholar
D. Wunsch, R. Nigro, G. Coussement, C. Hirsch, Uncertainty quantification in an engineering design software system, in Uncertainty Management for Robust Industrial Design in Aeronautics (Springer, Cham, 2019), pp. 747–754
Google Scholar
D.L. Barbour, Precision medicine and the cursed dimensions. npj Digit. Med. 2(1), 4 (2019)
Google Scholar
S.N. Karpagam, S. Raghavan, Automated diagnosis system for Alzheimer disease using features selected by artificial bee colony. J. Comput. Theor. Nanosci. 16(2), 682–686 (2019)
Article Google Scholar
W.K. Vong, A.T. Hendrickson, D.J. Navarro, A. Perfors, Do additional features help or hurt category learning? The curse of dimensionality in human learners. Cogn. Sci. 43(3), e12724 (2019)
Article Google Scholar
N.P. Patel, E. Sarraf, M.H. Tsai, The curse of dimensionality. Anesthesiol. J. Am. Soc. Anesthesiol. 129(3), 614–615 (2018)
Google Scholar
M. Oudah, A. Henschel, Taxonomy-aware feature engineering for microbiome classification. BMC Bioinf. 19(1), 227 (2018)
Google Scholar
A. Serani, M. Diez, J. Wackers, M. Visonneau, F. Stern, Stochastic shape optimization via design-space augmented dimensionality reduction and RANS computations, in AIAA SciTech 2019 Forum (2019), pp. 2218
Google Scholar
S.L. Gupta, A.S. Baghel, A. Iqbal, Big data classification using scale-free binary particle swarm optimization, in Harmony Search and Nature Inspired Optimization Algorithms (Springer, Singapore, 2019), pp. 1177–1187
Book Google Scholar
H. Shi, H. Li, D. Zhang, C. Cheng, X. Cao, An efficient feature generation approach based on deep learning and feature selection techniques for traffic classification. Comput. Netw. 132, 81–98 (2018)
Article Google Scholar
U. Khurana, H. Samulowitz, D. Turaga, Feature engineering for predictive modeling using reinforcement learning, in Thirty-Second AAAI Conference on Artificial Intelligence (2018)
Google Scholar
D. Zhang, J. Yin, X. Zhu, C. Zhang, Network representation learning: a survey, in IEEE Transactions on Big Data (2018)
Google Scholar
R. Vanaja, S. Mukherjee, Novel wrapper-based feature selection for efficient clinical decision support system, in International Conference on Intelligent Information Technologies (Springer, Singapore, 2018), pp. 113–129
Google Scholar
E. Hancer, B. Xue, M. Zhang, D. Karaboga, B. Akay, Pareto front feature selection based on artificial bee colony optimization. Inf. Sci. 422, 462–479 (2018)
Article Google Scholar
X.-Y. Liu, Y. Liang, S. Wang, Z.-Y. Yang, H.-S. Ye, A hybrid genetic algorithm with wrapper-embedded approaches for feature selection. IEEE Access 6, 22863–22874 (2018)
Article Google Scholar
V. Rostami, A.S. Khiavi, Particle swarm optimization based feature selection with novel fitness function for image steganalysis, in 2016 Artificial Intelligence and Robotics (IRANOPEN) (IEEE, Qazvin, 2016), pp. 109–114
Google Scholar
S. Jiang, S. Yang, A steady-state and generational evolutionary algorithm for dynamic multiobjective optimization. IEEE Trans. Evol. Comput. 21(1), 65–82 (2016)
Article Google Scholar
M.H. Amini, M.P. Moghaddam, O. Karabasoglu, Simultaneous allocation of electric vehicles parking lots and distributed renewable resources in smart power distribution networks. Sustain. Cities Soc. 28, 332–342 (2017)
Article Google Scholar
L.Y. Zhang, G. Luo, L.N. Lu, Genetic algorithms in resource optimization of construction project. J. Tianjin Univ. (Sci. Technol.) 34(2), 188–192 (2001)
Google Scholar
M.H. Amini, A. Islam, Allocation of electric vehicles’ parking lots in distribution network, in ISGT 2014 (IEEE, Washington, DC, 2014), pp. 1–5
Google Scholar
J.R. Koza, Genetic Programming: On the Programming of Computers by Means of Natural Selection, vol. 1 (MIT Press, Cambridge, 1992)
MATH Google Scholar
I.L.S. Russo, H.S. Bernardino, H.J.C. Barbosa, Knowledge discovery in multiobjective optimization problems in engineering via genetic programming. Expert Syst. Appl. 99, 93–102 (2018)
Article Google Scholar
L.J. Fogel, Artificial intelligence through a simulation of evolution, in Proceedings of the 2nd Cybernetics Science Symposium (1965)
Google Scholar
D. Karaboga, B. Gorkemli, C. Ozturk, N. Karaboga, A comprehensive survey: artificial bee colony (ABC) algorithm and applications. Artif. Intell. Rev. 42(1), 21–57 (2014)
Article Google Scholar
D. Karaboga, An idea based on honey bee swarm for numerical optimization. Technical report, Technical report-tr06, Erciyes University, Engineering Faculty, Computer (2005)
Google Scholar
F. Zabihi, B. Nasiri, A novel history-driven artificial bee colony algorithm for data clustering. Appl. Soft Comput. 71, 226–241 (2018)
Article Google Scholar
Y. Cao, Y. Lu, X. Pan, N. Sun, An improved global best guided artificial bee colony algorithm for continuous optimization problems. Clust. Comput. 22, 3011–3019 (2018)
Article Google Scholar
Y. Xue, J. Jiang, B. Zhao, T. Ma. A self-adaptive artificial bee colony algorithm based on global best for global optimization. Soft Comput. 22, 2935–2952 (2018)
Article Google Scholar
F. Harfouchi, H. Habbi, C. Ozturk, D. Karaboga, Modified multiple search cooperative foraging strategy for improved artificial bee colony optimization with robustness analysis. Soft Comput. 22(19), 6371–6394 (2018)
Article Google Scholar
H. Wang, J.-H. Yi, An improved optimization method based on krill herd and artificial bee colony with information exchange. Memet. Comput. 10(2), 177–198 (2018)
Article Google Scholar
K. Chen, F.-Y. Zhou, X.-F. Yuan, Hybrid particle swarm optimization with spiral-shaped mechanism for feature selection. Expert Syst. Appl. 128, 140–156 (2019)
Article Google Scholar
J. Kennedy, R. Eberhart, Particle swarm optimization, in Proceedings of IEEE International Conference on Neural Networks, vol. 4 (IEEE Press, Perth, 1995), pp. 1942–1948
Google Scholar
J. Kennedy, Particle swarm optimization, in Encyclopedia of Machine Learning (Springer, Boston, 2010), pp. 760–766
Google Scholar
Z.-F. Hao, Z.-G. Wang, H. Huang, A particle swarm optimization algorithm with crossover operator, in 2007 International Conference on Machine Learning and Cybernetics, vol. 2 (IEEE, Hong Kong, 2007), pp. 1036–1040
Google Scholar
Y. Zhang, S. Wang, P. Phillips, G. Ji, Binary PSO with mutation operator for feature selection using decision tree applied to spam detection. Knowl. Based Syst. 64, 22–31 (2014)
Article Google Scholar
A. Agrawal, S. Tripathi, Particle swarm optimization with probabilistic inertia weight, in Harmony Search and Nature Inspired Optimization Algorithms (Springer, Singapore, 2019), pp. 239–248
Book Google Scholar
M.A. Abido, Optimal design of power-system stabilizers using particle swarm optimization. IEEE Trans. Energy Convers. 17(3), 406–413 (2002)
Article Google Scholar
S. Naka, T. Genji, T. Yura, Y. Fukuyama, Practical distribution state estimation using hybrid particle swarm optimization, in 2001 IEEE Power Engineering Society Winter Meeting. Conference Proceedings (Cat. No. 01CH37194), vol. 2 (IEEE, Columbus, 2001), pp. 815–820
Google Scholar
H. Yoshida, K. Kawata, Y. Fukuyama, S. Takayama, Y. Nakanishi, A particle swarm optimization for reactive power and voltage control considering voltage security assessment. IEEE Trans. Power Syst. 15(4), 1232–1239 (2000)
Article Google Scholar
M. Dorigo, G.D. Caro, Ant colony optimization: a new meta-heuristic, in Proceedings of the 1999 Congress on Evolutionary Computation-CEC99 (Cat. No. 99TH8406), vol. 2 (IEEE, Washington, DC, 1999), pp. 1470–1477
Google Scholar
K. Socha, M. Dorigo, Ant colony optimization for continuous domains. Eur. J. Oper. Res. 185(3), 1155–1173 (2008)
Article MathSciNet MATH Google Scholar
M.M. Kabir, M. Shahjahan, K. Murase, A new hybrid ant colony optimization algorithm for feature selection. Expert Syst. Appl. 39(3), 3747–3763 (2012)
Article Google Scholar
S. Mirjalili, S.M. Mirjalili, A. Lewis, Grey wolf optimizer. Adv. Eng. Softw. 69, 46–61 (2014)
Article Google Scholar
X. Meng, Y. Liu, X. Gao, H. Zhang, A new bio-inspired algorithm: chicken swarm optimization, in International Conference in Swarm Intelligence (Springer, Cham, 2014), pp. 86–94
Google Scholar
W. Shi, Y. Guo, S. Yan, Y. Yu, P. Luo, J. Li, Optimizing directional reader antennas deployment in UHF RFID localization system by using a MPCSO algorithm. IEEE Sensors J. 18(12), 5035–5048 (2018)
Article Google Scholar
K. Ahmed, A.E. Hassanien, E. Ezzat, P.-W. Tsai, An adaptive approach for community detection based on chicken swarm optimization algorithm, in International Conference on Genetic and Evolutionary Computing (Springer, Cham, 2016), pp. 281–288
Google Scholar
X.-B. Meng, H.-X. Li, Dempster-Shafer based probabilistic fuzzy logic system for wind speed prediction, in 2017 International Conference on Fuzzy Theory and Its Applications (iFUZZY) (IEEE, Pingtung, 2017), pp. 1–5
Google Scholar
X.-L. Li, An optimizing method based on autonomous animats: fish-swarm algorithm. Syst. Eng. Theory Pract. 22(11), 32–38 (2002)
Google Scholar
Y. Chen, Z. Zeng, J. Lu, Neighborhood rough set reduction with fish swarm algorithm. Soft Comput. 21(23), 6907–6918 (2017)
Article Google Scholar
I. Rahman, J. Mohamad-Saleh, N. Sulaiman, Artificial fish swarm-inspired whale optimization algorithm for solving multimodal benchmark functions, in 10th International Conference on Robotics, Vision, Signal Processing and Power Applications (Springer, Singapore, 2019), pp. 59–65
Google Scholar
F.G. Mohammadi, M.H. Amini, Promises of meta-learning for device-free human sensing: learn to sense, in Proceedings of the 1st ACM International Workshop on Device-Free Human Sensing (DFHS’19) (ACM, New York, 2019), pp. 44–47. https://doi.org/10.1145/3360773.3360884
Google Scholar
F.G. Mohammadi, M.H. Amini, H.R. Arabnia, Applications of nature-inspired algorithms for dimension reduction: enabling efficient data analytics. arXiv preprint, arXiv: 1908.08563 (2019)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Franklin College of Arts and Sciences, University of Georgia, Athens, GA, USA
Farid Ghareh Mohammadi & Hamid R. Arabnia
School of Computing and Information Sciences, Florida International University, Miami, FL, USA
M. Hadi Amini
Sustainability, Optimization, and Learning for InterDependent Networks Laboratory (solid lab), Florida International University, Miami, FL, USA
M. Hadi Amini

Authors

Farid Ghareh Mohammadi
View author publications
You can also search for this author in PubMed Google Scholar
M. Hadi Amini
View author publications
You can also search for this author in PubMed Google Scholar
Hamid R. Arabnia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to M. Hadi Amini .

Editor information

Editors and Affiliations

School of Computing and Information Sciences, Florida International University, Miami, FL, USA, Sustainability, Optimization, and Learning for InterDependent Networks Laboratory (solid lab), Florida International University, Miami, FL, USA
M. Hadi Amini

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Mohammadi, F.G., Amini, M.H., Arabnia, H.R. (2020). Evolutionary Computation, Optimization, and Learning Algorithms for Data Science. In: Amini, M. (eds) Optimization, Learning, and Control for Interdependent Complex Networks. Advances in Intelligent Systems and Computing, vol 1123. Springer, Cham. https://doi.org/10.1007/978-3-030-34094-0_3

Download citation

DOI: https://doi.org/10.1007/978-3-030-34094-0_3
Published: 23 February 2020
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34093-3
Online ISBN: 978-3-030-34094-0
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics