Reinforcement learning marine predators algorithm for global optimization

Wang, Jianlan; Wang, Zhendong; Zhu, Donglin; Yang, Shuxin; Wang, Junling; Li, Dahai

doi:10.1007/s10586-024-04381-y

Reinforcement learning marine predators algorithm for global optimization

Published: 17 April 2024

(2024)
Cite this article

Cluster Computing Aims and scope Submit manuscript

Jianlan Wang¹,
Zhendong Wang¹,
Donglin Zhu²,
Shuxin Yang¹,
Junling Wang¹ &
…
Dahai Li¹

42 Accesses
Explore all metrics

Abstract

Given the weak convergence, limited balance capacity, and optimization limitations observed in the Marine Predators Algorithm (MPA), which draws inspiration from the predatory behavior of marine organisms during evolutionary processes, this study introduces a Reinforcement Learning Marine Predators Algorithm (RLMPA). Firstly, based on the predatory characteristics at different stages, we have designed three location update strategies for search agents aimed at creating high-quality candidate solutions from three perspectives. In particular, ranking paired mutually beneficial learning is specifically designed to expand the scope of exploration to generate as many high-quality candidate solutions as possible for future generations. The Gaussian random walk learning is specifically designed to achieve better optimization in the transitional phase by adjusting the step-size control parameters, successfully completing the transition from exploration to local exploitation phase. Additionally, modified somersault foraging strategy is introduced to accelerate local convergence and perform more extensive local exploitation. Secondly, we integrate reinforcement learning into MPA and use Q-learning mechanism to adaptively select location update strategies. Agents fully utilize the collected information to evaluate the next action of the agents, coordinate the exploration phase and exploitation phase, and enhance the global optimization ability. Finally, compared with 10 competitive algorithms, RLMPA achieves better comprehensive performance in global optimization ability, search efficiency and convergence speed on 41 test functions and 5 practical engineering problems. In the Friedman rank sum tests, RLMPA achieves a preferable overall ranking, and has certain ascendant preponderances in solving practical problems with stability, effectiveness and robustness.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Spider wasp optimizer: a novel meta-heuristic optimization algorithm

Article 13 March 2023

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Data availability

All data for this study are available from the corresponding author.

References

Zhou, Y., He, X., Chen, Z., Jiang, S.: A neighborhood regression optimization algorithm for computationally expensive optimization problems. IEEE Trans. Cybern. 52(5), 3018–3031 (2022)
Article Google Scholar
Zhu, D., et al.: Human memory optimization algorithm: a memory-inspired optimizer for global optimization problems. Expert Syst. App. 237, 121597 (2023)
Article Google Scholar
Sun, G., Han, R., Deng, L., Li, C., Yang, G.: Hierarchical structure-based joint operations algorithm for global optimization. Swarm Evol. Comput. 79, 101311 (2023)
Article Google Scholar
Li, C., Sun, G., Deng, L., Qiao, L., Yang, G.: A population state evaluation-based improvement framework for differential evolution. Inf. Sci. 629, 15–38 (2023)
Article Google Scholar
Zhu, D., et al.: A multi-strategy particle swarm algorithm with exponential noise and fitness-distance balance method for low-altitude penetration in secure space. J. Comput. Sci. 74, 102149 (2023)
Article Google Scholar
Zhu, D., Wang, S., Zhou, C., et al.: Manta ray foraging optimization based on mechanics game and progressive learning for multiple optimization problems. Appl. Soft Comput. (2023). https://doi.org/10.1016/j.asoc.2023.110561
Article Google Scholar
Shami, T.M., Mirjalili, S., Al-Eryani, Y., Daoudi, K., Izadi, S., Abualigah, L.: Velocity pausing particle swarm optimization: a novel variant for global optimization. Neural Comput App 35(12), 1–31 (2023)
Google Scholar
Ma, C., Huang, H., Fan, Q., Wei, J., Du, Y., Gao, W.: Grey wolf optimizer based on Aquila exploration method. Expert Syst. App. 205, 117629 (2022)
Article Google Scholar
Houssein, E.H., Oliva, D., Çelik, E., Emam, M.M., Ghoniem, R.M.: Boosted sooty tern optimization algorithm for global optimization and feature selection. Expert Syst App 213, 119015 (2023)
Article Google Scholar
Too, J., Mafarja, M., Mirjalili, S.: Spatial bound whale optimization algorithm: an efficient high-dimensional feature selection approach. Neural Comput App 33, 16229–16250 (2021)
Article Google Scholar
Wang, J., Bei, J., Song, H., Zhang, H., Zhang, P.: A whale optimization algorithm with combined mutation and removing similarity for global optimization and multilevel thresholding image segmentation. Appl. Soft Comput. 137, 110130 (2023)
Article Google Scholar
Elgamal, Z., Md Sabri, A.Q., Tubishat, M., Tbaishat, D., Makhadmeh, S.N., Alomari, O.A.: Improved reptile search optimization algorithm using chaotic map and simulated annealing for feature selection in medical field. IEEE Access 10, 51428–51446 (2022)
Article Google Scholar
Nematollahi, A.F., Rahiminejad, A., Vahidi, B.: A novel meta-heuristic optimization method based on golden ratio in nature. Soft. Comput. 24(2), 1117–1151 (2020)
Article Google Scholar
Li Y, Zhao L, Zhou S. Review of genetic algorithm. Mater. Sci Eng, PTS1–22011, 365–367
Storn, R., Price, K.: Differential evolution—a simple and efficient heuristic for global optimization over continuous spaces. J. Global Optim. 11(4), 341–359 (1997). https://doi.org/10.1023/A:1008202821328
Article MathSciNet Google Scholar
Hwang, C.-R.: Simulated annealing: theory and applications. Acta Appl. Math. 12(1), 108–111 (1988). https://doi.org/10.1007/BF00047572
Article MathSciNet Google Scholar
Saryazdi, S.: GSA: a gravitational search algorithm. Inf. Sci. 179(13), 2232–2248 (2009)
Article Google Scholar
Rao, R.V., Savsani, V.J., Vakharia, D.P.: Teaching-learning-based optimization: an optimization method for continuous non-linear large scale problems (article). Inf. Sci. 183(1), 1–15 (2012)
Article Google Scholar
Kennedy, J., Eberhart, R.: Particle swarm optimization. In: IEEE International Conference on Neural Networks, pp. 1942–1948. IEEE (1995)
Google Scholar
Mirjalili, S., Lewis, A.: Grey wolf optimizer. Adv. Eng. Softw. 69(3), 46–61 (2014)
Article Google Scholar
Mirjalili, S., Lewis, A.: The whale optimization algorithm. Adv. Eng. Softw. 95(5), 51–67 (2016)
Article Google Scholar
Gandomi, A.H., Yang, X.-S., Alavi, A.H.: Cuckoo search algorithm: a metaheuristic approach to solve structural optimization problems. Eng. Comput. 29(1), 17–35 (2013)
Article Google Scholar
Karaboga, D., Akay, B.: A comparative study of artificial bee colony algorithm. Appl. Math. Comput. 214(1), 108–132 (2009)
MathSciNet Google Scholar
Heidari, A.A., Mirjalili, S., Faris, H., Aljarah, I., Mafarja, M., Chen, H.: Harris hawks optimization: algorithm and applications. Fut Gen Comput Syst 97, 849–872 (2019)
Article Google Scholar
Zhao, W., Zhang, Z., Wang, L.: Manta ray foraging optimization: an effective bio-inspired optimizer for engineering applications. Eng App Artif Intell 87, 103300 (2020)
Article Google Scholar
Yang, X.S.: Firefly algorithm, stochastic test functions and design optimization. Int J Bio-Inspired Comput 2(2), 78–84 (2010)
Article Google Scholar
Mirjalili, S., Gandomi, A.H., Mirjalili, S.Z., Saremi, S.: Salp swarm algorithm: a bio-inspired optimizer for engineering design problems. Adv. Eng. Softw. 114, 163–191 (2017)
Article Google Scholar
Faramarzi, A., Heidarinejad, M., Mirjalili, S., Gandomi, A.H.: Marine predators algorithm: a nature-inspired metaheuristic. Expert Syst. Appl. 152, 113377 (2020)
Article Google Scholar
Yousri, D., AbdElaziz, M., Oliva, D., Abraham, A., Alotaibi, M.A., Hossain, M.A.: Fractional-order comprehensive learning marine predators algorithm for global optimization and feature selection. Knowl Based Syst 235, 107603 (2022)
Article Google Scholar
Abd Elaziz, M., Mohammadi, D., Oliva, D., Salimifard, K.: Quantum marine predators algorithm for addressing multilevel image segmentation. Appl. Soft Comput. 110, 107598 (2021)
Article Google Scholar
ZakiDiab, A.A., Tolba, M.A., El-Magd, A.G.A., Zaky, M.M., El-Rifaie, A.M.: Fuel cell parameters estimation via marine predators and political optimizers. IEEE Access 8, 166998–167018 (2020)
Article Google Scholar
Fan, Q., Huang, H., Chen, Q., Yao, L., Yang, K., Huang, D.: A modified self-adaptive marine predators algorithm: framework and engineering applications. Eng. Comput. 38(4), 3269–3294 (2022)
Article Google Scholar
Shaheen, A.M., Elsayed, A.M., Ginidi, A.R., El-Sehiemy, R.A., Alharthi, M.M., Ghoneim, S.S.M.: A novel improved marine predators algorithm for combined heat and power economic dispatch problem. Alexandria Eng J 61(3), 1834–1851 (2022)
Article Google Scholar
Qin, C., Han, B.: A novel hybrid quantum particle swarm optimization with marine predators for engineering design problems. IEEE Access 10, 1 (2022)
Article MathSciNet Google Scholar
Wolpert, D.H., Macready, W.G.: No free lunch theorems for optimization. IEEE Trans. Evol. Comput. 1(1), 67–82 (1997)
Article Google Scholar
Shakya, A.K., Pillai, G., Chakrabarty, S.: Reinforcement learning algorithms: a brief survey. Expert Syst App 231, 120495 (2023)
Article Google Scholar
Song, Y., Wei, L., Yang, Q., Wu, J., Xing, L., Chen, Y.: RL-GA: a reinforcement learning-based genetic algorithm for electromagnetic detection satellite scheduling problem. Swarm Evol. Comput. 77, 101236 (2023)
Article Google Scholar
Zamfirache, I.A., Precup, R.-E., Roman, R.-C., Petriu, E.M.: Reinforcement Learning-based control using Q-learning and gravitational search algorithm with experimental validation on a nonlinear servo system. Inf. Sci. 583, 99–120 (2022)
Article Google Scholar
Wang, B., Feng, K., Wang, X.: Bi-objective scenario-guided swarm intelligent algorithms based on reinforcement learning for robust unrelated parallel machines scheduling with setup times. Swarm Evol. Comput. 80, 101321 (2023)
Article Google Scholar
Li, W., Liang, P., Sun, B., Sun, Y., Huang, Y.: Reinforcement learning-based particle swarm optimization with neighborhood differential mutation strategy. Swarm Evol. Comput. 78, 101274 (2023)
Article Google Scholar
Kumar, S., Yildiz, B.S., Mehta, P., Panagant, N., Sait, S.M., Mirjalili, S., Yildiz, A.R.: Chaotic marine predators algorithm for global optimization of real-world engineering problems. Knowl Based Syst 261, 110192 (2023)
Article Google Scholar
Zhao, S., Wu, Y., Tan, S., Wu, J., Cui, Z., Wang, Y.-G.: QQLMPA: a quasi-opposition learning and Q-learning based marine predators algorithm. Expert Syst App 213, 119246 (2022)
Article Google Scholar
Yousri, D.A., Fathy, A.A., Rezk, H.: A new comprehensive learning marine predator algorithm for extracting the optimal parameters of supercapacitor model. J Energy Storage 42, 103035 (2021)
Article Google Scholar
Shen, B., Khishe, M., Mirjalili, S.: Evolving marine predators algorithm by dynamic foraging strategy for real-world engineering optimization problems. Eng App Artif Intell 123, 106207 (2023)
Article Google Scholar
Han, M., Du, Z., Zhu, H., Li, Y., Yuan, Q., Zhu, H.: Golden-sine dynamic marine predator algorithm for addressing engineering design optimization. Expert Syst App 210, 118460 (2022)
Article Google Scholar
AS Sadiq; AA Dehkordi; S Mirjalili; Q-V Pham. Nonlinear marine predator algorithm: a cost-effective optimizer for fair power allocation in NOMA-VLC-B5G networks. Expert Syst App, 2022, 203.
Oszust, M.: Enhanced marine predators algorithm with local escaping operator for global optimization. Knowl Based Syst 232, 107467 (2021)
Article Google Scholar
Hassan, M.H., Yousri, D., Kamel, S., Rahmann, C.: A modified Marine predators algorithm for solving single- and multi-objective combined economic emission dispatch problems. Comput. Ind. Eng. 164, 107906 (2022)
Article Google Scholar
Wang, M., Li, X., Chen, L., Chen, H.: Medical machine learning based on multiobjective evolutionary algorithm using learning decomposition. Expert Syst App 216, 119450 (2023)
Article Google Scholar
Gibson, S., Issac, B., Zhang, L., Jacob, S.M.: Detecting spam email with machine learning optimized with bio-inspired metaheuristic algorithms. IEEE Access 8, 187914–187932 (2020)
Article Google Scholar
Ma, J., Xia, D., Wang, Y., Niu, X., Jiang, S., Liu, Z., Guo, H.: A comprehensive comparison among metaheuristics (MHs) for geohazard modeling using machine learning: Insights from a case study of landslide displacement prediction. Eng App Artif Intell 114, 105150 (2022)
Article Google Scholar
Esmaeili, H., Bidgoli, B.M., Hakami, V.: CMML: combined metaheuristic-machine learning for adaptable routing in clustered wireless sensor networks. Appl. Soft Comput. 118, 108477 (2022)
Article Google Scholar
Deng, H., Peng, L., Zhang, H., Yang, B., Chen, Z.: Ranking-based biased learning swarm optimizer for large-scale optimization. Inf Sci: Int J 493, 120–137 (2019)
Article MathSciNet Google Scholar
Onay, F.K.: A novel improved chef-based optimization algorithm with Gaussian random walk-based diffusion process for global optimization and engineering problems. Math Comput Simulation 212, 195–223 (2023)
Article MathSciNet Google Scholar
Peng, H., Zeng, Z., Deng, C., Wu, Z.: Multi-strategy serial cuckoo search algorithm for global optimization. Knowl Based Syst 214, 106729 (2021)
Article Google Scholar
Duan, Y., Liu, C., Li, S., Guo, X., Yang, C.: Manta ray foraging and Gaussian mutation-based elephant herding optimization for global optimization. Eng. Comput. 39(2), 1085–1125 (2023)
Article Google Scholar
Liu, J., Wu, Y.: An improved lion swarm optimization algorithm with chaotic mutation strategy and boundary mutation strategy for global optimization. IEEE Access 10, 1 (2022)
Article Google Scholar
Feng, Z.-k, Duan, J.-f, Niu, W.-j, Jiang, Z.-q, Liu, Y.: Enhanced sine cosine algorithm using opposition learning, adaptive evolution and neighborhood search strategies for multivariable parameter optimization problems. Appl. Soft Comput. 119, 108562 (2022)
Article Google Scholar
Ai, C., He, S., Fan, X.: Parameter estimation of fractional-order chaotic power system based on lens imaging learning strategy state transition algorithm. IEEE Access 11, 13724–13737 (2023)
Article Google Scholar
Jiao, K., Chen, J., Xin, B., Li, L.: A reference vector based multiobjective evolutionary algorithm with Q-learning for operator adaptation. Swarm Evol. Comput. 76, 101225 (2023)
Article Google Scholar
Zhang, H., Sun, J., Bäck, T., Zhang, Q., Xu, Z.: Controlling sequential hybrid evolutionary algorithm by Q-learning. IEEE Comput Intell Magaz 18(1), 84–103 (2023)
Article Google Scholar
Hamad, Q.S., Samma, H., Suandi, S.A., Mohamad-Saleh, J.: Q-learning embedded sine cosine algorithm (QLESCA). Expert Syst App 193, 116417 (2022)
Article Google Scholar
Baykasoglu, A., Ozsoydan, F.B.: Adaptive firefly algorithm with chaos for mechanical design optimization problems. Appl. Soft Comput. 36(1), 152–164 (2015)
Article Google Scholar
Kalyanmoy, D.: An efficient constraint handling method for genetic algorithms. Comput. Methods Appl. Mech. Eng. 186, 311–338 (2000)
Article Google Scholar

Download references

Funding

This research was funded by the Natural Science Foundation of China (Grant Nos. 62062037, 61562037, 72261018); the Natural Science Foundation of Jiangxi Province (Grant Nos. 20212BAB202014, 20171BAB202026).

Author information

Authors and Affiliations

School of Information Engineering, Jaingxi University of Science and Technology, Ganzhou, 341000, China
Jianlan Wang, Zhendong Wang, Shuxin Yang, Junling Wang & Dahai Li
College of Mathematics and Computer Science, Zhejiang Normal University, Jinhua, 321004, China
Donglin Zhu

Authors

Jianlan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Zhendong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Donglin Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Shuxin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Junling Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dahai Li
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

JW: Conceptualization, Methodology, Software, Data curation, Writing—Original draft preparation. ZW: Conceptualization, Supervision, Funding acquisition. DZ: Visualization, Investigation. SY: Conceptualization, Methodology. JW: Methodology, supervision. DL: Methodology, supervision.

Corresponding author

Correspondence to Zhendong Wang.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

See Figs.

14,

15,

16,

17,

18, and

19; Table

Table 15 The results of 11 algorithms on CEC 2022 (20-dimension)

Full size table

15 and

Table 16 The results of 11 algorithms on CEC 2017 (50-dimension)

Full size table

16.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wang, J., Wang, Z., Zhu, D. et al. Reinforcement learning marine predators algorithm for global optimization. Cluster Comput (2024). https://doi.org/10.1007/s10586-024-04381-y

Download citation

Received: 27 August 2023
Revised: 23 January 2024
Accepted: 24 February 2024
Published: 17 April 2024
DOI: https://doi.org/10.1007/s10586-024-04381-y

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Reinforcement learning marine predators algorithm for global optimization

Abstract

Access this article

Similar content being viewed by others

Spider wasp optimizer: a novel meta-heuristic optimization algorithm

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Multi-Agent Reinforcement Learning: A Selective Overview of Theories and Algorithms

Data availability

References

Funding