A competitive learning-based Grey wolf Optimizer for engineering problems and its application to multi-layer perceptron training

Aala Kalananda, Vamsi Krishna Reddy; Komanapalli, Venkata Lakshmi Narayana

doi:10.1007/s11042-023-15146-x

A competitive learning-based Grey wolf Optimizer for engineering problems and its application to multi-layer perceptron training

Published: 22 March 2023

Volume 82, pages 40209–40267, (2023)
Cite this article

Download PDF

Multimedia Tools and Applications Aims and scope Submit manuscript

A competitive learning-based Grey wolf Optimizer for engineering problems and its application to multi-layer perceptron training

Download PDF

Aala Kalananda Vamsi Krishna Reddy¹ &
Komanapalli Venkata Lakshmi Narayana¹

1472 Accesses
4 Citations
Explore all metrics

Abstract

This article presents a competitive learning-based Grey Wolf Optimizer (Clb-GWO) formulated through the introduction of competitive learning strategies to achieve a better trade-off between exploration and exploitation while promoting population diversity through the design of difference vectors. The proposed method integrates population sub-division into majority groups and minority groups with a dual search system arranged in a selective complementary manner. The proposed Clb-GWO is tested and validated through the recent CEC2020 and CEC2019 benchmarking suites followed by the optimal training of multi-layer perceptron’s (MLPs) with five classification datasets and three function approximation datasets. Clb-GWO is compared against the standard version of GWO, five of its latest variants and two modern meta-heuristics. The benchmarking results and the MLP training results demonstrate the robustness of Clb-GWO. The proposed method performed competitively compared to all its competitors with statistically significant performance for the benchmarking tests. The performance of Clb-GWO the classification datasets and the function approximation datasets was excellent with lower error rates and least standard deviation rates.

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

Article 27 November 2022

Learning to optimize: A tutorial for continuous and mixed-integer optimization

Article 08 May 2024

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

1.1 Introduction to meta-heuristics

Optimization through meta-heuristics has emerged as a prominent trend for problem-solving and systematic resource management in multi-disciplinary research and real-world scenarios with numerous applications. Optimization has been adopted and encouraged by researchers and experts apprehensive of its simplicity and efficacy in solving complex problems with a greater degree of success [1, 10, 11, 13, 36, 45]. The development of Genetic algorithm (GA) [38], Particle swarm optimization (PSO) [74] marks a watershed in the history of optimization with a myriad of other techniques to follow soon. The optimization techniques iterate sequentially to determine the best optimal solutions concerning the objective function as it explores a complex labyrinth of peaks and valleys known as the “search-landscape/search space”. With a fraction of the knowledge required to determine the solutions while considering the ambivalent state of the problems, optimization is a promising aberration and has been at the forefront of the many-sided research avenues that continue to thrive toward the perfection of the existing and forthcoming systems [45].

1.2 A review of meta-heuristics in multimedia applications and artificial intelligence

Literature in recent times [11, 23, 71] provides an outlook on the widespread applications of meta-heuristic-based stochastic optimizers in multimedia tools and artificial intelligence (AI). The adaption of meta-heuristic solvers coupled with IoT [58] in big-data data analytics [52], block-chain [11], video games [10], artificial intelligence [24], feature selection [15], machine learning [14, 53], and deep learning has gained immense popularity on account of its simplicity, accuracy in training and testing and robustness to entrapment [7, 58]. Figure 1 provides a classification of various areas within the realm of multimedia and AI adopting meta-heuristic algorithms.

Moreover, multiple other domains including medicine and health sectors have been increasingly relying on the utilization of AI and multimedia tools to improve their accuracy and efficiency while working with large datasets. A few examples from the literature lately include the development of a multi-feature fusion convolution neural network (CNN) framework to represent the complex morphology and gene expression patterns [68], fall prediction based on key points of human bones through bone map dataset and CNN to prevent the damage to the elderly [70] etc. Computing systems for image and vision integrate AI to enhance the detection capabilities and these include a neural network-based edge-oriented framework for saliency detection enhancement for complex images [69], Deep Convolutional Generative Adversarial Networks (DCGAN) with TensorFlow deep learning framework for virtual face generation [31] etc. The financial engineering domain with AI-based prediction has been of increasing concern as they help to track and predict the trends of various markets and effectively design strategic products and services to maximize profits. For example, the research at [67] developed a novel mobile personalized recommended method based on the money flow model for the stock exchange to provide investors with reliable practical investment guidance and receive more returns. Machine learning strategies based on two-dimensional numerical models in financial engineering [66] to reduce the prediction error and improve forecasting precision for major U.S. stock market index fall under the same category.

The efficacy of meta-heuristics in multimedia tools and AI has been well-researched and documented in the literature. Compared to the traditional solvers such as the gradient-descent method or back-propagation method where the initial solutions play a crucial role in the outcome of optimization, the stochasticity of meta-heuristics forms its major strength in exploring the various possibilities of solution combinations with very little to no dependence on the initial guess. Examples include (i) a continuation approach for training Artificial Neural Networks (ANNs) with meta-heuristics by J. R-Delado et al. in [55] including Particle Swarm Optimization (PSO), Firefly Algorithm (FA) and Cuckoo Search. The execution times were lowered by about 5–30% without statistically significant loss of accuracy for the public benchmark datasets and this was achieved by the accelerated convergence of the meta-heuristics. (ii) Automated fine-tuning compiler heuristics through meta-optimization and machine learning to reduce compiler design complexity and tedium of heuristic tuning is implemented in [61]. The resulting framework improved the average speedup of the heuristic compilation by 23% with an average performance improvement of 25% on the training set and 9% on the test set. (iii) In other works, ANN training integrating a hybrid meta-heuristic combining the exploration and exploitation capabilities of Invasive Weed Optimization and Differential Evolution (DE) was realized in [46]. Benchmarked against the 5 and 10-layered multi-layer perceptron training, the hybrid algorithm lowered the training and testing errors by about 5 to 10% for the three different datasets with a faster rate of convergence. (iv) In a similar development, S. Benabderrahmane in 2017 [5], combined machine learning and swarm intelligence for real-time object detection and tracking to accelerate time processing and enhance the extraction efficiency of the classifier. Experimenting with genetic algorithms (GA), particle swarm optimization (PSO), random walk and a novel hybrid combination of these methods, significant improvements were observed in computation time, efficiency and accuracy. (v) Interactive software design incorporating meta-heuristic algorithms for search engines with user-provided evaluation and rating systems to develop Interactive Evolutionary Algorithms (IEAs) is implemented in [59]. A comparative analysis between greedy local search, an evolutionary algorithm and ant colony optimization (ACO) showed that ACO-based interactive search outperformed the latter for the software design problem.

Examples of integration of meta-heuristics for the optimization of multimedia tools include (i) An optimized network flow wavelet-based image coding for multipath selection to maximize the received multiple description coding (MDCs) in a lossy network model in [28]. The multi-objective optimization problem was tackled using GA and PSO-based simulations for various random network topologies. PSO delivered the most optimal routings with reduced packet loss and increased throughput. (ii) In other works, a video watermarking scheme for anti-piracy protection incorporates Squirrel Search Algorithm (SSA) with constraints on video quality and other thresholds [6]. The embedded watermarking scheme utilized the frame selection method on five different videos and the proposed SSA-based framework recorded a Peak Signal to Noise ratio (PSNR) of 71.06 dB outperforming eight methods from the literature. (iii) In similar developments, phishing website detection integrating a support vector machine and an Improved spotted hyena optimization (ISHO) algorithm is proposed to select proper features for classifying phishing websites [56]. Compared to PSO, FA and bat algorithm-based SVM classifiers, the ISHO-based SVM achieved higher classification accuracy compared to others.

1.3 Challenges associated with meta-heuristic algorithms

Although the efficacy of the swarm-based nature-inspired optimization algorithms is imputed to its multifarious search mechanisms, the same mechanisms are prone to a myriad of problems and complications that require addressal for these algorithms to maximize their potential. The search mechanisms devised to mimic the processes in nature, which account for the core of any nature-inspired meta-heuristic, may not be competent at optimizing every class of optimization problem. Even though they are meticulously crafted, the crucial and conflicting case of exploration and exploitation has been eluding researchers and motivating them towards the realization of a near-perfect optimization strategy. Grounding on this, lately, there has been a gold rush for the development of better/improved variants of nature-inspired algorithms best suited to the search landscape of the problem being dealt with. Simultaneously, there is an upsurge in the publications relating to the development of better/improved variants tackling the following aspects.

The deterioration in the performance (stagnation of fitness or sluggish convergence characteristics) of an optimization technique with the increasing number of problem dimensions is more often than not ascribed as “the curse of dimensionality”, coined by Richard E. Bellman. The manifold reason is that there could be several possibilities of every decision variable for each combination of values and the fitness of all such possibilities are to be computed within a present number of function evaluations resulting in solutions very far from the global optimum.
The swarm intelligent optimization algorithms are inherent to the conflicting case of the balancing of exploration (global search) and exploitation (local search). An ideal trade-off between exploration (diversification) and exploitation (intensification) is needed such that the algorithm is capable of understanding the condition to explore further or improve the existing solutions [2, 23].
Another issue managed through the improvement of meta-heuristics is the near-perfect coordination of the tuning criterion (otherwise called “algorithm-specific parameters”) and, notably, the requirement to tune several of such parameters to extract the best possible performance is often a tedious and time-consuming one and improper or inappropriate tuning has often been the major reason for the algorithms’ failure.
Algorithms excelling at unconstrained cases may not perform equally well for a constrained problem and similarly, algorithms with quick convergent characteristics may not deliver the best optimality compared to others. Furthermore, algorithms designed to explore efficiently over complex landscapes may not be efficient at local search and vice-versa. This is commonly alluded to as the “No free lunch theory” [64], which expresses that the perfect optimization algorithm is not practically realizable and no meta-heuristics can deliver the best performance for every optimization task.

To address the aforementioned issues and extract the best performance for the chosen optimization problem, several improved/upgraded meta-heuristics have been proposed and have gained significance owing to their superior performance in terms of optimality, consistency and robustness [8]. The improvement of any meta-heuristic algorithm for the specific application is predominantly done through the introduction and empirical establishment of special techniques/operators that advance the exploration to newer areas within the search space while at the same time balancing the exploitation/local search. This procedure of altering a meta-heuristic to upgrade its existing abilities for better and more robust performance is known as “improving” or “enhancing” or “modifying” [9]. Researchers frequently turn towards improvisation and improving the existing meta-heuristics to guarantee that a near-perfect compromise between the exploration and exploitation is achieved to a good extent such that the need for the tedious tuning process is reduced through adaptive techniques that are capable of dynamically adapting to the search landscape and also work towards restructuring them to eliminate the algorithms’ weaknesses in dealing with complex optimization tasks. This is represented in Fig. 2.

1.4 Contributions of the proposed work

In this work, Grey Wolf Optimizer (GWO) has been studied extensively with its merits and demerits have been analysed and an improved version is realised to overcome the various shortcomings associated with it. The proposed algorithm has been named Competitive-learning GWO (Clb-GWO) and it employs four major modifications.

1.
Dual search mechanisms with new techniques for global and local search have been devised and arranged in a selective complementary fashion.
2.
Population sub-grouping into major and minor groups is considered to dedicate sections of the population to learn and adapt with respect to the problem landscape.
3.
Novel difference vectors are designed to promote population diversity and prevent population stagnation.
4.
Non-linear hunting and competitive learning strategies are formulated and integrated systemically with adaptive mechanisms to achieve equilibrium between exploration and exploitation.

The underlying reasons for the choice of GWO over the other optimization paradigms are as follows. (1) GWO is one of the most successful state-of-the-art optimization techniques with impeccable performance in multi-disciplinary applications and stands unabated with incredible competence outperforming other paradigms as outlined in the literature survey. (2) The simple structure of GWO is easier to be realized in any programming language of choice and can be deployed to various optimization problems in accordance with the researchers’ interests. (3) There exists a plethora of publications wherein the performance of GWO has been greatly improved through either application-specific enhancements or hybridization indicating a greater scope of its re-usability for a potentially robust variant of GWO for the many-sided research avenues. (4) The tuning of GWO has been experimented with quite often to improve the accuracy, population diversity, lower its susceptibility to “the curse of dimensionality”, overthrow local entrapment etc. There is always room for improvement considering the applicability of the variant aimed at, e.g., complex constrained optimization problems with a higher dimensional count could require additional modifications to the algorithmic structure and dynamic tuning resulting in a greater search gradation. (5) The selection and population updating strategies and their coherence to the performance of the algorithms, and the outcome of the optimization have been reviewed and analysed leading to a multitude of variants that exploit various population selection and updating techniques to leverage the algorithm’s full potential.

1.4.1 Highlights of the current work

The highlights of the proposed work are outlined as follows.

A novel version of GWO immune to the curse of dimensionality and premature convergence is designed through multi-population and adaptive learning mechanisms
Extensive testing through the latest benchmarking (2020 and 2019) suites is carried out to determine the suitability and effectiveness of the proposed algorithm while proving an updated overview of GWO’s performance for the latest benchmarking standards.
Comprehensive comparisons are made with the recent and advanced variants of GWO with have been evaluated for older benchmarking (2005) suites so far. This study compares multiple aspects of the variants of GWO to establish their performance standards for the latest in benchmarking.
The validation of the proposed method’s performance for complex real-world problems in multimedia tools and artificial intelligence is established through MLP training (5 classification datasets and 3 function approximation datasets).

The remainder of this article is organized as follows. Section 2 focuses on the working of GWO followed by a discussion of its merits and demerits. Section 3 discusses the formulation of the competitive learning-based GWO technique with a detailed description of its various attributes. The performance of Clb-GWO with ten different meta-heuristics (including five variants of GWO, two modern meta-heuristics and two state-of-the-art advanced meta-heuristics) is analysed in Section 4 through CEC2020 and CEC2019 benchmarking suites. Additionally, Section 4 analyses the sensitivity of the various tuning parameters on the outcome of optimization and the effect of population size, and number of iterations on the exploration and exploitation. Section 5 analyses the performance of the proposed method with real-world complex optimization tasks (MLP training for five classification datasets and three function approximation datasets). The conclusion, merits and demerits of Clb-GWO potential applications and the future scope of the current work are given in Section 5.

2 Grey wolf Optimizer

Grey Wolf Optimizer, referred to as GWO, is a swarm-based, nature-inspired meta-heuristic optimization algorithm based on the leadership hierarchy and hunting mechanism of grey wolves (Canis lupus). Developed in 2014 by Seyedali Mirjalili, Seyed Mohammad Mirjalili and Andrew Lewis, GWO has risen to become one of the prominent state-of-the-art optimizers. [42]. GWO is unique with its excellently crafted social hierarchical system as it groups the grey wolves into alpha, beta, delta, and omega and explores and exploits the search space. The tuning requisites of GWO constitute the basic specification of the population size and iteration count and an optional control vector. The balance of the exploration and exploitation is achieved through a linearly decreasing nature of the control vector which is set to decrement from 2 to 0 over the course of iterations. The simplicity of its algorithmic structure and its outstanding performance towards optimization of both unconstrained and constrained with good convergence properties has attracted researchers and practitioners from various fields to opt for it. Computer Science, Machine Learning and Artificial Intelligence, Engineering, Mathematics, Energy, Materials Science, Physics and Astronomy, etc., are some of the applications of GWO across various disciplines.

2.1 Working of GWO

To understand the working of GWO, it is essential to gain insight into how the social hierarchy of wolves is considered in mathematical modelling. GWO considers the alpha wolves (male/female) as the leader (the dominant wolves) as they dictate the functioning of the group and are predominantly responsible for decision-making and managing the group. The second-order consists of the beta wolves which are the subordinates and the advisors and also command the other lower order of wolves. The third in the line-up is the omega wolves which form the lowest ranking group and often assume the roles of a scapegoat or a babysitter. Additionally, the delta wolves which don’t identify themselves as alpha, beta or omega are the Scouts, sentinels, elders, hunters, and caretakers in the group. The delta wolves dominate the omegas but obey the betas and alphas forming an intermediate between the beta wolves and delta wolves. The collective foraging activity based on the social hierarchy forms the core of GWO. Figure 3 depicts the social dominance based hierarchical system of the grey wolves.

In GWO, the best solution is considered as the alpha, the second-best solution is beta and the third-best solution is delta respectively. The latter of the population is considered the omegas. A comprehensive description of the various aspects of the mathematical modelling of GWO is as follows.

Encircling the prey

The first phase of GWO is aimed at determining the position of the prey. Initially assumed to be unknown, the algorithm explores the search space considering that the prey’s position is located near the optimal solution. Once, the location of the prey is found, they encircle it as a part of the hunting process. To locate a better solution, grey wolves explore the area around the location of prey.

Eq. (1) and Eq. (2) constitute the mathematical model for the encircling of the prey in GWO.

$$ \overrightarrow{P_{gw}\ }\left(t+1\right)=\overrightarrow{P_p(t)}-\overrightarrow{A}\ \overrightarrow{.\mathrm{d}} $$

(1)

$$ \overrightarrow{d}=\left|\ \overrightarrow{E\ }.\overrightarrow{P_p(t)}-\overrightarrow{P_{gw}(t)\ }\right| $$

(2)

where, $ \overrightarrow{P_{gw}} $ is the position of the grey wolf,$ \overrightarrow{A\ } $and $ \overrightarrow{E\ } $ are coefficient vectors, t is the present iteration, $ \overrightarrow{P_p(t)} $is the position of the prey, || is the modulus operator to determine the absolute value and’.’ represents multiplication in an element-to-element manner.

Eq. (3) and Eq. (4) describe the mathematical formulation of the co-efficient vectors $ \overrightarrow{A\ } $ and $ \overrightarrow{E} $.

$$ \overrightarrow{A} = 2\overrightarrow{a}.\overrightarrow{\ {\mathit{\operatorname{rand}}}_1} - \overrightarrow{a} $$

(3)

$$ \overrightarrow{E} = 2.\overrightarrow{\ {\mathit{\operatorname{rand}}}_2} $$

(4)

where,$ \overrightarrow{\ a} $ is the control vector whose value tends to linearly decrease from an initial value of 2 to a final value of 0 over the course of iterations and $ \overrightarrow{\ \mathit{\operatorname{rand}}} $ denotes a random vector in [0, 1].

Hunting

As soon as the location of the prey has been recognized, the hunting proves commences guided by the alpha. Supported by the beta, delta and on rare occasions by the omega, the positions of the omegas are updated in conjuncture with the mean position of the alpha, beta and delta. The best three solutions obtained are saved as described in the hierarchical dominance of the wolves to further estimate the location of prey and guide the omegas to update their positions around it in the subsequent iterations.

The distances between the current grey wolf and the three dominant wolves are given in Eq. (5) and the positions formulated based on the distances are given in Eq. (6).

$$ {\displaystyle \begin{array}{c}\overrightarrow{d_{\alpha }}=\left|\ \overrightarrow{E_1\ }.\overrightarrow{P_{\alpha }}-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\beta }}=\left|\ \overrightarrow{E_2\ }.\overrightarrow{P_{\beta }}-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\delta }}=\left|\ \overrightarrow{E_2\ }.\overrightarrow{P_{\delta }}-\overrightarrow{P_{gw}\ }\right|\end{array}} $$

(5)

$$ {\displaystyle \begin{array}{c}\overrightarrow{P_1} = \overrightarrow{P_{\alpha }}-\overrightarrow{A_1}.\left(\overrightarrow{d_{\alpha }}\right)\\ {}\overrightarrow{P_2} = \overrightarrow{P_{\beta }}-\overrightarrow{A_2}.\left(\overrightarrow{d_{\beta }}\right)\\ {}\overrightarrow{P_3} = \overrightarrow{P_{\delta }}-\overrightarrow{A_3}.\left(\overrightarrow{d_{\delta }}\right)\end{array}} $$

(6)

Finally, the position of the grey wolf is given by Eq. (7)

$$ \overrightarrow{P_{gw}\ }\left(t+1\right)=\left[\frac{\overrightarrow{P_1} + \overrightarrow{P_2} + \overrightarrow{P_3\ }}{3}\right] $$

(7)

where, $ \overrightarrow{P_{gw}} $ is the position of the grey wolf, $ \overrightarrow{P_{\alpha }} $, $ \overrightarrow{P_{\beta }} $ and $ \overrightarrow{P_{\delta }} $ represent the positions of the alpha, beta and delta wolves, $ \overrightarrow{A\ } $ and $ \overrightarrow{E} $ are the co-efficient vectors.

2.2 Demerits of the canonical GWO

Although efficient in several applications, the shortcomings of GWO include a lack of population diversity, local entrapment, premature convergence, lack of a stronger exploitation system etc. to name a few. Several review articles [14, 21, 22, 26, 44, 49] have outlined the limitations of GWO and there has been a greater focus towards the improvement of GWO to achieve a reliable and robust variant.

A summarization of the critical limitations of GWO from various review and work articles has been listed below

GWO has been susceptible to the curse of dimensionality in several benchmark and real-world applications. The performance has been deteriorating in problems with multiple constrained and higher independent decision variables owing to the selection and the population updating strategy. The algorithm’s inability to manage multiple dimensions as it may not reposition all its search agents appropriately has been researched extensively to realise a better variant immune to such drawbacks.
The convergence speeds are slower compared to other algorithms depending on sorting techniques for benchmarking and real-world scenarios.
The system of splitting iterations aimed at an explorative search for the first half and exploitative intensification for the next half does not necessarily guarantee that the majority of the search space has been covered and the conflicting aspects of exploration versus exploitations have not been perfectly balanced despite its good performance compared to the classical paradigms.
Complex and multi-modal search landscapes have been a challenging aspect as the algorithm is more likely to fall prey to local entrapment leading to premature convergence.
The exploration system being robust initially, narrows down to the location of the three dominant wolves with the progression of iterations and the wolves may not move far away from each other beyond the initial stages resulting in premature convergence.
The higher dependence on the three dominant wolves in the wolf pack localizes the population towards the end of iterations causing local entrapment inevitable. If entrapment occurs at an earlier stage, there are no adaptive techniques to escape it.

3 Proposed method: Competitive learning-based Grey wolf Optimizer

In this work, a competitive learning GWO is proposed after a comprehensive analysis of GWO, its other state-of-the-art variants, review articles, and publications related to GWO and its applications. The improved algorithm, named Competitive-learning based GWO is devised to address the various limitations of GWO as mentioned previously to improve its immunity to the curse of dimensionality and local entrapment with an accelerated convergence towards the global optimum and to achieve the desired equilibrium between the global (exploration) and local search (exploitation) with an enhanced population diversity.

3.1 Analysis and deductions from the previous publications aimed at improving GWO

Although very successful at multi-disciplinary optimization, GWO has its fair share of criticism and controversies surrounding its algorithmic structure for having a one-sided search system known to favour the geometric centre of the search landscapes. There have been several publications demonstrating its demerits that have pointed out it’s weaknesses, including the lack of a strong exploration system for multi-modal and complex landscapes. The population progression system in GWO favours diversity in the initial stages and quickly converges to the surroundings of the dominant wolves leading to stagnation and loss of population diversity. The analysis at [50] demonstrated this tendency of GWO to slide to the geometric centre (Functions with 0 as the location of the global optimum in the publication) and proposed a verification method through a set of nine modified test functions with varying degrees of shifted global optimum positions. The study concluded that the performance deterioration of GWO was proportional to the degree of shift in the global optimum from 0 and deduced the linearly decreasing nature of the control variable as one of the possible reasons. Multiple works have focussed on improving the population diversity with the intent of lowering the algorithm’s dependence on the three dominant wolves in the wolf pack [4, 34, 73]. This has also been referred to as the reason for the algorithm’s premature convergence which happens a result of local entrapment [19, 76]. The algorithm’s lack of immunity towards the curse of dimensionality has also been the centre of focus as well, with studies indicating the excessive dependence on the dominant wolves and the lack of elitism among the population as the reasons for a poor exploratoty sysem [21, 22, 26, 44]. Slower convergence has been reported in several cases as the algorithm accepts all the population members to replace the older population (Mu, Lambda (μ,λ) selection) despite their inferior fitness [63].

On the other hand, several publications have credited the linear control strategy as it provided a basic foundation that can be further improved to ensure robustness in the performance of GWO for other complex landscapes. Reference [47] adopted the standard GWO foraging techniques with an additional dimensional learning strategy based on Euclidian distance and greedy selection to improve the performance of GWO for complex landscapes with an improved GWO algorithm. The performance of IGWO has been verified against the CEC2017 benchmarking suite where none of the benchmarking functions had their global optimum at ‘0’. In [16], one of the more popular variants of GWO, a random walk GWO with greedy selection is proposed and tested against the CEC2014 suite (also with functions having no optima located at ‘0’) and demonstrated its robustness with complex landscapes. Additionally, selective opposition-based GWO in [12] incorporating the Spearman coefficient in an opposition-based learning scheme to improve the fitness of omegas with respect to the difference between the alpha and omegas is proposed. In [75], hybridization of GWO with Biogeography-Based Optimization (BBO) algorithm to enhance the population diversity of GWO and accelerate the convergence speeds has been proposed. It compared the hybrid GWO with EPSDE, SHADE and SinDE for the CEC2014 benchmarking suite where it outperformed them by a large margin for the thirty benchmark functions.

In other developments, non-linear control strategies have been very popular to establish a solid balance between exploration and exploitation for several multi-disciplinary applications. W.Long et al. in [32] proposed an exploration-enhanced GWO by experimenting with multiple non-linear modulation indices for the control parameter ‘a’ and deduced that an initial value of 1 or higher nearer to 1.5 is promising for multi-modal landscapes. The article at [73] proposed an improved GWO with exponential control vectors based on the current and final iterations to enhance the exploration quality for the truss optimization. Research has also been directed at balancing the exploration and the exploitation system of the GWO through the introduction of chaotic strategies to mitigate local entrapment [3, 18, 27, 33, 51, 57]. Hybridization/combinatorial variants of GWO with the existing swarm and evolutionary algorithms have been an ongoing trend since the publication of GWO. The combinatorial variants operate in synergy combining the best aspects of both their parent algorithms with robust and consistent performance across all standards and have considerable performance improvement for the conflicting cases of exploration versus exploitation [60, 63, 77].

3.2 Motivation

In addition to the aforementioned aspects and a myriad of publications of GWO, the motivation for the current work is as follows:

1.
Although GWO is a relatively old meta-heuristics, its search process can be efficiently improved to enhance its robustness and consistency for multi-modal and complex landscapes.
2.
To combat the demerits associated with the linear exploration method, other non-linear schemes can be experimented with and incorporated into the search mechanism to promote the diversity of the population.
3.
A balance between exploration and exploitation can be promoted through the design of suitable difference vectors which can be incorporated strategically and systematically at different stages in the search process.
4.
The segregation of the population into two groups has not been experimented with GWO so far to dedicate smaller sections of the population to achieve a specific purpose. The previous multi-strategy ensemble variants have aimed at modifying the structure of GWO completely and have not experimented with population sub-division as of lately.
5.
The inclusion of a second position updation strategy and greedy selection has been proven to be beneficial for most complex search spaces and grounding on this the competitive learning phase with distinct strategies is laid out.
6.
Success and failure-based strategy adaption, the most popular with the state-of-the-variants of DE have been experimented with to allow the algorithm to learn and adapt to any possible scenario.

Hence, based on the above developments, the current study proposes an improved GWO with an ensemble of strategies to improve population diversity and exploration quality. The following aspects have been considered for the development of the proposed method:

1.
The proposed method is built on the merits of the standard GWO linear hunting scheme and a second learning phase with greedy selection follows it as has been the most successful for composite and complex landscapes as seen with the various improved variants.
2.
The proposed method follows the population sub-division and is implemented in two phases with each phase complimenting the other in terms of the search strategy and selection mechanisms.
3.
The enhancement of population diversity has been the core of the current method.
4.
A non-linear control strategy and competitive learning strategy have been added to the standard GWO to improve its robustness and immunity against the curse of dimensionality
5.
Benchmarking is done through the recent CEC2020 suite and CEC2019 suites, neither of which have been considered for benchmarking so far and none of the benchmarking functions in them has their global optimum located at ‘0’.
6.
Comparisons are not only made against the advanced variants of GWO but also with the state-of-the-art advanced meta-heuristics from the literature and these algorithms are kept consistent throughout the entire benchmarking and real-world testing.

3.3 Implementation

Clb-GWO is implemented in two phases and in both phases, the population is divided into two subgroups. The algorithm’s phases and population groups are structured in a selective complementary fashion with the aim of promoting population diversity over convergence. The population is divided into a majority group with 90 % of the wolves and a minority group with 10 % of the remaining wolves. The majority group is mainly responsible for the large-scale exploration and exploitation while the minority groups are reserved to promote divergence and convergence as per their formulation. The advantages of population sub-division have been highlighted in several of the state-of-the-art advanced meta-heuristics such as EPSO [35], EPSDE [37], MPEDE [65] etc.

3.3.1 Modified GWO phase

i)
Majority Group 1 / Hunting group:

As discussed earlier, the linear hunting strategy from the standard GWO has its own set of merits and demerits and grounding on this, the first phase considers a linear hunting scheme complemented by a non-linear hunting scheme. The hunting scheme, either linear hunting or non-linear hunting is selected with a random probability such that both the schemes contribute to the generation of a new population as represented by Eq. (8).

$$ \overrightarrow{P_{hunt}\ }\left(t+1\right)=\left\{\begin{array}{cc} Linear\ GWO\ hunting\ \left(\overrightarrow{{P_{gw}}^{lin}\ }\right)& p{r}_1>0.5\\ {} Non- linear\ GWO\ huting\ \left(\overrightarrow{{P_{gw}}^{nl}\ }\right)& otherwise\end{array}\right. $$

(8)

where, $ \overrightarrow{P_{hunt}\ }\left(t+1\right) $ is the updated position of the grey wolf through the various hunting schemes, pr₁ is a random number in 0 and 1 generated through uniform distribution.

Linear hunt

The linear hunting scheme is the same as the encircling and hunting of prey technique from the standard GWO algorithm. The distance and position vectors are described in Eq. (9) and Eq. (10) respectively.

$$ {\displaystyle \begin{array}{c}\overrightarrow{d_{\alpha }}=\left|\ \overrightarrow{E_1\ }.\overrightarrow{P_{\alpha }}-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\beta }}=\left|\ \overrightarrow{E_2\ }.\overrightarrow{P_{\beta }}-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\delta }}=\left|\ \overrightarrow{E_2\ }.\overrightarrow{P_{\delta }}-\overrightarrow{P_{gw}\ }\right|\end{array}} $$

(9)

$$ {\displaystyle \begin{array}{c}\overrightarrow{P_1} = \overrightarrow{P_{\alpha }}-\overrightarrow{A_1}.\left(\overrightarrow{d_{\alpha }}\right)\\ {}\overrightarrow{P_2} = \overrightarrow{P_{\beta }}-\overrightarrow{A_2}.\left(\overrightarrow{d_{\beta }}\right)\\ {}\overrightarrow{P_3} = \overrightarrow{P_{\delta }}-\overrightarrow{A_3}.\left(\overrightarrow{d_{\delta }}\right)\end{array}} $$

(10)

The coefficient vectors are described by Eq. (11) and Eq. (12) respectively.

$$ \overrightarrow{A} = 2\overrightarrow{a}.\overrightarrow{\ {r}_1} - \overrightarrow{a} $$

(11)

$$ \overrightarrow{E} = 2.\overrightarrow{\ {r}_2} $$

(12)

The positions of grey wolves are updated as per Eq. (13)

$$ \overrightarrow{{P_{gw}}^{lin}\ }\left(t+1\right)=\left[\frac{\overrightarrow{P_1} + \overrightarrow{P_2} + \overrightarrow{P_3\ }}{3}\right] $$

(13)

where, $ \overrightarrow{{P_{gw}}^{lin}} $ is the updated position of the grey wolf through the linear hunting scheme, $ \overrightarrow{P_{gw}} $ is the position of the grey wolf, $ \overrightarrow{P_{\alpha }} $, $ \overrightarrow{P_{\beta }} $ and $ \overrightarrow{P_{\delta }} $ represent the positions of the alpha, beta and delta wolves, $ \overrightarrow{A\ } $ and $ \overrightarrow{E} $ are the co-efficient vectors, $ \overrightarrow{\ a} $ is the control vector whose value tends to linearly decrease from an initial value of 2 to a final value of 0 over the course of iterations ‘t’ and $ \overrightarrow{\ r} $ denotes a random vector in [0, 1].

Non-linear hunt

The non-linear hunting scheme is considered to improve the population diversity based on an exponentially decreasing vector through the course of iterations. This strategy includes the worst solution to form the differential vector while also considering the selection of randomized omega wolves to prevent local stagnation which is often the result of the dominant wolves converging quickly to a single point in the search space. The non-linearity associated with the control vector prevents the solutions from sliding towards the geometric centre of the search space which has been known to severely impact its performance for shifted and rotated benchmark functions. The distance and position vectors are described in Eq. (14) and Eq. (15) respectively.

$$ {\displaystyle \begin{array}{c}\overrightarrow{d_{\alpha }}=\left|\ \left[\overrightarrow{P_{\alpha }} - \overrightarrow{P_W\ }\right]-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\beta }}=\left|\ \left[\overrightarrow{P_{\beta }} - \overrightarrow{P_W\ }\right]-\overrightarrow{P_{gw}\ }\right|\\ {}\overrightarrow{d_{\delta }}=\left|\ \left[\overrightarrow{P_{\delta }} - \overrightarrow{P_W\ }\right]-\overrightarrow{P_{gw}\ }\right|\end{array}} $$

(14)

$$ {\displaystyle \begin{array}{c}\overrightarrow{P_1} = \overrightarrow{\phi .}\overrightarrow{P_{\omega (r1)}}-\overrightarrow{r_3}.\left(\overrightarrow{d_{\alpha }}\right)\\ {}\overrightarrow{P_2} = \overrightarrow{\phi .}\overrightarrow{P_{\omega (r2)}}-\overrightarrow{r_4}.\left(\overrightarrow{d_{\beta }}\right)\\ {}\overrightarrow{P_3} = \overrightarrow{\phi .}\overrightarrow{P_{\omega (r3)}}-\overrightarrow{r_5}.\left(\overrightarrow{d_{\delta }}\right)\end{array}} $$

(15)

where, $ \overrightarrow{P_W\ } $ position of the grey wolf with the worst fitness value, $ \overrightarrow{\phi\ } $ is the control vector whose value tends to exponentially decrease from an initial value of 1 to a final value of 0 over the course of iterations ‘t’.

The exponential control vector decreases from 1 to 0 over the course of iterations as described in Eq. (16).

$$ \overrightarrow{\phi} = {e}^{\left(-0.05\times t\right)} $$

(16)

The final position of the grey wolf is the average of the three positions described by Eq. (17).

$$ \overrightarrow{{P_{gw}}^{nl}\ }\left(t+1\right)=\left[\frac{\overrightarrow{P_1} + \overrightarrow{P_2} + \overrightarrow{P_3\ }}{3}\right] $$

(17)

where, $ \overrightarrow{{P_{gw}}^{nl}} $ is the updated position of the grey wolf through the non-linear hunting scheme.

ii)
Minority Group 1 / Diverging group:

The first minority group comprising the remaining 10 % of wolves is retained for re-initialization and random repositioning to diverge the wolves and prevent local entrapment. The divergence is achieved through the described equations in Eq. (18) chosen at random. The divergence vector $ \overrightarrow{P_{\Omega}\ } $ is formulated to push the wolves far away from each other.

$$ \overrightarrow{P_{hunt}\ }\left(t+1\right)=\left\{\begin{array}{cc}\overrightarrow{P_{gw}} + \Delta .\left[\overrightarrow{P_{\Omega}\ }\right]& p{r}_2>0.5\\ {} lb+\overrightarrow{r_6}.\left[ ub- lb\right]& otherwise\end{array}\right. $$

(18)

where

$$ \overrightarrow{P_{\Omega}} = \overrightarrow{P_{\omega \left({r}_a\right)}}-\overrightarrow{P_{\omega \left({r}_b\right)}} $$

where, is a random vector in [1, 13], $ \overrightarrow{P_{\Omega}\ } $ denotes the difference vector of any two randomly chosen omega wolves $ \overrightarrow{P_{\omega \left({r}_a\right)}} $ and $ \overrightarrow{P_{\omega \left({r}_b\right)}} $, lb and ub denote the lower and upper bounds for the decision variables.

The diversity preserving Mu, Lambda (μ, λ) selection follows the modified GWO phase for the population updation wherein every new solution is accepted to replace its parent solution despite its improved or deteriorated fitness value. However, the memory of the three dominant wolves is updated when a wolf with better fitness than their respective fitness is found.

3.3.2 Competitive learning phase

The competitive learning process follows the standard GWO procedure to further improve the quality of solutions, expand the solution space and ensure a better balance of exploration and exploitation. The search processes are synchronized to allow for the exploration of the search space in both the GWO and the competitive learning phases with a higher emphasis on exploration through the majority competitive learning group followed by a greedy selection process to ensure those fitter solutions replace the older ones.

Similar to the first phase, the competitive learning phase also divides the population into two sub-groups i.e., the Minority group 2 and majority group 2.

i) Minority Group 2 /Converging group:

The second minority group considered the first 10 % of the wolves to improve the local search and accelerate the convergence in a controlled manner. Here, fitness-based repositioning is implemented to guide the wolves to the promising areas of the search space. A single-dimensional update strategy is followed to generate one random number for all the problem dimensions as it ensures accelerated convergence for multi-modal and separable functions.

Every wolf in the second minority group is compared with a random wolf other than the three dominant wolves and repositioned closer to the alpha wolf with respect to its fitness. Fitter wolves are allowed to migrate slowly while the non-fitter wolves are given a higher degree of freedom to reposition themselves much closer to the alpha. Local search around the current position and exploitation of the best solutions is facilitated through the $ \overrightarrow{{P_{\Omega}}^{gw}\ } $ and $ \overrightarrow{{P_{\Omega}}^{\alpha }\ } $ vectors respectively as described below in Eq. (19).

$$ {\displaystyle \begin{array}{c}\overrightarrow{P_{learn}\ }\left(t+1\right)=\left\{\begin{array}{cc}\overrightarrow{P_{\omega {(r)}_{hunt}}} + \overrightarrow{r_7}.\left[\overrightarrow{{P_{\Omega}}^{gw}\ }\ \right]+\overrightarrow{r_8}.\left[\overrightarrow{{P_{\Omega}}^{\alpha }\ }\ \right]&\ Fit(i)< Fit(r)\\ {}\overrightarrow{P_{gw}} - \overrightarrow{r_9}.\left[\overrightarrow{{P_{\Omega}}^{\alpha }\ }\ \right]& Fit(i)> Fit(r)\end{array}\right.\\ {}\overrightarrow{{P_{\Omega}}^{gw}} = \overrightarrow{P_{gw}} - \overrightarrow{P_{\omega (r)}}\\ {}\overrightarrow{{P_{\Omega}}^{\alpha }} = \overrightarrow{P_{\alpha }} - \overrightarrow{P_{\omega (r)}}\end{array}} $$

(19)

where, $ \overrightarrow{P_{\mathrm{learn}}\ }\left(t+1\right) $ is the updated position of the grey wolf through the various learning schemes, $ \overrightarrow{{P_{\Omega}}^{\alpha }\ } $denotes the difference vector of the alpha wolf $ \overrightarrow{P_{\alpha }\ } $and a randomly chosen omega wolf $ \overrightarrow{P_{\omega (r)}} $ and $ \overrightarrow{{P_{\Omega}}^{gw}\ } $ denotes the difference vector of the current wolf $ \overrightarrow{P_{gw}\ } $and a randomly chosen omega wolf $ \overrightarrow{P_{\omega (r)}} $.

The inclusion of at least one omega wolf whose position has not been modified from the previous hunting phase is made sure to prevent the loss of diversity during the repositioning process.

ii) Majority group 2 / Learning group:

This learning phase is selected based on the success and failure rates that serve as the moderators to switch between the linear learning and adaptive learning techniques that have been described below. Multi-dimensional update strategy which has been proven to be excellent for non-separable functions has been applied to the learning group, wherein the random numbers are unique for each dimension such that it expands the search space around them for a stronger global search emphasis. Initially, the selection is made probabilistically, and a learning parameter named ‘competitive rate’ controls the selection of the schemes best suited to ensure that the exploration goes on in a smooth and undisturbed manner as per Eq. (20). The competitive rate is the sum of the number of consecutive failures (fr) and success (sr) corresponding to each of the strategies. A detailed description of the competitive rate and its impact on the learning outcomes are discussed in the upcoming sub-sections.

$$ {\displaystyle \begin{array}{c}\overrightarrow{P_{learn}\ }\left(t+1\right)=\left\{\begin{array}{cc} Linear\ GWO\ learning\left(\overrightarrow{{P_{clb}}^{lin}\ }\right)& If\ fr>10\ or\kern0.5em If\ sr>5\\ {} Adaptive\ GWO\ learning\left(\overrightarrow{{P_{clb}}^{adapt}\ }\right)& otherwise\end{array}\right.\\ {} Comp= fr+ sr\end{array}} $$

(20)

where, Comp denotes the competitive rate fr stands for the failure rate and sr stands for the success rate respectively.

Linear learning

The linear GWO learning scheme adopts the linearly decreasing control vector from the standard GWO phase to search for new solutions around the most promising areas in the search space and has a good global search ability. The second technique comprising random omega wolves from the current and hunting population is simply added to prevent the one-sided search progression associated with the linear control strategy and hence has been given a lower priority. The linear search process is prone to drive the population to the geometric centre and to avoid this the difference vectors $ \overrightarrow{P_{\Omega}\ } $ and $ \overrightarrow{{P_{\Omega}}^{hunt}\ } $ are designed. Linear hunting is described by Eq. (21).

$$ \overrightarrow{{P_{clb}}^{lin}\ }\left(t+1\right)=\left\{\begin{array}{cc}\overrightarrow{P_{\alpha }} + \overrightarrow{a}.\left[\overrightarrow{P_{\Omega}\ }\ \right]& p{r}_3<0.75\\ {}\overrightarrow{P_{\alpha }} + \overrightarrow{r_{10}}.\left[\overrightarrow{P_{\Omega}\ }\ \right]+\overrightarrow{r_{11}}.\left[\overrightarrow{{P_{\Omega}}^{hunt}\ }\ \right]& otherwise\end{array}\right. $$

(21)

where

$$ \overrightarrow{{P_{\Omega}}^{hunt}} = \overrightarrow{P_{\omega {(r)}_{hunt}}} - \overrightarrow{P_{\omega (r)}} $$

where, $ \overrightarrow{{P_{clb}}^{lin}\ } $ is the updated position of the grey wolf through the linear learning scheme, $ \overrightarrow{\ a} $ is the control vector whose value tends to linearly decrease from an initial value of 2 to a final value of 0 over the course of iterations ‘t’, $ \overrightarrow{P_{\omega {(r)}_{hunt}}\ } $ denotes a randomly chosen omega wolf from the previous hunting schemes and $ \overrightarrow{{P_{\Omega}}^{hunt}\ } $ denotes the difference vector of a randomized omega wolf $ \overrightarrow{P_{\omega {(r)}_{hunt}}\ } $ from the hunting schemes and a randomly chosen omega wolf $ \overrightarrow{P_{\omega (r)}} $.

Adaptive learning

The adaptive hunting scheme comprises an adaptive cooperative learning technique (the first technique) with the alpha, beta and delta wolves to form the solution vector while the second technique involves the selection of only randomised omega wolves from the current and the previous hunting populations. The first technique is prioritized over the second as the knowledge of the alpha, beta and delta can be exploited efficiently in guiding the omegas to more promising areas. The second strategy serves the purpose of diversity enhancement and prevents excessive dependence on the dominant wolves at all times in the search process through the divergence vectors $ \overrightarrow{P_{\Omega_1}} $ and $ \overrightarrow{P_{\Omega_2}} $ respectively and hence its priority is set to be lower for its selection. Adaptive learning is achieved through the vectors $ \overrightarrow{{P_{\Omega}}^{\alpha \prime }\ } $ and $ \overrightarrow{{P_{\mathrm{gw}}}^{\beta, \delta }\ } $ wherein the information from the three dominant wolves is used to reposition the wolves from the previous phases. Adaptive hunting is described by Eq. (22).

$$ \overrightarrow{{P_{clb}}^{adapt}\ }\left(t+1\right)=\left\{\begin{array}{cc}\overrightarrow{P_{gw}} + \overrightarrow{R_1}.\left[\overrightarrow{{P_{\Omega}}^{\upalpha \prime }\ }\ \right]-\overrightarrow{R_2}.\left[\overrightarrow{{P_{\mathrm{gw}}}^{\beta, \delta }\ }\ \right]& p{r}_4<0.75\\ {}\overrightarrow{P_{\omega {(r)}_{hunt}}} + \overrightarrow{r_{12}}.\left[\ \overrightarrow{P_{\Omega_1}}\right]+\overrightarrow{r_{13}}.\left[\overrightarrow{P_{\Omega_2}}\right]& otherwise\end{array}\right. $$

(22)

$$ \overrightarrow{R}=\mathit{\operatorname{rand}}\left(1,D\right) $$

$$ \overrightarrow{{P_{\Omega}}^{\alpha \prime }} = \overrightarrow{P_{\omega {(r)}_{hunt}}} - \overrightarrow{P_{\alpha }\ } $$

$$ \overrightarrow{{P_{\mathrm{gw}}}^{\beta, \delta }} = \overrightarrow{P_{gw}} + \left(\overrightarrow{P_{\beta }} + \overrightarrow{P_{\delta }\ }\right) $$

where, $ \overrightarrow{{P_{clb}}^{adapt}\ } $ is the updated position of the grey wolf through the adaptive learning scheme, $ \overrightarrow{R} $ is a random vector comprising random numbers in [0,1] of the size of 1 by D, with D representing the problem dimensions.

The final step is the fitness evaluations of all the newer population members. The greedy selection technique follows the competitive learning phase to update the population pool with superior solutions from the competitive learning phase. The greedy selection allows for the population members from the competitive learning strategies with better fitness compared to the one from the modified GWO process. The survival of the fittest strategy is followed to select the fitter population members and discard the rest. In the case of inferior solutions, the positions from the modified GWO procedure are retained as given by Eq. (23).

$$ \overrightarrow{P_{gw}\ }\left(t+1\right)=\left\{\begin{array}{cc}\overrightarrow{P_{hunt}\ }\left(t+1\right)& if\ f\left(\overrightarrow{P_{hunt}\ }\right)<f\left(\overrightarrow{P_{learn}\ }\right)\\ {}\overrightarrow{P_{learn}\ }\left(t+1\right)& otherwise\end{array}\right. $$

(23)

where, $ f\left(\overrightarrow{P_{learn}\ }\right) $ is the fitness score of the decision variables obtained by the competitive learning strategy and $ f\left(\overrightarrow{P_{hunt}\ }\right) $ fitness score of the decision variables obtained by the modified GWO procedure.

The overall algorithmic structure of Clb-GWO is presented in Fig. 4.

3.3.3 Pseudocode of Clb-GWO

3.3.4 Analysis of the difference vectors

The difference vectors lie at the core of the proposed method and have been designed after a meticulous study and analysis of the various possible combinations used in previous advanced meta-heuristics. The primary function of the various vectors is to lower the dependence of the algorithms at all times on the three dominant wolves and eventuate to increased diversity in the population. Most of the difference vectors comprise a ransom omega wolf from the population pool which has been deliberately planned to eliminate the clustering of the wolves at any given time and extend the course of exploration over a greater interval of time. Although this can result in slower convergence, the implementation of the search mechanism with them is eliminating the one-side search system in GWO that has received a lot of criticism. The evolution of the wolfpack can be directed in the right direction to explore and exploit systematically and without being susceptible to entrapment. A tabulation of the various difference vectors designed for the proposed method is tabulated in Table 1.

Table 1 Tabulation of the various difference vectors implemented in Clb-GWO

A competitive learning-based Grey wolf Optimizer for engineering problems and its application to multi-layer perceptron training

Abstract

Similar content being viewed by others

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

Learning to optimize: A tutorial for continuous and mixed-integer optimization

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

1 Introduction

1.1 Introduction to meta-heuristics

1.2 A review of meta-heuristics in multimedia applications and artificial intelligence

1.3 Challenges associated with meta-heuristic algorithms

1.4 Contributions of the proposed work

1.4.1 Highlights of the current work

2 Grey wolf Optimizer

2.1 Working of GWO

Encircling the prey

Hunting

2.2 Demerits of the canonical GWO

3 Proposed method: Competitive learning-based Grey wolf Optimizer

3.1 Analysis and deductions from the previous publications aimed at improving GWO

3.2 Motivation

3.3 Implementation

3.3.1 Modified GWO phase

Linear hunt

Non-linear hunt

3.3.2 Competitive learning phase

Linear learning

Adaptive learning

3.3.3 Pseudocode of Clb-GWO

3.3.4 Analysis of the difference vectors

3.3.5 Exploration and exploitation

3.3.6 Time complexity and computational complexity

4 Results and discussion

4.1 Description of benchmark functions and performance evaluation criteria

4.1.1 Algorithms in the benchmarking framework

4.1.2 Tuning settings of the algorithms

4.2 CEC2020 benchmarking suite

4.2.1 Results of benchmarking (CEC2020 test suite)

Analysis of results

4.3 CEC2019 benchmarking suite

Analysis of results

4.4 Sensitivity to tuning parameters

4.4.1 Influence of the non-linear control vector (\( \overrightarrow{\phi} \))

4.4.2 Influence of competition rate (comp)

4.4.3 Influence of the N:T ratio

5 Multi-layer perceptron training

5.1 Problem description

5.2 Experimental setup

6 Conclusion

6.1 Merits and demerits

6.1.1 Merits

6.1.2 Limitations

6.2 Future scope

References

CRediT taxonomy

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher’s note

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation