Adaptive multi-population inflationary differential evolution

Di Carlo, Marilena; Vasile, Massimiliano; Minisci, Edmondo

doi:10.1007/s00500-019-04154-5

Adaptive multi-population inflationary differential evolution

Methodologies and Application
Open access
Published: 15 July 2019

Volume 24, pages 3861–3891, (2020)
Cite this article

Download PDF

You have full access to this open access article

Soft Computing Aims and scope Submit manuscript

Adaptive multi-population inflationary differential evolution

Download PDF

Marilena Di Carlo ORCID: orcid.org/0000-0001-5046-3028¹,
Massimiliano Vasile¹ &
Edmondo Minisci¹

2225 Accesses
23 Citations
4 Altmetric
Explore all metrics

Abstract

This paper proposes a multi-population adaptive version of inflationary differential evolution algorithm. Inflationary differential evolution algorithm (IDEA) combines basic differential evolution (DE) with some of the restart and local search mechanisms of Monotonic Basin Hopping (MBH). In the adaptive version presented in this paper, the DE parameters ${ CR}$ and F are automatically adapted together with the size of the local restart bubble and the number of local restarts of MBH. The proposed algorithm implements a simple but effective mechanism to avoid multiple detections of the same local minima. The novel mechanism allows the algorithm to decide whether to start or not a local search. The algorithm has been extensively tested over more than fifty test functions from the competitions of the Congress on Evolutionary Computation (CEC), CEC 2005, CEC 2011 and CEC 2014, and compared against all the algorithms participating in those competitions. For each test function, the paper reports best, worst, median, mean and standard deviation values of the best minimum found by the algorithm. Comparisons with other algorithms participating in the CEC competitions are presented in terms of relative ranking, Wilcoxon tests and success rates. For completeness, the paper presents also the single population adaptive IDEA, that can adapt only $\textit{CR}$ and F, and shows that this simpler version can outperform the multi-population one if the radius of the restart bubble and the number of restarts are properly chosen.

Self-Adaptive Differential Evolution for Dynamic Environments with Fluctuating Numbers of Optima

Population Size in Differential Evolution

A New Differential Evolution Algorithm with Alopex-Based Local Search

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Differential evolution (DE), proposed by Price et al. (2006), is a well-known population-based evolutionary algorithm for solving global optimisation problems over continuous spaces. Literature indicates that DE exhibits very good performance over a wide variety of optimisation problems (Das and Suganthan 2011). However, although being a very efficient optimiser, its local search ability has long been questioned and work has been done to improve its local convergence by combining DE with local optimisation strategies (Qing 2010).

In previous works by the authors, Locatelli and Vasile (2015) and Vasile et al. (2011), it was demonstrated that DE can converge to a fixed point, a level set or a hyperplane that does not contain the global minimum. The collapse of the population to a fixed point or a neighbourhood of a fixed point from which DE cannot escape was one of the motivation for the development of inflationary differential evolution algorithm (Vasile et al. 2011).

IDEA is based on the hybridisation of DE with the restarting procedure of monotonic basin hopping (MBH) (Wales and Doye 1997); it implements both a local restart in the neighbourhood of a local minimum and a global restart in the whole search space. IDEA was shown to give better results than a simple DE, but its performance is dependent upon the parameters controlling both the DE and MBH heuristics (Vasile et al. 2011). These parameters are the crossover probability $\textit{CR}$, the differential weight F, the radius of the local restart bubble $\delta _{\mathrm{local}}$ and the number of local restarts $n_{\mathrm{LR}}$, whose best settings are problem dependent. Different adaptive mechanisms for adjusting $\textit{CR}$ and F during the search process can be found in the literature, (Brest et al. 2006, 2013; Liu and Lampinen 2005; Omran et al. 2005); a parameter-less adaptive evolutionary algorithm has been presented in Papa (2013). However, no approach has been proposed so far to adapt $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$. In this paper, we present a simple mechanism to adapt $\textit{CR}$ and F within a single population and a multi-population strategy to adapt $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$. The multi-population version of IDEA is in the following called MP-AIDEA (Multi Population Adaptive Inflationary Differential Evolution Algorithm).

The resulting algorithm was extensively tested over 51 test problems from the single objective global optimisation competitions of the Congress on Evolutionary Computation (CEC) 2005, 2011 and 2014. Tests to assess the performance of the algorithm include rankings, Wilcoxon test and success rate. It will be shown that the adaptive version of IDEA always ranks in the first three best algorithms in every competition for every number of dimensions except for the CEC 2014 test set with 30 dimensions. Furthermore, it will be shown that the simple adaptation of $\textit{CR}$ and F within a single population can outperform the multi-population version with adaptation of $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$ if these two parameters are properly chosen.

This paper extends the work presented in Di Carlo et al. (2015). In Di Carlo et al. (2015), the basic mechanisms that constitute MP-AIDEA were introduced, and the performance of MP-AIDEA was measured only by a relative ranking against other algorithms. This paper provides a more detailed explanation of all the mechanisms and heuristics inside MP-AIDEA; moreover, it presents an extensive empirical assessment of its performance, using several metrics in addition to the relative ranking. As part of this extensive performance evaluation, we compare MP-AIDEA against a number of other algorithms and a single population version of MP-AIDEA with no adaptive local restart. Detailed results obtained for each test functions are also presented, so that the paper can be used as a reference for comparison against other algorithms.

The paper starts stating the problem we are trying to solve in Sect. 2 and briefly introducing the basic principles and fundamental theoretical developments underneath inflationary differential evolution in Sect. 3. The adaptation mechanisms are presented, together with the resulting adaptive multi-population version of IDEA, MP-AIDEA, in Sect. 5. The test cases are presented in Sect. 6, and the obtained results are presented in Sect. 6.1. Finally, the paper presents the results of all the comparative tests in Sects. 6.2, 6.3 and 6.4. Section 7 concludes the paper.

2 Problem statement

This paper is concerned with the following class of global minimisation problems with box constraints:

$$\begin{aligned} \min _{{\mathbf {x}}\in B} f({\mathbf {x}}) \end{aligned}$$

(1)

with $f:B \subseteq {\mathbb {R}}^{n_\mathrm{D}}\rightarrow {\mathbb {R}}$, $n_\mathrm{D}$ the number of dimensions and the box B defined by the upper and lower boundaries ${\mathbf {x}}_{\mathrm{lower}} \le {\mathbf {x}} \le {\mathbf {x}}_{\mathrm{upper}}$. In the following, we will use a gradient-based local search algorithm; therefore, we further require that $f\in C^2(B)$. Note, however, that this is not a strict requirement as we can show that the algorithm can work also when a finite number of non-differentiable points exist.

3 Inflationary differential evolution

This section briefly recalls the working principles of inflationary differential evolution and presents the parameters that the algorithm proposed in this paper adapts. Following the notation introduced in Vasile et al. (2011), we express the general DE process as a discrete dynamical system. The governing equation, for the i-th individual at generation k, is expressed as:

$$\begin{aligned} {\mathbf {x}}_{i,k+1}={\mathbf {x}}_{i,k}+S({\mathbf {x}}_{i,k}+{\mathbf {u}}_{i,k},{\mathbf {x}}_{i,k}) {\mathbf {u}}_{i,k} \end{aligned}$$

(2)

with

$$\begin{aligned} \begin{aligned} {\mathbf {u}}_{i,k} ={\mathbf {e}}&\left[ G {\mathbf {x}}_{r_1,k} + (1 - G) {\mathbf {x}}_{i,k} +F({\mathbf {x}}_{r_2,k}-{\mathbf {x}}_{r_3,k})\right. \\&+\,\left. (1-G) F ({\mathbf {x}}_{\mathrm{best},k}-{\mathbf {x}}_{i,k}) \right] \end{aligned} \end{aligned}$$

(3)

where G can be either 0 or 1 [with $G = 1$ corresponding to the DE strategy DE/rand and $G = 0$ corresponding to the DE strategy DE/current-to-best (Price et al. 2006)]. In Eq. (3), $r_1$, $r_2$ and $r_3$ are integer numbers randomly chosen in the population, and ${\mathbf {e}}$ is a mask containing random numbers of 0 and 1 according to:

$$\begin{aligned} e_t= \Bigg \lbrace \begin{array}{c} 1\Rightarrow U\le CR \\ 0\Rightarrow U> CR \\ \end{array} t = 1, \dots , n_\mathrm{D} \end{aligned}$$

(4)

U is a random number taken from a random uniform distribution [0, 1]. The product between ${\mathbf {e}}$ and the term in square brackets in Eq. (3) has to be intended component-wise. In this work, given $u_{t,i,k}$, the t-th component of the trial vector ${\mathbf {u}}_{i,k}$, the following correction is applied to satisfy the box constraints (Zhang and Sanderson 2009):

$$\begin{aligned} u_{t,i,k} = {\left\{ \begin{array}{ll} \left( x_{t,i,k} + x_{t,\mathrm{lower}}\right) /2, &{} \text { if } u_{t,i,k} < x_{t,\mathrm{lower}}\\ \left( x_{t,i,k} + x_{t,\mathrm{upper}}\right) /2, &{} \text { if } u_{t,i,k} > x_{t,\mathrm{upper}} \end{array}\right. } \end{aligned}$$

(5)

The selection function S is defined as:

$$\begin{aligned} S({\mathbf {x}}_{i,k}+{\mathbf {u}}_{i,k},{\mathbf {x}}_{i,k})=\Big \{ \begin{array}{l} 1 \;\; \text {if} \;\; f({\mathbf {x}}_{i,k}+{\mathbf {u}}_{i,k})<f({\mathbf {x}}_{i,k})\\ 0 \;\; \text {otherwise} \end{array} \end{aligned}$$

(6)

In the general case in which the indices $r_{1}$, $r_{2}$ and $r_{3}$ can assume any value, in Vasile et al. (2011) it was demonstrated that the population can converge to a fixed point different from a local minimum or to a level set. Furthermore, in Locatelli and Vasile (2015) it was demonstrated that DE can converge to a hyperplane that does not contain the global minimum. Finally, consider the following proposition.

Proposition 1

Consider the subset $\varPsi =\{{\mathbf {x}}\in B: f({\mathbf {x}})\le {\bar{f}}\}$ and the superset $\phi $ such that:

1.
$\varPsi \subset \phi $
2.
${\mathbf {x}}_{i,k+1}\in \phi , \forall i$
3.
$\forall {\mathbf {y}}\in \phi \setminus \varPsi , f({\mathbf {y}})>{\bar{f}}$

then if the population at iteration k is entirely contained in $\varPsi $ it cannot escape from $\varPsi $ at any future iteration.

Proof

The proof descends from the definition of S. Suppose that a candidate individual ${\mathbf {x}}_{i,k+1}$ was generated by map (2) then, because of point 3 of the proposition, it would be rejected by the selection operator. $\square $

Therefore, when the population contracts within a ball $B_\mathrm{c}\subseteq \varPsi $ of radius $\rho _\mathrm{l}$, DE can only converge to a point or a subset within $B_\mathrm{c}$. We call $\rho _\mathrm{l}$ the contraction limit, in the following.

In inflationary differential evolution, the DE heuristics is iterated until the population reaches the contraction limit. A local search is then started from the best individual in the population ${\mathbf {x}}_{\mathrm{best}}$, the corresponding local minimum ${\mathbf {x}}_{\mathrm{LM}}$ is saved in an archive of local minima A and the population is restarted in a bubble $B_\mathrm{R}$ of radius $\delta _{\mathrm{local}}$ around the local minimum ${\mathbf {x}}_{\mathrm{LM}}$. This mechanisms is borrowed from the basic logic underneath monotonic basin hopping (Wales and Doye 1997). To assess if the contraction condition is satisfied, the maximum distance between all possible combinations of individuals of the population at generation k, $\rho ^{(k)}$, is computed:

$$\begin{aligned} \rho ^{(k)} = \text {max} \left( || {\mathbf {x}}_{i,k} - {\mathbf {x}}_{l,k}||\right) i,l = 1, \dots , N_{\mathrm{pop}} \end{aligned}$$

(7)

where $N_{\mathrm{pop}}$ is the number of individuals in the population. The contraction is verified when $\rho ^{(k)}\le {\bar{\rho }} \rho _{\mathrm{max}}$, where $\rho _{\mathrm{max}}=\max _k \rho ^{(k)}$ is the maximum value of $\rho $ ever recorded until generation k and ${\bar{\rho }}$ is one of the parameters of the algorithm, the contraction threshold. This contraction criterion is consistent with Proposition 1 under the assumption that $\rho _\mathrm{l}={\bar{\rho }} \rho _{\mathrm{max}}$.

After a number $n_{\mathrm{LR}}$ of such local restarts, without any improvement of the current best solution, the archive A collects all the local minima found so far. At this point, the population is restarted globally in the search space so that every individual is initially at a distance $\sqrt{n_\mathrm{D}} \delta _{\mathrm{global}}$ from the centres of the clusters of the local minima in A. During local restarts, the most important information is preserved in the local minimum. The assumption is that the basin of attraction of that local minimum has already been explored and that exploration led to the convergence of the population to $B_\mathrm{c}$. When the population is restarted globally the essential information, all the local minima, is stored in the archive A. Here the assumption is that IDEA has completely explored a funnel structure resulting in a cluster of minima.

These restart procedures were proven to be very effective in a series of difficult real problems in which the landscape presents multiple funnels (see Vasile et al. 2011 for additional details).

The complete inflationary differential evolution process with trial vector (3) is governed by the following key parameters: $N_{\mathrm{pop}}$, $\textit{CR}$ and F, G, ${\bar{\rho }}$, $\delta _{\mathrm{local}}$, $n_{\mathrm{LR}}$, $\delta _{\mathrm{global}}$. From experience, we know that $\delta _{\mathrm{global}}$ is not a critical parameter in most of the cases while $\textit{CR}$, F, $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$ play a significant role and are not always easy to define. The parameters $\textit{CR}$ and F are applied to update each individual in a population while $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$ are applied to restart the whole population. Therefore, in this paper we propose two adaptation mechanisms, one for $\textit{CR}$ and F and one for $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$. In particular, the adaptation mechanisms of $\textit{CR}$, F and $\delta _{\mathrm{local}}$ are such as to result in the definition of numerical values for these parameters, to be used by the algorithm. On the contrary, the use of $n_{\mathrm{LR}}$ is replaced by a mechanism that allows the algorithm to decide when to perform a local or global restart, so that the definition of a numerical value for $n_{\mathrm{LR}}$ is not required anymore.

4 Adaptation mechanisms

Because of the very nature of $\textit{CR}$ and F, $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$, the automatic adaptation of $\textit{CR}$ and F requires only the evaluation of the success of each candidate increment ${\mathbf {u}}_{i,k}$. On the other hand, the adaptation of $\delta _{\mathrm{local}}$ and $n_{\mathrm{LR}}$ requires the evaluation of the success of the restart of an entire population. Therefore, in this paper it is proposed to extend the working principle of inflationary differential evolution by evolving $n_{\mathrm{pop}}$ populations in parallel, where $n_{\mathrm{pop}}$ is defined a priori.

Each population adapts its own values of $\textit{CR}$ and F. We use a stigmergic approach in which the $\textit{CR}$ and F of each individual are drawn from a joint probability distribution, over a set of possible values of $\textit{CR}$ and F, that evolves with the population.

All populations are then concurrently adapting $\delta _{\mathrm{local}}$ and the number of local restarts. More specifically, the adaptation mechanism of the local restart bubble evolves a probability distribution function over a range of possible values of $\delta _{\mathrm{local}}$. Each population draws values from that probability distribution and at each local restart increases the probability associated to the value of $\delta _{\mathrm{local}}$ that led to a transition from one local minimum to another. The range of $\delta _{\mathrm{local}}$ is also adapted by taking the mean and the minimum distance among the local minima in A.

The number of local restarts, instead, is dictated by the contraction of a population within the basin of attraction of an already identified local minimum. Given a local minimum ${\mathbf {x}}_{\mathrm{LM}}\in A$ and a list of $n_{\mathrm{best},LM}$ best individuals from which a local search converged to ${\mathbf {x}}_{\mathrm{LM}}$, the size of the basin of attraction of ${\mathbf {x}}_{\mathrm{LM}}$ is defined as

$$\begin{aligned} d_{\mathrm{basin},LM} = \min _j || {\mathbf {x}}_{\mathrm{best},j} - {\mathbf {x}}_{\mathrm{LM}}||,\;\; j\in {1,...,n_{\mathrm{best},LM}} \end{aligned}$$

(8)

Each local minimum ${\mathbf {x}}_{\mathrm{LM}}$ in A, therefore, is associated to a particular $d_{\mathrm{basin},LM}$. Figure 1 illustrates this mechanism. Once $d_{\mathrm{basin},LM}$ is estimated, every time the condition $\rho _m^{(k)}\le {\bar{\rho }} \rho _{m,\mathrm{max}}$ is satisfied for population m, if the best individual ${\mathbf {x}}_{\mathrm{best},m}$ is at a distance lower than $d_{\mathrm{basin},LM}$ from ${\mathbf {x}}_{\mathrm{LM}}$, then no local restart is performed but the population is restarted globally in the search space. The number $n_{\mathrm{best},LM}$ is set to 4 in this implementation.

The overall algorithm, called Multi-Population Adaptive Inflationary Differential Evolutionary Algorithm (MP-AIDEA), is described in more detail in the following section.

5 Multi-population adaptive inflationary differential evolution

MP-AIDEA is described in Algorithm 1. Let $n_{\mathrm{pop}}$ be the number of populations and m the index identifying each population. With reference to Algorithm 1, after initialisation of main parameters and functionalities (Algorithm 1, line 1), MP-AIDEA starts by running $n_{\mathrm{pop}}$ Differential Evolutions in parallel, one per population (Algorithm 1, line 3). During each evolution process, the parameters F and $\textit{CR}$ are automatically adapted following the approach presented in Sect. 5.2. When a population m contracts within a ball $B_\mathrm{c}$ of radius ${\bar{\rho }}\; \rho _{m,\mathrm{max}}$, the evolution of that population is stopped. Once all the populations have contracted, the relative position of the best individual of each population, ${\mathbf {x}}_{\mathrm{best},m}$ with respect to the local minima in A, ${\mathbf {x}}_{\mathrm{LM}}$, is assessed (Algorithm 1, line 7). This step makes use of all the minima found by all populations and, therefore, it has to be regarded as an information sharing mechanism among populations. If the best individual of population m is not within the basin of attraction of any previously detected local minimum (that is, $\forall LM \; : \; \Vert {\mathbf {x}}_{\mathrm{best},m} - {\mathbf {x}}_{\mathrm{LM}} \Vert > d_{\mathrm{basin},LM}$) then a local search is run (Algorithm 1, line 8) and the resulting local minimum is stored in the archive A (Algorithm 1, line 16). The flag for the local restart, $LR_m$, is set to 1. On the contrary, if the best individual of population m is inside the basin of attraction of a previously detected local minimum, the local search is not performed and $LR_m$ is set to 0 (Algorithm 1, line 20).

Before running a local or a global restart (Algorithm 1, line 24), the probability distribution associated to $\delta _{\mathrm{local}}$ and its range are updated (Algorithm 1, line 23). After restarting the population, if the number of maximum function evaluations, $n_{\mathrm{feval,max}}$, is not exceeded, the process restarts from line 2 in Algorithm 1. Each part of Algorithm 1 is explained in more details hereafter.

5.1 Initialisation

The steps for the initialisation of MP-AIDEA are presented in Algorithm 2. MP-AIDEA starts with the initialisation of $n_{\mathrm{pop}}$ populations, with $N_{\mathrm{pop}}$ individuals each, in the search space B. The number of function evaluations for each population is set to zero, $n_{\mathrm{feval},m} = 0$ and ${\bar{\rho }}$, $\delta _{\mathrm{global}}$, are initialised to the values specified by the user. The counter of the number of local search per population, $s_m$, is set to 0.

5.2 Differential evolution and the adaptation of $\textit{CR}$ and F

For each population m, a DE process is run (Algorithm 3, line 6), using Equations 2, 3, 4 and 6. The parameter G, in Equation 3, assumes values equal to 0 or 1 with probability 0.5. During the advancement from parents to offspring, each individual of the population is associated to a different value of $\textit{CR}$ and F, drawn from a distribution $\mathbf {CRF}_m^{(k_m)}$ (Algorithm 3, lines 1, 2, 3). $\mathbf {CRF}_m^{(k_m=1)}$ is initialised as a uniform distribution with $(n_\mathrm{D}+1)^2$ points in the space $\textit{CR} \in [0.1, 0.99]$ and $F \in [-0.5, 1]$ (Algorithm 3, line 1). A Gaussian kernel is then allocated to each node and a probability density function is built by Parzen approach (Minisci and Vasile 2014). The values of $\textit{CR}$ and F to be associated to the individuals of the population are drawn from this distribution (Algorithm 3, line 4). A change value dd linked to each kernel is initialised to zero (Algorithm 3, line 3) and is used during the advancement of the population from parents to children to adapt $\textit{CR}$ and F (Algorithm 3, line 8). The adaptation of $\textit{CR}$ and F is summarised in Algorithm 4 and described in the following.

For each individual i of each population m, the adaptation mechanism for $\textit{CR}$ and F is started only if the child is characterised by an objective function value lower than the parent’s one, that is $f({\mathbf {x}}_{m,i}^{(k_m+1)}) < f({\mathbf {x}}_{m,i}^{(k_m)}) $ (Algorithm 4, line 1). If this condition is verified, the difference in objective function between parent and child at subsequent generation, $df_{m,i}^{(k_m+1)} = ||f ( {\mathbf {x}}_{m,i}^{(k_m+1)} ) - f ({\mathbf {x}}_{m,i}^{(k_m)} ) ||$, is computed (Algorithm 4, line 2). Then the sorted elements of $\mathbf {CRF}_{m}^{(k_m)}$ are sequentially evaluated; the q-th value of $\textit{CR}$ in $\mathbf {CRF}_m^{(k_m)}$ is identified as $\mathbf {CRF}_{m,q,1}^{(k_m)}$ and the q-th value of F is identified as $\mathbf {CRF}_{m,q,2}^{(k_m)}$. The first time that $dd_{m,q}^{(k_m)}$ (the dd value associated to the q-th row of $\mathbf {CRF}_{m}^{(k_m)}$) is lower than $df_{m,i}^{(k_m+1)}$ (Algorithm 4, line 4), the differential weight $F_{m,i}^{(k_m)}$ used for the individual ${\mathbf {x}}_{m,i}^{(k_m)}$ substitutes $\mathbf {CRF}_{m,q,2}^{(k_m)}$ and $df_{m,i}^{(k_m+1)}$ substitutes $dd_{m,q}^{(k_m)}$ (Algorithm 4, lines 5 and 6). This is because $F_{m,i}^{(k_m)}$ produced a bigger decrease in the objective function than $\mathbf {CRF}_{m,q,2}^{(k_m)}$ (as shown by $df_{m,i}^{(k_m+1)} > dd_{m,q}^{(k_m)}$). For $\textit{CR}$, the value associated to ${\mathbf {x}}_{m,i}^{(k_m)}$ substitutes $\mathbf {CRF}_{m,q,1}^{(k_m)}$ (Algorithm 4, line 8) only if $df_{m,i}^{(k_m+1)}$ is greater than a given value CRC (Algorithm 4, line 7), (Minisci and Vasile 2014).

The DE stops according to the contraction condition presented in Sect. 3. In order to prevent an excessive use of resources when the population partitions, a fail safe criterion was introduced that stops the DE after 10D generations (Algorithm 3, line 13).

5.3 Local search and restart mechanisms

After the evolution of all populations has stopped, MP-AIDEA checks if the best individual of each population is inside the basin of attraction of any previously detected local minimum (see Algorithm 1, line 7). If that is not the case, a local search is performed from the best individual and the population is locally restarted within a hypercube with edge equal to $2\delta _{\mathrm{local}}$ around the detected local minimum; otherwise, no local search is performed and the population is restarted globally in the whole search space (Algorithm 1, line 24). Prior to the implementation of the restart mechanisms, MP-AIDEA updates the estimation of the size of the basin of attraction of each minimum, the archive A (see Algorithm 1, lines 5 to 22) and the distribution over the possible values of $2\delta _{\mathrm{local}}$ (see Algorithm 1, line 23). In the following the identification of the basin of attraction, the estimation of $\delta _{\mathrm{local}}$ and the two restart mechanisms are described in more details.

5.3.1 Identification of the basin of attraction

In order to mitigate the possibility of running multiple local searches that converge to already discovered local minima, MP-AIDEA estimates for each local minimum in A the radius of the basin of attraction of that local minimum. The radius of the basin of attraction is here defined as the distance $d_{\mathrm{basin},LM}$ from a given local minimum ${\mathbf {x}}_{\mathrm{LM}}$ such that if the best individual in population m, ${\mathbf {x}}_{\mathrm{best},m}$, is at a distance from ${\mathbf {x}}_{\mathrm{LM}}$ lower than $d_{\mathrm{basin},LM}$ a local search starting from ${\mathbf {x}}_{\mathrm{best},m}$ would converge to ${\mathbf {x}}_{\mathrm{LM}}$.

The radius $d_{\mathrm{basin},LM}$ is estimated with the simple procedure in Algorithm 1, lines 7 to 19. Once the evolution of all populations has stopped, the distance $\Vert {\mathbf {x}}_{\mathrm{best},m}-{\mathbf {x}}_{\mathrm{LM}}\Vert $ of the best individual, in each population, with respect to all the minima in A is calculated and compared to the $d_{\mathrm{basin},LM}$ associated to each local minimum in A; initially all $d_{\mathrm{basin},LM}$ are set to 0. If the distance $\Vert {\mathbf {x}}_{\mathrm{best},m}-{\mathbf {x}}_{\mathrm{LM}}\Vert $ is grater than $d_{\mathrm{basin},LM}$ a local search is started from ${\mathbf {x}}_{\mathrm{best},m}$. If the resulting local minimum ${\mathbf {x}}_{\mathrm{min},m}^{(s_m)}$ already belongs to A, the counter $i_{\mathrm{LM}}$ is updated and the new estimate of the basin of attraction of ${\mathbf {x}}_{\mathrm{LM}}$ becomes $d_{\mathrm{basin},LM}=\min [d_{\mathrm{basin},LM},\Vert {\mathbf {x}}_{\mathrm{best},m}-{\mathbf {x}}_{\mathrm{LM}}\Vert ]$. ${\mathbf {x}}_{\mathrm{min},m}^{(s_m)}$ belongs to A if $ \exists \; LM \; : \; \Vert {\mathbf {x}}_{\mathrm{min},m}^{(s_m)} - {\mathbf {x}}_{\mathrm{LM}} \Vert \le \varepsilon \varDelta $. $\varepsilon $ is set to $10^{-3}$. If $i_{\mathrm{LM}}$ exceeds a given maximum value and $\Vert {\mathbf {x}}_{\mathrm{best},m}-{\mathbf {x}}_{\mathrm{LM}}\Vert < d_{\mathrm{basin},LM} \; \forall \; LM$ no local search and no local restart are performed. The counter $i_{\mathrm{LM}}$ is initialised to 1 for every new local minimum and keeps track of the number of times a local minimum is discovered.

5.3.2 Adaptation of $\delta _{\mathrm{local}}$

When a population m is locally restarted, individuals are generated by taking a random sample, with Latin Hypercube, within a hypercube with edge equal to $2\delta _{local,m}$. The dimension $\delta _{local,m}$ is drawn from a probability distribution that is progressively updated at every restart. We use a kernel approach with kernels centred in the elements of a vector ${\mathbf {B}}$ (see Algorithm 6) containing a range of possible values of $\delta _{local,m}$. The vector ${\mathbf {B}}$ is initialised, with the procedure presented in Algorithm 5, when all populations performed a local search for the first time and at every global restart. During initialisation the distance between all the local minima in the archive A is computed (Algorithm 5, line 1) and ${\mathbf {B}}$ is initialised with values spanning the interval between the minimum and the mean distance among minima (Algorithm 5, lines 2–3). The mean values instead of the max is used to limit the size of the restart bubble and speed up convergence under the assumption that a local restart needs to lead to the local exploration of the search space. In the experimental tests, it will be shown that this working assumption is generally verified and $\delta _{local,m}$ tends to converge to small values. Then, a second vector ${{\mathbf {d}}}{{\mathbf {d}}}_{b}$, with the same number of components of ${\mathbf {B}}$, is initialised to zero (Algorithm 5, line 4).

During the update phase of $\delta _{local,m}$, MP-AIDEA uses the index $s_m$ to keep track of the number of times population m performed a local search and calculates the difference $p_m$ between two subsequent local minima (see Algorithm 6, line 5). The value $p_m$ is then compared to the elements in ${{\mathbf {d}}}{{\mathbf {d}}}_{b}$ and when $dd_{b,q} < p_m$ then $\delta _{local,m}$ replaces $B_q$, and $p_m$ replaces $dd_{b,q}$ (Algorithm 6, lines 7-10). In other words, if the $\delta _{local,m}$ used to restart population m led to a local minimum ${\mathbf {x}}_{\mathrm{min},m}^{(s_m)}$ different from ${\mathbf {x}}_{\mathrm{min},m}^{(s_m-1)}$, the local minimum previously identified by the same population, the probability of sampling $\delta _{local,m}$ is increased.

5.3.3 Local and global restart

After the identification of the basin of attraction and the update of the value of $\delta _{\mathrm{local}}$, populations undergo a restart process in which a new population is generated either by sampling a neighbourhood of a local minimum (local restart) or by sampling the whole search space (global restart). The two restart procedures are described in Algorithm 7.

The local restart procedure takes the latest identified local minimum ${\mathbf {x}}_{\mathrm{min},m}^{(s_m)}$ of population m and restart the population with Latin Hypercube sampling in a box centred in ${\mathbf {x}}_{\mathrm{min},m}^{(s_m)}$ with edge length $2\delta _{local,m}$.

The global restart procedure identifies clusters of local minima with a Fuzzy C-Mean algorithm (Bezdek 1981), computes the centre of each cluster and initialises population m so that each individual is at distance at least $\sqrt{n_\mathrm{D}}\delta _{\mathrm{global}}$ from each of the centres of the clusters (Algorithm 7, lines 6 and 7).

At each local and global restart, the $\mathbf {CRF}$ matrix is re-initialised while the vector ${\mathbf {B}}$ is initialised only after every global restart. The motivation for re-initialising $\mathbf {CRF}$ at every restart is twofold: on the one hand different values of $\textit{CR}$ and F might be optimal in different parts of the search space, and on the other hand convergence to the optimal value of $\textit{CR}$ and F is not always guaranteed. In search spaces with uniform and homogeneous structures, restarting $\mathbf {CRF}$ and ${\mathbf {B}}$ might lead to an overhead on the computational cost; therefore, in future implementations we will test the possibility of retaining $\mathbf {CRF}$ and ${\mathbf {B}}$ across the restart process.

5.4 Computational complexity

The computational complexity of MP-AIDEA is defined by the three main sets of operations:

Local search. The local search uses the Matlab fmincon function which implements an SQP scheme with BFGS estimation of the Hessian matrix. Since the matrix is generally dense, its decomposition is ${\mathcal {O}}(n_{D}^3)$.
Adaptation of ${{\mathbf {C}}}{{\mathbf {R}}}$and${\mathbf {F}}$. The adaptation of $\textit{CR}$ and F for each individual in each population is the other expensive bit of the algorithm and is ${\mathcal {O}}(n_{\mathrm{pop}}N_{\mathrm{pop}}n_{D}^2)$ ( see line 2 in Algorithm 1, line 8 in Algorithm 3 and line 3 in Algorithm 4). As a comparison, the computational complexity of the standard DE is ${\mathcal {O}}\left( N_{\mathrm{pop}} \right) $.
Restart mechanisms. The cost of the local restart procedure is limited to the generation of $n_{\mathrm{pop}} N_{\mathrm{pop}}$ individuals, while the global restart has a cost associated also to clustering, which is ${\mathcal {O}} = (n_{\mathrm{LM}}^2 n_\mathrm{D} n_{\mathrm{iter}})$ (Bezdek 1981), where $n_{\mathrm{iter}}$ is the number of iterations for the clustering, and one associated to the verification that the new population is far from the clusters, which is ${\mathcal {O}}(N_{\mathrm{pop}}n_{\mathrm{LM}})$ (see line 7 of Algorithm 7).

Overall when $n_{\mathrm{pop}}N_{\mathrm{pop}}<n_{D}$ the dominant algorithmic cost is the local search while the adaptation of $\textit{CR}$ and F becomes more expensive for large and numerous populations. Since in the experimental test cases we will use $N_{\mathrm{pop}}=n_\mathrm{D}$ and $n_{\mathrm{pop}}=4$ the overall algorithmic complexity remains ${\mathcal {O}}(n_{D}^3)$.

Table 1 Functions of the CEC 2005 test set

Adaptive multi-population inflationary differential evolution

Abstract

Similar content being viewed by others

Self-Adaptive Differential Evolution for Dynamic Environments with Fluctuating Numbers of Optima

Population Size in Differential Evolution

A New Differential Evolution Algorithm with Alopex-Based Local Search

1 Introduction

2 Problem statement

3 Inflationary differential evolution

Proposition 1

Proof

4 Adaptation mechanisms

5 Multi-population adaptive inflationary differential evolution

5.1 Initialisation

5.2 Differential evolution and the adaptation of \(\textit{CR}\) and F

5.3 Local search and restart mechanisms

5.3.1 Identification of the basin of attraction

5.3.2 Adaptation of \(\delta _{\mathrm{local}}\)

5.3.3 Local and global restart

5.4 Computational complexity

6 Experimental performance analysis

6.1 Test sets

6.1.1 CEC 2005 test set

6.1.2 CEC 2011 test set

6.1.3 CEC 2014 test set

6.2 Ranking

6.2.1 CEC 2005 test set

6.2.2 CEC 2011 test set

6.2.3 CEC 2014 test set

6.3 Wilcoxon test

6.3.1 CEC 2011 test set

6.3.2 CEC 2014 test set

6.4 Success rate

6.4.1 CEC 2011 test set

6.4.2 CEC 2014 test set

7 Conclusions

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

A Wilcoxon test results

A Wilcoxon test results

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation