Comparison of synchronous and asynchronous parallelization of extreme surrogate-assisted multi-objective evolutionary algorithm

Harada, Tomohiro; Kaidan, Misaki; Thawonmas, Ruck

doi:10.1007/s11047-020-09806-2

Comparison of synchronous and asynchronous parallelization of extreme surrogate-assisted multi-objective evolutionary algorithm

Open access
Published: 18 September 2020

Volume 21, pages 187–217, (2022)
Cite this article

Download PDF

You have full access to this open access article

Natural Computing Aims and scope Submit manuscript

Comparison of synchronous and asynchronous parallelization of extreme surrogate-assisted multi-objective evolutionary algorithm

Download PDF

2227 Accesses
6 Citations
Explore all metrics

Abstract

This paper investigates the integration of a surrogate-assisted multi-objective evolutionary algorithm (MOEA) and a parallel computation scheme to reduce the computing time until obtaining the optimal solutions in evolutionary algorithms (EAs). A surrogate-assisted MOEA solves multi-objective optimization problems while estimating the evaluation of solutions with a surrogate function. A surrogate function is produced by a machine learning model. This paper uses an extreme learning surrogate-assisted MOEA/D (ELMOEA/D), which utilizes one of the well-known MOEA algorithms, MOEA/D, and a machine learning technique, extreme learning machine (ELM). A parallelization of MOEA, on the other hand, evaluates solutions in parallel on multiple computing nodes to accelerate the optimization process. We consider a synchronous and an asynchronous parallel MOEA as a master-slave parallelization scheme for ELMOEA/D. We carry out an experiment with multi-objective optimization problems to compare the synchronous parallel ELMOEA/D with the asynchronous parallel ELMOEA/D. In the experiment, we simulate two settings of the evaluation time of solutions. One determines the evaluation time of solutions by the normal distribution with different variances. On the other hand, another evaluation time correlates to the objective function value. We compare the quality of solutions obtained by the parallel ELMOEA/D variants within a particular computing time. The experimental results show that the parallelization of ELMOEA/D significantly reduces the computational time. In addition, the integration of ELMOEA/D with the asynchronous parallelization scheme obtains higher quality of solutions quicker than the synchronous parallel ELMOEA/D.

HAS-EA: a fast parallel surrogate-assisted evolutionary algorithm

Article 11 October 2022

A Parallel Version of SMS-EMOA for Many-Objective Optimization Problems

Pseudo Expected Improvement Matrix Criteria for Parallel Expensive Multi-objective Optimization

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Evolutionary algorithms (EAs) have been applied to many real-world applications like engineering (Obayashi et al. 2010; Oyama et al. 2017), data mining (Devos et al. 2014; Soufan et al. 2015), electronics (Roberge et al. 2014), and nanoscience (Shayeghi et al. 2015; Davis et al. 2015) because of their high search capability without any preliminary knowledge of a target problem. Since many real-world applications include two or more objectives with conflict, i.e., a trade-off, multi-objective evolutionary algorithms (MOEAs) have attracted much attention for dealing with multiple objectives simultaneously. One problem when applying MOEAs to real-world applications is a necessity of enormous computing time to obtain optimal solutions. This is because MOEAs need to evaluate many tentative solutions through the optimization process, and the evaluation of tentative solutions is computationally expensive.

Previous works have proposed surrogate-assisted EAs (SAEAs) (Jin 2011; Haftka et al. 2016) to reduce the computational time required for the EA process. SAEAs construct a surrogate model that estimates the evaluation values of solutions from already evaluated solutions using machine learning techniques. A proper optimizer, like an MOEA, is executed using the evaluation estimated by a constructed surrogate model. Finally, the surrogate-assisted optimization obtains promising solutions that can exhibit superior actual evaluation value, and a time-consuming process actually evaluates them. The computational time of the estimation with a surrogate model is extremely shorter than the actual evaluation, and the actual evaluation is applied merely to promising solutions. Thus, an SAEA can reduce the computational time to obtain optimal solutions. Previous works have proposed some surrogate-assisted MOEAs, for example, MOEA/D-RBF (Zapotecas Martínez and Coello Coello 2013) or ParEGO (Knowles 2006), which use a Gaussian process or a radial based function (RBF) as a surrogate model. ELMOEA/D (Pavelski et al. 2014, 2016), extreme learning assisted MOEA/D, is one of the state-of-the-art as a novel SAEA. This method adopts the surrogate model based on an extreme learning machine (ELM) (Huang et al. 2004), and MOEA/D Zhang and Li (2007) generates promising solutions with the ELM surrogate evaluation. Their work demonstrated that ELMOEA/D outperforms ParEGO and MOEA/D-RBF on multi-objective optimization benchmarks especially when the dimension of the search space is large.

To further reduce computational time, the parallelization of SAEAs can be considered as a potential attempt. Parallel EAs (PEAs) (Alba and Tomassini 2002; Alba et al. 2013) also have been studied in the last decades. In particular, a master-slave parallelization is a straightforward approach to implement PEAs. A master-slave PEA performs the main procedure of EAs, i.e., initialization, selection, variation, and replacement, on one master node. Many slave nodes, on the other hand, execute evaluations of newly generated solutions in parallel. Since the solution evaluation is the most time-consuming part in EAs, PEAs can reduce the computational time compared to sequentially evaluating solutions. In recent years, several works have studied the integration of SAEA with parallelization for reducing the computational time of the EA process. For example, a parallel efficient global optimization has been proposed (Wang et al. 2016) and applied to the optimization of microwave antennas (Liu et al. 2019). The work of Akinsolu et al. (2019) proposed parallel surrogate-assisted MOEA with the Gaussian process and applied it to electromagnetic design optimization.

Master-slave PEAs can be classified into two schemes, synchronous PEAs (SPEAs) and asynchronous PEAs (APEAs). SPEAs wait for all evaluations of solutions done by slave nodes and generate a new population by utilizing all newly evaluated solutions. APEAs, on the other hand, continuously generate a new solution without waiting for evaluations of other solutions. Since APEAs do not wait for the slowest solution evaluation, it is possible to efficiently utilize the computing resource of slave nodes in a situation where the evaluation of solutions differs. The previous works that integrate SAEA and the parallelization have considered only the synchronous PEA. However, it is generally assumed that the evaluation time of solutions is enormous and generally differs. For this fact, we consider that the integration of an SAEA with the asynchronous parallelization scheme is effective. The purpose of this study is to clarify which scheme of the synchronous and the asynchronous scheme is suitable for SAEA. Specifically, considering the parallelization of ELMOEA/D, we compare a synchronous parallel ELMOEA/D (SP-ELMOEA/D) and an asynchronous parallel ELMOEA/D (AP-ELMOEA/D).

This paper conducts experiments that compare SP-ELMOEA/D with AP-ELMOEA/D to investigate which parallelization scheme is appropriate for ELMOEA/D. We test these methods on well-known multi-objective optimization benchmarks, ZDT series (Zitzler et al. 2000), WFG series (Huband et al. 2005), and DTLZ series (Deb et al. 2002). We also use a few benchmarks from real-world problems provided in the black-box optimization competition (Loshchilov and Glasmachers 2015). This paper employs a simulated master-slave parallel computing environment proposed in the work of Zăvoianu et al. (2015) to measure the computational time of the competitive methods. We simulate two settings of the evaluation time in the experiment to investigate the influence of the different evaluation time characteristics. One evaluation time is determined by the normal distribution with the different variances, while another evaluation time correlates to the objective function value. We employ a hypervolume (HV) indicator (Zitzler and Thiele 1998) to assess the quality of obtained solutions.

The remaining of this paper is organized as follows. Sect. 2 firstly presents the description of a multi-objective optimization problem and explains the detail of MOEA/D. Section 3 shows the detail of ELM and ELMOEA/D. Section 4 briefly explains PEAs and proposes integrated algorithms of SP-ELMOEA/D and AP-ELMOEA/D. Section 5 describes the experimental setup to compare the synchronous and asynchronous ELMOEA/D on multi-objective optimization problem (MOP) benchmarks and real-world problems. Section 6 shows the results on MOP benchmarks with the normally-distributed evaluation time, while Sect. 7 shows the one with the fitness-correlated evaluation time. Section 8 shows the results on real-world problems. Finally, Sect. 9 concludes this research and shows future work.

2 Multi-objective evolutionary algorithms

This section firstly explains what multi-objective optimization problem is, and then, we show a detailed algorithm of MOEA/D (Zhang and Li 2007), which is an underlying algorithm of ELMOEA/D.

2.1 Multi-objective optimization problem

A multi-objective optimization problem (MOP) is a problem of minimizing or maximizing m mutually competing objective functions ${\varvec{f}}(\mathbf{x})$ (Deb 2001) expressed as follows:

$$\begin{aligned}&\min _{{{\mathbf {x}}}} {\varvec{f}}({{\mathbf {x}}})=\{f_1({\mathbf{x}}),f_2({{\mathbf {x}}}),\ldots ,f_m({{\mathbf {x}}})\}^T \end{aligned}$$

(1)

$$\begin{aligned}&\text {Subject to:} \,\, x_{L,d} \le x_d \le x_{U,d} \,\, (d=1,\ldots ,D), \end{aligned}$$

(2)

where ${{\mathbf {x}}}=(x_1,\ldots ,x_{D})$ is a D dimensional decision variable. $x_{L,d}$ and $x_{U,d}$ are the lower and upper bounds of the dth variable, respectively. ${\varvec{f}}:\varOmega \rightarrow {\mathbb {R}}^m$ is a vector having m objective functions and maps the decision variable space $\varOmega$ to the objective space ${\mathbb {R}}^m$.

It is difficult to obtain a single optimal solution in MOPs because each objective function is in a trade-off relationship. Therefore, the goal of MOPs is to acquire the Pareto optimal solution set. The Pareto optimal solution set is defined with the dominance relationship of solutions in MOPs denoted as follows:

$$\begin{aligned} {{\mathbf {x}}}\prec {{\mathbf {y}}}\Leftrightarrow {\left\{ \begin{array}{ll} f_j({{\mathbf {x}}}) \le f_j({{\mathbf {y}}})& \forall j \in \{1,\ldots , m\} \\ f_j({{\mathbf {x}}}) < f_j({{\mathbf {y}}})&{} \exists j \in \{1,\ldots , m\} \end{array}\right. } \end{aligned}$$

(3)

where ${{\mathbf {x}}} \prec {{\mathbf {y}}}$ means ${{\mathbf {x}}}$ dominates ${{\mathbf {y}}}$. The Pareto optimal solution set consists of solutions that are not dominated by any other solutions, and it is denoted as follows:

$$\begin{aligned} P=\{{\mathbf {x}}\in \varOmega | {\mathbf {x}}^\prime \nprec {\mathbf {x}}\,\,\forall {\mathbf {x}}^\prime \in \varOmega \} \end{aligned}$$

(4)

2.2 MOEA/D

MOEA/D (Zhang and Li 2007) is a multi-objective evolutionary algorithm based on decomposition. It decomposes multiple objectives into many single objective sub-problems based on an aggregation function associated with the weight vector uniformly distributed on the objective space. MOEA/D simultaneously performs many single-objective searches with genetic operations within a local region determined by the neighborhood of the weight vectors. This paper adopts MOEA/D-DE (Li and Zhang 2009) that introduces a differential evolution (DE) operator (Storn and Price 1997; Das and Suganthan 2011) into MOEA/D as a crossover operator.

Algorithm 1 describes the detailed algorithm of MOEA/D-DE. Firstly, the weight vector sets W are generated uniformly on the objective space. W consists of N (the population size) weight vectors $\varvec{\lambda }_i\,(i=1, \ldots , m)$. The weight vector is used to decompose multiple objectives to a single objective with an aggregation function (detailed later). Then, the neighborhood set ${\mathbf {B}}(i)$ is generated, consisting of T weight vectors closest to the weight vector $\varvec{\lambda }_i$. Next, the initial population is generated and evaluated. The ideal point ${\mathbf {z}}^{*}$ is calculated from the objective function values of the initial population. The parent solutions for the DE operator are selected from the neighborhood set ${\mathbf {B}}(i)$ with the probability of $\delta$, while selected from the entire population with the probability of $(1-\delta )$. After applying the DE operator and the polynomial mutation (PM) (Deb and Goyal 1996), a newly generated solution is evaluated, and the ideal point ${\mathbf {z}}^*$ is updated. Finally, the population is updated according to the aggregation function value of a new solution. These procedures are repeated until the stopping criteria are satisfied.

Although previous works have proposed several aggregation functions, this research uses the Penalty Boundary Intersection (PBI) aggregation function. PBI is defined as Eqs. (6, 7 and 8):

$$\begin{aligned} g_{PBI}({{\mathbf {x}}}|\varvec{\lambda } ,{{\mathbf {z}}^*})& = d_1+\theta d_2 \end{aligned}$$

(6)

$$\begin{aligned} d_1& = \frac{\left\| ({{\mathbf {f}}}({{\mathbf {x}}})-{{\mathbf {z}}^*})^T \varvec{\lambda }\right\| }{\Vert \varvec{\lambda }\Vert } \end{aligned}$$

(7)

$$\begin{aligned} d_2& = \left\| {{\mathbf {f}}}({{\mathbf {x}}})-\left( {\mathbf{z}^*}+d_1\varvec{\lambda }\right) \right\| \end{aligned}$$

(8)

where $\theta$ is the penalty factor.

3 Extreme learning surrogate-assisted MOEA/D

This section introduces the original extreme learning assisted MOEA/D, ELMOEA/D (Pavelski et al. 2016). ELMOEA/D uses an extreme learning machine (ELM) (Huang et al. 2004) as a surrogate model and generates promising solutions by using MOEA/D-DE (Li and Zhang 2009) according to objective function values estimated by a constructed ELM model. In the following subsections, we first explain the overview of ELM, and then we provide the detail of ELMOEA/D.

3.1 Extreme learning machine (ELM)

ELM is a kind of machine learning technique (Huang et al. 2004). ELM is constructed as a single layer feed-forward neural network that consists of L hidden layer neurons weighted by ${\mathbf {a}}_j\,(j=1, \ldots , L)$, output weights $\varvec{\beta }$, and an activation function $G({{\mathbf {x}}}, {{\mathbf {a}}}, b)$. ${{\mathbf {a}}}$ indicates weights to a hidden neuron, while b indicates a bias value. The output of ELM is calculated as $Y={\mathbf {H}}\varvec{\beta }$, where $\varvec{\beta }$ is a learned parameter, while ${\mathbf {H}}$ is a matrix of the activation function as ${\mathbf {H}}=\{G({\mathbf {x}}_i, {\mathbf {a}}_j, b_j)\}$.

The most notable feature of ELM is that the hidden layer weights ${{\mathbf {a}}}$ and biases b are randomly assigned and are not learned. The output weights ${\varvec{\beta }}$ are only parameters to be learned from N distinct training samples of n-dimensional inputs ${{\mathbf {x}}}_i$ with m dimensional outputs ${{\mathbf {t}}}_i$. In particular, ${\varvec{\beta }}$ is calculated as $\varvec{\beta }=\left( {\mathbf {I}}/C +{\mathbf {H}}{\mathbf {H}}^T\right) ^{-1}{\mathbf {H}}^TT$, where C is a regularization parameter, ${{\mathbf {I}}}$ indicates an $N\times N$ unit matrix, and ${{\mathbf {T}}}=\left[ {{\mathbf {t}}}_1, \ldots , {\mathbf{t}}_N\right]$ is the matrix of desired outputs.

ELM can use any piece-wise nonlinear continuous activation functions in the hidden layer, i.e., $G({{\mathbf {x}}}, {{\mathbf {a}}}, b)$. The original work of ELMOEA/D used the following three activation functions:

Sigmoid (SIG):
$$\begin{aligned} G_{SIG}({{\mathbf {x}}}, {{\mathbf {a}}}, b)=\frac{1}{1+\exp \left( -\left( {{\mathbf {a}}}\cdot {{\mathbf {x}}} + b\right) \right) } \end{aligned}$$
(9)
Gaussian (GAU):
$$\begin{aligned} G_{GAU}({{\mathbf {x}}}, {{\mathbf {a}}}, b)=\exp \left( -b||{\mathbf{x}}-{{\mathbf {a}}}||^2\right) \end{aligned}$$
(10)
Multiquadric (MQ):
$$\begin{aligned} G_{MQ}({{\mathbf {x}}}, {{\mathbf {a}}}, b)=\left( ||{{\mathbf {x}}} - {\mathbf{a}}||^2+b^2\right) ^{\frac{1}{2}} \end{aligned}$$
(11)

The advantages of ELM to neural networks and support vector machines (SVMs) are summarized as follows (Huang et al. 2004; Huang 2015):

The learning speed of ELM is extremely fast. It can train single-layer feed-forward networks much faster than classical learning algorithms.
Unlike traditional classic gradient-based learning algorithms, which intend to reach minimum training error but do not consider the magnitude of weights, ELM tends to reach not only the smallest training error but also the smallest norm of weights. Thus, ELM tends to have better generalization performance for feed-forward neural networks.
Unlike the traditional classic gradient-based learning algorithms that only work for differentiable activation functions, the ELM learning algorithm can be used to train single-layer feed-forward networks with non-differentiable activation functions.
Unlike traditional classic gradient-based learning algorithms facing several issues like local minimum, improper learning rate, and overfitting, etc, ELM tends to reach the solutions straightforward without such trivial issues. The ELM learning algorithm looks much simpler than most learning algorithms for feed-forward neural networks.
Unlike SVM, which does not consider the feature representation and functioning roles of each inner hidden layer, ELM studies feature representations in each layer.

3.2 Algorithm of ELMOEA/D

Algorithm 2 briefly describes the algorithm of ELMOEA/D (see detail in the work of (Pavelski et al. 2016)).

ELMOEA/D uses Latin hypercube sampling (LHS) (McKay et al. 1979) as the sampling method in the initialization step. ELMOEA/D uses two weight vector sets, one is used for the evolution of MOEA/D, while another is used to select promising solutions. A weight vector set W used for the evolution of MOEA/D-DE is generated that uniformly spread on the objective space. The weight vector set consists of N weight vectors ($W=\{{\varvec{\lambda }}_1, \ldots , {\varvec{\lambda }}_N\}$). Subsequently, a selector weight vector set $W_s$ used for the selection of $N_s$ promising solutions is generated, which consists of $N_s$ selector weight vectors $W^s=\{{\varvec{\lambda }}^{s}_{1}, \ldots , {\varvec{\lambda }}^{s}_{N_s}\}\subset W$. Finally, a neighborhood vector set for the promising solution selection ${\mathbf {B}}^s(i)=\{{\varvec{\lambda }}_{i,1},\ldots ,{\varvec{\lambda }}_{i,K_s}\}\,(\varvec{\lambda }_{i, j}\in W)$ is associated with each selector weight vector $\varvec{\lambda }_i^s\in W_s$. Each neighborhood vector set ${\mathbf {B}}^s(i)$ consists of $K_s(=N/N_s)$ weight vectors in W that are closest to each selector weight vector $\varvec{\lambda }_i^s$.

After the initialization, ELMOEA/D constructs an ELM surrogate model by using a training set $P_t$. ELMs are trained with different activation functions and different normalization parameters C in this step. The best ELM model that performs the smallest mean square error (MSE) for the training set is selected and used as a surrogate model.

The constructed ELM model is used for evaluating solutions during the MOEA/D-DE procedure. The initial population is randomly selected from the non-dominated solutions in the archive population $P_a$ if its size exceeds the population size N. Otherwise, LHS generates the remaining solutions in the bound of $[\varvec{\mu }-\varvec{\sigma }, \varvec{\mu }+\varvec{\sigma }]$, where ${\varvec{\mu }}$ and ${\varvec{\sigma }}$ are the mean and the standard deviation calculated from the non-dominated solutions in $P_a$.

The promising solutions for the actual evaluation are selected from the final population after the optimization of MOEA/D-DE. The aggregation function values of solutions in the final population are calculated for each ${\varvec{\lambda }}_{i, j} \in {\mathbf{B}}^s(i)$ based on the estimated objective values by the surrogate model. Then, the best solution is selected from each neighborhood vector set ${\mathbf {B}}^s(i)$, and totally, $N_s$ promising solutions are selected for the actual evaluation.

The previous work (Pavelski et al. 2016) reported that ELMOEA/D has the following advantages:

Since ELMOEA/D uses a simpler surrogate model with less user-specific parameters, it consumes less computational resources and is faster than the conventional surrogate-assisted EA.
ELMOEA/D showed promising results on the MOP benchmarks with high dimensional design variables compared with ParEGO and MOEA/D-RBF.

For these advantages, our research employs ELMOEA/D.

4 Parallelization of ELMOEA/D

This paper considers two schemes of ELMOEA/D parallelization, a synchronous parallel ELMOEA/D (SP-ELMOEA/D), and an asynchronous parallel ELMOEA/D (AP-ELMOEA/D). This section firstly introduces parallel EAs briefly, and then, we explain the detailed description of SP-ELMOEA/D and AP-ELMOEA/D. We finally introduce two selection mechanisms for AP-ELMOEA/D.

4.1 Parallel evolutionary algorithm

When applying EAs to real-world optimization problems, solution evaluations may take much computational time because of, for example, a requirement of physical simulation. Previous works have studied parallel EAs (PEAs) (Alba and Tomassini 2002; Alba et al. 2013) to speed up the optimization process of EAs by using multiple computing resources. In recent years, parallel MOEAs have been studied to reduce the computing time to approximate the Pareto front (Talbi 2019).

A master-slave parallelization is one of the most typical and straightforward approaches of PEAs. A master-slave PEA performs the main procedure of EAs like initialization, selection, variation, and replacement on a single master computing node. At the same time, many slave computing nodes execute fitness evaluations in parallel. Such a master-slave parallelization has been applied to solving real-world problems. For example, the work of Barbera et al. (2018) applied the massively parallel EA to the phylogenetic placement problem. The work of Strofylas et al. (2018) proposed a parallel differential evolution method for calibrating a second-order macroscopic traffic flow.

Master-slave PEAs can be classified into two schemes, a synchronous evolution and an asynchronous one. Synchronous PEAs (SPEAs) is similar to generation based EAs, where the new population is generated after waiting for all solution evaluations. On the other hand, asynchronous PEAs (APEAs) is similar to steady-state EAs. A new solution is immediately generated whenever one solution evaluation completes without waiting for other solution evaluations. In general, an SPEA has a higher search capability than an APEA because an SPEA can utilize many evaluation information simultaneously when generating a new population. On the other hand, when the evaluation time of solutions differs from each other, i.e., the variance of the evaluation time is large, an APEA shows a higher search efficiency than in terms of the computational time (Harada and Takadama 2013, 2014; Scott and De Jong 2015a, b). This is because APEAs can reduce the idling time of slave nodes that quickly complete the evaluation tasks, unlike SPEAs need to wait for the slowest solution evaluations.

4.2 Synchronous parallel ELMOEA/D

Algorithm 3 shows a master-slave synchronous parallel ELMOEA/D (SP-ELMOEA/D). SP-ELMOEA/D is an extension of ELMOEA/D, and the underlined texts denote the difference from Algorithm 2. At the initialization procedure, all generated solutions are sent to slave nodes in Step 1–2, and their evaluations are firstly waited for in Step 1–3. Then, the remaining initialization procedure is executed. After evolving solutions on MOEA/D-DE with an ELM surrogate, $N_s$ promising solutions are selected, and they are sent to slave nodes in Step 5–1. SP-ELMOEA/D waits for all evaluations from slave nodes in Step 5–2 and updates the archive population and the training set. SP-ELMOEA/D repeats these procedures until the terminal condition is satisfied.

4.3 Asynchronous parallel ELMOEA/D

AP-ELMOEA/D evaluates solutions in parallel as same as SP-ELMOEA/D but waits for only one solution evaluation every step. When one solution evaluation completes, it is appended to the archive population, and MOEA/D-DE with the ELM surrogate model generates a new promising solution. The differences between AP-ELMOEA/D and SP-ELMOEA/D are in Steps 4 and 5 in Algorithm 3. Concretely, these procedures are modified as Algorithm 4.

First, AP-ELMOEA/D waits for only one evaluation from a slave node in Step 5’–2, unlike SP-ELMOEA/D waits for all $N_s$ solution evaluations. In response to this, Steps 4’ and 5’ of AP-ELMOEA/D are modified to address one solution. In Step 4–1 in Algorithm 3, SP-ELMOEA/D selects $N_s$ promising solutions in each generation and actually evaluates them in parallel. On the other hand, AP-ELMOEA/D selects only one promising solution in Step 4’–3. For this selection, AP-ELMOEA/D needs to select an index $k_s$ of the selector weight vector in Step 4’–2 instead of considering all $N_s$ selector vectors. We take this into account in the next subsection. In addition to this, AP-ELMOEA/D addresses only one solution in the update of the archive population $P_a^{g+1}$ and the training set $P_t^{g+1}$ in Steps 5’–3 and 5’-4. Besides, because only one solution is selected for an actual evaluation every MOEA/D-DE optimization step, the surrogate model is constructed every $N_s$ actual evaluations in AP-ELMOEA/D not to increase the number of the ELM training.

4.4 Promising solution selection of AP-ELMOEA/D

When applying the asynchronous evolution scheme to ELMOEA/D, we need to determine how to select one promising solution from the population optimized by MOEA/D-DE on the surrogate evaluation model. In particular, an index $k_s$ should be chosen Step 4’ in Algorithm 4 whenever one solution evaluation completes. This paper explores simple two selection mechanisms of an index $k_s$: the index order based selection and the random order selection.

4.4.1 Index order based selection:

A straightforward way to select an index $k_s$ is to use the order of the index of the neighborhood vector set. This paper denotes AP-ELMOEA/D with the promising selection using the order of the index of the neighborhood vector set as AP-ELMOEA/D-IO. AP-ELMOEA/D-IO selects a promising solution depending on the order of the index of the neighborhood vector set. AP-ELMOEA/D-IO initially set an index $k_s$ to 1, and whenever a promising solution is selected, $k_s$ is incremented as $k_s\leftarrow k_s+1$. If $k_s$ exceeds $K_s$, which is the number of the neighborhood vector set, $k_s$ is reset to 1.

4.4.2 Random order selection

The second way is to choose an index $k_s$ every selection procedure randomly. This paper denotes this as AP-ELMOEA/D-RO. Whenever an index $k_s$ is selected, it is uniformly chosen from 1 to $N_s$ without considering the previous index, i.e., consecutive selection of the same index is allowed. Since an actual implementation of MOEA/D often uses a random permutation to select the aggregation vector during the optimization Fan et al. (2019); Nebro et al. (2015), the random order selection can be a valid strategy in AP-ELMOEA/D.

5 Experiment

We conduct an experiment to compare three different parallel ELMOEA/D variants, i.e., SP-ELMOEA/D, AP-ELMOEA/D-IO, and AP-ELMOEA/D-RO, to clarify which parallelization scheme is suitable for ELMOEA/D. Additionally, we compare the sequential ELMOEA/D (i.e., run on a single CPU) to confirm the effectiveness of the parallelization. The following subsections explain the benchmark problems used in the experiment, the experimental setting, and the evaluation criteria.

5.1 Problem instance

We use well-known multi-objective benchmarks, the ZDT test suite (Zitzler et al. 2000), the WFG test suite (Huband et al. 2005), and the DTLZ test suite (Deb et al. 2002). This experiment considers bi-objective minimization for all test suites. The dimension of the decision variable D is as:

ZDT1–3: $D=30$
ZDT4 and 6: $D=10$
WFG1–9: $D=24$
DTLZ1–7: $D=5$.

Additionally, we use four real-world problems provided by the black box optimization competition (Loshchilov and Glasmachers 2015). In particular, three are from the engineering design, a heat exchanger problem, a hydro-dynamics problem, and a vibrating platform problem, while one is from the operational research, a facility placement problem. All of them are bi-objective minimization, and $D=16$ for the heat exchanger problem, $D=6$ for the hydro-dynamics problem, $D=5$ for the vibrating platform problem, and $D=20$ for the facility placement problem.

5.2 Experimental setting

We conduct the experiment on the pseudo (simulated) master-slave parallel computing environment that measures the computational time according to the model proposed in the work of Zăvoianu et al. (2015). In the experiment, we define the unit of the computing time as “simulation time”. In particular, we define one unit simulation time as the time to complete the sequential tasks on the master node, i.e., the ELM learning and the MOEA/D optimization with the surrogate evaluation.

We test two settings of evaluation time of solutions. One determines the evaluation time of any solutions by the normal distribution of ${\mathcal {N}}(t_p, c_v\times t_p)$ simulation time on the distributed slave nodes. $t_p$ ($\gg 1$) is the mean evaluation time, while the random variance is added according to the normal distribution with the standard deviation $c_v\times t_p$, where $c_v\,(c_v\ge 0)$ is a parameter to determine the variance of the evaluation time. In this setting, the evaluation time does not depend on the feature of evaluated solutions. We call this evaluation time normally-distributed evaluation time.

Another setting defines the evaluation time of solutions correlates to the objective function value. In particular, the evaluation time is calculated as:

$$\begin{aligned} t_{cor}({\mathbf {x}})={\left\{ \begin{array}{ll} t_p-c_gt_p+c_gt_pf_1({\mathbf {x}}) & 0\le f_1({\mathbf {x}})\le 2\\ t_p-c_gt_p & f_1({\mathbf {x}}) < 0\\ t_p+c_gt_p & \hbox {otherwise} \end{array}\right. } \end{aligned}$$

(12)

where $f_1({\mathbf {x}})$ indicates the first objective function value of a solution ${\mathbf {x}}$, $t_p$ ($\gg 1$) is the base evaluation time when $f_1({\mathbf {x}})=1$, while $c_g$ denotes the parameter to decides the gradient of the evaluation time. If $c_g>0$, the evaluation time positively correlates to the first objective function value, i.e., the greater the objective function value, the longer the evaluation time. If $c_g<0$, on the other hand, the evaluation time negatively correlates to the first objective function value, i.e., the greater the objective function value, the shorter the evaluation time. Since the evaluation time depends on the objective function value, the search direction of the AP-ELMOEA/D variants has the possibility to biased toward the search space where solutions quickly complete their evaluations (Scott and De Jong 2015a).

This experiment uses the parameters shown in Table 1, most of which are the same as the original work (Pavelski et al. 2016). Notably, we use the MOEA/D-DE parameter settings that the original MOEA/D-DE work recommended (Li and Zhang 2009). Ten slave nodes are used for solution evaluations, which means $N_s=10$ selected promising solutions can be evaluated simultaneously. To clarify how the variance of the evaluation time on the parallel computing environment affects the search performance, we consider different magnitudes of the variance $c_v=\{0.02, 0.05, 0.10, 0.20\}$ in the normally-distributed evaluation time. On the other hand, in the fitness-correlated evaluation time, we use the different gradient parameters $c_g=\{\pm 0.2, \pm 0.5\}$ to clarify the influence of the evaluation time gradient.

Table 1 Parameter settings

Comparison of synchronous and asynchronous parallelization of extreme surrogate-assisted multi-objective evolutionary algorithm

Abstract

Similar content being viewed by others

HAS-EA: a fast parallel surrogate-assisted evolutionary algorithm

A Parallel Version of SMS-EMOA for Many-Objective Optimization Problems

Pseudo Expected Improvement Matrix Criteria for Parallel Expensive Multi-objective Optimization

1 Introduction

2 Multi-objective evolutionary algorithms

2.1 Multi-objective optimization problem

2.2 MOEA/D

3 Extreme learning surrogate-assisted MOEA/D

3.1 Extreme learning machine (ELM)

3.2 Algorithm of ELMOEA/D

4 Parallelization of ELMOEA/D

4.1 Parallel evolutionary algorithm

4.2 Synchronous parallel ELMOEA/D

4.3 Asynchronous parallel ELMOEA/D

4.4 Promising solution selection of AP-ELMOEA/D

4.4.1 Index order based selection:

4.4.2 Random order selection

5 Experiment

5.1 Problem instance

5.2 Experimental setting

6 Results on benchmark problems with the normally-distributed evaluation time

6.1 Hypervolume at the maximum number of the actual evaluations

6.2 Hypervolume at the maximum elapsed simulation time

6.3 Transition of HV over the elapsed simulation time

7 Results on benchmark problems with the fitness-correlated evaluation time

7.1 Hypervolume at the maximum number of the actual evaluations

7.2 Hypervolume at the maximum elapsed simulation time

8 Results on real-world problems

8.1 Hypervolume at the maximum number of the actual evaluations

8.2 Hypervolume at the maximum elapsed simulation time

9 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation