A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization

Zhao, Yi; Zhao, Jian; Zeng, Jianchao; Tan, Ying

doi:10.1007/s40747-022-00751-4

A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization

Original Article
Open access
Published: 03 May 2022

Volume 8, pages 5047–5063, (2022)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization

Download PDF

Yi Zhao¹,
Jian Zhao¹,
Jianchao Zeng² &
…
Ying Tan³

1329 Accesses
3 Citations
Explore all metrics

Abstract

Many optimization problems are expensive in practical applications. The surrogate-assisted optimization methods have attracted extensive attention as they can get satisfyingly optimal solutions in a limited computing resource. In this paper, we propose a two-stage infill strategy and surrogate-ensemble assisted optimization algorithm for solving expensive many-objective optimization problems. In this method, the population is optimized by a surrogate ensemble. Then a two-stage infill strategy is proposed to select individuals for real evaluations. The infill strategy considers individuals with better convergence or greater uncertainty. To calculate the uncertainty, we consider two aspects. One is the approximate variance of the current surrogate ensemble and the other one is the approximate variance of the historical surrogate ensemble. Finally, the population is revised by the recently updated surrogate ensemble. In experiments, we testify our method on two sets of many-objective benchmark problems. The results demonstrate the superiority of our proposed algorithm compared with the state-of-the-art algorithms for solving computationally expensive many-objective optimization problems.

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

Article 27 November 2022

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Article 09 April 2023

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Article 19 January 2024

Introduction

In industrial optimization problems, such as electric vehicle control problems [4], industrial scheduling [12], and robotics [24], there are multiple objectives to be optimized simultaneously. These multi-objective optimization problems (MOPs) can be mathematically formulated as follows:

$$\begin{aligned}&\text {minimize } F(\mathbf {x})=(f_1(\mathbf {x}), f_2(\mathbf {x}),\ldots ,f_M(\mathbf {x})),\nonumber \\&\text {subject to } \mathbf {x} \in R^{D} \end{aligned}$$

(1)

where $R^{D}$ represents the D-dimensional decision space, $f_i(\mathbf {x})$ represents the ith objective of individual $\mathbf {x}$ and M is the number of objectives. Objectives are often conflicting with each other, and one objective getting better may cause another to deteriorate. Pareto optimal solutions obtained by multi-objective optimization algorithms can trade off the different objectives, and they are called Pareto set (PS) in the decision space and Pareto front (PF) in the objective space, respectively. Various methods like a fast and elitist multiobjective genetic algorithm (NSGA-II) [8], a multiobjective evolutionary algorithm based on decomposition (MOEA/D) [38], and indicator-based selection in multiobjective search (IBEA) [42] have been proposed for solving MOPs with two or three objectives. These multi-objective evolutionary algorithms (MOEAs) are confronted with the lack of selection pressure when the objectives increase. To solve many-objective optimization problems (MaOPs) with more than three objectives, a lot of methods have been proposed, for example, NSGA-II/SDR [31] based on the strengthened dominance relation (SDR), AR-MOEA [30] based on an IGD non contributing solution detection (IGD-NS) indicator, reference vector guided evolutionary algorithm (RVEA) [3] which decomposes a MOP into several single-objective optimization problems.

Usually, MOEAs require a lot of real evaluations of the objective function before finding a set of Pareto optimal solutions, which makes the MOEAs restricted in the expensive engineering problems, such as computing fluid kinetics (CFD) simulation [10] and engineering design optimization [11]. In these expensive problems, one simulation could take from minutes to hours [14]. Surrogate-assisted evolutionary algorithms (SAEAs), have been proposed to address the optimization of expensive problems. Commonly used surrogate models include polynomial regression model (PR) [12], radial basis function (RBF) [13], and Gaussian process (GP) [14]. For more descriptions of surrogate model, readers can refer to the review articles [13, 14].

Surrogate-assisted evolutionary algorithms (SAEAs) can be divided into on-line and off-line optimization methods [33]. In the on-line surrogate-assisted optimization methods, a small number of expensive real fitness can be conducted during the optimization, and the newly generated samples can also be used to update the surrogates. Conversely, in the off-line surrogate-assisted optimization algorithms, there are not new samples available [35]. In the on-line surrogate-assisted optimization algorithms, the methods to select individuals for expensive evaluation, are also known as infill sampling criterion, infill strategy or surrogate management. Infill sampling criteria, such as expected improvement (EI) [21, 25], lower confidence bound (LCB) [18], and probability of improvement (PoI) [9], are widely used in the Kriging or Gaussian process (GP) assisted optimization algorithms.

On-line SAEAs are more flexible than off-line SAEAs as they have additional samples for surrogate management during the optimization process, which may have more opportunities to improve the performance of the algorithm than off-line SAEAs [14]. Therefore, in this paper, our work mainly focuses on the on-line surrogate-assisted optimization methods. A lot of on-line surrogate-assisted single-objective optimization methods have been proposed. In [16], Li selected the individuals with the best approximated fitness and maximum uncertainty for expensive evaluations and used the distance and fitness value information to calculate the uncertainty. In [29], Tian proposed a multi-objective infill strategy that considers both approximated fitness and uncertainty for solving high-dimensional expensive problems. In [22], Pan used teaching-learning-based optimization (TLBO) or differential evolution (DE) alternately to search for the best candidate solution, and the generation-based and individual-based strategies are used for surrogate management. In [37], Yu used a GP model as a coarse surrogate to learn the global landscape and an RBF model as a fine surrogate to learn the local feature of the fitness landscape. In the coarse search, approximate fitness together with the uncertainty of GP is used for environmental selection. In [19], Liu used the affinity propagation clustering technique to partition the population into several subpopulations and proposed an RBF-assisted learning strategy-based particle swarm optimizer (PSO) to update the particle in each subpopulation. For more methods of on-line single-objective SAEAs, please refer to Refs. [2, 18, 27, 28, 32, 36].

Some on-line surrogate assisted-multi-objective optimization methods have been proposed. In ParEGO [15], a weight vector was selected at each iteration for optimization by the efficient global optimization (EGO). In MOEA/D-EGO [39], Gaussian process is built for each objective in the MOP and maximizing the expected improvement metrics are used to select test points for expensive evaluations. In KRVEA [5], the Kriging model is used to approximate each objective of MOPs, and approximated values or uncertainties provided by Kriging model are adaptively selected for expensive evaluations. In CSEA [23], Pan used a feedforward neural network (FNN) as a classifier to identify good solutions from the whole population. In [11], Guo used an efficient dropout neural network (EDN) to approximate the fitness of individuals and get their approximate uncertainties by randomly ignoring neurons in the neural network. In [34], Wang proposed an adaptive acquisition function in the Bayesian approach to solve expensive multi-objective optimization problems. In [26], Song used the two-archive evolutionary algorithm to optimize the population, and the differences between the individuals in the two archives are used for infill strategy. In [17], Lin used a global Kriging and several sub-models to construct the surrogate ensemble to approximate the objectives of the expensive multi-objective problems and proposed a reference vector-based infill strategy. In [10], Gu used the Kriging model to optimize the population for several generations, then the crowding degrees of the individuals in the radial space and the uncertainty information provided by the Kriging model are used for surrogate management. For ParEGO, MOEA/D-EGO, these two methods cannot obtain good performance on expensive MaOPs with more than three objectives. In KRVEA, the population is optimized by a set of uniformly distributed reference vectors, and the true evaluated individuals are selected from the population. When the shapes of reference vectors and PFs are not consistent with each other, the performance of the algorithm will be affected.

From the mentioned above, there have been some studies on multi-objective expensive optimization problems, but there are still some challenges in this field. First, different surrogate models are suitable for different types of expensive problems. Therefore, selecting an appropriate surrogate for the problems is important and difficult as there is no criterion for this work. Second, the multi-objective optimization framework also has an impact on the performance of the expensive MOPs. Third, as to surrogate management, infill sampling criteria directly affect the approximate accuracy of the surrogate and the optimal solutions finally found by the methods.

On one hand, considering the first challenge of the expensive multi-objective optimization problems as described above, a natural idea is to combine multiple base learners to form a strong learner. And in our previous work [40], ensemble constructs a strong learner by combining multiple base learners, which has proved to be superior to a single learner in terms of accuracy and robustness. Furthermore, an ensemble surrogate can provide approximate variance among different surrogates, which is important for surrogate management. On the other hand, in SAEAs, the update process of surrogate model contains some historical information. At present, there is no method to use these historical information for surrogate management. Motivated by these, we propose a method to solve the expensive many-objective optimization problems, named a two-stage infill strategy and surrogate-ensemble assisted expensive many-objective evolutionary optimization algorithm (TSEMO). The main work of this paper consists of two aspects:

(1)
The RBF models based on two different kernel functions are used as the base learners to construct the surrogate ensemble. And the surrogate ensemble is used to optimize the population.
(2)
A two-stage infill strategy is proposed to select individuals for expensive function evaluations. In the first stage, the individuals with the minimum approximate fitness or the maximum uncertainty are selected for expensive evaluations. The uncertainty is the approximate variance of the surrogate ensemble. In the second stage, the individual with the maximum uncertainty is selected for expensive evaluations and the uncertainty is the approximate variance of the historical surrogate ensemble.

The rest of this paper is organized as follows. Section “Related work” introduces the relevant work. In “A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization”, the details of the proposed algorithm TSEMO are described. In “Numerical experiments”, the numerical experiments are carried out, and the results are compared with several methods on multi-objective benchmark problems. Finally, the conclusion and future work are drawn in “Conclusion”.

Related work

Different multi-objective optimization algorithms have a certain degree of impact on expensive multi-objective optimization problems. To solve the expensive many-objective optimization problems, we used NSGA-II/SDR [31] as the underlying optimization framework of our method. And the NSGA-II/SDR algorithm is described below.

NSGA-II/SDR

Before giving NSGA-II/SDR, we first introduce the original dominant relation in NSGA-II, which is defined below.

Definition 1: an individual $\mathbf {x}$ is said to dominate another individual $\mathbf {y}$ (denoted as $\mathbf {x}\prec \mathbf {y}$) if and only if ${\forall }$ i, $i=1,2,\ldots ,M$, $f_i(\mathbf {x})\le f_i(\mathbf {y})$, and there is at least one j, $j\in {1,2,\ldots ,M}$ satisfying $f_j(\mathbf {x})< f_j(\mathbf {y})$.

Due to the lack of selection pressure, the traditional NSGA-II algorithm based on the dominant relation deteriorates in solving MaOPs with more than three objectives. To solve this problem, Tian [31] proposes a strengthened dominance relation (SDR), which replaces the original dominance relation in NSGA-II, and can greatly improve the selection pressure of many-objective optimization problems, thus promoting the performance of the algorithm. SDR can balance the convergence and diversity of MaOPs well by adopting the specific niche technology. The definition of strengthened dominance relation is given below.

Definition 2: an individual $\mathbf {x}$ strengthened dominate another individual $\mathbf {y}$ (denoted as $\mathbf {x}\prec _{SDR} \mathbf {y}$) if and only if

$$\begin{aligned} {\left\{ \begin{array}{ll} \mathrm{{Con}}(\mathbf {x})<\mathrm{{Con}}(\mathbf {y}) &{}\theta _{\mathbf {xy}}<\bar{\theta } \\ \mathrm{{Con}}(\mathbf {x})\cdot \frac{\theta _{\mathbf {xy}}}{\bar{\theta }} \le \mathrm{{Con}}(\mathbf {y}) &{}\theta _{\mathbf {xy}}\ge \bar{\theta } \end{array}\right. } \end{aligned}$$

(2)

where $\mathrm{{Con}}(\mathbf {x})=\sum _{i=1}^{M}f_i(\mathbf {x})$, $\theta _{\mathbf {xy}}$ denotes the acute angle between the two individual $\mathbf {x}$, $\mathbf {y}$ in the objective space, $\bar{\theta }$ is the parameter which is set to the $\lfloor (|P|/2)\rfloor $th minimum element of

$$\begin{aligned} \left\{ \min \limits _{\mathbf {q}\in P \setminus {\mathbf {p}} } \quad \theta _{\mathbf {pq}} \quad | \quad \mathbf {p}\in P\right\} . \end{aligned}$$

(3)

SDR is irreflexive, antisymmetric, and nontransitive, respectively.

NSGA-II/SDR is based on the algorithm framework of NSGA-II. The multi-objective algorithms obtain a set of non-dominant solutions that are consistently distributed and proximate to the Pareto front by performing the environmental selection. During the environmental selection, NSGA-II/SDR first uses the strengthened dominance relation to sort the individuals into several non-dominant layers $(L_1, L_2, \ldots ,L_i, \ldots )$. We suppose that the parent population size is N, and the first $|i-1|$ layers are put into the next generation, where$|L_1\cup L_2\cup \cdots \cup L_{i-1}|<N$ and $|L_1\cup L_2\cup \cdots \cup L_i|>N$. Crowding distance [8] which is the same as the original NSGA-II, is used to select the ith layer into the next generation one by one until the population size reaches N. More details of NSGA-II/SDR can be referred to [31].

A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization

Radial basis function(RBF) has been widely used in SAEAs. RBF fits well for nonlinear and higher-order problems and it has a low training complexity [41]. We use two different kernel functions, cubic and Gaussian function in RBF as the basic learners to construct the surrogate ensemble. The flowchart of TSEMO is given in Fig. 1 and the pseudocode of TSEMO is in Algorithm 1. In the main loop, TSEMO first uses Latin hypercube (LHS) [20] to generate $11D-1$ individuals. These individuals are evaluated by the expensive functions and kept in archive $A_1$. The non-dominated individuals in $A_1$ are kept in $A_2$. We conduct the environmental selection of NSGA-II/SDR on the individuals in $A_1$, and N individuals are selected as the initial population. Then the individuals in $A_1$ are used to train the surrogate ensemble for each objective of the MaOPs. After that, the population is optimized for $w_{\max }$ generations. Next, a two-stage infill strategy is used to select individuals for expensive evaluations. These new samples are added to archive $A_1$ and used to update archive $A_2$. The surrogate ensemble is re-trained by the samples in $A_1$. At last, the population is revised with the surrogate ensemble. This process is repeated until the maximum number of expensive evaluations $FE_{\max }$ is reached. In the subsequent subsections, we will give a detailed description of the methods on the surrogate ensemble-based optimization, choosing individuals for exact function evaluations (a two-stage infill strategy), and revising the population, respectively.

Surrogate-ensemble based optimization

In the surrogate-ensemble based optimization, all the individuals are evaluated by the RBF ensemble instead of the exact objective functions. Algorithm 2 gives the pseudocode of the surrogate-ensemble based optimization. In the surrogate-ensemble based optimization, we use simulated binary cross over[6] and polynomial mutation [7] to generate offspring population Q, and population Q is evaluated by RBF ensemble. The ith objective of individual $\mathbf {x}$ is calculated below:

$$\begin{aligned} \widetilde{f}_i(\mathbf {x})=\frac{\widetilde{f}_i^{\mathrm{{cubic}}}(\mathbf {x})+\widetilde{f}_i^{\mathrm{{Guassian}}}(\mathbf {x})}{2} \end{aligned}$$

(4)

where $\widetilde{f}_i^{\mathrm{{cubic}}}(\mathbf {x})$ and $\widetilde{f}_i^\mathrm{{Guassian}}(\mathbf {x})$ denote ith objective of the individual $\mathbf {x}$ evaluated by RBF models with cubic kernel function and Gaussian kernel function respectively. Then the objective functions of $P\cup Q$ are normalized by

$$\begin{aligned} F(\mathbf {x})^{'}=\frac{F(\mathbf {x})-F_{\min }}{F_{\max }-F_{\min }} \end{aligned}$$

(5)

where $F_{\max }$ and $F_{\min }$ are the vectors of the maximum and minimum objective functions of population $P\cup Q$ respectively. Then the environmental selection of NSGA-II/SDR is used on the combined population $L=P\cup Q$ to select the next generation. If the maximum number of generation $w_{\max }$ is reached, the surrogate-ensemble based optimization is ended and the population $P^{'}$ is as the output.

A two-stage infill strategy

After the surrogate-ensemble assisted optimization, u individuals are selected for exact evaluations. In this paper, we propose a two-stage infill strategy for selecting individuals, and the pseudocode of the infill strategy is given in Algorithm 3.

In the first stage, the convergence of population or the uncertainty of the surrogate ensemble is adaptively considered according to the distribution of the population. By considering the diversity of the population, we use k-means to divide the population $P^{'}$ into K clusters in objective space. An individual from each cluster with the minimum Euclidean distance (ED) to the origin or the maximum approximate variance $\varepsilon _1$ is selected for expensive function evaluations and $\varepsilon _1$ is calculated as follows:

$$\begin{aligned} \varepsilon _1(\mathbf {x})=\frac{1}{M}\sum _{i=1}^{M}\Bigg (\sqrt{(\widetilde{f}_i^{\mathrm{{cubic}}}(\mathbf {x})-\widetilde{f}_i^{\mathrm{{Gaussian}}}(\mathbf {x}))^2} \end{aligned}$$

(6)

To select individuals from each cluster, the population $P^{'}$ is sorted into several non-dominated fronts by the original dominant relation in NSGA-II. For each cluster, if there is only one non-dominated front, which means the individuals are non-dominated with each other, the individual with the maximum uncertainty will be selected. Otherwise, the individual with the minimum Euclidean distance (ED) to the origin will be selected. Then the distance $d_1$, which is the minimum distance from each of the K individuals to the samples in $A_1$ in the decision space is calculated. If the distance $d_1$ is greater than the threshold $\delta $, the individual selected from the first stage will be evaluated by the exact objective function. After that, the individuals calculated by the expensive function are added to $A_1$. Else if the distance $d_1$ is less than or equal to $\delta $, the individual will not be reevaluated. Here we set the value of $\delta $ to $10^{-6}$. A simple example is given in Fig. 2 to show the criterion to select individuals in each cluster. In Fig. 2, there are three clusters, $c_1$, $c_2$, $c_3$, and one individual from each cluster will be selected. In cluster $c_1$, because there are two non-dominated fronts, the individual $a_1$ with the smallest ED value will be selected. In cluster $c_2$, there is only one non-dominated front, thus the individual $a_3$ with the largest $\varepsilon _1$ will be selected. In cluster $c_3$, $a_6$ will be selected as in cluster $c_2$.

In the second stage, the $u-K$ individuals with the maximum uncertainty of $\varepsilon _2$ are selected. In this stage, the surrogate ensemble is updated by $A_1$ and the population $P^{'}$ is re-evaluated by the surrogate ensemble. The uncertainty $\varepsilon _2$ is calculated below:

$$\begin{aligned} \varepsilon _2(\mathbf {x})=\frac{1}{M}\sum _{i=1}^{M}\Bigg (\sqrt{(\widetilde{f}_i^{1}(\mathbf {x})-\widetilde{f}_i^{2}(\mathbf {x}))^2} \end{aligned}$$

(7)

where $\widetilde{f}_i^1(\mathbf {x})$ denotes the ith objective value of the individual $\mathbf {x}$ approximated by the surrogate ensemble in the first stage and $\widetilde{f}_i^{2}(\mathbf {x})$ denotes the ith objective value of the individual $\mathbf {x}$ approximated by the surrogate ensemble in the second stage. The distance $d_2$ of $u-K$ individuals is calculated in the same way as $d_1$. If $d_2$ is greater than the threshold $\delta $, the individuals will be evaluated by the exact objective function and they will be added to $A_1$. The approximated variance from the first and the second stage is used to select individuals for exact evaluation and the aim is to exploit the historical information of the surrogate ensemble.

Revising the population

In surrogate-assisted optimization, the approximated errors of objectives may lead to the wrong direction of search. And the approximated objectives of the surrogate ensemble may become increasingly accurate as the training samples in $A_1$ increase. Considering the above two points, we propose a strategy to revise the population. First, the population P before the surrogate-based optimization and the population $P^{'}$ are combined into $A_3$. After the two-stage infill strategy, the surrogate ensemble will be updated by the samples in $A_1$. Then the objectives of the individuals in $A_3$ are re-evaluated by the updated surrogate ensemble. Finally, environmental selection of NSGA-II/SDR is performed on $A_3$ and N individuals with better convergence and diversity are retained. Figure 3 gives an example to explain the effect of revising the population. Figure 3 shows a two-dimensional decision space. In Fig. 3, region 1 represents the region occupied by individuals with worse convergence and smaller approximate error, region 2 represents the region occupied by individuals with better convergence and larger approximate error, region 3 represents the region occupied by individuals with better convergence and smaller approximate error, and region 4 represents the region occupied by individuals with worse convergence and larger approximate error. The five-pointed star represents the individuals whose exact function values are convergent. In the process of surrogate-ensemble based optimization, the optimization of the population is based on approximated value of the surrogate ensemble. Besides, due to the limited training samples at the initial stage of optimization, approximated error of the surrogate ensemble is large, the individuals in region 4 with worse convergence will be discarded. However, region 4 may contain some individuals with good convergence. By revising the population, the initial population is re-evaluated with the updated surrogate ensemble after the two-stage infill strategy. At this time, the approximate accuracy of the surrogate ensemble is improved due to the increased samples. In this way, the individuals represented by the five-pointed star in region 4 can be retained. And the effectiveness of this step will be validated in the experimental section.

Numerical experiments

In this section, we first describe the performance metrics and parameter settings. Then we test the performance of TSEMO by comparing it with several state-of-the-art MaOPs algorithms on DTLZ and MaF test problems. After that, we conduct experiments on different values of K to determine the thresholds used in the two-stage infill strategy in TSEMO. Then the effectiveness of the two-stage infill strategy is studied. At last, the efficiency of revising the population is examined. Wilcoxon rank-sum test is used to compare the results of different algorithms and each algorithm is run 20 times independently. The significance level is set to 0.05. And the symbol ‘+’ indicates that the compared algorithm performs statistically better than TSEMO. The ‘−’ represents that the compared algorithm performs statistically worse than TSEMO. And the ‘$\approx $’ means that there is no significant difference between the performance of TSEMO and the compared algorithm.

Performance metrics and experimental settings

In this paper, inverted generational distance (IGD) [1] is used as the performance indicator to compare the performance of different algorithms. The IGD indicator can provide information about convergence and diversity of the non-dominated solutions, and it is defined as follows:

$$\begin{aligned} \mathrm{{IGD}}(P^*,\varOmega )=\frac{\sum _{x\in P^*}\mathrm{{dis}}(x,P^*)}{|P^*|} \end{aligned}$$

(8)

where $P^*$ denotes the evenly distributed point set on the true PF, $\varOmega $ is the non-dominated solutions obtained by the algorithm and $\mathrm{{dis}}(x,P^*)$ represents the minimum Euclidean distance between x and the point in $P^*$. The number of the reference points in the true PF is set as recommended in KRVEA [5].

We also use HV as the performance indicator to compare the performance of our method with other algorithms. The HV indicator measures the volume of the hypercube dominated by the non-dominated solutions obtained by the algorithm, and it is defined below:

$$\begin{aligned} \begin{aligned}&HV(\varOmega |\mathbf {z}^r)\\&\quad = \mathrm{{Vol}}\Bigg ( \bigcup \limits _{\mathbf {x}\in \varOmega }[f_1(\mathbf {x}), z_1^r]\times [f_2(\mathbf {x}),z_2^r]\times \cdots \times [f_M(\mathbf {x}),z_M^r]\Bigg ) \end{aligned} \end{aligned}$$

(9)

where $Vol(\cdot )$ indicates the Lebesgue measure, and $\mathbf {z}^r=(z_1^r,z_2^r,\ldots ,z_M^r)$ is a reference point in the objective space that is dominated by $\varOmega $. $\mathbf {z}^r$ is set as recommended in AR-MOEA [30].

The experimental settings are given below:

(1)
Genetic operators: in this paper, the simulated binary crossover [6] and polynomial mutation [7] are used to generate offspring population in all the algorithms. The distribution index of crossover is set to 20, and the distribution index of mutation is set to 20. The crossover probability $p_c$ is set to 1.0, and the mutation probability $p_m$ is set to 1/D.
(2)
Population size: the population of TSEMO is set to 50 and the compared algorithms are set as their corresponding references.
(3)
The termination condition: the maximum number of function evaluations is used as the termination condition and it is set to 300.
(4)
The number of evolutionary generations assisted by surrogate -ensemble $w_{\max }$ is set to 20 as the empirical value in KRVEA [5].

Table 1 Statistical results for IGD values obtained by KTA2, KRVEA, CSEA, NSGA-II/SDR, RVEA and TSEMO

Full size table

Table 2 Statistical results for HV values obtained by KTA2, KRVEA, CSEA, NSGA-II/SDR, RVEA and TSEMO

Full size table

Table 3 Statistical results for IGD values obtained by KTA2, KRVEA, CSEA, NSGA-II/SDR, RVEA and TSEMO

Full size table

Experimental results

In this section, we compare TSEMO with three surrogate-assisted algorithms namely, KTA2, CSEA, KRVEA, and two MOEAs called NSGA-II/SDR, RVEA on DTLZ1-DTLZ7, and the comparison results are given in Tables 1 and 2. The objective dimensions of DTLZ problems are 3, 4, 6, 8, 10. We also test the performance of TSEMO and other compared algorithms on MaF1–MaF7. The objective dimensions of MaF problems are 3, 5, 10. The decision variable dimensions of the two test suites are 10.

Table 4 Statistical results for IGD values obtained by TSEMO-I and TSEMO

Full size table

Table 5 Statistical results for IGD values obtained by TSEMO-II and TSEMO

Full size table

The statistical results of IGD in Table 1 show that TSEMO obtained the best results on all the test problems of DTLZ2, DTLZ5, and DTLZ6, and TSEMO obtained the best results on most of the test problems of DTLZ4 and DTLZ7. For two multi-objective algorithms, NSGA-II/SDR and RVEA, TSEMO obtained 25 better results than NSGA-II/SDR and RVEA among 35 test problems, which demonstrate that surrogate models are effective on most test problems. For a few test problems like DTLZ1 and DTLZ3, TSEMO needs more expensive function evaluations to find the true PF. TSEMO got 20 better results and 12 worse results than KTA2. TSEMO got 18 better results and 6 worse results than KRVEA. Compared to CSEA, TSEMO obtained 23 better results and 11 worse results. Then the statistical results of HV in Table 2 also show that TSEMO performed the best among all the algorithms. Therefore, for DTLZ test suite, TSEMO is more efficient than the three representative surrogate-assisted many-objective optimization algorithms, KTA2, KRVEA, and CSEA. Next, the parallel coordinates plot of the final non-dominated solutions obtained by TSEMO and the compared algorithms for DTLZ2 with 10 objectives is presented in Fig. 4. From it, we can see that the maximum value of objectives for all the non-dominated solutions obtained by TSEMO is smaller than that of the compared algorithms and the non-dominated solutions of TSEMO have a higher density than the two non-surrogate assisted MOEAs algorithms. These mean TSEMO can achieve a set of well convergent and evenly distributed solutions and has a better capability on the convergence and diversity. As to MOEAs, NSGA-II/SDR and RVEA have a lower density of solutions which means they have a worse distribution.

Furthermore, to verify the effectiveness of TSEMO on many-objective optimization problems, we conduct experiments on MaF1–MaF7, which are the modified versions of the DTLZ. And the results are presented in Table 3. For MOEAs without surrogates, TSEMO obtained 15 better results than RVEA and NSGA-II/SDR among 21 test problems, which further verified the effectiveness of adding the surrogate in MOEAs. We can see that our proposed TSEMO can obtain 11, 10 better results among 21 test problems than KRVEA and CSEA, and lose to win on 6 and 7 problems than KRVEA and CSEA, respectively, which shows that our proposed method is more efficient than both of these algorithms on MaF1–MaF7 test problems.

Parameter analysis

In the two-stage infill strategy, a total of u individuals are selected for expensive objective evaluations. For fairness, we set u to the same value of 5 as in the KRVEA [5]. In the first stage of the infill strategy, K individuals are selected for real evaluations. And the remaining $u-K$ individuals are selected in the second stage. The number of individuals selected in the first stage will affect individuals with large uncertainty found in the second stage and ultimately affect finding a good Pareto front. Therefore, we first conducted experiments with $u=5$, and different numbers of individuals selected in the first stage for expensive evaluations, i.e., $K = 5, 4, 3, 2, 1$. When $K=5$, it means that individuals are selected for real evaluations only in the first stage. Figure 5 shows the mean IGD values of 20 independently runs obtained by TSEMO with the different numbers of individuals selected for exact evaluations in the first stage. From Fig. 5, we can see that when the number of individuals selected for exact evaluation in the first stage is set to 4, the performance of TSEMO is the best.

Then we further analyze the performance variation of the algorithm, when u, K increase further, and the number of infill samples in the second stage remains unchanged. Figure 6 shows the mean IGD values of 20 independently runs obtained by TSEMO with the different numbers of u individuals selected for exact evaluations. From Fig. 6, we can see that there are no significant differences among the performance of TSEMO on 3, 4, 6, 8, 10 objectives of DTLZ2 when u increases further. On the low-dimensional 3 objectives of DTLZ1, the algorithm performs best when $u=6$, $K=5$. However, on the high-dimensional DTLZ1 of 4, 6, 8, 10 objectives, the algorithm performs best when $u=5$, $K=4$. Thus, we set $u=5$ and $K=4$ in our method for solving the expensive many-objective optimization problems.

Effectiveness of the two-stage infill strategy

In this part, we investigate the effects of the two-stage infill strategy by comparing TSEMO with another variant TSEMO-I, which only selects individuals for exact evaluations in the first stage. Table 4 gives the statistical results of our proposed TSEMO, and TSEMO-I. From Table 4, we can see that TSEMO obtained 7/35 better results than TSEMO-I. Compared to TSEMO-I, our TSEMO obtained 0 worse and 28 comparable results than TSEMO-I. TSEMO-I without the second stage sampling strategy has worse performance mainly on DTlZ1 with 8 objectives, DTLZ2 with 3, 4, 6, 8 and 10 objectives, DTLZ3 with 8 objectives, and DTLZ5 with 10 objectives. The reason we analyze is that DTLZ2 is used to investigate the diversity and distribution of the algorithm and the second stage sampling strategy has a potential benefit to improve the diversity of algorithms. For other problems, like DTLZ4 and DTLZ7, TSEMO-I showed comparable performance with TSEMO. Therefore, the effectiveness of the second stage infill sampling strategy cannot be ignored and the two-stage infill strategy is adopted to be used in our method.

Effectiveness of the two different kernel functions used in the surrogate ensemble

In this paper, we adopt two different kernel functions of RBF models to construct a surrogate-ensemble and utilize the ensemble to approximate each objective function. To see the efficiency of using surrogate-ensemble, we compare TSEMO with another variant TSEMO-II, where the surrogate-ensemble is replaced with GP, the uncertainty is provided by GP, and the two-stage infill strategy and the strategy of revising the population are employed. The average IGD results of TSEMO-II and TSEMO based on 20 independent runs are presented in Table 5. As can be seen in Table 5, TSEMO performed 12 better, 3 comparable, and 0 worse results among a total of 15 problems than TSEMO-II. TSEMO performs well on most test problems. The results showed that TSEMO can achieve a better balance between convergence and diversity. This may be attributed to that the two different kernels of RBF can provide more accurate fitness predictions and the uncertainty information provided by our method is useful. Thus, we use the surrogate ensemble to approximate each objective function.

Effectiveness of revising the population

In this section, we investigate the effects of revising the population by comparing TSEMO with another variant TSEMO-R, which removes the strategy of revising the population. The average IGD results of TSEMO-R and TSEMO based on 20 independent runs on DTLZ1-DTLZ7 problems with 3, 4, 6, 8, and 10 objectives are presented in Table 6. As can be seen in Table 6, TSEMO performed 5 better , 25 comparable, and 0 worse results among a total of 35 problems than TSEMO-R. TSEMO performs better on DTLZ2 with 10 objectives, DTLZ5 with 8 and 10 objectives and DTLZ6 with 3 and 4 objectives. The results showed that the strategy of revising the population is effective for the surrogate-assisted multi-objective optimization problems.

Table 6 Statistical results for IGD values obtained by TSEMO-R and TSEMO for DTLZ1-DTLZ7

Full size table

Conclusion

In this paper, we propose a two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization (TSEMO) method. To improve the robustness of the surrogate model, we use two kernel functions of RBF to construct a surrogate ensemble. After optimization, a two-stage infill strategy is proposed to select individuals for expensive evaluations. The approximated variance of the surrogate ensemble and approximated fitness are considered adaptively in the first stage. And historical information of the approximated variance by the surrogate ensemble is considered in the second stage. To avoid the approximated errors from misleading the search direction, a strategy of revising the population is proposed.

We compare our method with three surrogate-assisted MOEAs and two non-surrogate-assisted MOEAs on two sets of test problems. The results show that TSEMO achieves better performance than the compared algorithms on the limited number of expensive function evaluations. In our work, the surrogate-ensemble, two-stage infill strategy, and strategy of revising the population are effective for solving expensive MOPs. However, for multimodal problems, like DTLZ3 and MaF4, TSEMO still needs more expensive function evaluations to find the true PFs. Therefore, a more effective optimization process and infill strategy should be developed for solving these complex problems.

References

Bosman P, Thierens D (2003) The balance between proximity and diversity in multiobjective evolutionary algorithms. IEEE Trans Evol Comput 7(2):174–188
Article Google Scholar
Cai X, Gao L, Li X (2020) Efficient generalized surrogate-assisted evolutionary algorithm for high-dimensional expensive problems. IEEE Trans Evol Comput 24(2):365–379
Article Google Scholar
Cheng R, Jin Y, Olhofer M, Sendhoff B (2016) A reference vector guided evolutionary algorithm for many-objective optimization. IEEE Trans Evol Comput 20(5):773–791
Article Google Scholar
Cheng R, Rodemann T, Fischer M, Olhofer M, Jin Y (2017) Evolutionary many-objective optimization of hybrid electric vehicle control: from general optimization to preference articulation. IEEE Trans Emerg Top Comput Intell 1(2):97–111
Article Google Scholar
Chugh T, Jin Y, Miettinen K, Hakanen J, Sindhya K (2018) A surrogate-assisted reference vector guided evolutionary algorithm for computationally expensive many-objective optimization. IEEE Trans Evol Comput 22(1):129–142
Article Google Scholar
Deb K (2001) Multi-objective optimization using evolutionary algorithms, vol 16. Wiley, Chichester
MATH Google Scholar
Deb K, Goyal M (1999) A combined genetic adaptive search (geneas) for engineering design. Comput Sci Inform 26:30–45
Google Scholar
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: Nsga-ii. IEEE Trans Evol Comput 6(2):182–197
Article Google Scholar
Emmerich M, Giannakoglou K, Naujoks B (2006) Single- and multiobjective evolutionary optimization assisted by gaussian random field metamodels. IEEE Trans Evol Comput 10(4):421–439
Article Google Scholar
Gu Q, Zhou Y, Li X, Ruan S (2021) A surrogate-assisted radial space division evolutionary algorithm for expensive many-objective optimization problems. Appl Soft Comput 111:107703
Article Google Scholar
Guo D, Wang X, Gao K, Jin Y, Ding J, Chai T (2022) Evolutionary optimization of high-dimensional multiobjective and many-objective expensive problems assisted by a dropout neural network. IEEE Trans Syst Man Cybern Syst 52(4):2084–2097
Article Google Scholar
Jia L, Wang Y, Fan L (2014) Multiobjective bilevel optimization for production-distribution planning problems using hybrid genetic algorithm. Integrat Comput Aided Eng 21(1):77–90
Article Google Scholar
Jin Y (2011) Surrogate-assisted evolutionary computation: recent advances and future challenges. Swarm Evol Comput 1(2):61–70
Article Google Scholar
Jin Y, Wang H, Chugh T, Guo D, Miettinen K (2019) Data-driven evolutionary optimization: an overview and case studies. IEEE Trans Evol Comput 23(3):442–458
Article Google Scholar
Knowles J (2006) ParEGO: a hybrid algorithm with on-line landscape approximation for expensive multiobjective optimization problems. IEEE Trans Evol Comput 10(1):50–66
Article Google Scholar
Li F, Shen W, Cai X, Gao L, Gary Wang G (2020) A fast surrogate-assisted particle swarm optimization algorithm for computationally expensive problems. Appl Soft Comput 92:106303
Article Google Scholar
Lin Q, Wu X, Ma L, Li J, Gong M, Coello CAC (2021) An ensemble surrogate-based framework for expensive multiobjective evolutionary optimization. IEEE Trans Evol Comput 20:1
Google Scholar
Liu B, Zhang Q, Gielen GGE (2014) A gaussian process surrogate model assisted evolutionary algorithm for medium scale expensive optimization problems. IEEE Trans Evol Comput 18(2):180–192
Article Google Scholar
Liu Y, Liu J, Jin Y (2021) Surrogate-assisted multipopulation particle swarm optimizer for high-dimensional expensive optimization. IEEE Trans Syst Man Cybern Syst 20:1–14
Google Scholar
Mckay MD, Beckman RJ, Conover WJ (1979) A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 21(2):239–245
MathSciNet MATH Google Scholar
Namura N, Shimoyama K, Obayashi S (2017) Expected improvement of penalty-based boundary intersection for expensive multiobjective optimization. IEEE Trans Evol Comput 21(6):898–913
Article Google Scholar
Pan JS, Liu N, Chu SC, Lai T (2021) An efficient surrogate-assisted hybrid optimization algorithm for expensive optimization problems. Inf Sci 561:304–325
Article Google Scholar
Pan L, He C, Tian Y, Wang H, Zhang X, Jin Y (2019) A classification-based surrogate-assisted evolutionary algorithm for expensive many-objective optimization. IEEE Trans Evol Comput 23(1):74–88
Article Google Scholar
Pires EJS, de Moura Oliveira PB, Machado JAT (2004) Multi-objective genetic manipulator trajectory planner. In: Raidl GR, Cagnoni S, Branke J, Corne DW, Drechsler R, Jin Y, Johnson CG, Machado P, Marchiori E, Rothlauf F, Smith GD, Squillero G (eds) Appl Evol Comput. Springer, Heidelberg, pp 219–229
Chapter Google Scholar
Ponweiser W, Wagner T, Vincze M (2008) Clustered multiple generalized expected improvement: A novel infill sampling criterion for surrogate models. In: 2008 IEEE congress on evolutionary computation (IEEE world congress on computational intelligence), pp 3515–3522
Song Z, Wang H, He C, Jin Y (2021) A kriging-assisted two-archive evolutionary algorithm for expensive many-objective optimization. IEEE Trans Evol Comput 25(6):1013–1027
Article Google Scholar
Sun C, Jin Y, Zeng JC, Yu Y (2015) A two-layer surrogate-assisted particle swarm optimization algorithm. Soft Comput 19(6):1461–1475
Article Google Scholar
Sun C, Jin Y, Cheng R, Ding J, Zeng J (2017) Surrogate-assisted cooperative swarm optimization of high-dimensional expensive problems. IEEE Trans Evol Comput 21(4):644–660
Article Google Scholar
Tian J, Tan Y, Zeng J, Sun C, Jin Y (2019) Multiobjective infill criterion driven gaussian process-assisted particle swarm optimization of high-dimensional expensive problems. IEEE Trans Evol Comput 23(3):459–472
Article Google Scholar
Tian Y, Cheng R, Zhang X, Cheng F, Jin Y (2018) An indicator-based multiobjective evolutionary algorithm with reference point adaptation for better versatility. IEEE Trans Evol Comput 22(4):609–622
Article Google Scholar
Tian Y, Cheng R, Zhang X, Su Y, Jin Y (2019) A strengthened dominance relation considering convergence and diversity for evolutionary many-objective optimization. IEEE Trans Evol Comput 23(2):331–345
Article Google Scholar
Wang H, Jin Y, Doherty J (2017) Committee-based active learning for surrogate-assisted particle swarm optimization of expensive problems. IEEE Trans Cybern 47(9):2664–2677
Article Google Scholar
Wang H, Jin Y, Sun C, Doherty J (2019) Offline data-driven evolutionary optimization using selective surrogate ensembles. IEEE Trans Evol Comput 23(2):203–216
Article Google Scholar
Wang X, Jin Y, Schmitt S, Olhofer M (2020) An adaptive Bayesian approach to surrogate-assisted evolutionary multi-objective optimization. Inf Sci 519:317–331
Article MathSciNet MATH Google Scholar
Yang C, Ding J, Jin Y, Chai T (2020) Offline data-driven multiobjective optimization: knowledge transfer between surrogates and generation of final solutions. IEEE Trans Evol Comput 24(3):409–423
Google Scholar
Yu H, Tan Y, Zeng J, Sun C, Jin Y (2018) Surrogate-assisted hierarchical particle swarm optimization. Inf Sci 454–455:59–72
Article MathSciNet Google Scholar
Yu H, Kang L, Tan Y, Zeng JC, Sun C (2021) A multi-model assisted differential evolution algorithm for computationally expensive optimization problems. Complex Intell Syst 7:2347–2371
Article Google Scholar
Zhang Q, Li H (2007) MOEA/D: a multiobjective evolutionary algorithm based on decomposition. IEEE Trans Evol Comput 11(6):712–731
Article Google Scholar
Zhang Q, Liu W, Tsang E, Virginas B (2010) Expensive multiobjective optimization by MOEA/D with Gaussian Process model. IEEE Trans Evol Comput 14(3):456–474
Article Google Scholar
Zhao Y, Sun C, Zeng J, Tan Y, Zhang G (2021) A surrogate-ensemble assisted expensive many-objective optimization. Knowl Based Syst 211:106520
Article Google Scholar
Zhao Y, Zeng J, Tan Y (2021) Neighborhood samples and surrogate assisted multi-objective evolutionary algorithm for expensive many-objective optimization problems. Appl Soft Comput 105:107268
Article Google Scholar
Zitzler E, Künzli S (2004) Indicator-based selection in multiobjective search. Parallel problem solving from nature. Springer, Berlin, pp 832–842
Google Scholar

Download references

Author information

Authors and Affiliations

College of Mechanical Engineering, Taiyuan University of Science and Technology, Taiyuan, 030024, China
Yi Zhao & Jian Zhao
Department of Computer Science and Control Engineering, North University of China, Taiyuan, 030051, China
Jianchao Zeng
Department of Computer Science and Technology, Taiyuan University of Science and Technology, Taiyuan, 030024, China
Ying Tan

Authors

Yi Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jianchao Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Ying Tan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Zhao.

Ethics declarations

Competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhao, Y., Zhao, J., Zeng, J. et al. A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization. Complex Intell. Syst. 8, 5047–5063 (2022). https://doi.org/10.1007/s40747-022-00751-4

Download citation

Received: 05 October 2021
Accepted: 08 April 2022
Published: 03 May 2022
Issue Date: December 2022
DOI: https://doi.org/10.1007/s40747-022-00751-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A two-stage infill strategy and surrogate-ensemble assisted expensive many-objective optimization

Abstract

Similar content being viewed by others

Dung beetle optimizer: a new meta-heuristic algorithm for global optimization

An exhaustive review of the metaheuristic algorithms for search and optimization: taxonomy, applications, and open challenges

Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning

Introduction