Two-stage many-objective evolutionary algorithm: enhanced dominance relations and control mechanisms for separated balance

Li, Wei; Niliang, Qilin; Wang, Lei; Jiang, Qiaoyong

doi:10.1007/s40747-024-01505-0

Two-stage many-objective evolutionary algorithm: enhanced dominance relations and control mechanisms for separated balance

Original Article
Open access
Published: 15 June 2024

(2024)
Cite this article

Download PDF

You have full access to this open access article

Complex & Intelligent Systems Aims and scope Submit manuscript

Two-stage many-objective evolutionary algorithm: enhanced dominance relations and control mechanisms for separated balance

Download PDF

Wei Li ORCID: orcid.org/0000-0002-4336-5582¹,
Qilin Niliang¹,
Lei Wang² &
…
Qiaoyong Jiang¹

150 Accesses
Explore all metrics

Abstract

Although the multiobjective evolutionary algorithms (MOEAs) have been proved to bring promising prospects for solving multiobjective optimization problems (MOPs), the performance of the algorithm deteriorates sharply in high-dimensional objective space due to the weak selection pressure and the unregulated balance, which is caused by the increase of objective space dimension. Some current MOEAs with two-stage strategy (TS) strive to address above issues by dividing the evolutionary process into two independent stages, in which convergence and diversity are handled separately within successive generations of different stages. However, TS-MOEAs have some weaknesses, such as sensitivity to stage division, and incomplete separation of convergence and diversity. In this paper, TS/KW-MaOEA is proposed for solving many-objective optimization problems (MaOPs), which keeps TS as the central and equips a perfect control mechanism for separated balance. More specifically, TS/KW-MaOEA can automatically adjust the balance trend and provide appropriate selection pressure for MaOPs according to the Kondratiev wave (KW) search model and the objective space dimension. To verify the effectiveness of the proposed algorithm, a series of experiments are carried out against seven state-of-the-art many-objective optimization algorithms on 15 benchmark problems with up to 30 objectives. Experimental results indicate that the proposed algorithm is highly competitive against peer competitors.

A Two-phase evolutionary algorithm framework for multi-objective optimization

Article 25 November 2020

Many-Objective Evolutionary Algorithm Based on Dominance and Objective Space Decomposition

Hybrid selection based multi/many-objective evolutionary algorithm

Article Open access 27 April 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

In recent years, multiobjective optimization problems have attracted a considerable interest of researchers due to their widespread appearance in the real world. A multiobjective optimization problem (MOP) [1] that involves two or more conflicting objectives to be optimized simultaneously is briefly stated as

$$\begin{array}{c}\underset{\mathbf{x}}{\text{min}}F\left(\mathbf{x}\right)=\left({f}_{1}\left(\mathbf{x}\right),{f}_{2}\left(\mathbf{x}\right),\dots ,{f}_{m}\left(\mathbf{x}\right)\right)\\ s.t. x\in \Pi \end{array}$$

(1)

where m is the number of objective functions, Π is the decision space with $\mathbf{x}=\left({x}_{1},{x}_{2},\dots ,{x}_{n}\right)$ being a decision vector of n decision variables. For a vector x in the decision space, there exists a corresponding objective vector, denoted by f(x), in the objective space. Multiobjective optimization problems (MOPs) with more than three objectives are often referred to as many-objective optimization problems (MaOPs) [2]. Without loss of generality, the MaOPs discussed in this paper are all minimization problems.

Since evolutionary algorithms (EAs) can obtain a set of solutions in a single run due to the population-based heuristic, multiobjective EAs (MOEAs) have developed quickly over the last 2 decades [3]. For many real-world engineering applications, any optimization of one objective often leads to a deterioration in at least one other objective due to the conflicting nature between objectives. Since the consideration of conflicting objectives is the key to deal with optimization problems, many MOEAs tend to a compromise search. Classical MOEAs can be roughly classified into three categories: Pareto-dominance-based MOEAs, decomposition- and indicator-based MOEAs.

The basic idea of Pareto-dominance-based MOEAs is to distinguish and select candidate solutions according to dominance relation, that is, the dominant solution dominates the inferior solution. Specifically, the solutions are sorted according to the Pareto dominance relation. Then, auxiliary strategies are applied to select the nondominated solutions, and finally a set of optimal solutions are obtained, which are distributed uniformly on the Pareto front (PF). An example of these algorithms is NSGA-II [4], designed by Deb et al., which is an elitist nondominated sorting genetic algorithm. Pareto-dominance-based MOEAs show significant effect in dealing with MOPs, however, the performance of these algorithms deteriorate severely when the number of objectives increases. With the increase of dimension and the mushrooming of nondominated solutions [5], MOEAs tend to search randomly. Since Pareto dominance is employed as a major selection criteria, the weakening of selection pressure will bring down the convergence ability of the solution toward the Pareto front.

The decomposition-based MOEAs decompose the MOP into a number of single objective optimization problems (SOPs) and optimize them simultaneously. An example of classical algorithms is MOEA/D proposed by Zhang and Li [6]. Compared with Pareto-dominance-based MOEAs, decomposition-based MOEAs are more efficient in dealing with MaOPs since they do not need to consider the conflict between objectives. However, decomposition-based MOEAs relies too much on the decomposition method and relevant mathematical methods. Whereas the weight aggregation method is not effective for the minimization problems with non-convex PF.

The indicator-based algorithms aim to enhance the environmental selection pressure to improve the undesirable effect of Pareto dominance relation in solving complex problems. The indicator-based MOEAs employ certain performance indicators, such as hypervolume (HV) indicator, R2 indicator and ∆p indicator [7,8,9] to measure the quality of solutions. A representative example of classical algorithms is IBEA [10], proposed by Zitzler et al., which uses fitness values based on scaled values to select individuals, regardless of Pareto-based ranking of the individuals. However, the computational complexity of most of the indicators is quite expensive, and there is no indicator preference for evaluating all problems.

Based on the above analysis, there still some challenges for MOEAs to deal with MaOPs. First, Pareto dominance lacks selection pressure in high-dimensional objective space [11]. Since most solutions of the population for next generation always come from nondominated solutions on the first PF, it is difficult to distinguish superior solutions and inferior solutions. Consequently, the performance of algorithms may worsen since their search ability are noticeably deteriorated. Second, it is difficult to make a good balance between convergence and diversity with the increase of spatial dimensions [12]. Third, how to reasonably allocate resources and reduce computational cost to improve algorithm performance becomes a challenge task as the number of objectives increases.

To solve the above problems, MaOEAs are proposed for MaOPs, most of which are the extension and improvement of traditional mainstream MOEAs. The main improvement methods can be summarized as follows.

1) Pareto-dominance-based MOEAs is extended to further balance the convergence and diversity in the evolutionary process. Three kinds of improvements can be observed, that is, evolution strategies, elite selection mechanism and diversity maintenance mechanism. For the first case, Lin et al. [13] proposed multiple search strategies to accelerate convergence and maintain population diversity. For the second case, the Pareto dominance relationship definition and the ranking methods of solutions are modified to solve MaOPs, such as g-dominance [14] and r-dominance [15] based on reference point, $\epsilon $-dominance [16] based on grid system design and fuzzy dominance with fuzzy logic [17, 18]. The dominance relations mentioned above are the relaxed form of Pareto dominance relation, which can more easily dominate solutions in a high-dimensional objective space. For the last case, Deb and Jain [19] proposed a reference-point-based many-objective NSGA-II, namely NSGA-III. NSGA-III uses a set of evenly distributed reference vectors to maintain diversity and modifies niche strategies simultaneously. Although the loss of selection pressure caused by Pareto dominance cannot be solved directly, a set of well-spread reference vectors is beneficial for balancing the convergence and diversity. Grid-based evolutionary algorithm (GrEA) [20] employs the grid adaptive strategy to enhance the selection pressure while ensuring uniform distribution among solutions. Knee point-driven evolution algorithm (KnEA) [21] employs knee points as a secondary selection criteria to strengthen the selection pressure. As the representative of the compromise solution, the knee points can accelerate convergence but have the tendency to prefer local optimum.

(2) The decomposition- and indicator-based MOEAs are improved to reduce the dimension and complexity of MaOPs. Three kinds of improvements are proposed, that is, decomposition strategies, reduction of redundant objectives and preference space, specific indicator-based MaOEAs. For the first case, the mainstream methods that decompose MaOP into SOPs include weighted sum approach, Tchebycheff approach and penalty-based boundary intersection approach, which have been fully discussed in [6]. Similarly, there are good prospects to explore the way in which MaOP is decomposed into MOPs. For the second case, Pal et al. [22] proposed a differential evolution using clustering based objective reduction algorithm (DECOR), which employs clustering method to eliminate minor objectives. However, it is difficult to reduce objectives in the real-world applications, since objective reduction methods only rely upon a relative order importance between the objectives [17]. For more problems, even if the number of objectives is fully reduced, it is also difficult to use PF in the low-dimensional objective space to portray the true PF in the original high-dimensional objective space. MaOEA based on objective space reduction and diversity improvement (MaOEA-RD) algorithm [23] incorporated the decomposition approach to reduce the objective space, which is more effective for specific PF. For the last case, the specific indicator-based EAs are developed for solving MaOPs, such as hypervolume-based MaOEA [7], pure diversity indicator (PD) [24] and IGD indicator-based MaOEA [25].

(3) Hybrid methods that combine several methods, such as Pareto dominance and performance indicators, are developed to pursue the promising convergence and diversity instead of a compromise between them. A representative example is a two-stage strategy (TS). TS divides the evolution into two stages, the convergence is considered in the former stage, while the diversity is addressed in the latter stage. In other words, there is no longer any compromise search tendency during the two-stage evolution, since the balance that is oriented toward which one will be adjusted by the separation control mechanism. Although this method is conducive to adjusting the trend of balance to satisfy the requirements of different MaOPs, most TSs cannot guarantee that the separated properties do not affect each other. Typical algorithms such as MaOEA-IT [26] should not only address the balance between convergence and diversity, but also reduce the complexity of MaOPs.

The above two-stage evolution studies provide an effective way to address MaOPs and bring a promising prospect to the domain. Nevertheless, as highlighted in [26], further research is still needed since the proposed methods cannot be effective in solving some MaOPs. In addition, the change of a single property in two sequential stages may influence the balance between convergence and diversity due to their conflicting nature. This observation greatly motivates us to exploit more effective control strategies for separated balance. Inspired by the theory of Kondratiev waves [27], a novel MaOEA with two-stage framework, termed TS/KW-MaOEA, is proposed in this paper. In the proposed TS/KW-MaOEA, the appropriate enhancement Pareto method is used to guide the search direction in each stage, which improves one feature of convergence and diversity without deteriorating the other. Furthermore, inspired by Kondratiev waves, the proposed algorithm can effectively deal with the problems caused by the lack of selection pressure in the high-dimension objective space and significantly alleviate the conflict between convergence and diversity in two stages. In conclusion, the contributions of TS/KW-MaOEA are highlighted as follows.

(1) A novel two-stage strategy model for dealing with MaOPs is proposed, which has dynamic periodicity similar to Kondratiev waves. The originally independent two stages alternate with each other according to the evolution process and the number of objectives. TS/KW model can promote the information transfer between different stages and weaken the negative effect on balance caused by the change of a single property.

(2) Two enhancement dominance methods that match the stage features are proposed, where the combination of decomposition approach and Pareto dominance is conducive to separation control. In the first stage, the hybrid dominance method is used to maintain population diversity without deterioration in convergence-dominated circumstances, which propels the obtained solutions to effectively approximate the true PF. In the second stage, the grid dominance method is employed to provide dynamic selection pressure.

(3) Two environmental selection components that match the dominance relation are proposed, which aims to maintain population diversity in the first stage, explicitly improves population diversity in the second stage, and reduces the computational complexity of the algorithm.

(4) This paper discusses the influence of different components in TS on algorithm performance, especially the importance of the first stage in TS. In addition, this paper studies the parameter settings of related components and search behavior, and gives rational explanations on advantages and deficiencies of TS/KW model.

The remaining of this paper is organized as follows. Section “Related works” reviews the related works and introduces the motivation of the proposed TS/KW-MaOEA. Section “Proposed algorithm” discusses TS/KW-MaOEA in detail. To evaluate the performance of TS/KW-MaOEA, a series of experiments is carried out and discussed in Section “Experimental results and discussion”. Finally, the conclusion and future works are given in Section “Conclusion”.

Related works

In this section, we briefly introduce some basic components related to our work, as well as their advantages and deficiencies.

Hybrid dominance

Pareto-dominance- and decomposition-based MOEAs have shown significant effects in dealing with simple MOPs. However, the former can hardly provide sufficient selection pressure for solutions to converge to the true PF due to its compromise nature, while the latter can hardly maintain the distribution of solutions due to its diversity characteristics. The two methods interact violently due to their failure to be effectively separated during the search, hence, they cannot work effectively for solving complex MOPs or MaOPs. Meanwhile, Pareto-dominance- or decomposition-based MOEAs usually employ distance metric or angle metric. However, with the increase of dimensions, the distribution of solutions becomes more complex and tends to be orthogonal, which significantly reduces the effectiveness of the methods. Therefore, many improvement strategies have been developed to make up for deficiencies. Zhang et al. [28] employed the information feedback model, which uses the information in the previous iteration to update the current individual. Yi et al. [29] proposed an array of improved crossover operators to improve the performance of the NSGA-III algorithm. Sun et al. [30] proposed an improved memetic algorithm (IMOMA-II), which uses the increment of the hypervolume to develop an activation strategy for every local search procedure.

The dominance relation that combines Pareto dominance and decomposition-based approach is termed as hybrid dominance. The hybrid dominance method decomposes the complex problem into a number of simple subproblems to reduce its computational complexity. In addition, the hybrid dominance method can guarantee the effectiveness of the combination metric of distance and angle, which is conductive to separated balance control. Research shows [19] that hybrid dominance method has a promising potential for higher-dimensional problems. Some examples are MOEA/D-D [31] and RP-dominance-based (NSGA-II-RPD) [32]. In the NSGA-II-RPD, the reference-point dominance (RPD) is designed. First, a set of uniformly distributed reference vectors is defined in the objective space [19, 33]. Then, the projection distance and vertical distance of the solution with respect to the reference vector are calculated by formulas (2) and (3). Each solution is associated with the reference vector with the minimum vertical distance.

$${d}_{1}\left(\mathbf{x}\right)=\frac{\Vert f{\left(\mathbf{x}\right)}^{T}w\Vert }{\Vert w\Vert }$$

(2)

$${d}_{2}\left(\mathbf{x}\right)=\Vert f\left(\mathbf{x}\right)-{d}_{1}\left(\mathbf{x}\right)\frac{w}{\Vert w\Vert }\Vert $$

(3)

where f(x) is an objective vector of a solution x, and w is a reference vector. d₁(x) and d₂(x) represent the projection distance and vertical distance of the solution x with respect to the reference vector w, respectively, as shown in Fig. 1.

The RP-dominance relation (RPD) is defined as follows.

Definition 1 (RPD):

For two solutions u and v, u is said to dominate v, denoted by $u{\prec }_{RP}v$ if one of the following conditions holds true.

(1)
$u\prec v$ or.
(2)
$u$ and v are equivalent in terms of Pareto.
1. (a)
  u and v are associated with the same reference vector, and ${d}_{1}\left(u\right)<{d}_{1}\left(v\right)$ or.
2. (b)
  u and v are associated with the different reference vector, ${d}_{1}\left(u\right)<{d}_{1}\left(v\right)$ and $SD\left(u\right)<SD\left(v\right)$.

where SD(u) represents the density of the subregion u belongs to, that is, the number of solutions associated with the reference vector of u.

RPD uses a set of uniformly distributed reference points to construct a strict partial order on a set of nondominated solutions [32]. However, the adverse effects caused by standardization operation are not considered. In addition, the dominance rule based regional density only considers the convergence measure d₁ and ignores the diversity measure d₂, which easily leads to the sparse distribution of the solution set.

Grid dominance

The grid-based Pareto-dominance relation is termed as grid-domination relation (GRD). A grid has the characteristics of reflecting the distribution of solutions based on coordinates [20]. The convergence of solutions can be estimated by the coordinate difference of solutions. The grid-domination system has taken shape in ϵ-MOEA and expanded in GrEA. GRD divides the objective space into a group of hyperboxes with the same size and assigns coordinates to each solution by formulas (4)–(7).

$${lb}_{k}={fmin}_{k}-\left({fmax}_{k}-{fmin}_{k}\right)/\left(2\times div\right)$$

(4)

$${ub}_{k}={fmax}_{k}+\left({fmax}_{k}-{fmin}_{k}\right)/\left(2\times div\right)$$

(5)

$${d}_{k}=\left({ub}_{k}-{lb}_{k}\right)/div$$

(6)

$$ G_{k} \left( {\mathbf{x}} \right) = \left\lfloor {\left( {f_{k} \left( {\mathbf{x}} \right) lb_{k} } \right)/d_{k} } \right\rfloor $$

(7)

where ub_k and lb_k represent the upper and lower bounds of the grid in the kth objective, respectively. fmax_k and fmin_k represent the maximum and minimum values of the solution in the kth objective, respectively. div represents the number of divisions of each objective dimension, while d_k represents the width of the hyperbox in the kth objective. G_k(x) and f_k(x) denote the coordinate and the function value of the solution x in the kth objective, respectively. Figure 2 shows an example of a grid setting in the kth objective.

RGD is defined as follows.

Definition 2 (RGD):

For two solutions u and v, u is said to dominate v, denoted by $u{\prec }_{GP}v$ if one of the following conditions holds true.

(1)
$u\prec v$ or.
(2)
$u$ and v are equivalent in terms of Pareto. For $\forall i\in \left(\text{1,2},\dots ,m\right)$ such that ${G}_{i}\left(u\right)\le {G}_{i}\left(v\right)$, and there is $j\in \left(\text{1,2},\dots ,m\right)$ such that ${G}_{j}\left(u\right)<{G}_{j}\left(v\right)$.

In addition, the coordinate difference is converted into grid difference (GD) by formula (8). The larger the div, the smaller the size of a unit hyperbox and the greater the difference between solutions.

$$GD\left(u,v\right)=\sum_{k=1}^{M}\left|{G}_{k}\left(u\right)-{G}_{k}\left(v\right)\right|$$

(8)

Although the grid-dominance relation achieves better results in dealing with MaOPs, the method based on hyperbox relies on data structure with exponential growth. In addition, since the grid system increases extra computation to deal with the solutions in the hyperbox, the computational cost is high.

Kondratiev wave

The statistical research concludes that Kondratiev wave is the pattern of long run fluctuations in economic growth [27]. As shown in Fig. 3, the cycle marked by T₁ includes four periods from t₁ to t₄, namely the recovery, prosperity, recession and depression of economic development, and the economic law follows this cycle.

Kondratiev waves are driven by technological innovations. The dividend brought by the new technology triggered the economic balance from depression to recovery, however, the saturation of dividends and the stagnation of technology have broken the balance, making the economy turn from prosperity to recession [34]. The Economics of Keynes advocates macro-control in the period of economic downturn. In short, the waves with opposite influence caused by discontinuous superposition can counteract or moderate the extent of recession.

In MaOEAs, convergence and diversity conflict with each other during the evolution [10, 35], and thus the solution with good convergence usually has poor diversity. In the second stage of TS, the adjustment method focusing on diversity often deteriorates the convergence. Similar to the Kondratiev waves cycle, TS-MaOEAs involve parameters that control resource allocation during the evolution. The parameters determine when the search enters the second stage, and then affect the balance of convergence and diversity. However, it is difficult to determine the computational resources required at each stage, and improper allocation will lead to performance degradation. Thus it can be seen that there are great commonalities between TS-MaOEAs evolution process and Kondratiev waves. It is feasible to study the evolution behavior of TS-MaOEAs with the assistance of Kondratiev waves.

Proposed algorithm

In this section, the TS/KW-MaOEA framework and its components are introduced, where the control mechanisms for separated balance are described in detail. To ensure the compatibility and robustness of TS/KW-MaOEA, the control mechanisms for separated balance can be summarized into the following three levels. In the first level, the equilibrium state of the solution set is separated into the convergence state and the distribution state, and the corresponding compromise search tendency is separated into convergence first and diversity first. Stages with different search tendencies are carried out alternately through TS/KW model. In the second level, the stages with different search tendencies employ appropriate dominance rules, selection criteria and maintenance strategies, which are designed in the dominance relation, mating selection components and environmental selection components. In the third level, the measurement values that correspond to the stage characteristics are used to represent the convergence state and distribution state at different stages.

The framework of TS/KW-MaOEA is outlined in Section “Framework of TS/KW-MaOEA.” Section Enhanced dominance relations describes the two enhanced dominance relations in detail. Finally, mating selection component, environmental selection component and Pareto-optimal subspace learning strategy are discussed in Sections 3.3 to 3.6.

Framework of TS/KW-MaOEA

Similar to NSGA-II, the framework of the proposed TS/KW-MaOEA is outlined in Algorithm 1, which consists of the following three steps. First, the Das and Dennis’s method is used to generate a set of reference vectors $W=\left\{{w}_{1},{w}_{2},\dots ,{w}_{N}\right\}$ with the same size as the population P. Meanwhile, an archive A is initialized for storing the nondominated solution obtained in the first phase (lines 1–4). Second, the current stage S is calculated with the TS/KW model (line 6). The construction of the TS/KW model will be described in the experiment in Section IV-E. Subsequently, the binary tournament selection, simulated binary crossover [36] and the polynomial mutation [37] operators are employed to generate the offspring U’ (lines 7–8). In the first and second stages, the mating selection component employs d₂ metric and SD metric, which are mentioned in Section Hybrid dominance, as the secondary criteria for selection, respectively. Finally, the solutions that are propagated to the next generation are selected from the union of the parent population and the offspring. The final solution set is obtained through continuous iteration (lines 9–12). In the first and second stages, the environmental selection component uses the truncation method and the allocation method as the elimination strategy, respectively. In addition, the archive A is used to learn the Pareto-optimal subspace in the second stage. An efficient nondominated sorting (ENS) approach [38] is used in the proposed TS/KW-MaOEA.

Enhanced dominance relations

(1) Subregion hybrid Dominance relation

Since the penalty-based boundary intersection (PBI) approach can obtain a set of Pareto optimal solutions that approximate the PF very well, the PBI-based subregion hybrid dominance relation (PHD) is introduced to control the convergence independently in the first stage of TS. Mathematically, PBI approach is in the form

$$PBI\left(\mathbf{x}\right)={d}_{1}\left(\mathbf{x}\right)+\theta {d}_{2}\left(\mathbf{x}\right)$$

(9)

where d₁(x) and d₂(x) are defined in Eqs. (2)–(3). θ is a penalty parameter, which can control the search direction and affect the convergence process [39]. When the θ is small, the search emphasizes the convergence, otherwise the diversity is emphasized. However, most MaOEAs usually set the parameter θ to 5 to maintain a compromise search tendency. In view of the balance tendency in the first stage, θ is set to 1 in this paper. PHD is described as follows, where $\left(\overline{\cdot}\right)$ denotes the normalization.

Definition 3 (PHD):

: For two solutions u and v, u is said to dominate v, denoted by $u{\prec }_{PH}v$ if one of the following conditions holds true.

(1) $u\prec v$ or.

(2) $u$ and v are equivalent in terms of Pareto.

a) u and v are associated with the same reference vector in the case of normalization.

$\overline{PBI }\left(u\right)<\overline{PBI }\left(v\right)$
$\overline{PBI }\left(u\right)>\overline{PBI }\left(v\right)$, u and v are associated with the same reference vector in the case of standardization; $PBI\left(u\right)<PBI\left(v\right)$, ${d}_{1}\left(u\right)<{d}_{1}\left(v\right)$ and $\overline{{d }_{1}}\left(u\right)<\overline{{d }_{1}}\left(v\right)$.

b) u and v are associated with the different reference vector in the case of normalization. However, u and v are associated with the same reference vector in the case of standardization; $\overline{PBI }\left(u\right)<\overline{PBI }\left(v\right)$ and $PBI\left(u\right)<PBI\left(v\right)$; $\overline{{d }_{1}}\left(u\right)<\overline{{d }_{1}}\left(v\right)$, ${d}_{1}\left(u\right)<{d}_{1}\left(v\right)$ and $\overline{SD }\left(u\right)<\overline{SD }\left(v\right)$.

As shown in Fig. 4, since the scale of each objective is different, the distribution of the solution and reference vector has changed in the case of standardization and normalization. In the Fig. 4a, the reference vectors with standardization are uniformly distributed in the entire objective space, while they are not necessarily uniformly distributed on the entire PF. As shown in Fig. 4b, the normalized solution set has the same span for each objective. However, the reference vectors are not uniformly distributed in the entire objective space, which is more likely to lead to the non-uniform distribution of solutions on the PF. Therefore, d₁ and SD are different in the case of standardization and normalization, and the dominance relation between solutions are also different [39, 40].

PHD follows the dominance rules of normalization first and standardization second. For two normalized solutions u and v that are associated with the same reference vector, the solution with smaller $\overline{PBI }$ is superior in comparison. To ensure the accuracy of the results, it is necessary to judge whether they are associated with the same reference vector in the case of standardization. If so, PBI values are further compared. To ensure better convergence, the inferior solution in the above cases should meet the requirements of the smaller PBI, d₁ and $\overline{{d }_{1}}$ are used to reverse its disadvantage position. Similarly, for two normalized solutions u and v that are associated with different reference vectors, strong selection pressure is required to distinguish them. The superior solution should meet the requirements of the smaller $\overline{PBI }$, PBI, $\overline{{d }_{1}}$, d₁ and $\overline{SD }$, which can avoid the deterioration of the diversity of the superior solution.

(2) Dual grid dominance relation

Since the grid can explicitly depict the distribution of solutions, the second stage of TS uses a dual grid dominance (DGD) relation to conduct separated control on diversity. DGD eliminates the comparison between the solutions inside the hyperbox to reduce the computational cost. Coordinate parameters of solution x, $GP\left(x\right)=\left\{{G}_{1}\left(x\right),{G}_{2}\left(x\right),..{G}_{m}\left(x\right)\right\}$, are obtained from formulas (4)–(7). A solution is assigned a level according to the coordinate parameters, and the second level grid consists of the hyperboxes with the same level.

Figure 5 shows an example of two-dimensional objective space. The first level distinguishes a pair of Pareto equivalent solutions in a similar way to GRD. For instance, the solution in hyperbox C can dominate the region marked by the dotted line. The sum of coordinate parameters of hyperbox is calculated by $GS\left(x\right)={G}_{1}\left(x\right)+{G}_{2}\left(x\right)+\dots +{G}_{m}\left(x\right)$. The solutions in the same hyperbox share the GS value. For example, for any solution x in hyperbox A, $GS\left(x\right)=7$. The hyperboxes at the same level in the second grid are labeled as connected regions with the same color and share the Level value. For instance, for any solution x in hyperbox D and E, $Level\left(x\right)=4$.

In view of the above-mentioned description, DGD is defined as follows.

Definition 4 (DGD):

For two solutions u and v, u is said to dominate v, denoted by $u{\prec }_{DG}v$ if one of the following conditions holds true.

(1) $u\prec v$ or.

(2) $u$ and v are equivalent in terms of Pareto.

(a) In the first level grid, $u{\prec }_{GR}v$.

(b) u and v are equivalent in terms of the first level grid; $GS\left(u\right)<GS\left(v\right)$ and $Level\left(u\right)<Level\left(v\right)$.

As shown in Fig. 5, any couple of solutions from hyperboxes A and C are equivalent in terms of Pareto, and the advantages and disadvantages of the two are hard to distinguish with GRD. Here, the square of distance metric δ² is used to indicate the convergence state of the solution, where δ is the Euclidean distance from the ideal point to the hyperbox. core_δ² denotes the convergence of the central solution of the hyperbox. max_δ² and min_δ² represent the maximum and minimum convergence of the solution inside the hyperbox, respectively. Then, the convergence of hyperboxes A and C is calculated respectively, that is, $coer\_{\delta }^{2}\left(A\right)=22.5$ and $coer\_{\delta }^{2}\left(C\right)=12.5$. Hyperbox C is significantly better than hyperbox A. From a solution perspective, ${min\_\delta }^{2}\left(a\right)=17$ and $max\_{\delta }^{2}\left(c\right)=18$, where a and c are the solutions in the hyperboxes A and C, respectively. Since the worst case of the solution in hyperbox C is almost the same as the best case of the solution in hyperbox A, there is a high probability that the solution from hyperbox C has better convergence than the solution from hyperbox A. In the case of sufficiently small hyperbox, each hyperbox can hold one solution at most, then hyperbox C has faster convergence than hyperbox A. For simplicity, the GS value is used to represent the convergence of the solution in the hyperbox. For instance, $GS\left(C\right)=6$ and $GS\left(A\right)=7$. Therefore, hyperbox C is better than hyperbox A in terms of convergence. It is worth noting that GS implicitly protects the hyperbox along the border while representing the convergence state. For example, the convergence of hyperboxes C and E is $core\_{\delta }^{2}(C)=core\_{\delta }^{2}(E)=12.5$, and $max\_{\delta }^{2}\left(c\right)=18$, $max\_{\delta }^{2}\left(e\right)=17$, $min\_{\delta }^{2}\left(c\right)=9$, $min\_{\delta }^{2}\left(e\right)=8$. It can be seen from the results that the convergence of hyperboxes C and E is almost the same. However, $GS\left(E\right)=5$ and $GS\left(C\right)=6$, due to the grid characteristics, the hyperbox E along the boundary wins the comparison, which is inconsistent with the result that the solution in hyperbox C is equivalent to the solution in hyperbox E in a large probability.

When comparing a couple of approximate Pareto equivalent solutions, since the GS metric overestimates the solution in the hyperbox along the boundary, the Level metric is used in the second level grid to complement the GS metric. The second level grid layers the hyperbox horizontally according to δ². The closer the hyperbox is to the ideal point, the smaller its Level value is, and the better the hyperbox convergence at this level is. In the comparison of a couple of approximate Pareto equivalent solutions, the Level metric may highly evaluate the hyperbox in the inner layer, which can guarantee the convergence and implicit loose diversity representation. Therefore, according to the DGD relation, hyperbox C can dominate hyperbox A and is equivalent to hyperbox E. For other simple cases, for example, hyperbox B can dominate hyperbox A.

Compared with GRD, DGD has simple structure and less computation, but it does not compare Pareto equivalent solutions within the same hyperbox. If the hyperbox is sufficiently small, each hyperbox can hold one solution at most. Accordingly, there are no two Pareto equivalent solutions in the same hyperbox. The number of hyperbox can be formed as

$$div=\alpha \cdot \left(1+\frac{\Vert {PF}_{1}\Vert }{\Vert {P}_{t}\Vert }\right)$$

(10)

where $\Vert \cdot \Vert $ denotes the number of vectors. α is the threshold factor related to the objective number m. PF₁ denotes nondominated solutions on the first PF, and P_t is the population at generation t. Considering the number of reference points with m objectives ($m\in [\text{3,15}]$) generated by the Das and Dennis method, α is set to $2m(m-2)$ in this paper. Clearly, formula (10) indicates that the number of hyperboxes can be controlled within the range [$\alpha ,2\alpha $]. After the above operations, Pareto equivalent solutions still exist in the same hyperbox, so the environmental selection components based on the decomposition method and allocation principle are used to select these solutions. The details are presented in Section III-D.

Mating selection

In TS/KW-MaOEA, the mating selection component uses the binary tournament, which follows convergence measure (CM) first and diversity measure (DM) second. The steps of mating selection is shown in Algorithm 2, which is composed of three steps: (1) initialize mating pool (line 1) and (2) different nondominated sorting methods are selected according to the evolution state (lines 3–9). For the first stage that focuses on convergence, the efficient nondominated sorting with PBI-subregion hybrid dominance (PHD-ENS) is used, where the nondominated rank representing convergence is stored in CM (line 4), and the d₂ metric representing diversity is stored in DM (line 5). For the second stage that focuses on diversity, the efficient nondominated sorting with dual grid dominance (DGD-ENS) is used, in which the nondominated rank representing convergence is stored in CM (line 7), and the SD metric representing diversity is stored in DM (line 8). $DM\left(x\right)$ represents the metric value obtained by normalizing the solution x and the associated reference vector. 3) Solution a and solution b are randomly selected from the current set P for comparison, and the solution with small CM value wins in comparison (lines 12–15). If the CM values of a and b are the same, the solution with smaller DM value wins in comparison (lines 17–20). Otherwise, one of the two is randomly selected as the winner (lines 22–26). The winner is put into the mating pool, and the comparison operation is repeated until the mating pool is saturated.

The mating selection component continues to consider diversity on the premise of ensuring convergence, and selects the appropriate dominant method for the search operation at each stage. The PHD method is conducive to selecting solutions that can effectively promote population convergence without deteriorating its distribution state, while the DGD method can select solutions with good diversity from approximate Pareto equivalent solutions. In addition, the d₂ metric and SD metric utilized in the components can be reused, which can replace the crowding-distance to reduce the consumption of computational resources.

Environmental selection

The environmental selection component in TS/KW-MaOEA uses the diversity maintenance strategy including truncation mode and allocation mode, and the diversity metric is used to replace the crowding-distance. The steps of environmental selection operation are shown in Algorithm 3. First, different nondominated sorting methods are selected according to the evolution state (lines 3–10). In the first stage that focuses on convergence, PHD-ENS is used as a nondominated sorting method. The nondominated rank representing convergence is stored in $CM^{\prime}$ (line 4), and the d₂ metric representing diversity is stored in $DM^{\prime}$ (line 5). In the second stage that focuses on diversity, DGD-ENS is used as a nondominated sorting method. The nondominated rank representing convergence is stored in $CM^{\prime}$ (line 8), and the cosine similarity value representing diversity is stored in $DM^{\prime}$ (line 9). $DM^{\prime}(x)$ represents the metric value of solution x and the associated reference vector in the case of normalization, and $DM^{\prime}\left(x,{w}_{j}\right)$ represents the metric value of solution x and the specified reference vector w_j in the case of normalization. Third, the solutions in each PF are added into the solution set P in the order of their rankings until no more solutions can be accommodated (lines 11–14). Finally, in the first stage, if the size of solution set P is larger than N, P is truncated in the ascending order of d₂ metric (lines 15–19). The nondominated solutions in the first PF are copied into the archive A (line 20). In the second stage, the cosine similarity of each solution and its specified reference vector w_j is calculated, and each reference vector is assigned a solution with the maximum cosine similarity.

In the second stage, the environmental selection component in TS/KW-MaOEA employs the allocation method based on cosine similarity metric to maintain diversity, that is, each reference vector is assigned a solution closest to it. Since the reference vector is the same size as the solution set, the closer the assigned solution is to the reference vector, the more the solution can maintain a similar uniform distribution with the reference vector. Figure 6 shows an example of solution selection based on cosine similarity metric.

In Fig. 6, solutions a and b with the same d₂ metric are on the right side of the reference vector w₁, while solutions c and d with the same d₂ metric are on the left side of the reference vector w₂. If d₂ metric and the truncation method are used, solutions b and d in the inner PF will be reserved. In the above cases, due to the small distance between the reserved solutions on the opposite side and the large distance between the solutions on the same side, it is easy to cause dense regions and sparse regions, which is not conductive to diversity maintenance. If the allocation method based on cosine similarity metric is employed, solutions a and c will be reserved, which can significantly improve the distribution state of solution set. However, the solutions with better convergence in the inner PF are eliminated, which will deteriorate the convergence of solutions.

Figure 7 shows an example of environmental selection process at different stages. As shown in Fig. 7a, in the first stage of focusing on convergence, since the environmental selection component focuses on preventing the deterioration of diversity, the traditional truncation method is used. In the second stage of focusing on diversity, since the environmental selection component aims to improve the distribution of solutions, the allocation method is used. As shown in Fig. 7b, the allocation method can provide opportunities for eliminated solutions that use the truncation method to enter the next generation. However, some solutions may far away from the true PF. To address this problem, a Pareto optimal subspace learning strategy is used before the nondominated sorting, which aims to find subregions that approximate the true PF and reduce the regions outside the subregions to compensate for the loss of convergence. In addition, the alternate execution of the two stages used in TS/KW-MaOEA can effectively alleviate the adverse effects on convergence caused by environmental selection in the second stage.

Pareto-optimal subspace learning

The principal component analysis approach (PCA) used in TS/KW-MaOEA aims to learn the Pareto optimal subspace. The execution process can be roughly divided into the following three steps, as shown in Algorithm 4. First, the archive A is used to construct the sample matrix K of PCA, and the threshold value ϵ is initialized. Each row of matrix K represents a solution x, and each column represents a component of the solution set. The second step is to conduct principal component analysis (lines 4–9). Specifically, the eigenvalue $V=\left\{{v}_{1},{v}_{2},\dots ,{v}_{M}\right\}$ of $K{\prime}$ is calculated, and then the principal component is determined based on the subscript and the threshold ϵ. Finally, the mean value is used to replace the upper and lower bounds of non-principal components. The space bounded by the new upper and lower bounds is called the Pareto optimal subspace (lines 10–14).

The Pareto optimal subspace learning strategy can locate the region where the true PF exists and reduce the region outside the located region. On the one hand, computational resources are concentrated on potential regions. On the other hand, it compensates for the convergence loss caused by the environmental selection component in the second stage. In addition, since both DGD and allocation strategies require more computational resources, the Pareto optimal subspace learning strategy uses the PCA dimensionality reduction method with low computational complexity to reduce the computational cost in the second stage.

Experimental results and discussion

To evaluate the performance of the proposed algorithm TS/KW-MaOEA in solving MaOPs, a series of experiments is performed against six state-of-the-art MaOEAs.

Experimental setting

(1) Benchmark Test Problems: Two popular test problems, DTLZ test suite [41] and WFG test suite [42], are used in the experiments. The DTLZ test suite is widely used as a benchmark for testing MOEA performance. The WFG test suite consists of a set of difficult benchmark test problems that involve deceptive problems. Compared with DTLZ test suite with separable variables, WFG test suite is more complex and difficult to deal with. The parameter settings for DTLZ1-DTLZ7 and WFG1-WFG8 are shown in Tables 1, 2, respectively.

Table 1 Settings of test problems DTLZ1-DTLZ7

Two-stage many-objective evolutionary algorithm: enhanced dominance relations and control mechanisms for separated balance

Abstract

Similar content being viewed by others

A Two-phase evolutionary algorithm framework for multi-objective optimization

Many-Objective Evolutionary Algorithm Based on Dominance and Objective Space Decomposition

Hybrid selection based multi/many-objective evolutionary algorithm

Introduction

Related works

Hybrid dominance

Definition 1 (RPD):

Grid dominance

Definition 2 (RGD):

Kondratiev wave

Proposed algorithm

Framework of TS/KW-MaOEA

Enhanced dominance relations

Definition 3 (PHD):

Definition 4 (DGD):

Mating selection

Environmental selection

Pareto-optimal subspace learning

Experimental results and discussion

Experimental setting

Influence of PBI parameter on equilibrium

Influence of normalized correction strategy on distribution

Comparison of the stage one in TS

Construction of the TS/KW Model

Results on the DTLZ and WFG Suite

Conclusion

Data availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation