Global solution of constrained min-max problems with inflationary differential evolution

Filippi, Gianluca; Vasile, Massimiliano

doi:10.1007/s11081-021-09613-3

Global solution of constrained min-max problems with inflationary differential evolution

Research Article
Open access
Published: 02 April 2021

Volume 24, pages 99–145, (2023)
Cite this article

Download PDF

You have full access to this open access article

Optimization and Engineering Aims and scope Submit manuscript

Global solution of constrained min-max problems with inflationary differential evolution

Download PDF

Gianluca Filippi¹ &
Massimiliano Vasile¹

3178 Accesses
1 Citation
3 Altmetric
Explore all metrics

Abstract

This paper proposes a method for the solution of constrained min-max problems. The method is tested on a benchmark of representative problems presenting different structures for the objective function and the constraints. The particular min-max problem addressed in this paper finds application in optimisation under uncertainty when the constraints need to be satisfied for all possible realisations of the uncertain quantities. Hence, the algorithm proposed in this paper search for solutions that minimise the worst possible outcome for the objective function due to the uncertainty while satisfying the constraint functions in all possible scenarios. A constraint relaxation and a scalarisation procedure are also introduced to trade-off between objective optimality and constraint satisfaction when no feasible solutions can be found.

Globally convergent evolution strategies for constrained optimization

Article 31 March 2015

Lost in Translation: Explicitly Solving Nonlinear Stochastic Optimal Control Problems Using the Median Objective Value

Article 24 September 2015

On continuation methods for non-linear bi-objective optimization: towards a certified interval-based approach

Article 11 May 2014

1 Introduction

A min-max optimisation problem aims at minimising, with respect to a vector $\mathbf {d}$ defined in some space D, the maximum value of a given cost function with respect to a vector $\mathbf {u}$, different from $\mathbf {d}$. In this paper we are interested in a class of constrained min-max problems where the constraints have to be always satisfied for all values of $\mathbf {u}$ in a given set U.

Min-max problems appear in a wide range of applications. They can be found in the optimal sequence of moves for a computer player in famous board games: the Egyptian ancient game Seega (Abdelbar et al. 2003), the game of checkers (Hughes 2005) and chess (Shannon 1950). They are used to formulate the design for robustness of systems affected by uncertainty: design of electric circuits (Agnew 1981), design of on-line controllers (Sebald and Schlenzig 1994), of aerodynamic shapes (Ong et al. 2006), of the location of sensors and pursuit-evasion games between missile and target (Kim and Tahk 2001).

Due to their wide applicability, they can appear in different forms. They can be seen indeed as a special case of bi-level optimisation problems where two agents quantify their utility with opposite fitness functions. From a game theoretic point of view min-max problems are two-players zero-sum games where the optimal strategy brings the two agents to a Nash equilibrium. From an engineering point of view min-max problems can be seen as deterministic approaches to make decisions under uncertainty. In this case one can understand the engineering problem as a zero-sum game where the two antagonist players are respectively the designer, handling the decision variables, and Nature, handling the uncertainty variables.

Mathematical Programming (Agnew 1981; Chaney 1982) has been widely used to solve min-max problems. An overview of classical discrete approaches can be found in Aissi et al. (2008). The use of Mathematical Programming, however, requires, strong assumptions on the nature of the problem and tends to be problem specific.

On the other hand heuristic approaches (Cramer et al. 2009; Lung and Dumitrescu 2011; Cavalier et al. 2007) and in particular evolutionary based algorithms (Cramer et al. 2009; Lung and Dumitrescu 2011; Laskari et al. 2002; Zhou and Zhang 2010) appear to be more flexible. For example one can find a form of Genetic Algorithm (GA) in Cramer et al. (2009), a special version of the Differential evolution (DE) called Crowding Differential Evolution (CrDE) combined with the Nash ascendancy relation in Lung and Dumitrescu (2011) and a form of Particle Swarm Optimisation (PSO) in Laskari et al. (2002).

As stated in Zhou and Zhang (2010) we can divide the class of evolutionary approaches for min-max problems in three branches. The first one includes problems with a discrete set of possible scenarios U. In Laskari et al. (2002) U is by definition discrete and small, in Cavalier et al. (2007), initially continuous, it is instead discretised by a uniform grid while in Cramer et al. (2009) it is reduced to be discrete by a random selection. The second branch considers a continuous space U and directly solves a sequence of nested problems (Agnew 1981; Sebald and Schlenzig 1994). This approach could be computationally intractable if no strategy is implemented to alleviate the cost. Finally, a third category uses a form of co-evolution optimisation (Kim and Tahk 2001; Cramer et al. 2009; Tahk and Sun 2000; Shi and Krohling 2002) where two separate populations evolve simultaneously with a predator-prey interaction in D and U separately each one being competitor and environment to the other. As demonstrated in Jensen (2001) however most of the co-evolutionary algorithms for min-max have to satisfy a symmetric assumption known as Issac’s condition (Tamer Basar 1982). A class of problems that can be properly modelled under this assumption exists (Tahk and Sun 2000; Barbosa 1999) and for them the co-evolution approach is proved to converge to the global solution (Power 1998). However this is not the general case and new methods to overcome this limitation are presented in Branke and Rosenbusch (2008).

In the interest of completeness, we include also the Best Replay approach cited by Marzat et al. (2013) where two optimisation problems are alternated until convergence, one minimising over D and the other maximising over U. It has been proven, however, that this approach often does not converge or it cycles through wrong design candidates, problem known as the Red Queen Effect.

An other research line, applicable to any previously presented method, deals with the reduction of the computational complexity. It makes use of surrogates which approximate the fitness function (Lung and Dumitrescu 2011; Zhou and Zhang 2010; Marzat et al. 2013). In particular, Zhou and Zhang (2010), Marzat et al. (2013) consider response surfaces within an Efficient Global Optimisation (EGO) where they alternate the minimisation in D of the surrogate of the maxima in U and the maximisation in U of the real function for the obtained optimal solution in D.

When it comes to constraint handling, in the existing literature, only few papers could be found that have explored how to deal with constraints in min-max optimisation. Most of them need to start from some strong assumptions on the nature of constraints and cost functions and have been developed for constrained bi-level problems and not specifically for the treatment of min-max problems (Sinha et al. 2020, 2018). In Kim and Tahk (2001) the duality between primal constrained min-max problem and dual unconstrained min-max problem with the Lagrange multipliers is demonstrated for both separable and non-separable constraints under the Issac’s condition. The constrained problem is then translated in the unconstrained one and a co-evolutionary algorithm is finally used to converge to the saddle point. Other works instead, still dealing with both min-max problem and constraint handling, go in the opposite direction. They indeed translate a single-level constrained problem in an unconstrained min-max problem and use the co-evolution approach to solve it (Tahk and Sun 2000; Shi and Krohling 2002).

This paper proposes a novel algorithm for the solution of a class of constrained min-max problems. The proposed algorithm is inspired by the procedure suggested by Shimizu and Aiyoshi (1980) and further elaborated in Zhou and Zhang (2010), Marzat et al. (2013). The idea is to alternate a minimisation and a restoration (or maximisation) process. As the alternation of these two processes progresses, we incrementally build a discrete representation of the space of the maxima in U. This representation is then used in the minimisation process to converge to an approximation of the desired min-max solution. This mechanism takes memory of the previous solutions avoiding the optimiser to follow the same path again and again. For this reason the proposed algorithm does not suffer from the Red queen Effect. The main novelty is the extension of this logic to the handling of constraints that need to be satisfied for all maxima in a given set. The paper proposes also a constraint relaxation and a scalarisation techniques that allow convergence to a solution in the case the given set U is too restrictive. A multi-population inflationary differential evolution algorithm is proposed to address both the constrained minimisation and the maximisation.

The paper is structured as follows. First we introduce the constrained min-max problems of interest. We then present the general algorithm together with a complexity analysis. The paper then introduces a benchmark of synthetic functions with increasing complexity. Finally the proposed method is tested on a real application case of space systems engineering under epistemic uncertainty.

2 Problem statement

This paper is concerned with the following class of single-objective constrained min-max problems:

$$\begin{aligned} \begin{array}{l} \min _{\mathbf {d} \in D} \max _{ \mathbf {u} \in U} f(\mathbf {d},\mathbf {u})\\ s.t.\\ c_i(\mathbf {d},\mathbf {u})\le 0 \quad \forall \mathbf {u} \in U, \forall {i} \in I_c = [1,...,s]^T \end{array} \end{aligned}$$

(1)

where f is the objective function and $c_i$ is the i-th constraint function. Both f and $c_i$ are defined on the space $D \times U$ and depend on a vector of design (or decision) variables $\mathbf {d} \in D\subset \mathcal {R}^n$ and a vector of uncertain (or environmental) variables $\mathbf {u} \in U\subset \mathcal {R}^m$. The solution $\mathbf {d}_{\text {opt}}$ of Eq. (1) has two properties: it satisfies all the constraint functions $c_i$ over the whole uncertain domain U and minimises the worst realisation of the objective function f over U. Furthermore, we assume that both f and $c_i$ are locally $\mathcal {C}^2$. This assumption can be relaxed if the local search can handle non-differentiable or discontinuous problems.

3 A memetic single objective constrained min-max approach

The proposed algorithm is a bi-level optimisation procedure based on the alternation of a minimisation and a restoration step. The minimisation step searches for a global solution to the constrained min-max problem:

$$\begin{aligned} \begin{array}{l} \min _{\mathbf {d} \in D} \max _{ \mathbf {u}_{af} \in \bar{A}_{uf}} f(\mathbf {d},\mathbf {u}_{af})\\ s.t.\\ \max _{\mathbf {u}_{ac} \in \bar{A}_{uc}} \max _{ i \in I_c} c_i(\mathbf {d},\mathbf {u}_{ac})\le 0. \end{array} \end{aligned}$$

(2)

While the restoration step searches for a solution to the following two global maximisation problems, given the solution $\bar{\mathbf{d }}$ coming from problem (2):

$$\begin{aligned}&\begin{array}{l} \max _{\mathbf {u} \in U} f(\bar{\mathbf{d }},\mathbf {u}) \\ s.t.\\ \max _{ i \in I_c} c_i(\bar{\mathbf{d }},\mathbf {u})\le 0 \end{array} \end{aligned}$$

(3)

$$\begin{aligned}&\max _{\mathbf {u} \in U} \max _{i \in I_c} c_i(\bar{\mathbf{d }},\mathbf {u}) \end{aligned}$$

(4)

The two archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ contain respectively the solutions of problem (3) and (4). By iteratively alternating the minimisation and restoration steps, one fills the two archives with the maxima of problems (3) and (4). Thus we can say that problem (2) searches for an optimal $\mathbf {d}$ over a discrete representation of the space of the maxima of problems (3) and (4).

This mechanism was first proposed for unconstrained problems by Shimizu and Aiyoshi (1980) and Marzat et al. (2013). A solution approach using a memetic algorithm was then introduced in Vasile (2014) and the constrained version was first formulated in Filippi and Vasile (2019). In this paper we provide five main contributions: (i) a detailed algorithmic presentation of a memetic solution approach, (ii) a constrained relaxation strategy, (iii) a scalarisation strategy, (iv) an algorithmic complexity analysis and (v) an extensive benchmark on which our proposed solver of constrained min-max problems can be tested.

The solution approach is summarised in Algorithms 1–8 and explained in the following subsections with the help of the flow diagrams in Figs. 1 and 2 and the examples in Figs. 3, 5 and 6.

3.1 Initialisation

The algorithm is initialised either by selecting a random value $\bar{\mathbf {d}}$ or with a given first guess and then by searching for a first solution to problems (3) and (4). These first solutions are then saved into the two archives ${A}_{uf}$ and ${A}_{uc}$. Problems (3) and (4) are global optimisation problems in general. In the following we will propose the use of a memetic global optimiser which combines Differential Evolution with a multi-restart mechanism and local search. Given the global nature of the search for a solution, one needs to set the amount of computational resources allocated to the solution of each of the minimisation and maximisation problems. We quantify this resource in terms of objective function evaluations. More specifically, with reference to problem (3) the maximum number of function evaluations $n_{\text {feval,max}}$ is the number of calls to the objective function $f(\bar{\mathbf {d}},\mathbf {u})$ while for problem (4) we count the function calls to $\max _i c_i(\bar{\mathbf {d}},\mathbf {u})$. Once the maximum number of function evaluations per sub-problem are defined one needs to define the maximum number of iterations $n_{\text {loop,max}}$ of minimisation/restoration.

The steps for the initialisation are summarised in Algorithm 1. Line 1 presents how to define the problem in order to be solved by Algorithm 2: the required information on objective function $f (\mathbf {d}, \mathbf {u})$, set of constraint functions $\mathbf {c} (\mathbf {d}, \mathbf {u})$, dimensions $n_D$ and $n_U$ of the design and uncertain vectors $\mathbf {d} \in D\subset \mathcal {R}^{n_D}$ and $\mathbf {u} \in U\subset \mathcal {R}^{n_U}$ and the bounds of each variable in $\mathbf {d}$ and $\mathbf {u}$. Line 2 defines the constraints on the computational resources for the single sub-problems in Eqs. (2) – (4). They are the maximum number of function evaluations $n_{\text {feval,max}}^{\text {outer}}$ for problem (2), $n_{\text {feval,max}}^{\text {inner}}$ for problem (3) and $n_{\text {feval,max}}^{\text {inner},c}$ for problem (4). Line 3 defines the threshold $\sigma _{\text {stop}}$ below which Algorithm 2 is considered to have converged. Line 4 defines the limit on the computational cost for the whole procedure (1) with its maximum number of function evaluations $n_{\text {fval,max}}$ and maximum number of iterations $n_{\text {loop,max}}$ between minimisation and restoration. Line 5 defines the optimisation algorithms and its parameter settings that are used to solve the sub-problems in Eq. (2) – (4). On line 6 the initial design vector is defined by the user or initialised randomly with a hyper-cube sampling procedure. In the case some important uncertain scenarios in U or design configurations in D are already known, the initial archives can be seeded with these scenarios, otherwise they are initialised as empty sets (line 7). Similarly, it is done for the archives relative to functions f and $\mathbf {c}$ (line 8). Lines 9 and 10 initialise the counters of the number of function evaluation $n_{\text {feval}}$ and the iteration $n_{\text {loop}}$ between minimisation and restoration processes. The initial accepted constraint violation $\epsilon $ (line 11) is initialised to zero.

3.2 Minimisation-restoration loop

The main algorithm is summarised in Algorithm 2. Using the first design guess $\bar{\mathbf {d}}$ from the initialisation, Eqs. (3) and (4) are solved in parallel (lines 4 and 5) within the first restoration, or inner, loop (lines from 2 – 8). With $\bar{\mathbf {d}}$ fixed, the former equation evaluates the feasible worst case condition of f while the latter determines the worst constraint violation. In the following we will adopt a multi-population version of Inflationary Differential Evolution (Carlo et al. 2019) to solve Eqs. (3) and (4). In this case multiple solutions are returned. The solution $\mathbf {u}_{af}$ is selected following Algorithm 3. In particular, if none of the outputs from the optimiser is feasible (no elements in no one of the populations), the uncertain vector with the lowest constraint violation is chosen (lines 2 – 3 of Algorithm 3). Otherwise, within the set of feasible solutions, the uncertain vector with the highest value of f is selected (line 5 of Algorithm 3). The uncertain vectors solutions are then stored in the archives as illustrated in Algorithm 4. Consider first lines 9 – 11 of Algorithm 4: the uncertain vector solutions $\mathbf{u} _{af}$ corresponding to the feasible maximum of f for the fixed $\bar{\mathbf {d}}$ is saved in $A_{uf}$, the corresponding $f(\bar{\mathbf {d}}, \mathbf {u}_{af})$ and $\max _{i\in I_c}c_i(\bar{\mathbf {d}}, \mathbf {u}_{af})$ are saved in $A_f$ and $A_{cf}$ respectively. Consider now lines 12 and 13 of Algorithm 4: the uncertain vector solutions $\mathbf{u} _{ac}$ that maximises the constraint violation for the same fixed design vector is saved in $A_{uc}$, while the corresponding $\max _{i\in I_c} c_i(\bar{\mathbf {d}}, \mathbf {u}_{ac})$ is saved in $A_c$. Finally, on lines 14 and 15 of Algorithm 4 the archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ are created by removing from $A_{uf}$ and $A_{uc}$ all the repeated elements (with a selected tolerance of 1e-8 on the euclidean distance between two elements in the archive).

The main loop is then started (line 9 of Algorithm 2 and Eq. (2)) with an alternation of the minimisation, or outer, step (lines 10, 11 and 12) and restoration, or inner, step (lines from 13 to 17). The latter has been already described. A further check is, however, performed in lines 1 to 8 of Algorithm 4 before the updating procedures of the archives. If the condition in line 2 holds, indeed, the inner loop in Eq. (3) has failed. Then the solution generated in this loop is discarded and the one of the previous outer loop is maintained. This condition follows the same criterion used in Algorithm 3 by giving priority to the constraint satisfaction. Similarly the condition in line 6 for Eq. (4).

The outer step addresses the solution of the constrained minimisation in Eq. (4). Also for this minimisation loop we propose the use of a multi-population version of Inflationary Differential Evolution (Carlo et al. 2019). For each new design vector generated by the optimiser the cost function and constraints are evaluated for all the $\mathbf {u}$ vectors in $\bar{A}_{uf}$ and $\bar{A}_{uc}$ and the worst cost function and constraint violation values are retained (line 11). Note that in some cases it is desirable to run a local search every time a vector in the two archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ is evaluated (see for example Vasile (2014)). This added local search significantly increases the computational cost of each single evaluation of the outer loop but in some cases improves convergence to the point that the overall cost of the algorithm is reduced. For this reason we inserted this option in the algorithm although it is not tested in this paper. It was, however, tested, for the case of unconstrained min-max problems, in a previous work by the authors Vasile (2014). At the end of the minimisation process, the archive $A_d$ of the design configurations is updated with the new design solution $\bar{\mathbf{d }}$.

If the condition in line 18 of Algorithm 2 holds, the constraint relaxation strategy described in Algorithm 5 is activated. An alternative option, the trade-off strategy in Algorithm 6, can instead be activated if condition in line 21 is true. These alternatives can be visualised in Fig. 1. If the former condition holds, problem A is stopped, the relaxation step in the first block in B is performed finding $\epsilon $ and finally the constrained min-max problem is updated as in the second block in B. If instead the second condition hold, block A is stopped and then blocks C and D are solved.

The minimisation and restoration steps are alternated until condition in line 9 is satisfied. In particular, the iteration holds until the number of calls to the function has not reached the maximum allowed ($n_{\text {feval}} < n_{\text {feval,max}}$), the number of iterations is below the upper bound ($n_{\text {loop}} < n_{\text {loop,max}}$) and the solutions saved in archives have not converged ($\max _A(\sigma ) > \sigma _{\text {stop}}$) where $\max _A(\sigma )$ is the maximum standard deviation between all the archives in the last 3 iterations. The violation of at least one of these conditions corresponds to the termination criterion.

Then, the cross-check between all the design vectors archived in $A_d$ is performed (line 26). The cross-check procedure is explained in Sect. 3.6 and summarised in Algorithm 7. Finally, the solution is chosen following Algorithm 8 (line 28).

An example of the application of Algorithm 2 to the two functions MWP10 and GFc1 (see Sect. 4.1), without constraint relaxation and trade-off can be visualised in Fig. 3. In sub-figures c and d an initial design guess $\hat{\mathbf {d}}$ and each new design $\bar{\mathbf{d }}$ proposed by the minimisation step are represented as vertical lines. The corresponding worst case scenario for f and $\mathbf {c}$ are plotted, with the same colour, as dots and stars respectively. Sub-figure e shows sections of f and $\mathbf {c}$ over U for different design configurations. Sub-figure f shows how the space of the maxima of f and $\mathbf {c}$ as it appears to the minimisation process. In particular, sub-figure (c) shows that the algorithm is able to find at the first iteration, for this test case, the subspace $\hat{D} \in D$ in which the design solutions are feasible for any uncertain scenario. The first design guess $\hat{\mathbf {d}}$ (blue line) finds the worst scenario for $\mathbf {u}=10$. In b indeed, the maximum constraint violation is brought to zero at the second iteration and all the other iterations are then used to minimise the worst case of f working within the design domain $\hat{D}$.

3.3 Memetic strategy

When problems in Eqs. (2) to (4) within the min-max Algorithm 2 are global optimisation problems we propose the use of the memetic optimisation algorithm Multi–Population Adaptive Inflationary Differential Evolution Algorithm (MP-AIDEA) (Di Carlo et al. 2019). It combines the Darwinian evolution of a number of populations of candidates through a Differential Evolution (DE) strategy with the Lamarckian evolution of the best agent in each converged population through a local search algorithm. In particular, the local refinement is performed if the converged solution is not in the basin of attraction of previous local minima. A number of local restart are then performed before globally restarting.

MP-AIDEA depends on the following parameters: the maximum number of function evaluation $n_{\text {feval,max}}$, the number of populations $n_{\text {pop}}$, the number of agents in each population $N_{\text {pop}}$, the convergence threshold $\rho $ of DE, and the radius of the global restart bubble $\delta _{\text {global}}$.

An example of a run of MP-AIDEA with $n_{\text {pop}}=2$ for problem MWP-1&GFc-1 is in Fig. 4 which shows, for the evolving populations, the alternations of DE steps together with the local refinements. For this test case the MATLAB function solver fmincon has been used. Given the constraint on the maximum number of function evaluations, the algorithm is able to perform only one local restart in each population. Colours red, blue and green refer to the first population while colours orange, yellow and brown refer to the second population. As shown in the figure, first the two populations are initialised randomly and evolved independently until convergence is achieved (being the parameter $\rho $ one of the termination criteria). In particular the red (orange) area represent the evolution of the best agent in the population while the blue (yellow) represent the mean value and the green (brown) the worst agent. From the two converged solutions a local search is performed with fmincon until the green and brown points are obtained. From these two local minima then the two populations are locally reinitialised. The overall process is then repeated till convergence.

3.4 Constraint relaxation strategy

The min-max problem proposed in this paper imposes quite stringent conditions on the satisfaction of the constraints as constraints need to be satisfied for all possible values of $\mathbf {u}\in U$. It is, therefore, possible that no solution $\mathbf {d}_{\text {opt}}$ is feasible in all U. Since we are interested in the worst case solution for both constraints and objective function, when no feasible $\mathbf {d}$ is possible we introduce an automatic relaxation of the constraints.

In order to find a set $X\subset D$ that is feasible for all $\mathbf {u}\in U$, we first solve the following minmax problem:

$$\begin{aligned} \min _{\mathbf {d} \in D} \max _{ \mathbf {u} \in U} \max _{ i \in I_c} c_i(\mathbf {d},\mathbf {u}) \end{aligned}$$

(5)

Problem (5) minimises the maximum violation of the constraints and returns a solution vectors $\mathbf {d}_{\text {min,}c}$ and $\mathbf {u}_{\text {min,}c}$. Vectors $\mathbf {d}_{\text {min,}c}$ satisfies the constraint:

$$\begin{aligned} \mathbf {c}_{\epsilon }\le 0 \end{aligned}$$

(6)

for all $\mathbf {u}\in U$, where:

$$\begin{aligned} \mathbf {c}_{\epsilon }=\mathbf {c}-\epsilon \end{aligned}$$

(7)

and

$$\begin{aligned} \epsilon = \max _{i\in I_c}c_i(\mathbf {d}_{\text {min,}c},\mathbf {u}_{\text {min,}c}) \end{aligned}$$

(8)

The relaxation strategy is explained in Algorithm 5. If condition on line 18 of Algorithm 2 is satisfied then Algorithm 5 is triggered. Once in Algorithm 5, until condition in line 2 holds, problem (5) is solved (see lines 3 – 8 of Algorithm 5), with the iteration between the following minimisation

$$\begin{aligned} \min _{\mathbf {d} \in D} \max _{ \mathbf {u}_{ac} \in \bar{A}_{uc}} \max _{i \in I_c} c_i(\mathbf {d},\mathbf {u}_{ac}), \end{aligned}$$

(9)

and restoration step

$$\begin{aligned} \max _{\mathbf {u} \in U} \max _{i \in I_c} c_i(\bar{\mathbf{d }},\mathbf {u}) \end{aligned}$$

(10)

This is an unconstrained min-max formulation where the optimised function is the vector of constraints $\mathbf {c}$. The solution at convergence is the minimum over D of the worst constraint violations in U. In line 10 that value is associated to the relaxation parameter $\epsilon $. In this way the algorithm is re-conducted to Eq. (5) by means of the relaxation of the constraint violation and Eq. (1) is translated in:

$$\begin{aligned} \begin{array}{l} \min _{\mathbf {d} \in D} \max _{ \mathbf {u} \in U} f(\mathbf {d},\mathbf {u})\\ s.t.\\ c_i(\mathbf {d},\mathbf {u})\le \epsilon \quad \forall \mathbf {u} \in U, \forall {i} \in I_c = [1,...,n]^T \end{array} \end{aligned}$$

(11)

3.5 Scalarisation strategy

This subsection presents a different point of view that is based on the parameter-based scalarisation approach (Kasimbeyli et al. 2019) for the solution of Eq. (1). It is here supposed that the decision maker, who is the final user of this methodology, is able to assign preference weights to the different Quantity of Interests (QoIs) involved in the problem under analysis. The scalarisation approach then has been preferred to the direct multi-objective optimisation because, for a given set of weights, it reduces the computational complexity. Problem (11) indeed can be seen as an Epsilon-Constraint Scalarisation (ECS) (Filippi and Vasile 2020; Haimes et al. 1971) formulation of the bi-objective min-max problem:

$$\begin{aligned} \min _{\mathbf {d} \in D} \left[ \max _{ \mathbf {u} \in U} f(\mathbf {d},\mathbf {u}), \max _{ \mathbf {u} \in U} \max _{i\in I_c} c_i\right] \end{aligned}$$

(12)

Thus one could be interested in a trade-off between f and $\epsilon $ and accept larger relaxations of the constraints in favour of a better objective. Note that in the context of uncertainty quantification this implies accepting a higher probability of violating the constraints in favour of a better cost function.

We now introduce the assumption that a preference vector $\varvec{\omega }=[\omega _f,\omega _c]^T$ can be defined a priori and explain the scalarisation approach summarised in Algorithm 6.

In the first part (line 2 – 10) the reference points $c_{\text {ideal}}$, $f_{\text {nadir}}$, $f_{\text {ideal}}$ and $c_{\text {nadir}}$ are calculated. $c_{\text {ideal}}$ is the minimum (best) over D of the worst case constraint violations in U:

$$\begin{aligned} c_{\text {ideal}} = \min _{\mathbf {d} \in D} \max _{ \mathbf {u} \in U} \max _{i \in I_c} c_i(\mathbf {d},\mathbf {u}) \end{aligned}$$

(13)

and it is equal to the relaxed constraint $\epsilon $. For the corresponding design vector $\mathbf {d}_{c\text {-ideal}}$ the worst scenario for the objective function is $f_{\text {nadir}}$ (lines 3 to 8):

$$\begin{aligned} f_{\text {nadir}} = \max _{\mathbf {u}\in U} f(\mathbf {d}_{c\text {-ideal}},\mathbf {u}) \end{aligned}$$

(14)

In line 9 the unconstrained min-max problem

$$\begin{aligned} f_{\text {ideal}} = \min _{\mathbf {d} \in D} \max _{ \mathbf {u} \in U} f(\mathbf {d},\mathbf {u}) \end{aligned}$$

(15)

is solved to define the best design configuration $\mathbf {d}_{f\text {-}ideal}$ that minimises the worst scenarios of the objective function f regardless the constraint violation. $\mathbf {d}_{f\text {-}ideal}$ is then used in line 10 to calculate the corresponding worst condition for the constraint violation:

$$\begin{aligned} c_{\text {nadir}} = \max _{\mathbf {u} \in U}\max _{ i \in I} c_i(\mathbf {d}_{f\text {-ideal}},\mathbf {u}) \end{aligned}$$

(16)

An example of the reference points for a generic Pareto front applied to the min-max problem is in Fig. 5 where $\mathbf {z}^{\text {ideal}} = [f_{\text {ideal}}, c_{\text {ideal}}]$ and $\mathbf {z}^{\text {nadir}} = [f_{\text {nadir}}, c_{\text {nadir}}]$. In the second part (lines 13 – 27) it is instead described the scalarisation procedure. Nadir and ideal points are here used to normalise f and c:

$$\begin{aligned} \bar{f}= & {} \frac{f-f_{\text {ideal}}}{f_{\text {nadir}}-f_{\text {ideal}}} \end{aligned}$$

(17)

$$\begin{aligned} \bar{c}= & {} \frac{\max _{i\in I_c}c_i-c_{\text {ideal}}}{c_{\text {nadir}}-c_{\text {ideal}}}. \end{aligned}$$

(18)

Three different methods were considered in this study: ECS, Weighted-Sum Scalarisation (WSS) (Gass and Saaty 1955) and Chebychev/Pascoletti-Serafini Scalarisation (CPSS) where the last one is a smooth combination of Weighted Chebyshev Scalarisation (WCS) (Bowman 1976) and Pascoletti-Serafini Scalarissation (PSS) (Bo et al. 2007). A comparison of the different approaches can be found in Filippi and Vasile (2020). The difference in Algorithm 6 is only within the minimisation loop (lines 13 – 22).

The WSS solves the minimisation over the design space D of the weighted sum of the worst case scenarios for f and c in the respective archives (line 18):

$$\begin{aligned} \min _{\mathbf {d} \in D} \big [ \omega _f \max _{ \mathbf {u} \in \bar{A}_{uf}} \bar{f}(\mathbf {d},\mathbf {u}) + \omega _{c} \max _{ \mathbf {u} \in \bar{A}_{uc}} \bar{c}(\mathbf {d},\mathbf {u}) \big ] \end{aligned}$$

(19)

The CPSS solves two different problems. During the DE step of Inflationary Differential Evolution, the algorithm addresses the following problem:

$$\begin{aligned} \min _{\mathbf {d}} \max _i \omega _i \bar{z}_i \end{aligned}$$

(20)

where we have introduced the weight vector $\varvec{\omega }=[\omega _f,\omega _c]^T$ and the objective vector $\bar{\mathbf {z}}=[\max _{ \mathbf {u}_{af} \in A_{uf}} \bar{f} \big (\mathbf {d},\mathbf {u}_{af} \big )-z_f,\max _{ \mathbf {u}_{ac} \in A_{uc}} \bar{c}(\mathbf {d},\mathbf {u}_{ac})-z_c]^T$. During the local search step of Inflationary Differential Evolution, instead, the CPSS addresses the following constrained minimisation problem:

$$\begin{aligned} {\left\{ \begin{array}{ll} \min _{\mathbf {d} \in D} t\\ t< 0 \\ \omega _f \big ( \max _{ \mathbf {u}_{af} \in A_{uf}} \bar{f}(\mathbf {d},\mathbf {u}_{uf})-z_f \big )< t \\ \omega _c \big ( \max _{ \mathbf {u}_{ac} \in A_{uc}} \bar{c}(\mathbf {d},\mathbf {u}_{ac}) - z_c \big ) < t \end{array}\right. } \end{aligned}$$

(21)

where $z_f$ and $z_c$ are the values of $\bar{f}$ and $\bar{c}$ respectively for the best solution obtained from the DE step in Eq. (20) and from which the local search is initialised. For all three methods, the restoration problem solves the two problems:

$$\begin{aligned}&\max _{\mathbf {u} \in U} f(\bar{\mathbf{d }},\mathbf {u}) \end{aligned}$$

(22)

$$\begin{aligned}&\max _{\mathbf {u} \in U} \max _{i\in I_c}c_i(\bar{\mathbf{d }},\mathbf {u}) \end{aligned}$$

(23)

For the given preference vector, the optimal worst case condition for f and the optimal relaxation of the constraint $\epsilon $ are given, at convergence, by Eqs. (22) and (23).

The three possible scalarisation procedures ECS, WSS and CPSS can be used to reconstruct the Pareto front for the minimisation trade-off between the worst case scenarios $\max f$ and $\max \mathbf {c}$. Fig. 6 shows a comparison of the three methods, where the combination of the objective function GFf1 in Table 1 and the constraint function GFc1 in Table 2 has been chosen as representative test case. We used MP-AIDEA with the following settings: maximum number of function evaluation $n_{\text {feval,max}}=3000$, number of populations $n_{\text {pop}}=2$, number of agents in the population $N_{\text {pop}} = 5$, dimension of the bubble for the global restart $\delta _{\text {global}}=0.1$ and DE threshold for each population $\rho =0.1$. The whole constrained min-max algorithm has then been run until convergence. For ECS the algorithm has been run for 60 different thresholds ranging between 1 and 6.2 (calculated minimum and maximum constraint violation.) For WSS and CPSS, instead, the trigonometric weights

$$\begin{aligned} \begin{array}{l} w_f = \frac{\cos {\theta }}{\cos {\theta }+\sin {\theta }}\\ w_c = \frac{\sin {\theta }}{\cos {\theta }+\sin {\theta }} \end{array} \end{aligned}$$

(24)

have been used letting $\theta $ varying from 0 to $\frac{\pi }{2}$ and using a discretisation of 60 interval as well. Fig. 6 shows that WSS is not capable of finding optimal Pareto points in the non convex part of the Pareto front (Das and Dennis 1997) and also, using equally spaced weights, it finds non-equally spaced points in the front. The ECS strategy gives better results. The best performance is, however, obtained with the CPSS. Indeed it allows the user to express a preference through the selection of the weights and, as shown in Fig. 6, with the same number of simulations (60 different descent directions for the CPSS and epsilon values for the ECS) it is able to find more Pareto optimal solutions than ECS. For these reasons CPSS was chosen and implemented in Algorithm 2

3.6 Cross-check

All the optimisation problems in the minmax algorithm require the identification of a global maximum or a global minimum. Since it is proposed to use a memetic algorithm it is possible that some of the maxima or minima in the archive are only locally optimal. Note that the use of a deterministic global optimiser would remove this problem but would introduce a tractability problem due to the potential NP-hard nature of some optimisation problems.

In order to mitigate the occurrence of local minima/maxima in the archives we introduce a cross-check of the solutions following the procedure explained in Algorithm 7. It is performed for each design vector $\bar{\mathbf {d}}$ that can be proposed by the optimiser during the minimisation step and at the end of the whole algorithm (respectively in line 9 and 19 of Algorithm 2). Referring to Algorithm 7, lines 1 – 7 regard the objective function f while lines from 8 – 14 regard the constraint function $\mathbf {c}$. In both cases, for a given $\bar{\mathbf {d}}$ objectives and constraints are evaluated for all the $\mathbf {u}$ vectors in the archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$. We also introduced an option (through local flag) to run a local search from each new pair [$\mathbf {\bar{d}}$, $\mathbf {u}$]. This option slows down Algorithm 2 but improves the quality of the solution if the functions present nested minima/maxima. Finally, line 15, retains the worst values of f and $\mathbf {c}$ for the archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ for each $\bar{\mathbf {d}}$.

3.7 Selection of output solution

After the termination criterion in Algorithm 2 is applied and the cross-check over the archives is performed (line 21), the solution for the min-max problem is selected following Algorithm 8. In particular, if a feasible subset $\hat{A}_d$ of the archive $A_d$ of the design vectors exists (line 1) the selected solution vector is the one, within $\hat{A}_d$, minimising the worst value of f (line 2). If, on the other hand, $\hat{A}_d$ is an empty set, the design vector that minimises the constraint violation is selected (line 4).

3.8 Computational complexity

The computational cost of Algorithm 2 is measured in terms of number of function calls. With reference to the minimisation step, the counter $n_{\text {feval}}^{\text {outer}}$ takes into account, for both the constrained and the unconstrained min-max problems, the calls to $\max _{\mathbf {u} \in \bar{A}_{uf}} f(\mathbf {d},\mathbf {u}_{af})$ in Eq. (2). The same criterion, then, holds for the constraint relaxation step in Eq. (9) and for the two trade-off steps in Eqs. (15) and (19). It has to be noted that, as the algorithm proceeds in the search of the global optimum solution, the archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ of the uncertainty vectors increase progressively in dimension. Each minimisation step explores a maximum number of possible design configurations which is limited by the input parameter $n_{\text {feval,max}}^{\text {outer}}$. However, due to the growth of the archives of the solutions coming from the restoration step, each evaluation of the minimisation loop becomes increasingly more expensive.

With reference to the maximisation step, instead, the cost of the two separate problems in Eqs. (3) and (4) has to be considered. For the former $n_{\text {feval}}^{\text {inner},f}$ counts the number of calls to the objective function $f(\bar{\mathbf {d}},\mathbf {u}_{af})$ and it is limited by the input $n_{\text {feval,max}}^{\text {inner},f}$. This holds true also for the two steps in the trade-off strategy in Eqs. (16) and (22) and for the relaxation step in Eq. (10) where the function c is considered instead of f. For the latter, $n_{\text {feval}}^{\text {inner},c}$ counts the number of calls to $\max _i c_i(\hat{\mathbf {d}},\mathbf {u})$ in Eq. (4) where the input $n_{\text {feval,max}}^{\text {inner},c}$ is the upper limit.

Finally, the parameters $n_{\text {feval,max}}$ and $n_{\text {loop,max}}$ give an upper limit on the whole cost of Algorithm 2. Note that $n_{\text {feval,max}}$ and $n_{\text {loop,max}}$ represent an upper limit because, as it is shown in line 9 of Algorithm 2, we use an additional termination criterion that looks at the convergence of the solutions in the archives.

The computational complexity of the different parts of the overall algorithm is as follows:

1.
Local Search: the local search uses the Matlab fmincon function. All the alternative algorithms (interior-point, trust-region-reflective, sqp, sqp-legacy, active-set) can be selected. We use here interior-point that works well with both large sparse and small dense problems. The complexity is $\mathcal {O}( n^3_D)$ or $\mathcal {O}( n^3_U)$ depending on which step between minimisation and restoration is considered, where $n_D$ is the design and $n_U$ the uncertain vector’s dimension.
2.
Adaptation of CR and F: The DE parameters CR and F in MP-AIDEA are auto-adapted for each element of each population. For $n_{\text {pop}}$ evolving populations with $N_{\text {pop}}$ agents, the complexity is $\mathcal {O}(n_{\text {pop}} N_{\text {pop}} n_D^2)$ and $\mathcal {O}(n_{\text {pop}} N_{\text {pop}} n_U^2)$ for inner and outer problem respectively (Di Carlo et al. 2019).
3.
Restart mechanisms. The populations evolve with a differential evolution DE approach which is restarted, locally and globally, a number of times. The local restart has a cost proportional to $n_{\text {pop}} N_{\text {pop}}$. The cost for the global restart, instead, has a component related to the clustering procedure $\mathcal {O}( N_{\text {pop}} n_D n_{\text {iter}})$ or $\mathcal {O}( n_{LM} n_U^2 n_{\text {iter}})$ with $n_{\text {iter}}$ the required number of iteration for the clustering and $n_{LM}$ the number of local minima, and a component related to the verification that the new population is far from the clusters $\mathcal {O}( N_{\text {pop}} n_{LM})$ (Di Carlo et al. 2019).
4.
Outer-Loop. During the minimisation step there is a cost related to the testing of each design vector suggested by the optimiser in combination with all the uncertain vectors saved in the archives $\bar{A}_{uf}$ and $\bar{A}_{uc}$ and there is a cost due to the selection of their maximum. In both cases the complexity is $\mathcal {O}(|| \bar{A}_{uf} || n_{\text {fval,max}}^{\text {outer}} ) \le \mathcal {O}( n_{\text {loop}} n_{\text {fval,max}}^{\text {outer}})$ for the objective function f and $\mathcal {O}(|| \bar{A}_{uc} || n_{\text {fval,max}}^{\text {outer}} ) \le \mathcal {O}( n_{\text {loop}} n_{\text {fval,max}}^{\text {outer}})$ for the constraint function c.
5.
Cross-Check. As in the outer loop there is here a cost for the cross-check and a cost for the selection of the maxima. In both cases it is: $\mathcal {O} ( || A_{d} || || \bar{A}_{uf} || ) \le \mathcal {O}( n_{\text {loop}}^2 )$ and $\mathcal {O} ( || A_{d} || || \bar{A}_{uc} || ) \le \mathcal {O}( n_{\text {loop}}^2 )$ for f and c respectively because each design vector $\mathbf {d} \in A_d$ is considered.
6.
Select Solution. After the final cross-check, the archives are updated and the set of design vectors $\hat{A}_d$ feasible in all the uncertain domain can be defined. The min-max solution is selected following Algorithm 8 sorting the feasible solutions f ($\mathcal {O}(||\hat{A}_d ||)$) or minimising the constraint violation c ($\mathcal {O}(||A_d ||)$).

4 Testing procedure

Algorithm 2 has been tested on the benchmark described and explained in this section. Each test case is a combination of an objective function f and a constraint function c. Depending on the mathematical features of each problem, a local optimiser or a global optimiser have been used for the three problems in Eqs. (2) to (4). The criteria used to choose the right optimiser is explained in Sect. 4.3.

Table 1 test cases for the objective function f

Global solution of constrained min-max problems with inflationary differential evolution

Abstract

Similar content being viewed by others

Globally convergent evolution strategies for constrained optimization

Lost in Translation: Explicitly Solving Nonlinear Stochastic Optimal Control Problems Using the Median Objective Value

On continuation methods for non-linear bi-objective optimization: towards a certified interval-based approach

1 Introduction

2 Problem statement

3 A memetic single objective constrained min-max approach

3.1 Initialisation

3.2 Minimisation-restoration loop

3.3 Memetic strategy

3.4 Constraint relaxation strategy

3.5 Scalarisation strategy

3.6 Cross-check

3.7 Selection of output solution

3.8 Computational complexity

4 Testing procedure

4.1 Benchmark

4.1.1 GFf-1

4.1.2 GFf-2

4.1.3 GFc-1

4.1.4 GFc-2

4.1.5 GFc-3

4.1.6 GFc-4

4.1.7 GFc-5

4.1.8 GFc-6

4.1.9 GFc-7

4.1.10 GFc-8

4.2 Success rate

4.3 Algorithm settings

5 Results

5.1 Uni-modal test problem

5.2 Multi-modal test problems

5.3 Convergence complexity

5.4 Space system design

6 Conclusion

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation