Extension of the value function reformulation to multiobjective bilevel optimization

Lafhim, Lahoussine; Zemkoho, Alain

doi:10.1007/s11590-022-01948-9

Extension of the value function reformulation to multiobjective bilevel optimization

Original Paper
Open access
Published: 28 October 2022

Volume 17, pages 1337–1358, (2023)
Cite this article

Download PDF

You have full access to this open access article

Optimization Letters Aims and scope Submit manuscript

Extension of the value function reformulation to multiobjective bilevel optimization

Download PDF

2053 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

We consider a multiobjective bilevel optimization problem with vector-valued upper- and lower-level objective functions. Such problems have attracted a lot of interest in recent years. However, so far, scalarization has appeared to be the main approach used to deal with the lower-level problem. Here, we utilize the concept of frontier map that extends the notion of optimal value function to our parametric multiobjective lower-level problem. Based on this, we build a tractable constraint qualification that we use to derive necessary optimality conditions for the problem. Subsequently, we show that our resulting necessary optimality conditions represent a natural extension from standard optimistic bilevel programs with scalar objective functions.

Methods for Multiobjective Bilevel Optimization

Optimality of Bilevel Programming Problems Through Multiobjective Reformulations

Constraint Qualifications and Optimality Conditions in Bilevel Optimization

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A standard bilevel optimization problem involves the minimization of a real-valued function under a constraint set partly constrained by the optimal solution set of a parametric optimization problem with a scalar objective function; see, e.g., [10] for the most recent surveys on the topic. However, in the last two to three decades, significant attention has being paid to the generalization of this model to the case where the upper- and/or lower-level objective functions are vector-valued. This is precisely the main focus of the work in this paper, as we consider the multiobjective optimization problem

$$\begin{aligned} \min _{x,y} \ F\left( x,y\right) :=\left( F_{1}\left( x,y\right) ,\cdots ,F_{p}\left( x,y\right) \right) ^{T} \ \ \text {s.t.} \ x\in X, \ \ y\in S\left( x\right) , \end{aligned}$$

(MUL)

where the vector valued function $F:{\mathbb {R}}^{n}\times {\mathbb {R}}^{m}\rightarrow {\mathbb {R}}^{p}$ (with p, m, $n\in {\mathbb {N}}$, $p\ge 2$) represents the upper-level objective function, while $X \subseteq {\mathbb {R}}^{n}$ corresponds to the upper-level feasible set. As for the set-valued mapping $S:{\mathbb {R}}^n \rightrightarrows {\mathbb {R}}^m$, it collects the optimal solutions of the vector lower-level problem

$$\begin{aligned} {\displaystyle \min _{y}} \ f\left( x,y\right) :=\left( f_{1}\left( x,y\right) ,\cdots ,f_{q}\left( x,y\right) \right) ^{T} \ \ \text {s.t.} \ \ y\in Y\left( x\right) \qquad \qquad (\hbox {L}[x]) \end{aligned}$$

for a given $x\in {\mathbb {R}}^n$. Here, $f:{\mathbb {R}}^{n}\times {\mathbb {R}}^{m}\rightarrow {\mathbb {R}}^{q}$ (with q, m, $n\in {\mathbb {N}}$, $q\ge 2$) represents the vector lower-level objective function, while the set-valued map $Y:{\mathbb {R}}^n \rightrightarrows {\mathbb {R}}^m$ describes the lower-level feasible set.

We associate with the (multiobjective) lower-level problem (L[x]) the corresponding frontier map $\varPhi :{\mathbb {R}}^{n} \rightrightarrows {\mathbb {R}}^{q}$ defined by

$$\begin{aligned} \varPhi \left( x\right) := \text {Eff}/\text{WEff } \left( f\left( x,Y\left( x\right) \right) ; \,{\mathbb {R}}_{+}^{q}\right) , \end{aligned}$$

(1.1)

where the notation $\text {Eff}/\text{WEff }$ is used to reflect the fact that optimality in (1.1) is in the sense of efficient Pareto (Eff) or weakly efficient Pareto ($\text{ WEff }$) optimality. In the sequel, we will simply write $\varPhi ^{E}\left( x\right) =\text {Eff}\left( f\left( x,Y\left( x\right) \right) ; \,{\mathbb {R}}_{+}^{q}\right)$ $\left( \text{ resp }. \varPhi ^{W}\left( x\right) =\text{ WEff } \left( f\left( x,Y\left( x\right) \right) ; \,{\mathbb {R}}_{+}^{q}\right) \right)$ when referring to Pareto (resp. weakly Pareto) efficiency in situations where it is necessary to distinguish between these two concepts, which are defined in the next section. Obviously, based on (1.1), the set-valued mapping $S: {\mathbb {R}}^{n} \rightrightarrows {\mathbb {R}}^{m}$ can be rewritten as

$$\begin{aligned} S\left( x\right) :=\left\{ y\in Y\left( x\right) : \;\; f\left( x,y\right) \in \varPhi \left( x\right) \right\} \;\, \text{ for } \;\, x\in {\mathbb {R}}^n. \end{aligned}$$

(1.2)

Hence, our problem (MUL)–(L[x]) can be equivalently written as

$$\begin{aligned} {\displaystyle \min _{x,y}} \ F\left( x,y\right) :=\left( F_{1}\left( x,y\right) ,\cdots ,F_{p}\left( x,y\right) \right) ^{T} \ \ \text {s.t.} \ x\in X, \;\; y\in Y\left( x\right) , \;\; f\left( x,y\right) \in \varPhi \left( x\right) . \end{aligned}$$

(1.3)

Note that in the case where $q=1$, meaning that our lower-level problem (L[x]) is simply a standard scalar objective parametric optimization problem, then the frontier map $\varPhi$ reduces to the corresponding optimal value function. And therefore, problem (1.3) will become the standard lower-level value function (LLVF) reformulation well-known in bilevel optimization with scalar objective functions; see, e.g., [5, 8, 9, 29, 30], for more details on this class of the problem. Hence, clearly, (1.3) is a natural extension of the LLVF reformulation to the multiobjective bilevel optimization problem (MUL)–(L[x]); thus we labelled it as such throughout this paper.

The number of publications around problem (MUL)–(L[x]) or the semivectorial version of the problem, where only the lower-level is multiobjective has been growing significantly over the last decade. Recent surveys on the subject include [11, 25], where overviews of different types of solution algorithms are given. However, our main interest here is on constructing necessary optimality conditions for problem (MUL)–(L[x]); a common point of most works on optimality conditions of this problem is that they rely on some form of scalarization to deal with the multiobjective nature of the lower-level problem (L[x]); for recent surveys on the subject, see, e.g., [6, 7] and references therein.

Additionally, in the latter references, the LLVF reformulation is common after the scalarization step, although [32] provides a different perspective. Subsequently, as in the case where $p=1$ and $q=1$, the standard approach to develop necessary optimality conditions for the corresponding model, after scalarization, has been the concept of partial calmness introduced in [29]. However, given that partial calmness is, in some sense, equivalent to partial exact penalization of the corresponding value function constraint, it is unclear how such an approach can be directly applied to (1.3) when $p>1$ or $q>1$. Hence, our first main focus in this paper (see Sect. 3) is to study the possibility to apply the concept of calmness of set-valued mapping, which is closely related to partial calmness [9, 15]. In Sect. 3, we construct a tractable framework for a set-valued mapping tailored to (MUL)–(L[x]) to be used as constraint qualification (CQ) for the problem. In Sect. 4, we show how this CQ can be used to develop necessary optimality conditions for problem (MUL)–(L[x]). As a by-product of the regularity condition studied in Sect. 4, we provide a new sufficient condition to derive stability for the optimal solution set-valued mapping S (1.2); i.e., for the estimation of its coderivative and Lipschitz-likeness.

Before we move to the development of the main results in Sects. 3 and 4. Finally, in Sect. 5, we apply our results to problems with smooth constraint functionals.

2 Preliminaries

2.1 Tools from variational analysis

In this subsection, we present basic tools from variational analysis that will be used throughout the paper; more on the material covered here can be found in [22, 23], for example. For some point $x\in {\mathbb {R}}^{n}$ and a scalar $\epsilon > 0$,

$$\begin{aligned} {\mathbb {U}}_{\epsilon }\left( x\right) :=\{y\in {\mathbb {R}}^{n}| \ \Vert y-x\Vert < \epsilon \}\;\, \text{ and } \; {\mathbb {B}}_{\epsilon }\left( x\right) :=\{y\in {\mathbb {R}}^{n}| \ \Vert y-x\Vert \le \epsilon \} \end{aligned}$$

denote the open and closed $\epsilon$-ball around x, respectively. For brevity, we make use of ${\mathbb {U}}_{n}={\mathbb {U}}_{1}\left( 0\right)$ and ${\mathbb {B}}_{n}={\mathbb {B}}_{1}\left( 0\right)$. For a set-valued mapping $\Upsilon : {\mathbb {R}}^{n}\rightrightarrows {\mathbb {R}}^{m}$, its Painlevé-Kuratowski outer/upper limit at a point ${\bar{x}}$ is defined by

$$\begin{aligned} {\displaystyle \limsup _{x\rightarrow {\bar{x}}}} \ \Upsilon \left( x\right) :=\left\{ x^{*}\in {\mathbb {R}}^{m}:\;\, \exists x_{k} \rightarrow {\bar{x}}, \ x^{*}_{k}\rightarrow x^{*} \ \text {with} \ x^{*}_{k}\in \Upsilon \left( x_{k}\right) \ \text {for all} \ k\in {\mathbb {N}} \right\} . \end{aligned}$$

Next, consider a set $\Omega \subset {\mathbb {R}}^{n}$, which is assumed to be closed around a point ${\bar{x}}\in \Omega$. The Fréchet normal cone to $\Omega$ at ${\bar{x}}\in \Omega$ is defined by

$$\begin{aligned} {\widehat{N}}\left( {\bar{x}};\; \Omega \right) :=\left\{ x^{*}\in {\mathbb {R}}^{n}:\;\, {\displaystyle \limsup _{\begin{array}{c} {x\overset{\Omega }{\rightarrow }{\bar{x}}} \end{array}}} \ \frac{\left\langle x^{*},x-{\bar{x}}\right\rangle }{\parallel x-{\bar{x}}\parallel } \le 0\right\} , \end{aligned}$$

(2.1)

where $x\overset{\Omega }{\rightarrow }{\bar{x}}$ means that $x\rightarrow {\bar{x}}$ and $x\in \Omega$. Based on this concept, we can introduce the limiting/Mordukhovich normal cone $N\left( {\bar{x}};\Omega \right)$ to $\Omega$ at ${\bar{x}}$, which can be obtained by taking the sequential Painlevé-Kuratowski upper limits of the Fréchet normal cone in (2.1):

$$\begin{aligned} N\left( {\bar{x}}; \; \Omega \right) := {\displaystyle \limsup _{\begin{array}{c} {x\overset{\Omega }{\rightarrow }{\bar{x}}} \end{array}}} \ {\widehat{N}}\left( x;\Omega \right) . \end{aligned}$$

If ${\bar{x}}\notin \Omega$, it is standard to set $N\left( {\bar{x}}; \;\Omega \right) :=\emptyset$. We obviously have ${\widehat{N}}\left( {\bar{x}};\Omega \right) \subset N\left( {\bar{x}}; \; \Omega \right)$ and if the inclusion holds as equality, then we say that $\Omega$ is normally regular at ${\bar{x}}$. The class of normally regular sets includes convex ones and many other important sets in the field of variational analysis and optimization; see, e.g., [22], for more details.

Let $\Upsilon : {\mathbb {R}}^{n}\rightrightarrows {\mathbb {R}}^{m}$ be a set-valued mapping with its graph

$$\begin{aligned} \text {gph} \ \Upsilon :=\left\{ \left( x,y\right) \in {\mathbb {R}}^{n} \times {\mathbb {R}}^{m}:\;\, y\in \Upsilon \left( x\right) \right\} , \end{aligned}$$

the normal coderivative of $\Upsilon$ at $\left( {\bar{x}},{\bar{y}}\right) \in \text {gph} \ \Upsilon$ is defined by

$$\begin{aligned} D^{*}\Upsilon \left( {\bar{x}},{\bar{y}}\right) \left( y^{*}\right) :=\left\{ x^{*}\in {\mathbb {R}}^{n}:\;\, \left( x^{*},-y^{*}\right) \in N\left( \left( {\bar{x}},{\bar{y}}\right) ; \;\text {gph} \ \Upsilon \right) \right\} \ \ \text {for all} \ y^{*}\in {\mathbb {R}}^{m}. \end{aligned}$$

(2.2)

When $\Upsilon$ is a single-valued mapping, to simplify the notation, one writes $D^{*}\Upsilon \left( {\bar{x}}\right) \left( y^{*}\right)$ instead of $D^{*}\Upsilon \left( {\bar{x}},\Upsilon \left( {\bar{x}}\right) \right) \left( y^{*}\right)$. Furthermore, for a function $f: {\mathbb {R}}^{n}\rightarrow {\mathbb {R}}^{m}$ that is strictly differentiable at $\left( {\bar{x}},{\bar{y}}\right)$, we have the representation

$$\begin{aligned} D^{*}f\left( {\bar{x}}\right) \left( y^{*}\right) =\left\{ \nabla f\left( {\bar{x}}\right) ^{\top }y^{*}\right\} \ \ \ \text {for all} \ y^{*}\in {\mathbb {R}}^{m}. \end{aligned}$$

We conclude this subsection with some further properties of set-valued mappings. Consider a set-valued mapping $\Upsilon : {\mathbb {R}}^{n}\rightrightarrows {\mathbb {R}}^{m}$. It will be said to be Lipschitz-like around $\left( {\bar{x}},{\bar{y}}\right)$ if there exist neighbourhoods U of ${\bar{x}}$, V of ${\bar{y}}$, and a constant $l > 0$ such that

$$\begin{aligned} \Upsilon \left( x\right) \cap V \subseteq \Upsilon \left( u\right) +l\parallel u-x\parallel {\mathbb {B}}_{m} \ \ \text {for all} \ x,u \in U. \end{aligned}$$

The weaker concept of calmness is said to hold for a set-valued map $\Upsilon$ at some point $\left( {\bar{x}},{\bar{y}}\right)$ if there exist neighbourhoods U of ${\bar{x}}$, V of ${\bar{y}}$, and a constant $l > 0$ such that

$$\begin{aligned} d\left( y,\Upsilon \left( {\bar{x}}\right) \right) \le l \parallel x-{\bar{x}}\parallel \ \ \text {for all} \ x\in U \ \ \text {and} \ \ y\in V \cap \Upsilon \left( x\right) . \end{aligned}$$

Considering the continuous functions $h_{i}:{\mathbb {R}}^{n}\times {\mathbb {R}}^{m}\rightarrow {\mathbb {R}}$ for $i=1, \ldots , q$, we associate the set-valued mapping $\Upsilon$ defined by

$$\begin{aligned} \Upsilon \left( x\right) := \left\{ y\in {\mathbb {R}}^{m} \, \ h_{i}\left( x,y\right) \le 0, \ i=1,\cdots , q \right\} \; \text{ for } \; x\in {\mathbb {R}}^{n}. \end{aligned}$$

(2.3)

$\Upsilon$ (2.3) will be said to be R-regular at $\left( {\bar{x}},{\bar{y}}\right)$ w.r.t. $\Omega \subseteq {\mathbb {R}}^{n}$ if there are some positive numbers $\sigma$, and $\delta$ such that for all $\left( x,y\right) \in {\mathbb {U}}_{\delta }\left( {\bar{x}},{\bar{y}}\right) \cap \Omega \times {\mathbb {R}}^{m}$,

$$\begin{aligned} d\left( y, \, \Upsilon \left( x\right) \right) \ \le \ \sigma \max \left\{ 0, \;\,\max \left\{ h_{i}\left( x, y\right) |\;\, i=1, \cdots ,q\right\} \right\} . \end{aligned}$$

(2.4)

For more details on R-regularity, see [21] and references therein.

A set-valued mapping $\Upsilon : {\mathbb {R}}^{n}\rightrightarrows {\mathbb {R}}^{m}$ will be said to be order semicontinuous at a point $\left( {\bar{x}},{\bar{y}}\right) \in \text {gph} \ \Upsilon$, if for any sequence $\left( x_{k},y_{k}\right) \in \text {epi} \ \Upsilon$ converging to $\left( {\bar{x}},{\bar{y}}\right)$, there is a sequence $\left( x_{k},z_{k}\right) \in \text {gph} \ \Upsilon$ with $y_{k}-z_{k}\in {\mathbb {R}}_{+}^{m}$ such that $\left( z_{k}\right)$ contains a subsequence converging to ${\bar{y}}$. Here, $\text {epi} \ \Upsilon$ corresponds to the epigraph of $\Upsilon$ with respect to the ordering cone ${\mathbb {R}}_{+}^{m}$:

$$\begin{aligned} \text {epi} \ \Upsilon :=\left\{ \left( x,y\right) \in {\mathbb {R}}^{n} \times {\mathbb {R}}^{m}:\;\, y\in \Upsilon \left( x\right) +{\mathbb {R}}_{+}^{m} \right\} , \end{aligned}$$

Obviously, $\Upsilon$ will be order semicontinuous around $\left( {\bar{x}},{\bar{y}}\right) \in \text {gph} \ \Upsilon$ if there exists a neighbourhood U of $\left( {\bar{x}},{\bar{y}}\right)$ such that $\Upsilon$ is order semicontinuous at any $\left( x,y\right) \in U\cap \text {gph} \ \Upsilon$.

2.2 Multiobjective optimization concepts

Let $C\subset {\mathbb {R}}^{n}$ be a pointed closed convex cone, with nonempty interior, introducing a partial order denoted by $\preceq _{C}$ in ${\mathbb {R}}^{n}$.

Definition 2.1

Let $\Omega$ be a nonempty set of ${\mathbb {R}}^{n}$. $x\in \Omega$ is said to be a Pareto (resp. weak Pareto) efficient/minimal vector of $\Omega$ w.r.t. C if

$$\begin{aligned} \Omega \subset x+\left[ \left( {\mathbb {R}}^{n}\setminus \left( -C\right) \right) \cup \left\{ 0\right\} \right] \ \ \ \left( \text {resp.} \ \Omega \subset x+ \left( {\mathbb {R}}^{n}\setminus -\text{ int } C\right) \right) , \end{aligned}$$

where “int” denotes the topological interior of the set in question.

In the sequel, the set of all the Pareto (resp. weak Pareto) efficient/minimal vectors of a set $\Omega$ w.r.t. C is denoted by $\text {Eff}\left( \Omega ;\; C\right)$ (resp. $\text {WEff}\left( \Omega ; \;C\right)$). Let us now consider the following multiobjective optimization problem with respect to the partial order introduced by the pointed, closed, and convex cone C:

$$\begin{aligned} \min f (x) \ \ \text {s.t.} \ x\in \Omega , \end{aligned}$$

(2.5)

where f represents a vector-valued function defined on ${\mathbb {R}}^n$ and $\Omega$ the nonempty feasible set. For a nonempty set $N\subset \Omega$, the image of N by f is defined by

$$\begin{aligned} f\left( N\right) :=\left\{ f\left( x\right) :\;\, x\in N\right\} . \end{aligned}$$

Definition 2.2

A point ${\bar{x}}\in \Omega$ is said to be a Pareto (resp. weakly Pareto) optimal solution of problem (2.5) if $f\left( {\bar{x}}\right)$ is a Pareto (resp. weak Pareto) minimal vector of $f\left( \Omega \right)$, i.e., $f\left( {\bar{x}}\right) \in \text {Eff}\left( f\left( \Omega \right) ;C\right)$ (resp. $f\left( {\bar{x}}\right) \in \text {WEff}\left( f\left( \Omega \right) ;C\right)$).

Similarly, a point ${\bar{x}}\in \Omega$ is said to be a local Pareto (resp. weakly local Pareto) optimal solution of problem (2.5) if there exists a neighborhood U of ${\bar{x}}$ such that $f\left( {\bar{x}}\right)$ is a Pareto (resp. weak Pareto) minimal vector of $f \left( U\cap \Omega \right)$. For our analysis of the multiobjective bilevel program (MUL), we will use either the concept of weakly efficient solution for the upper-level problem or the concept of efficient solution for the upper-level problem, and similarly, for the lower-level problem, both notions of efficient optimal solution and weakly efficient solution will be applied.

3 Generalized value function constraint qualification

We start this section by introducing the main constraint qualification that will be used to derive necessary optimality conditions for problem (1.3).

Definition 3.1

The generalized value function constraint qualification (GVFCQ) holds at $\left( {\bar{x}},{\bar{y}}\right)$ if the set-valued mapping $\Psi : {\mathbb {R}}^{n}\times {\mathbb {R}}^{q} \rightrightarrows {\mathbb {R}}^{n}\times {\mathbb {R}}^{m}$ defined by

$$\begin{aligned} \Psi \left( u,v\right) :=\left\{ \left( x,y\right) \in \text {gph} \ Y: \;\;\left( \begin{array}{c} x \\ f(x,y) \end{array} \right) + \left( \begin{array}{c} u\\ v \end{array} \right) \in \text {gph}~\varPhi \right\} , \end{aligned}$$

(3.1)

is calm at the point $\left( 0, 0, {\bar{x}},{\bar{y}}\right)$.

Note that if the lower-level problem (L[x]) has a scalar objective function, then replacing the frontier map $\varPhi$ by the corresponding optimal value function $\varphi$, then the value function constraint qualification (VFCQ) in this case is obtained by replacing (3.1) with

$$\begin{aligned} \Psi _{\varphi }(v):=\left\{ \left( x,y\right) \in \text {gph} \ Y: \;\; f(x,y)-\varphi (x)\le v\right\} . \end{aligned}$$

(3.2)

Clearly, we have $\Psi _{\varphi }(v):= \Psi (0, -v)$ if $\text {gph}~\varPhi$ is replaced in (3.1) by the hypograph of $\varphi$.

It is well-known that in bilevel programs with scalar objective functions, the VFCQ implies that the partial calmness condition holds in the case where the lower-level feasible set is unperturbed [15]. Moreover, to the best of our knowledge, the VFCQ is the weakest CQ that ensures that partial calmness holds. Hence, since partial cannot be defined for (1.3), due to the multiobjective nature of the objective functions in (MUL)–(L[x]), it makes sense to consider the GVFCQ as the natural candidate for tractable CQ for the problem under consideration. For the remainder of this section, we focus our attention on constructing sufficient conditions ensuring that GVFCQ can hold.

We start with an extension of the uniform weak sharp minimum condition, which enables an extension of a relationship already well-known to be valid in standard bilevel optimization problems with scalar objective functions [15, 28,29,30].

Definition 3.2

The local uniform weak sharp minimum (LUWSM) condition holds at $(\bar{x},\bar{y})$, for the family problems $(L[x])_{x\in X}$, if there exist $\epsilon > 0$ and $\lambda > 0$ such that

$$\begin{aligned} \forall \left( x,y\right) \in V_{\epsilon }\left( {\bar{x}},{\bar{y}}\right) : \ \ y\in Y\left( x\right) \; \Longrightarrow \; \lambda d\left( y, \,S\left( x\right) \right) \le d\left( f\left( x, y\right) ;\; \varPhi \left( x\right) \right) . \end{aligned}$$

If $V_{\epsilon }\left( {\bar{x}},{\bar{y}}\right)$ is replaced by the whole space ${\mathbb {R}}^{n}\times {\mathbb {R}}^{m}$ in this definition, we simply say that the uniformly weak sharp minimum (UWSM) condition holds at $\left( {\bar{x}},{\bar{y}}\right)$.

Theorem 3.3

Let $\left( {\bar{x}},{\bar{y}}\right) \in \text {gph} S$ and f be locally Lipschitzian around $\left( {\bar{x}},{\bar{y}}\right)$ with constant L and assume that $\varPhi$ is Lipschitz-like around $\left( {\bar{x}},{\bar{z}}\right)$, with ${\bar{z}}=f\left( {\bar{x}},{\bar{y}}\right)$. If the LUWSM condition holds at $\left( {\bar{x}},{\bar{y}}\right)$, then the GVFCQ is satisfied at $\left( {\bar{x}},{\bar{y}}\right)$.

Proof

Based on the assumptions, there exit $l > 0$ and $\delta >0$ such that

$$\begin{aligned} \varPhi \left( x_{0}\right) \cap \left( {\bar{z}}+\delta {\mathbb {B}}\right) \subset \varPhi \left( x_{1}\right) +l \parallel x_{0}-x_{1}\parallel {\mathbb {B}} \ \ \text {for all} \ x_{0}, \ x_{1}\in {\bar{x}}+\delta {\mathbb {B}}_{q}. \end{aligned}$$

(3.3)

Let $0< \epsilon < \frac{\delta }{2}$ and $\lambda > 0$ be the constants from Definition 3.2 and $u \in \epsilon {\mathbb {B}}_{n}$, $v \in \epsilon {\mathbb {B}}_{q}$, and $\left( x,y\right) \in \left( {\bar{x}},{\bar{y}}\right) +\epsilon {\mathbb {B}}_{n\times m}$ such that $\left( x,y\right) \in \Psi \left( u,v\right)$. Since $\Psi \left( 0,0\right) =\text {gph}S$, then

$$\begin{aligned} d\left( \left( x,y\right) ,\Psi \left( 0,0\right) \right) =d\left( \left( x,y\right) ,\text {gph}S\right) \le d\left( y,S\left( x\right) \right) . \end{aligned}$$

(3.4)

By the local uniform weak sharp minimum condition, we have

$$\begin{aligned} d\left( y,S\left( x\right) \right) \ \le \ \lambda ^{-1} d\left( f\left( x,y\right) ; \varPhi \left( x\right) \right) . \end{aligned}$$

(3.5)

Since, f is locally Lipschitzian around $\left( {\bar{x}},{\bar{y}}\right)$ with constant L and radius $\alpha$, then setting $\beta =\min ~\left\{ \alpha , \frac{\delta }{4L}\right\}$ leads to

$$\begin{aligned} \begin{array}{lcl} \parallel v+f\left( x,y\right) -f\left( {\bar{x}},{\bar{y}}\right) \parallel &{} \le &{} \parallel v\parallel + L\left( \parallel x-{\bar{x}}\parallel +\parallel y-{\bar{y}}\parallel \right) , \\ &{} \le &{} \epsilon +L\left( \frac{\delta }{4L}+\frac{\delta }{4L}\right) , \\ &{} \le &{} \frac{\delta }{2} +\frac{\delta }{2}, \\ &{} = &{} \delta \end{array} \end{aligned}$$

for all $\left( x,y\right) \in \left( {\bar{x}},{\bar{y}}\right) + \beta {\mathbb {B}}_{n\times m}$. Thus, $v+f\left( x,y\right) \in {\bar{z}}+\delta {\mathbb {B}}_{q}$. Taking $x_{0}=x+u$ and $x_{1}=x$ while considering that $x+u\in {\bar{x}}+\delta {\mathbb {B}}_{n}$, it follows from (3.3) that there exist $z\in \varPhi \left( x\right)$ such that

$$\begin{aligned} \parallel v+f\left( x,y\right) -z \parallel \le l \parallel u\parallel . \end{aligned}$$

Consequently,

$$\begin{aligned} d\left( f\left( x,y\right) ; \varPhi \left( x\right) \right) \le \parallel f\left( x,y\right) -z \parallel \le l \parallel u\parallel +\parallel v\parallel . \end{aligned}$$

(3.6)

Setting $\tau =\max \left( l,1\right)$ and combining (3.4), (3.5), and (3.6), it follows that

$$\begin{aligned} d\left( \left( x,y\right) ,\Psi \left( 0,0\right) \right) \le \lambda ^{-1}\tau \parallel \left( u,v\right) \parallel \end{aligned}$$

for all $\left( u,v\right) \in \epsilon {\mathbb {B}}_{n\times q}$ and $\left( x,y\right) \in \Psi \left( u,v\right) \cap \left( \left( {\bar{x}},{\bar{y}}\right) + \epsilon {\mathbb {B}}_{n\times m}\right)$. Hence, the result follows. $\square$

To provide a concrete case where the LUWSM condition holds, we consider the parametric linear multiobjective optimization problem

$$\begin{aligned} \min _{y} \ Cy \ \ \ \text { s.t. } \;\ Ax+By\le d, \end{aligned}$$

(3.7)

where $d\in {\mathbb {R}}^{k}$, $C \in {\mathbb {R}}^{q}\times {\mathbb {R}}^{m}$, $A \in {\mathbb {R}}^{k}\times {\mathbb {R}}^{n}$ and $B \in {\mathbb {R}}^{k}\times {\mathbb {R}}^{m}$. To state the corresponding result, let $\Gamma$ denotes the simplex defined by

$$\begin{aligned} \Gamma :=\left\{ \alpha \in {\mathbb {R}}^{q}: \;\; \alpha \ge 0, \ \ {\displaystyle \sum ^{q}_{i=1}} \ \alpha _{i}=1\right\} . \end{aligned}$$

(3.8)

Proposition 3.4

Consider a family of problems (L[x])$_{x\in X}$ defined in (3.7) with $X \subseteq {\mathbb {R}}^n$, and let the corresponding version of the set-valued mapping S (1.2) for problem (3.7) be uniformly bounded on X; i.e., there exits some $k>0$ such that for all $x\in X$ and $y\in S\left( x\right)$, $\parallel y\parallel \le k$. Furthermore, suppose that there exists a constant $\delta > 0$ such that for all $\alpha \in \Gamma$, $x\in X$, and $y\in S\left( x\right)$, we have $\alpha ^{T}Cy \ge \delta$. Then the UWSM condition holds.

Proof

Let $x\in X$ and consider the family of sets

$$\begin{aligned} S_{\alpha }\left( x\right) := \arg \underset{y}{\min }\left\{ \alpha ^{T}Cy: \;\; Ax+By\le d \right\} \; \text{ for } \; \alpha \in \Gamma . \end{aligned}$$

(3.9)

Given that the set-valued mapping $G\left( x\right) =\{y\in {\mathbb {R}}^{m} \, \ Ax+By\le d \}$ is a polyhedral and convex-valued, it follows from [1] (see also [20, Theorem 3.3, pp 96]) that there are finitely many vectors $\alpha _{1}\left( x\right)$,..., $\alpha _{s}\left( x\right)$ of the set $\Gamma$ (3.8) such that we have

$$\begin{aligned} S\left( x\right) \ = \ {\displaystyle \bigcup ^{s}_{j=1}} \ S_{\alpha _{j}\left( x\right) }\left( x\right) . \end{aligned}$$

(3.10)

Let $y\in G\left( x\right)$. If $y\in S\left( x\right)$, then (3.2) holds true. Otherwise, considering any $y\in G\left( x\right) \setminus S\left( x\right)$, we have $0\notin Cy -\varPhi \left( x\right)$. Now, let $z\in \varPhi \left( x\right)$, then there is some ${\tilde{y}}\in S\left( x\right)$ such that $z=C{\tilde{y}}$ and $Cy -C{\tilde{y}}\ne 0$. On the other side, setting $a=\alpha _{j}^{T}\left( x\right) C$ and $b=\alpha _{j}^{T}\left( x\right) C{\tilde{y}}$ and using Hoffman’s lemma (see [16, Theorem 1]) it follows from (3.9) and (3.10)

$$\begin{aligned} d\left( y,S\left( x\right) \right) \le d\left( y,S_{\alpha _{j}}\left( x\right) \right) \le k \delta ^{-1} \alpha _{j}^{T}\left( x\right) \left( Cy -C{\tilde{y}}\right) , \end{aligned}$$

where k is the constant appearing in uniform boundedness of S. Hence,

$$\begin{aligned} \begin{array}{lcl} d\left( y,S\left( x\right) \right) &{} \le &{} k \delta ^{-1} \ \parallel \alpha _{j}\left( x\right) \parallel _{1} \ \parallel Cy - C{\tilde{y}} \parallel , \\ &{} \le &{} \lambda ^{-1} \ \parallel Cy - C{\tilde{y}} \parallel , \end{array} \end{aligned}$$

where $\lambda ^{-1} =k \delta ^{-1}$ and $\parallel \alpha _{j}\left( x\right) \parallel _{1}=1$. It follows from the last inequality that

$$\begin{aligned} Cy - C{\tilde{y}} \notin \lambda d\left( y,S\left( x\right) \right) {\mathbb {U}}_{q}. \end{aligned}$$

Finally, since $z=C{\tilde{y}}$ is arbitrary, we get $Cy -\varPhi \left( x\right) \cap \lambda d\left( y,S\left( x\right) \right) {\mathbb {U}}_{q} =\emptyset .$ This means that for all $z\in \varPhi \left( x\right)$, $Cy - z \notin \lambda d\left( y,S\left( x\right) \right) {\mathbb {U}}_{q}.$ Consequently, $\lambda d\left( y,S\left( x\right) \right) \le \parallel Cy - z\parallel$ for all $z\in \varPhi \left( x\right)$. This implies that $\lambda d\left( y, S\left( x\right) \right) \le d\left( Cy, \varPhi \left( x\right) \right) .$ Hence, the result follows. $\square$

Next, we provide an example where all the assumptions of proposition 3.4 are satisfied.

Example 3.1

Setting $X:=[4, \;\infty )\times [3, \infty )$ and considering problem (3.7) with

$$\begin{aligned} C:=\left( \begin{array}{cc} 2 &{} 0 \\ 0 &{} 1 \end{array} \right) , \quad A:=\left( \begin{array}{rr} 0 &{} 0 \\ 0 &{} 0\\ 0 &{} 0\\ 0 &{} 0\\ -1&{} 0\\ 0 &{} -1 \end{array}\right) , \quad B:=\left( \begin{array}{rr} 1 &{} 0 \\ -1 &{} 0\\ 0 &{} 2\\ 0 &{} -1\\ 1&{} 0\\ 0 &{} 1 \end{array}\right) , \quad \text{ and } \quad d:=\left( \begin{array}{r} 4\\ -1\\ 6\\ -2\\ 0\\ 0 \end{array}\right) , \end{aligned}$$

(3.11)

we can easily check that for any $x\in X$ and y such that $Ax + By \le d$, taking any $(\mu , \nu )\in {\mathbb {R}}^2_+$ such that $\mu + \nu =1$, we have the inequality

$$\begin{aligned} (\mu , \nu )C y = 2\mu y_1 + \nu y_2\ge 2. \end{aligned}$$

In case the uniform boundedness of the set-valued mapping S required in Proposition 3.4 is not satisfied, we can use the following alternative result.

Proposition 3.5

Consider a family of problems (L[x])$_{x\in X}$ defined in (3.7) with $X \subseteq {\mathbb {R}}^n$ such that for all $x\in X$ and $j\in \{1,\cdots ,s\}$, the sets $S_{\alpha _{j}}\left( x\right)$ from (3.10) are unbounded. Furthermore, suppose that there exist $\delta > 0$ and a unit vector $z\in {\mathbb {R}}^{q}$ such that for all a constant $\alpha \in \Gamma$ and $x\in X$, we have $\alpha ^{T}Cz\ge \delta > 0$. Then the UWSM condition holds.

Proof

Its folows on the path of Proposition 3.4. We shall argue in the same way as above and use [16, Theorem 2] instead of [16, Theorem 1]. $\square$

The next result provides a sufficient condition for the existence of uniform weak sharp minimun tailored to a more general multiobjective bilevel optimization problem.

Theorem 3.6

The UWSM condition holds for any general family of problems (L[x])$_{x\in X}$, where f is Lipschitz continuous in y uniformly in $x\in X$, the set $Y\left( x\right)$ is closed for any fixed $x\in X$, and there exists a strictly positive number $\lambda$ such that we have

$$\begin{aligned} \begin{array}{ll} \parallel \varsigma \parallel \ge \lambda ^{-1}, &{} \forall \varsigma \in \partial _{y} \left\langle y^{*},f \right\rangle \left( x,y\right) +N\left( y,Y\left( x\right) \right) ,\\ &{} \forall y^{*}\in N\left( f\left( x,y\right) , \; z-{\mathbb {R}}_{+}^{q}\right) , \;\; z\in \varPhi \left( x\right) , \;\; \left( x,y\right) \in \text {gph} \ Y, \;\; y\notin S\left( x\right) . \end{array} \end{aligned}$$

Proof

Consider any closed subset $\Lambda$ of ${\mathbb {R}}^{m}$, a locally Lipschitz function $\phi :{\mathbb {R}}^{m}\rightarrow {\mathbb {R}}^{q}$ with constant L, a vector $z\in {\mathbb {R}}^{q}$, and the set

$$\begin{aligned} \Xi \left( \phi ,z\right) =\left\{ y\in \Lambda \,: \ \phi \left( y\right) \le z\right\} \end{aligned}$$

and the function

$$\begin{aligned} \phi _{z}^{+}\left( y\right) = d\left( \phi \left( y\right) ,z-{\mathbb {R}}_{+}^{q}\right) = {\displaystyle \max ^{q}_{i=1}} \ \left( \phi _{i}\left( y\right) -z_{i}\right) _{+}, \end{aligned}$$

where the distance function is defined by the max norm on ${\mathbb {R}}^{q}$ and $a_{+}=\max \{a,0\}$. Now, let us show that if there exist $\lambda > 0$ and $0< \epsilon \le +\infty$ such that

$$\begin{aligned} \parallel \varsigma \parallel \ge \lambda ^{-1} \end{aligned}$$

(3.12)

for all $\varsigma \in \partial \left\langle y^{*},\phi \right\rangle \left( y\right) +N\left( y, \Lambda \right)$, $y^{*}\in N\left( \phi \left( y\right) ,z-{\mathbb {R}}_{+}^{q}\right)$, $y\in \Lambda$, and $0< \phi _{i}\left( y\right) -z_{i} < \epsilon$ for some i, then we have

$$\begin{aligned} d\left( y,\Xi \left( \phi ,z\right) \right) \le \lambda \phi _{z}^{+}\left( y\right) , \ \ \ \forall y\in \Lambda \ \ \text {such that} \ \ \phi _{z}^{+}\left( y\right) < \epsilon \left( 1+L\lambda \right) ^{-1}. \end{aligned}$$

(3.13)

First, by contraposition, suppose that there exist ${\bar{y}}\in \Lambda$ such that

$$\begin{aligned} \lambda \phi _{z}^{+}\left( {\bar{y}}\right)< d\left( {\bar{y}},\Xi \left( \phi ,z\right) \right) \ \ \ \text {and} \ \ \ \phi _{z}^{+}\left( {\bar{y}}\right) < \epsilon \left( 1+L\lambda \right) ^{-1}. \end{aligned}$$

(3.14)

It is obvious, by choosing suitable $r> 1$, that the following inequalities hold

$$\begin{aligned} \delta< d\left( {\bar{y}},\Xi \left( \phi ,z\right) \right) \ \ \ \text {and} \ \ \ \phi _{z}^{+}\left( {\bar{y}}\right) < \epsilon \left( 1+rL\lambda \right) ^{-1} \end{aligned}$$

with $\delta =r \lambda \phi _{z}^{+}\left( {\bar{y}}\right)$. Now, observing that

$$\begin{aligned} \phi _{z}^{+}\left( {\bar{y}}\right) \le {\displaystyle \inf _{y\in \Lambda }} \ \phi _{z}^{+}\left( y\right) + \delta \left( r\lambda \right) ^{-1}, \end{aligned}$$

one can deduce that

$$\begin{aligned} \psi \left( {\bar{y}}\right) \le {\displaystyle \inf _{y\in \Lambda }} \ \psi \left( y\right) + \epsilon \end{aligned}$$

with $\psi \left( y\right) = \phi _{z}^{+}\left( y\right) +\delta _{\Lambda }\left( y\right)$, $\delta _{\Lambda }$ is the indicator function of the set $\Lambda$ and $\epsilon =\delta \left( r\lambda \right) ^{-1}$. Hence, applying the variational principle of Ekeland we find $v\in \Lambda$ such that

$$\begin{aligned} \left\{ \begin{array}{l} \parallel v-{\bar{y}} \parallel \le \delta ,\\ \psi \left( v\right) \le \psi \left( y\right) + \left( r\lambda \right) ^{-1} \parallel y-v \parallel \ \ \ \text {for all} \ \ y\in \Lambda . \end{array} \right. \end{aligned}$$

(3.15)

Hence, v is a minimum of the function $y \longmapsto \psi \left( y\right) +\left( r\lambda \right) ^{-1} \parallel y-v \parallel$ and we get, by exploiting the chain rule, that

$$\begin{aligned} 0\in \partial \phi _{z}^{+}\left( v\right) + N\left( v, \Lambda \right) +\left( r\lambda \right) ^{-1} {\mathbb {B}}_{m}. \end{aligned}$$

(3.16)

In view of [22, Theorem 1.97 and Corrolary 3.43] it follows that

$$\begin{aligned} \partial \phi _{z}^{+}\left( v\right) \ \subset \ {\displaystyle \bigcup _{y^{*}\in N\left( \phi \left( v\right) ,z-{\mathbb {R}}_{+}^{q}\right) }} \ \partial \left\langle y^{*},\phi \right\rangle \left( v\right) . \end{aligned}$$

Consequently, there exist $y^{*}\in N\left( \phi \left( v\right) ,z-{\mathbb {R}}_{+}^{q}\right)$ and $\varsigma \in \partial \left\langle y^{*},\phi \right\rangle \left( v\right) + N\left( v,\Lambda \right)$ such that (3.16) yields

$$\begin{aligned} \parallel \varsigma \parallel \le \left( r\lambda \right) ^{-1} < \lambda ^{-1}. \end{aligned}$$

According to (3.14), (3.15) and $v\in \Lambda$, we have $v\notin \Xi \left( \phi ,z \right)$. Consequently, $0< \phi _{i}\left( v\right) -z_{i} < \epsilon$ for some i. On the other hand, since $\parallel v-{\bar{y}} \parallel \le \delta$, the condition (3.14) guarantees that

$$\begin{aligned} \begin{array}{lcl} \phi _{z}^{+}\left( v\right) &{} \le &{} \phi _{z}^{+}\left( {\bar{y}}\right) + L \parallel v-{\bar{y}} \parallel , \\ &{} \le &{} \phi _{z}^{+}\left( {\bar{y}}\right) + L \delta , \\ &{} \le &{} \phi _{z}^{+}\left( {\bar{y}}\right) \left( 1+Lr\lambda \right) , \\ &{} \le &{} \epsilon \left( 1+Lr\lambda \right) ^{-1}\left( 1+Lr\lambda \right) , \\ &{} \le &{} \epsilon . \end{array} \end{aligned}$$

Since $\phi _{i}\left( v\right) -z_{i} \le \phi _{z}^{+}\left( v\right)$, we deduce that $\parallel \varsigma \parallel \le \left( r\lambda \right) ^{-1} < \lambda ^{-1}$ and $\phi _{i}\left( y\right) -z_{i} \le \epsilon$, which contradict (3.12) and justifies the required inclusion (3.13).

Secondly, taking $\phi \left( y\right) =f\left( x,y\right)$, $\Lambda =Y\left( x\right)$, $z\in \varPhi \left( x\right)$, and observing that

$$\begin{aligned} \Xi \left( \phi ,z\right) =\{y\in Y\left( x\right) \, \ f\left( x,y\right) \le z \} \ \subset \ S\left( x\right) , \end{aligned}$$

it holds that

$$\begin{aligned} \begin{array}{lcll} d\left( y,S\left( x\right) \right) &{} \le &{} d\left( y,\Xi \left( z,f\right) \right) , &{} \\ &{} \le &{} \lambda d\left( f\left( x,y\right) , z-{\mathbb {R}}_{+}^{q}\right) , &{} \\ &{} \le &{} \lambda d\left( f\left( x,y\right) , z\right) . \end{array} \end{aligned}$$

Since, z is arbitrary in $\varPhi \left( x\right)$, then $d\left( y,S\left( x\right) \right) \le \lambda d\left( f\left( x, y\right) , \varPhi \left( x\right) \right) .$ $\square$

To conclude this section, we provide another sufficient condition for the LUWSM condition based on the R-regularity concept introduced in Subsection 2.1. To proceed, observe that the lower-level efficient solution mapping S (1.2) can be rewritten as

$$\begin{aligned} S(x)=\left\{ y\in {\mathbb {R}}^m:\;\; d\left( f\left( x,y\right) ,\varPhi \left( x\right) \right) \le 0, \;\; d\left( \left( x,y\right) ,\text {gph} \ Y\right) \le 0\right\} . \end{aligned}$$

Hence, we will say that the R-regularity constraint qualification (RRCQ) holds at the point $\left( {\bar{x}}, {\bar{y}}\right) \in \text {gph} \ S$ if S is R-regular (2.4) at $\left( {\bar{x}}, {\bar{y}}\right)$ w.r.t. $\text {dom} \ S$.

Proposition 3.7

If RRCQ holds at $\left( {\bar{x}}, {\bar{y}}\right)$ and there is some neighborhood $U\subset {\mathbb {R}}^{n}$ of ${\bar{x}}$ such that $\text {dom} \ Y\cap U= \text {dom} \ S\cap U$, then LUWSM is satisfied at $\left( {\bar{x}}, {\bar{y}}\right)$.

Proof

Fix $\left( {\bar{x}}, {\bar{y}}\right) \in \text {gph} \ S$. Since, the mapping S is R-regular at $\left( {\bar{x}}, {\bar{y}}\right)$ w.r.t. $\text {dom} \ S$, there exist $\sigma > 0$ and $\epsilon > 0$ such that for all $\left( x, y\right) \in {\mathbb {U}}_{\epsilon }\left( {\bar{x}}, {\bar{y}}\right) \cap \left( \text {dom} \ S \times {\mathbb {R}}^{m}\right)$ we have the inequality

$$\begin{aligned} d\left( y,S\left( x\right) \right) \ \le \ \sigma \ \max \{0,d\left( f\left( x,y\right) ,\varPhi \left( x\right) \right) ,d\left( \left( x,y\right) ,\text {gph} \ Y\right) \}. \end{aligned}$$

From the definition of the frontier map, for any $\left( x,y\right) \in {\mathbb {U}}_{\epsilon }\left( {\bar{x}}, {\bar{y}}\right)$ with $d\left( \left( x,y\right) ,\text {gph} \ Y\right) = 0$, we have the inequality $d\left( f\left( x,y\right) ,\varPhi \left( x\right) \right) \ge 0$. Hence, for all $\left( x,y\right) \in {\mathbb {U}}_{\epsilon }\left( {\bar{x}}, {\bar{y}}\right) \cap \left( \text {dom} \ S \times {\mathbb {R}}^{m}\right)$,

$$\begin{aligned} y\in Y\left( x\right) \Longrightarrow d\left( y,S\left( x\right) \right) \ \le \ \sigma \ d\left( f\left( x,y\right) ,\varPhi \left( x\right) \right) . \end{aligned}$$

On the other hand, we can choose an open ball around $\left( {\bar{x}}, {\bar{y}}\right)$ which is contained in ${\mathbb {U}}_{\epsilon }\left( {\bar{x}}, {\bar{y}}\right) \cap \left( \text {dom} \ S \times {\mathbb {R}}^{m}\right)$. Dividing (3) by $\sigma$, we get the result. $\square$

Finally, note that the relationships between all the constraint qualifications discussed above are summarized in Fig. 1.

4 Necessary optimality conditions

Our aim in this section is to use the GVFCQ, introduced and studied in the previous section, to derive necessary optimality conditions for problem (MUL)–(L[x]). To proceed, we consider the set

$$\begin{aligned} \Pi :=\left( X\times {\mathbb {R}}^{m}\right) \cap \text {gph} \ S \subset {\mathbb {R}}^{n}\times {\mathbb {R}}^{m} \end{aligned}$$

(4.1)

and the set-valued mapping

$$\begin{aligned} \Sigma \left( x\right) := f\left( x,Y\left( x\right) \right) := \left\{ f\left( x,y\right) : \;\, y\in Y\left( x\right) \right\} \;\, \text{ for } \;\, x\in {\mathbb {R}}^{n}. \end{aligned}$$

(4.2)

In the process, the upper estimates for coderivatives of the optimal solution set-valued mapping S and the frontier map $\varPhi$ will also be useful. To specifically compute an estimate of the coderivative of $\varPhi ^{E}$ (see (1.1) and related discussion), we additionally need the following strong domination property for the lower-level problem (L[x]):

$$\begin{aligned} f\left( x,Y\left( x\right) \right) \subset \varPhi ^{E}\left( x\right) + {\mathbb {R}}_{+}^{q} \ \ \ \ \ \forall x\in X. \end{aligned}$$

(4.3)

This property has been used in the literature with different names; for example, it is used in [26], where is called ${\mathbb {R}}_{+}^{q}$-minicomplete property, and utilized to estimate the contingent derivative of the set-valued mapping $\Sigma$. However, we borrow our vocabulary from the following weaker domination property used in [17]:

$$\begin{aligned} f\left( x,Y\left( x\right) \right) +{\mathbb {R}}_{+}^{q} = \varPhi ^{E}\left( x\right) + {\mathbb {R}}_{+}^{q}. \end{aligned}$$

To construct an estimate of the coderivative of $\varPhi ^E$ in the next result, we also need the limiting qualification condition at a reference point $\left( {\bar{x}},{\bar{y}}\right)$:

$$\begin{aligned} D^{*}S\left( {\bar{x}},{\bar{y}}\right) \cap \left( -N\left( {\bar{x}}, X \right) \right) =\{0\}. \end{aligned}$$

(4.4)

Theorem 4.1

Let $\left( {\bar{x}},{\bar{y}}\right) \in \text {gph} \ S$. Suppose that f is locally Lipschitzian around $\left( {\bar{x}},{\bar{y}}\right)$, that the graph of the image map $\Sigma$ is locally compact around ${\bar{x}}$, that Y is locally closed around $\left( {\bar{x}},{\bar{y}}\right)$ with ${\bar{z}}=f\left( {\bar{x}},{\bar{y}}\right)$ and the strong domination property (4.3) is satisfied. Suppose in addition that Y is Lipschitz-like around $\left( {\bar{x}},{\bar{y}}\right)$. Then, it holds that

$$\begin{aligned} D^{*}\varPhi ^{E}\left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right) \subset {\displaystyle \bigcup _{\left( x^{*},y^{*}\right) \in D^{*}f\left( {\bar{x}},{\bar{y}}\right) \left( z^{*}\right) }} \ [x^{*}+D^{*} Y\left( {\bar{x}},{\bar{y}}\right) \left( y^{*}\right) ], \ \ \ \text {for all} \ z^{*}\in {\mathbb {R}}^{q} \end{aligned}$$

(4.5)

and $\varPhi ^{E}$ is Lipschitz-like around $\left( {\bar{x}},{\bar{z}}\right)$. Furthermore, if the function f is strictly differentiable at $\left( {\bar{x}},{\bar{y}}\right)$, then for any $z^{*}\in {\mathbb {R}}^{q}$, we have

$$\begin{aligned} D^{*}\varPhi ^{E}\left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right) \subset \nabla _{x}f\left( {\bar{x}},{\bar{y}}\right) ^{*} z^{*}+ D^{*} Y\left( {\bar{x}},{\bar{y}}\right) \left( \nabla _{y}f\left( {\bar{x}},{\bar{y}}\right) ^{*} z^{*}\right) . \end{aligned}$$

Proof

First, observe that the image map $\Sigma$ in (4.2) has a composite form. Hence, applying to this composition the coderivative chain rule from [22, Theorem 3.18(i)] for the locally Lipschitzian cost mapping $f\left( x,y\right)$, we get

$$\begin{aligned} D^{*}\Sigma \left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right) \subset {\displaystyle \bigcup _{\left( x^{*},y^{*}\right) \in D^{*}f\left( {\bar{x}},{\bar{y}}\right) \left( z^{*}\right) }} \ [x^{*}+D^{*} Y\left( {\bar{x}},{\bar{y}}\right) \left( y^{*}\right) ], \ \ \ z^{*}\in {\mathbb {R}}^{q}. \end{aligned}$$

(4.6)

Fix $z^{*}\in {\mathbb {R}}^{q}$ and let us prove that $D^{*}\varPhi ^{E}\left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right) \subset D^{*}\Sigma \left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right)$. Let $x^{*}\in D^{*}\varPhi ^{E}\left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right)$. Based on (2.2), there are sequences $\begin{array}{c} {\left( x_{k},z_{k}\right) \overset{\text {gph} \ \varPhi ^{E}}{\rightarrow }\left( {\bar{x}},{\bar{z}}\right) } \end{array}$ and $\left( x^{*}_{k},z^{*}_{k}\right) \rightarrow \left( x^{*},z^{*} \right)$ such that

$$\begin{aligned} {\displaystyle \limsup _{\begin{array}{c} {\left( x_{k_{s}},z_{k_{s}}\right) \overset{\text {gph} \ \varPhi ^{E}}{\rightarrow }\left( x_{k},z_{k}\right) } \end{array}}} \ \frac{\left\langle x^{*},x_{k_{s}}-x_{k}\right\rangle -\left\langle z^{*},z_{k_{s}}-z_{k}\right\rangle }{\parallel x_{k_{s}}-x_{k}\parallel + \parallel z_{k_{s}}-z_{k}\parallel } \le 0. \end{aligned}$$

We claim that in some neighborhood U of $\left( {\bar{x}},{\bar{z}}\right)$ for any $\left( x_{k_{s}},z_{k_{s}}\right) \in U$ such that

$$\begin{aligned} \begin{array}{c} {\left( x_{k_{s}},z_{k_{s}}\right) \overset{\text {gph} \ \varPhi ^{E}}{\rightarrow }\left( x_{k},z_{k}\right) } \end{array}, \;\, \text{ one } \text{ has } \;\,\begin{array}{c} {\left( x_{k_{s}},z_{k_{s}}\right) \overset{\text {gph} \ \Sigma }{\rightarrow }\left( x_{k},z_{k}\right) } \end{array}. \end{aligned}$$

Indeed, suppose the contrary to our claim, that there exists

$$\begin{aligned} \left( x_{k_{s}},z_{k_{s}}\right) \in \text {gph} \ \Sigma \setminus \text {gph} \ \varPhi ^{E}\; \text{ such } \text{ that } \;\left( x_{k_{s}},z_{k_{s}}\right) \rightarrow \left( x_{k},z_{k}\right) . \end{aligned}$$

It follows immediately from the strong domination property (4.3) that $\left( x_{k_{s}},z_{k_{s}}\right) \in \text {epi} \ \varPhi ^{E}$. Since $\Sigma$ is locally compact around ${\bar{x}}$ it follows from [19, Proposition 4.3 (iv)] that $\varPhi ^{E}$ is locally order semicontinuous around $\left( {\bar{x}},{\bar{z}}\right)$. Hence, for $\left( x_{k_{s}},z_{k_{s}}\right) \in \text {epi} \ \varPhi ^{E}$, there exists a sequence $\left( x_{k_{s}},t_{k_{s}}\right) \in \text {gph} \ \varPhi ^{E}$ such that $z_{k_{s}} \in t_{k_{s}} +{\mathbb {R}}_{+}^{q}$. Then applying [18, Theorem 1.3], we get a contradiction while considering the fact that $D^{*}\Sigma \left( {\bar{x}},{\bar{z}}\right) \left( 0\right) =\{0\}$, which results from the Lipschitz-likeness of Y and the inclusion in (4.6). The above arguments ensures that $x^{*}\in D^{*}\Sigma \left( {\bar{x}},{\bar{z}}\right) \left( z^{*}\right)$. Combining this with (4.6), the desired result follows. $\square$

Note that our formula in (4.5) is the same as the one obtained in [17]. However, in the later reference, it is required that $z^*$ be in the interior of the corresponding cone; such a requirement is very restrictive and will not make it possible to construct the necessary optimality conditions, which represent our main goal in this section. Furthermore, under the strong domination property (4.3), the paper [27] provides an estimate of the coderivative of $\varPhi ^{E}$ for all $z^{*}$ in the uniformly positive polar to cone ${\mathbb {R}}_{+}^{q}$ defined by

$$\begin{aligned} K^{*}_{up}:= \left\{ \alpha \in {\mathbb {R}}^{q}: \; \exists \beta > 0, \; \langle \alpha ,\, z\rangle \ge \beta \Vert z\Vert , \; \forall z\in {\mathbb {R}}_{+}^{q}\right\} . \end{aligned}$$

As it can be seen in Theorem 4.1, our estimate of the coderivative of $\varPhi ^{E}$ is calculated at any point $z^*\in {\mathbb {R}}^q$, thus enabling an easy derivation of optimality conditions for problem (MUL)–(L[x]), as it will be clear by the end of this section. It is also important to note that a version of the strong domination property can well be defined for $\varPhi ^W$. However, it is unclear how it would help in obtaining an estimate of the coderivative of $\varPhi ^W$ analogous to the one derived in Theorem 4.1 for efficient Pareto points.

The next proposition gives a sufficient condition for the family of parametric linear programming problems (3.7) to satisfy the strong domination property (4.3).

Proposition 4.2

Assume that for all $x\in X$, the set $Y\left( x\right) =\{y\in {\mathbb {R}}^{m}: \;\, Ax+By\le d \}$ is bounded. Then, problem (3.7) satisfies the strong domination property (4.3).

Proof

Fix $x\in X$ and let ${\bar{y}}\in Y\left( x\right)$. If ${\bar{y}}\in S\left( x\right)$, then since $C{\bar{y}}\in C{\bar{y}}+{\mathbb {R}}_{+}^{q}$, we have the inclusion $f\left( x,Y\left( x\right) \right) \subset \varPhi ^{E}\left( x\right) +{\mathbb {R}}_{+}^{q}$. Suppose that ${\bar{y}}\notin S\left( x\right)$ and consider the set

$$\begin{aligned} {\mathcal DP}\left( x,{\bar{y}}\right) =\left\{ y\in G\left( x\right) \left| \;\, C{\bar{y}}-Cy\in {\mathbb {R}}_{+}^{q} \right. \right\} . \end{aligned}$$

Since $Y\left( x\right)$ is bounded, ${\mathcal DP}\left( x,{\bar{y}}\right)$ is also bounded. Hence, its support function $\sigma \left( \cdot ,{\mathcal DP}\left( x,{\bar{y}}\right) \right)$ is defined everywhere. Now, choose $u\in \text {int} \ {\mathbb {R}}_{-}^{q}$. Then, from [24, Corollary 23.5.3] there exists $z\in {\mathcal DP}\left( x,{\bar{y}}\right)$ such that $\langle C^{T}u, z \rangle = \sigma \left( C^{T}u,{\mathcal DP}\left( x,{\bar{y}}\right) \right)$. We claim that $z\in S\left( x\right)$. Indeed, suppose, in contrary to our claim, that there exist $v\in Y\left( x\right)$ such that

$$\begin{aligned} Cv-Cz \in - {\mathbb {R}}_{+}^{q} \ \ \text {and} \ \ Cv\ne Cz. \end{aligned}$$

Or equivalently, that

$$\begin{aligned} Cv-Cz \in - {\mathbb {R}}_{+}^{q}\setminus \{0\} \ \ \text {for some} \ v\in Y\left( x\right) . \end{aligned}$$

On the one side, $C{\bar{y}}-Cv=C{\bar{y}}-Cz+Cz-Cv \in {\mathbb {R}}_{+}^{q}+{\mathbb {R}}_{+}^{q}{\setminus }\{0\}\subset {\mathbb {R}}_{+}^{q}$. Consequently, $v\in {\mathcal DP}\left( x,{\bar{y}}\right)$. On the other side, since, $u\in \text {int} \ {\mathbb {R}}_{-}^{q}$, one has $0 < \langle u, Cv-Cz\rangle$. Thus, $\sigma \left( C^{T}u,{\mathcal DP}\left( x,{\bar{y}}\right) \right) < \langle C^{T}u, v\rangle$. Which is a contradiction. Finally, we have $z\in {\mathcal DP}\left( x,{\bar{y}}\right)$ and $Cz\in \varPhi ^{E}\left( x\right)$, it follows that $C{\bar{y}}-Cz \in {\mathbb {R}}_{+}^{q}$, which concludes the proof. $\square$

Now, we come to the final step before the statement of the main result of this section; i.e., we provide an estimate for the coderivative of the lower-level optimal solution set-valued mapping S (1.2) under the GVFCQ (3.1).

Proposition 4.3

Consider the lower-level optimal solution set-valued mapping S (1.2) and suppose that f is locally Lipschitz continuous and the set $\text{ gph } \ Y$ and $\text{ gph } \ \varPhi$ are closed. Furthermore, assume that the GVFCQ holds at $({{\bar{x}}}, {{\bar{y}}})$. Then it holds that

$$\begin{aligned} D^*S({{\bar{x}}}, {{\bar{y}}})(y^*) \subset \underset{(u^*, v^*): \;\, u^* \in D^*\varPhi \left( {{\bar{x}}}, \, f({{\bar{x}}}, \bar{y})\right) (-v^*)}{\bigcup }\;\;\underset{(a^*, b^*)\in D^*f({{\bar{x}}}, {{\bar{y}}})(v^*)}{\bigcup } \left\{ u^* + a^* + D^*Y({{\bar{x}}},\, {{\bar{y}}})(y^* + b^*)\right\} . \end{aligned}$$

If additionally, f is strictly differentiable, then we have

$$\begin{aligned} D^*S({{\bar{x}}}, {{\bar{y}}})(y^*) \subset \underset{(u^*, v^*): \;\, u^* \in D^*\varPhi \left( {{\bar{x}}},\,f({{\bar{x}}}, \bar{y})\right) (-v^*)}{\bigcup } \left\{ u^* + \nabla _x f({{\bar{x}}}, \bar{y})^\top v^* + D^*Y({{\bar{x}}},\, {{\bar{y}}})\left( y^* + \nabla _y f({{\bar{x}}}, {{\bar{y}}})^\top v^*\right) \right\} . \end{aligned}$$

Proof

Note that the graph of S (1.2) can be written as

$$\begin{aligned} \begin{array}{l} \text{ gph } \ S = \Omega \cap \psi ^{-1}\left( \Lambda \right) \;\, \text{ with } \;\, \Omega :=\text{ gph } \ Y, \;\, \psi (x,y):= \left( \begin{array}{c} x\\ f(x,y) \end{array}\right) ,\;\, \text{ and } \;\, \Lambda :=\text{ gph } \ \varPhi . \end{array} \end{aligned}$$

Then, based on the assumptions made, it follows from [14, Theorem 4.1] that

$$\begin{aligned} N\left( \left( {{\bar{x}}}, {{\bar{y}}}\right) ;\; \text{ gph }~S\right) \subset \underset{(u^*, v^*) \in N\left( \psi ({{\bar{x}}}, {{\bar{y}}}); \; \Lambda \right) }{\bigcup }D^*\psi ({{\bar{x}}}, {{\bar{y}}})(u^*, v^*) \;\; + N\left( ({{\bar{x}}}, {{\bar{y}}}); \; \Omega \right) . \end{aligned}$$

Hence, considering the definition of the concept of coderivative in (2.2), we have

$$\begin{aligned} \begin{array}{lll} D^*S({{\bar{x}}}, {{\bar{y}}})(y^*) &{} \subset &{} \underset{(u^*, v^*) \in N\left( \psi ({{\bar{x}}}, {{\bar{y}}}); \; \Lambda \right) }{\bigcup }\left\{ x^*\in {\mathbb {R}}^n\left| \; (x^*, -y^*)\in N\left( ({{\bar{x}}}, {{\bar{y}}}); \; \Omega \right) + D^*\psi ({{\bar{x}}}, {{\bar{y}}})(u^*, v^*) \right. \right\} \\ &{} \subset &{} \underset{(u^*, v^*) \in N\left( \psi ({{\bar{x}}}, {{\bar{y}}}); \; \Lambda \right) }{\bigcup }\left\{ x^*\in {\mathbb {R}}^n\left| \; (x^*-u^*, -y^*)\in N\left( ({{\bar{x}}}, {{\bar{y}}}); \; \Omega \right) + D^*f({{\bar{x}}}, {{\bar{y}}})(v^*) \right. \right\} \\ &{} \subset &{} \underset{(u^*, v^*) \in N\left( \psi ({{\bar{x}}}, {{\bar{y}}}); \; \Lambda \right) }{\bigcup }\left\{ x^*\in {\mathbb {R}}^n\left| \;\exists (a^*, b^*)\in D^*f({{\bar{x}}}, {{\bar{y}}})(v^*):\right. \right. \\ &{} &{}\qquad \qquad \qquad \qquad \qquad \qquad \left. \left. x^* - u^* - a^* \in D^*Y({{\bar{x}}},\, {{\bar{y}}})(y^* + b^*) \right. \right\} \\ \end{array} \end{aligned}$$

with the second inclusion resulting from

$$\begin{aligned} D^*\psi ({{\bar{x}}}, {{\bar{y}}})(u^*, v^*) \subset \left( \begin{array}{c} u^*\\ 0 \end{array}\right) + D^*f({{\bar{x}}}, {{\bar{y}}})(v^*). \end{aligned}$$

Clearly, the last inclusion in the above sequence of inclusions gives the desired result for the upper bound of $D^*S({{\bar{x}}},\,\bar{y})(y^*)$ when f is locally Lipschitz continuous. The case where f is strictly differentiable obviously follows from $D^*f({{\bar{x}}}, {{\bar{y}}})(v^*)=\nabla f({{\bar{x}}}, {{\bar{y}}})^\top v^*$. $\square$

What is nice with this result is not the construct process of the proof, which is not necessarily new, but its reliance on the GVFCQ and the corresponding rich set of sufficient conditions provided in the previous section. Such an approach does not seem to have been used in the literature to construct an estimate of the coderivative of the optimal solution set-valued mapping of a parametric multiobjective optimization problem.

We are now ready to state one of the main results of this paper, providing new necessary optimality conditions for the multiobjective bilevel optimization problem (MUL)–(L[x]).

Theorem 4.4

Let $({{\bar{x}}}, {{\bar{y}}})$ be a local efficient/weakly efficient Pareto point for problem (MUL)–(L[x]). We assume that the function F and f are Lipschitz continuous around $({{\bar{x}}}, {{\bar{y}}})$ and suppose that X, $\text{ gph }~S$, $\text{ gph }~Y$, and $\text{ gph }~\varPhi$ are closed sets. Furthermore, assume that the GVFCQ holds at $({{\bar{x}}}, {{\bar{y}}})$. Then, there exist vectors $v^{*}\in {\mathbb {R}}^{q}$ and $w^{*}\in {{\mathbb {R}}}_{+}^{p}$ with $\Vert w^{*}\Vert =1$ such that

$$\begin{aligned} \begin{array}{l} 0 \in \partial \langle w^{*},\; F\rangle \left( {\bar{x}},{\bar{y}}\right) + \partial \langle v^{*},\; f\rangle \left( {\bar{x}},{\bar{y}}\right) + D^{*}\varPhi \left( {\bar{x}}, f({{\bar{x}}}, {{\bar{y}}})\right) \left( -v^{*}\right) \times \{0\}\\ \qquad \qquad \qquad \qquad \qquad \qquad \qquad + \;\, N\left( \left( {\bar{x}},{\bar{y}}\right) ;\; \text{ gph }~Y\right) + N\left( {\bar{x}};\; X\right) \times \{0\}. \end{array} \end{aligned}$$

(4.7)

If $\varPhi = \varPhi ^E$ in (4.7) and additionally, gph$\Sigma$ is locally compact around ${\bar{x}}$, Y is locally closed and Lipschitz-like around $\left( {\bar{x}},{\bar{y}}\right)$, and the strong domination property (4.3) is satisfied. Then, there exist vectors $v^{*}\in {\mathbb {R}}^{q}$, $\left( \alpha ^{*},\beta ^{*}\right) \in \partial \langle - v^*, \; f\rangle \left( {\bar{x}},{\bar{y}}\right)$, and $w^{*}\in {{\mathbb {R}}}_{+}^{p}$ with $\Vert w^{*}\Vert =1$ such that

$$\begin{aligned} \begin{array}{l} \left( -\alpha ^{*},\, 0\right) \in \partial \langle w^{*}, F\rangle \left( {\bar{x}},{\bar{y}}\right) + \partial \langle v^*, f\rangle \left( {\bar{x}},{\bar{y}}\right) + D^{*}Y\left( {\bar{x}},{\bar{y}}\right) \left( \beta ^*\right) \times \{0\}\\ \qquad \qquad \qquad \qquad \qquad \qquad \qquad + \;\, N\left( \left( {\bar{x}},{\bar{y}}\right) ;~\text {gph}~Y\right) + N\left( {\bar{x}};\; X\right) \times \{0\}. \end{array} \end{aligned}$$

(4.8)

If additionally, F and f are strictly differentiable at the point $({{\bar{x}}}, {{\bar{y}}})$, then there exist vectors $v^{*}\in {\mathbb {R}}^{q}$ and $w^{*}\in {{\mathbb {R}}}_{+}^{p}$ with $\Vert w^{*}\Vert =1$ such that we have

$$\begin{aligned} \begin{array}{lll} 0 \in \nabla _x F\left( {\bar{x}}, {\bar{y}}\right) ^\top w^{*} &{} + &{} D^{*}Y\left( {\bar{x}},{\bar{y}}\right) \left( -\nabla _{y}f\left( {\bar{x}},{\bar{y}}\right) ^\top v^{*}\right) \\ &{} + &{} D^{*}Y\left( {\bar{x}},{\bar{y}}\right) \left( \nabla _{y}F\left( {\bar{x}},{\bar{y}}\right) ^\top w^{*} + \nabla _{y}f\left( {\bar{x}},{\bar{y}}\right) ^\top v^{*}\right) + N\left( {{\bar{x}}}; \; X\right) . \end{array} \end{aligned}$$

(4.9)

Proof

First start by noting that based on (4.1), problem (MUL)–(L[x]) can be rewriting as

$$\begin{aligned} {\displaystyle \min _{x,y}} \ F\left( x,y\right) :=\left( F_{1}\left( x,y\right) ,\cdots ,F_{p}\left( x,y\right) \right) ^{T} \;\; \text{ s.t. } \;\; (x, y) \in \Pi . \end{aligned}$$

Since the set $\Pi$ is closed, as intersection of two closed sets, and the function F is Lipschitz continuous around the point $({{\bar{x}}}, {{\bar{y}}})$, which is a local efficient/weakly efficient Pareto point for problem (MUL)–(L[x]), there exists $w^*\in {\mathbb {R}}^p_+$ with $\Vert w^*\Vert =1$ such that we have

$$\begin{aligned} \partial \langle w^*, \; F\rangle ({{\bar{x}}}, {{\bar{y}}}) + N\left( ({{\bar{x}}}, {{\bar{y}}}); \;\Pi \right) , \end{aligned}$$

(4.10)

according to [2, Theorem 5.3]. Hence, it suffices now to calculate an upper estimate of the normal cone to $\Pi$. Using the intersection rule from [23, Corollary 3.5], one has

$$\begin{aligned} N\left( \left( {\bar{x}},{\bar{y}}\right) ;\;\Pi \right) \subset N\left( \left( {\bar{x}},{\bar{y}}\right) ; \; X\times {\mathbb {R}}^{m}\right) +N\left( \left( {\bar{x}},{\bar{y}}\right) ;\; \text {gph}~S\right) \end{aligned}$$

(4.11)

as X and $\text {gph}~S$ are locally closed around ${\bar{x}}$ and $\left( {\bar{x}}, {\bar{y}}\right)$, respectively, and provided that

$$\begin{aligned} N\left( \left( {\bar{x}},{\bar{y}}\right) ;\; X\times {\mathbb {R}}^{m}\right) \cap \left( -N\left( \left( {\bar{x}},{\bar{y}}\right) ;\; \text {gph}~S\right) \right) =\{0\}. \end{aligned}$$

(4.12)

is satisfied. Considering the coderivative calculus rules, we can easily show that the fulfilment of (4.4) implies that (4.12) holds. Then combining (4.10) and (4.11), one gets

$$\begin{aligned} 0\ \in \ \partial \langle w^{*}, \; F\rangle \left( {\bar{x}},{\bar{y}}\right) + N\left( \left( {\bar{x}},{\bar{y}}\right) ; \;X\times {\mathbb {R}}^{m}\right) +N\left( \left( {\bar{x}},{\bar{y}}\right) ; \;\text {gph} \ S\right) . \end{aligned}$$

(4.13)

Hence, there exist $\left( x^{*}, y^{*}\right) \in \partial \langle w^{*}, F\rangle \left( {\bar{x}},{\bar{y}}\right)$ and $c^{*}\in N\left( {\bar{x}}; \;X\right)$ such that

$$\begin{aligned} \left( -x^{*}-c^{*},-y^{*}\right) \in N\left( \left( {\bar{x}},{\bar{y}}\right) ;\; \text {gph} \ S\right) \;\, \text{ or } \text{ equivalently, } \;\, -x^{*}-c^{*} \in D^{*}S\left( {\bar{x}},{\bar{y}}\right) \left( y^{*}\right) . \end{aligned}$$

(4.14)

Thanks to the upper estimate of coderivative of the optimal solution set-valued mapping S in Proposition 4.3, we can find vectors $u^{*}$, $v^{*}$, $a^{*}$ and $b^{*}$ such that

$$\begin{aligned} \left. \begin{array}{r} u^{*} \in D^{*}\varPhi \left( {\bar{x}}, \,f({{\bar{x}}}, {{\bar{y}}})\right) \left( -v^{*}\right) \\ \left( a^{*},b^{*}\right) \in D^{*}f\left( {\bar{x}},{\bar{y}}\right) \left( v^{*}\right) \\ -x^{*}-c^{*} \in u^{*}+a^{*}+D^{*}Y\left( {\bar{x}}, {\bar{y}}\right) \left( y^{*}+b^{*}\right) \end{array}\right\} \end{aligned}$$

(4.15)

given that gphY and gph$\varPhi$ are closed, and f is Lipschitz continuous around $({{\bar{x}}}, {{\bar{y}}})$. Then combining (4.13), (4.14), and (4.15), we immediately arrive at (4.7).

As for the inclusion in (4.8), it follows from Theorem 4.1 that one has an upper estimate for coderivative of frontier map $\varPhi ^{E}$ with respect to local Pareto optimality concept

$$\begin{aligned} D^{*}\varPhi ^{E}\left( {\bar{x}},{\bar{z}}\right) \left( -v^{*}\right) \subseteq {\displaystyle \bigcup _{\left( \alpha ^{*},\beta ^{*}\right) \in D^{*}f\left( {\bar{x}},{\bar{y}}\right) \left( -v^{*}\right) }} \bigg [ \alpha ^{*}+D^{*}Y\left( {\bar{x}},{\bar{y}}\right) \left( \beta ^{*}\right) \bigg ] \end{aligned}$$

(4.16)

considering the assumptions made. Substituting (4.16) into (4.7), it follows that we can find $\left( \alpha ^{*},\beta ^{*}\right) \in D^{*}f\left( {\bar{x}},{\bar{y}}\right) \left( -v^{*}\right)$ such that we have (4.8), which obviously leads to (4.9) under the additional differentiability assumptions. $\square$

Remark 4.5

Recall that the CQ (4.4) is automatically satisfied at $\left( {\bar{x}},{\bar{y}}\right)$ provided that problem (MUL)–(L[x]) has no upper-level constraints (i.e., $X ={\mathbb {R}}^{n}$) or the lower-level optimal solution set-valued mapping S is Lipschitz-like around $\left( {\bar{x}},{\bar{y}}\right)$, which is automatically the case if an upper estimate of $D^*S({{\bar{x}}}, {{\bar{y}}})(0)$ from Proposition 4.3 equals to zero, thanks to the Mordukhovich criterion [22, 23]; also see [31, 32] for further details and references.

To have a clear view of the fact that the necessary optimality conditions obtained in Theorem 4.4 represent a natural extension of the those from a standard optimistic bilevel optimization problem, consider problem (MUL)–(L[x]) with $p=1$ and $q=1$. Let $(\bar{x}, {{\bar{y}}})$ be a local optimal solution of the problem in this case. If the point satisfies the corresponding version of CQ (4.12) and F and f are strictly differentiable, then we have

$$\begin{aligned} 0\in \nabla _x F({{\bar{x}}}, {{\bar{y}}}) + D^*S({{\bar{x}}}, {{\bar{y}}})(\nabla _y F({{\bar{x}}}, {{\bar{y}}})) + N({{\bar{x}}}; \; X). \end{aligned}$$

(4.17)

This inclusion obviously coincides with (4.13) in this context where $w^*$ reduces to 1. Secondly, if $\varphi$ denotes the optimal value function of the corresponding parametric optimization problem (L[x]) and we additionally suppose that the function $\varphi$ is lower semicontinuous around ${{\bar{x}}}$ and the set-valued mapping

$$\begin{aligned} \Psi _{\varphi }(v):=\left\{ \left( x,y\right) \in \text {gph} \ Y: \;\; f(x,y)-\varphi (x) + v =0\right\} . \end{aligned}$$

(4.18)

is calm at $(0, {{\bar{x}}}, {{\bar{y}}})$, then condition (4.17) can be detailed further to obtain

$$\begin{aligned} 0\in \nabla F({{\bar{x}}}, {{\bar{y}}}) + \nabla f({{\bar{x}}}, {{\bar{y}}})v^* + \partial \langle -v^*, \; \varphi \rangle ({{\bar{x}}}) \times \{0\} + N(({{\bar{x}}}, {{\bar{y}}}); \; \text{ gph }~Y) + N({{\bar{x}}}; \; X)\times \{0\} \end{aligned}$$

(4.19)

for some $v^*\in {\mathbb {R}}$. Similarly, this coincides with (4.7) for $w^*=1$. Note that the set-valued mapping (4.18) is slightly different from (3.2), as in the latter case, we instead have an inequality on the perturbed value function constraint. Of course, using the version of the set-valued mapping in (3.2) would have led to $v^*\ge 0$.

Finally, still in the case $p=1$ and $q=1$, if S is inner semicontinuous and has a closed graph around $({{\bar{x}}}, {{\bar{y}}})$, and $\varphi$ is Lipschitz continuous around ${{\bar{x}}}$, then from (4.19), we have inclusion (4.9) with the corresponding $w^*=1$. For more background details on the constructions and relevant concepts above, in the context of standard optimistic optimization, interested readers are referred to [32], where, unlike in (4.9), a scalarization approach is used to deal with the lower-level multiobjective problem, as, to the best of our knowledge, it is the case for all previous references on necessary optimality conditions for multiobjective bilevel optimization.

5 Application to smooth constraint functionals

Let us consider the multiobjective bilevel optimization problem (MUL)–(L[x]) in the case where the upper- and lower-level feasible sets are defined by

$$\begin{aligned} X:=\left\{ x\in {\mathbb {R}}^n:\; G(x)\le 0\right\} \; \text{ and } \; Y(x):=\left\{ y\in {\mathbb {R}}^m:\; g(x, y)\le 0\right\} , \end{aligned}$$

respectively, with $G:{\mathbb {R}}^n \longrightarrow {\mathbb {R}}^r$ and $g:{\mathbb {R}}^n\times {\mathbb {R}}^m \longrightarrow {\mathbb {R}}^s$ being continuously differentiable functions. The upper-level regularity condition will be said to hold at ${{\bar{x}}}$ if there exists a vector $d\in {\mathbb {R}}^n$ such that we have

$$\begin{aligned} \nabla G_i({{\bar{x}}})^\top d < 0 \;\, \text{ for } \text{ all } \;\, i\in I_G({{\bar{x}}}):=\left\{ i\in \{1, \ldots , r\}:\;\, G_i(\bar{x})=0\right\} . \end{aligned}$$

Similarly, the lower-level regularity condition will be satisfied at $({{\bar{x}}}, {{\bar{y}}})$ if if there exists a vector $d\in {\mathbb {R}}^{n+m}$ that verifies

$$\begin{aligned} \nabla _y g_j({{\bar{x}}}, {{\bar{y}}})^\top d < 0 \;\, \text{ for } \text{ all } \;\, j\in I_g({{\bar{x}}}, {{\bar{y}}}):=\left\{ j\in \{1, \ldots , s\}:\;\, g_j(\bar{x}, {{\bar{y}}})=0\right\} . \end{aligned}$$

Obviously, these upper- and lower-level regularity conditions correspond to the Mangasarian-Fromovitz constraint qualification for the feasible set of the corresponding (upper- or lower-) level of our problem (MUL)–(L[x]). It is well-known that under the upper- and lower-level regularity conditions, we respectively have

$$\begin{aligned} N({{\bar{x}}}; \; X) = \left\{ \nabla G({{\bar{x}}})^\top u: \; u\ge 0, \; u^\top G({{\bar{x}}})=0\right\} \end{aligned}$$

and

$$\begin{aligned} D^*Y({{\bar{x}}}, {{\bar{y}}})(y^*) = \left\{ \nabla _x g({{\bar{x}}}, {{\bar{y}}})^\top v:\; -y^*= \nabla _y g({{\bar{x}}}, {{\bar{y}}})^\top v, \; v\ge 0,\; v^\top g({{\bar{x}}}, {{\bar{y}}})=0\right\} . \end{aligned}$$

Now, in addition to all the assumptions of Theorem 4.4, we assume that the upper- and lower-level regularity conditions at ${{\bar{x}}}$ and $({{\bar{x}}}, {{\bar{y}}})$, respectively, it follows from (4.9) that there exist $w^*\in {\mathbb {R}}^p_+$ with $\Vert w^*\Vert =1$, $v^*\in {\mathbb {R}}^q$, $u\in {\mathbb {R}}^r$, and $v\in {\mathbb {R}}^s$ satisfying the relationships

$$\begin{aligned} \nabla _y f({{\bar{x}}}, {{\bar{y}}})^\top (-v^*) + \nabla _y g({{\bar{x}}}, {{\bar{y}}})^\top v=0,\end{aligned}$$

(5.1)

$$\begin{aligned} u\ge 0, \;\; G({{\bar{x}}})\le 0, \;\; u^\top G({{\bar{x}}})=0, \end{aligned}$$

(5.2)

$$\begin{aligned} v\ge 0, \;\; g({{\bar{x}}}, {{\bar{y}}})\le 0, \;\; v^\top g({{\bar{x}}}, \bar{y})=0, \end{aligned}$$

(5.3)

and such that

$$\begin{aligned} - \nabla _x F\left( {\bar{x}}, {\bar{y}}\right) ^* w^{*} - \nabla G(\bar{x})^\top u - \nabla _x g({{\bar{x}}}, {{\bar{y}}})^\top v \in D^{*}Y\left( {\bar{x}},{\bar{y}}\right) \left( \nabla _{y}F\left( {\bar{x}},{\bar{y}}\right) ^{*}w^{*} + \nabla _{y}f\left( {\bar{x}},{\bar{y}}\right) ^{*}v^{*}\right) . \end{aligned}$$

Then from a second application of the above coderivative formula for $D^*Y({{\bar{x}}}, {{\bar{y}}})(y^*)$ to the latter inclusion, it follows that we can find $w\in {\mathbb {R}}^s$, $w^*\in {\mathbb {R}}^p_+$ with $\Vert w^*\Vert =1$, $u\in {\mathbb {R}}^r$, and $v\in {\mathbb {R}}^s$ such that the relationships (5.1)–(5.3) hold together with

$$\begin{aligned} \nabla F({{\bar{x}}}, {{\bar{y}}})^\top w^* + \left[ \begin{array}{c} \nabla G({{\bar{x}}})^\top u\\ 0 \end{array}\right] + \nabla g({{\bar{x}}}, {{\bar{y}}})^\top (v + w) =0,\end{aligned}$$

(5.4)

$$\begin{aligned} w\ge 0, \;\; g({{\bar{x}}}, {{\bar{y}}})\le 0, \;\; w^\top g({{\bar{x}}}, \bar{y})=0. \end{aligned}$$

(5.5)

The optimality conditions (5.1)–(5.5) are very similar to their standard optimistic bilevel optimization problem counterpart with scalar objective functions, as it can be seen in [5, 9], for example. The corresponding conditions in the latter paper have been shown in the recent papers [12, 13, 33] to be suitable to efficiently solve standard optimistic bilevel optimization problem. Hence, the extension of the methods in these papers to multiobjective bilevel programs will be explored in future works.

Finally, to end this section, we provide the following illustrative example, where the lower-level problem is the linear parametric multiobjective problem from Example 3.1.

Example 5.1

Consider problem (MUL)–(L[x]), where $F:{\mathbb {R}}^2\times {\mathbb {R}}^2 \rightarrow {\mathbb {R}}^p$ is any differentiable function and X and the lower-level problem are defined as in Example 3.1. We can easily check that for all $x\in X$, $Y(x):=\left\{ y\in {\mathbb {R}}^2|\; Ax + By\le d \right\}$ is bounded. Hence, the strong domination property (4.3) is satisfied according to Proposition 4.2. Furthermore, all the other assumptions of Theorem 4.4 hold. Hence, for any local efficient Pareto point $({{\bar{x}}}, {{\bar{y}}})$ of the problem,

$$\begin{aligned} \nabla _x F({{\bar{x}}}, {{\bar{y}}})^\top w^* - \left( \begin{array}{c} u_1\\ u_2 \end{array}\right) - \left( \begin{array}{c} v_5 + w_5\\ v_6 + w_6 \end{array}\right) =0,\\ \nabla _y F({{\bar{x}}}, {{\bar{y}}})^\top w^* + \left( \begin{array}{r} v_1+w_1 - v_2-w_2 + v_5+w_5\\ 2(v_3 + w_3) -v_4-w_4 + v_6 +w_6 \end{array}\right) =0,\\ \left( \begin{array}{r} 2v^*_1\\ v^*_2 \end{array}\right) - \left( \begin{array}{r} v_1 - v_2 + v_5\\ 2v_3 -v_4 + v_6 \end{array}\right) =0,\\ u_1\ge 0, \;\; y_1\ge 4, \;\; u_1 (y_1 - 4)=0,\\ u_2\ge 0, \;\; y_2\ge 3, \;\; u_2 (y_2 - 3)=0,\\ v\ge 0, \;\; A{{\bar{x}}} + B{{\bar{y}}} \le d, \;\; v\left( A{{\bar{x}}} + B{{\bar{y}}} -d\right) =0,\\ w\ge 0, \;\; A{{\bar{x}}} + B{{\bar{y}}} \le d, \;\; w\left( A{{\bar{x}}} + B\bar{y} -d\right) =0, \end{aligned}$$

for some $u\in {\mathbb {R}}^2$, $v\in {\mathbb {R}}^6$, $w\in {\mathbb {R}}^6$, $v^{*}\in {\mathbb {R}}^{2}$, and $w^{*}\in {{\mathbb {R}}}_{+}^{p}$ with $\Vert w^{*}\Vert =1$. Note that here, the Pareto efficient solution concept is also considered for the lower-level problem. The matrices A, B, and d in the last two lines of this system are given in (3.11).

Data availibility

No data is needed in this paper.

References

Arrow, K.J., Barankin, E.W., Blackwell, D.: Admissible points of convex sets, in Contributions to the Theory of Games, H. W. Kuhn and A. W. Tucker, eds., Princeton University Press, Princeton, New Jersey, vol 2, 87-91 (1953)
Bao, T.Q., Mordukhovich, B.S.: Relative Pareto minimizers for multiobjective problems: existence and optimality conditions. Math. Program. 122(2), 301–47 (2010)
Article MathSciNet MATH Google Scholar
Bao, T.Q., Mordukhovich, B.S.: Necessary conditions for super minimizers in constrained multiobjective optimization. J. Global Optim. 43, 533–552 (2009)
Article MathSciNet MATH Google Scholar
Bonnel, H.: Optimality conditions for the semivectorial bilevel optimization problem. Pacific J. Optim. 2(3), 447–467 (2006)
MathSciNet MATH Google Scholar
Dempe, S., Dutta, J., Mordukhovich, B.S.: New necessary optimality conditions in optimistic bilevel programming. Optimization 56(5–6), 577–604 (2007)
Article MathSciNet MATH Google Scholar
Dempe, S., Gadhi, N., Zemkoho, A.B.: New optimality conditions for the semivectorial bilevel optimization problem. J. Optim. Theory Appl. 157, 54–74 (2013)
Article MathSciNet MATH Google Scholar
Dempe, S., Mehlitz, P.: Semivectorial bilevel programming versus scalar bilevel programming. Optimization 69(4), 657–679 (2020)
Article MathSciNet MATH Google Scholar
Dempe, S., Mordukhovich, B.S., Zemkoho, A.B.: Sensitivity analysis for two-level value functions with applications to bilevel programming. SIAM J. Optim. 22(4), 1309–1343 (2012)
Article MathSciNet MATH Google Scholar
Dempe, S., Zemkoho, A.B.: The generalized mangasarian-fromowitz constraint qualification and optimality conditions for bilevel programs. J. Optim. Theory Appl. 148(1), 46–68 (2011)
Article MathSciNet MATH Google Scholar
Dempe, S., Zemkoho, A.B.: (eds.), Bilevel Optimization: Advances and Next Challenges, Springer (2020)
Eichfelder, G.: Methods for Multiobjective Bilevel Optimization. In S. Dempe and A. Zemkoho (eds.), Bilevel Optimization Springer, Cham. 423-449 (2020)
Fischer, A., Zemkoho, A.B., Zhou, S.: Semismooth Newton-type method for bilevel optimization: Global convergence and extensive numerical experiments. Optimization methods and software, in press, (2021) https://doi.org/10.1080/10556788.2021.1977810
Fliege, J., Tin, A., Zemkoho, A.B.: Gauss Newton-type methods for bilevel optimization. Comput. Optim. Appl. 78, 793–824 (2021)
Article MathSciNet MATH Google Scholar
Henrion, R., Jourani, A., Outrata, J.: On the calmness of a class of multifunctions. SIAM J. Optim. 13(2), 603–618 (2002)
Article MathSciNet MATH Google Scholar
Henrion, R., Surowiec, T.: On calmness conditions in convex bilevel programming. Appl. Anal. 90(6), 951–970 (2011)
Article MathSciNet MATH Google Scholar
Hui, H., Qing, W.: On approximate solutions of infinite systems of linear inequalities. Linear Algebra Appl. 114(115), 429–438 (1989)
Article MathSciNet MATH Google Scholar
Huy, N.Q., Mordukhovich, B.S., Yao, J.C.: Coderivatives of frontier and solution maps in parametric multiobjective optimization. Taiwan. J. Math. 12, 2083–2111 (2008)
Article MathSciNet MATH Google Scholar
Levy, A.B.: Nonsingularity conditions for multifunctions. Set-Valued Variational Anal. 7, 89–99 (1999)
Article MathSciNet MATH Google Scholar
Li, S.J., Xue, X.W.: Sensitivity analysis of gap functions for vector variational inequality via coderivatives. Optimization 63(7), 1075–1098 (2014)
Article MathSciNet MATH Google Scholar
Luc, D.T.: Theory of Vector Optimization. Springer-Verlag, Berlin (1989)
Book Google Scholar
Mehlitz, P., Minchenko, L.I.: R-regularity of set-valued mappings under the relaxed constant positive linear dependence constraint qualification with applications to parametric and bilevel optimization. Set-Valued Variational Anal. (2020). https://doi.org/10.1007/s11228-021-00578-0
Article MATH Google Scholar
Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation. I: Basic Theory. Springer, Berlin (2006)
Mordukhovich, B.S.: Variational Analysis and Generalized Differentiation. II. Applications. Springer, Berlin (2006)
Book Google Scholar
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton, New Jersey (1970)
Book MATH Google Scholar
Sinha, A., Malo, P., Deb, K.: A review on bilevel optimization: from classical to evolutionary approaches and applications. IEEE Trans. Evol. Comput. 22(2), 276–295 (2017)
Article Google Scholar
Tanino, T.: Sensitivity analysis in multiobjective optimization. J. Optim. Appl. 56(3), 479–499 (1988)
Article MathSciNet MATH Google Scholar
Xue, X.W., Li, S.J., Lia, C.M., Yao, J.C.: Sensitivity analysis of parametric vector set-valued optimization problems via coderivatives. Taiwan. J. Math. 15(6), 2533–2554 (2011)
Article MathSciNet MATH Google Scholar
Ye, J.J.: New uniform parametric error bounds. J. Optim. Theory Appl. 98(1), 197–219 (1998)
Article MathSciNet MATH Google Scholar
Ye, J.J., Zhu, D.L.: Optimality conditions for bilevel programming problems. Optimization 33, 9–27 (1995)
Article MathSciNet MATH Google Scholar
Ye, J.J., Zhu, D.L.: A note on optimality conditions for bilevel programming problems. Optimization 39, 361–366 (1997)
Article MathSciNet MATH Google Scholar
Zemkoho, A.B.: Estimates of generalized Hessians for optimal value functions in mathematical programming. Set-valued and variational analysis, in press, (2021) https://doi.org/10.1007/s11228-021-00623-y arXiv:1710.05887
Zemkoho, A.B.: Solving ill-posed bilevel programs. Set-Valued Var. Anal. 24, 423–448 (2016)
Article MathSciNet MATH Google Scholar
Zemkoho, A.B., Zhou, S.: Theoretical and numerical comparison of the Karush-Kuhn-Tucker and value function reformulations in bilevel optimization. Comput. Optim. Appl. 78(2), 625–674 (2021)
Article MathSciNet MATH Google Scholar

Download references

Acknowledgements

The work of AZ is supported by the EPSRC grant EP/V049038/1 and the Alan Turing Institute for Data Science and Artificial Intelligence under the EPSRC grant EP/N510129/1

Author information

Authors and Affiliations

Laboratoire LASMA, Department of Mathematics, Sidi Mohammed Ben Abdellah University, Fes, Morocco
Lahoussine Lafhim
School of Mathematical Sciences, University of Southampton, Southampton, UK
Alain Zemkoho

Authors

Lahoussine Lafhim
View author publications
You can also search for this author in PubMed Google Scholar
Alain Zemkoho
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alain Zemkoho.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lafhim, L., Zemkoho, A. Extension of the value function reformulation to multiobjective bilevel optimization. Optim Lett 17, 1337–1358 (2023). https://doi.org/10.1007/s11590-022-01948-9

Download citation

Received: 09 December 2021
Accepted: 19 October 2022
Published: 28 October 2022
Issue Date: July 2023
DOI: https://doi.org/10.1007/s11590-022-01948-9

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Extension of the value function reformulation to multiobjective bilevel optimization

Abstract

Similar content being viewed by others

Methods for Multiobjective Bilevel Optimization

Optimality of Bilevel Programming Problems Through Multiobjective Reformulations

Constraint Qualifications and Optimality Conditions in Bilevel Optimization

1 Introduction

2 Preliminaries

2.1 Tools from variational analysis

2.2 Multiobjective optimization concepts

Definition 2.1

Definition 2.2

3 Generalized value function constraint qualification

Definition 3.1

Definition 3.2

Theorem 3.3

Proof

Proposition 3.4

Proof

Example 3.1

Proposition 3.5

Proof

Theorem 3.6

Proof

Proposition 3.7

Proof

4 Necessary optimality conditions

Theorem 4.1

Proof

Proposition 4.2

Proof

Proposition 4.3

Proof

Theorem 4.4

Proof

Remark 4.5

5 Application to smooth constraint functionals

Example 5.1

Data availibility

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation