Abstract
We consider a quasi-variational inequality governed by a moving set. We employ the assumption that the movement of the set has a small Lipschitz constant. Under this requirement, we show that the quasi-variational inequality has a unique solution which depends Lipschitz-continuously on the source term. If the data of the problem is (directionally) differentiable, the solution map is directionally differentiable as well. We also study the optimal control of the quasi-variational inequality and provide necessary optimality conditions of strongly stationary type.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
We consider the quasi-variational inequality (QVI)
Here, V is a Hilbert space, \(A :V \rightarrow V^\star \) is a (possibly nonlinear) mapping, and \(f \in V^\star \). We will not cover the general situation of a set-valued mapping \(Q :V \rightrightarrows V\), but we restrict the treatment of (1.1) to the case in which Q(y) is a moving set, i.e.,
for some non-empty, closed and convex subset \(K \subset V\) and \(\varPhi :V \rightarrow V\). It is well-known that QVIs have many important real-world applications, we refer exemplarily to [1, 3, 5, 16] and the references therein.
The main contributions of this paper are the following.
We prove existence and uniqueness of solutions to (1.1) under a smallness assumption on the mapping \(\varPhi \), see Sect. 3.
If, additionally, the functions A and \(\varPhi \) are differentiable and if K is polyhedric, we establish the directional differentiability of the solution mapping of (1.1), see Sect. 5.
For the associated optimal control problem, we derive necessary optimality conditions of strongly stationary type, see Sect. 6.
In particular, our results are applicable if the Lipschitz constant of \(\varPhi \) is small. Let us put our work in perspective. In the following discussion, we will assume that A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz and that \(\varPhi \) is \(L_\varPhi \)-Lipschitz. We refer to Sect. 2 for the definitions. We further define the condition number of A via \(\gamma _A := L_A / \mu _A \ge 1\). An existence and uniqueness result for the general QVI (1.1) was given in [15, Theorem 9]. This result can be applied to the moving set case (1.2) via [14, Lemma 3.2]. One obtains the unique solvability of (1.1) under the condition
In the work [14, Corollary 2] the requirement was relaxed to
In this work, we shall show that
is sufficient for existence and uniqueness under the condition that A is the derivative of a convex function. Note that A is indeed a derivative of a convex function in many important applications. Moreover, the conditions (1.4) and (1.5) are necessary for uniqueness in the following sense: Whenever the constants \(L_\varPhi < 1\), \(0 < \mu _A \le L_A\) violate (1.4) with \(\gamma _A := L_A / \mu _A\), there exist bounded and linear operators \(A :V \rightarrow V^\star \) and \(\varPhi :V \rightarrow V\) possessing these constants such that (1.1) does not have a unique solution for every \(f \in V^\star \). If even (1.5) is violated, A can chosen to be symmetric. We refer to Theorems 3.6 and 3.7 below for the precise formulation of this result.
For a different approach to obtain uniqueness of solutions to (1.1), we refer to [10].
To our knowledge, [1] is the only contribution concerning differentiability of the solution mapping of (1.1). Their approach is based on monotonicity considerations and only the differentiability into non-negative directions is obtained. In what follows, we are able to relax the assumption required for the differentiability and we also obtain differentiability in all directions, see Theorem 5.5.
Finally, we are not aware of any contribution in which stationarity conditions for the optimal control of (1.1) are obtained.
2 Notation and preliminaries
Throughout this work, V will denote a Hilbert space. Its dual space is denoted by \(V^\star \). The radial cone, the tangent cone and the normal cone of a closed, convex set \(K \subset V\) at \(y \in K\) are given by
respectively. The critical cone of K w.r.t. \((y,\lambda ) \in K \times \mathcal {T}_K(y)^\circ \) is given by
The set K is called polyhedric at \((y,\lambda )\) if \(\mathcal {K}_K(y,\lambda ) = {\text {cl}}\{\mathcal {R}_K(y) \cap \lambda ^\perp \}\). We refer to [17] for a recent review of polyhedricity.
A mapping \(B:V \rightarrow V^\star \) is called monotone if
It is called \(\mu \)-strongly monotone if \(\mu > 0\) satisfies
If H is another Hilbert space, a mapping \(C:V \rightarrow H\) is called L-Lipschitz for some \(L \ge 0\) if
The monotonicity of an operator implies some weak lower semicontinuity.
Lemma 2.1
Let \(A :V \rightarrow V^\star \) be a monotone operator. Suppose that \(y_n \rightharpoonup y\) in V and \(A(y_n) \rightharpoonup A(y)\) in \(V^\star \). Then,
Proof
From the monotonicity of A we find
The right-hand side converges towards \(\langle A(y),y\rangle \) due to the weak convergences \(y_n \rightharpoonup y\) and \(A(y_n) \rightharpoonup A(y)\). This implies the claim. \(\square \)
In the case that A is additionally bounded and linear, the above claim can be obtained from the observation that \(y \mapsto \langle A(y),y\rangle \) is convex. This convexity does not hold in the nonlinear setting: consider \(A :\mathbb {R}\rightarrow \mathbb {R}\), \(y \mapsto \max (-\,1,\min (1,y))\).
In order to obtain unique solvability of (1.1) via contraction-type arguments, one typically requires an inequality like
for some \(L_Q \ge 0\), see, e.g., [14, Theorem 4.1]. Note that this inequality is not related to the Lipschitz continuity of the projection since the arguments of the projections in (2.1) coincide. By means of an example, we show that (2.1) does not hold for obstacle-type problems if Q is not of the moving-set type. We consider the setting \(\varOmega = (0,1)\), \(V = H_0^1(\varOmega )\)\(f = 1\), \(A = -\,\varDelta \) and
for \(\mathbb {R}\ni h \ge 0\). It is easy to check that the projection of \(A^{-1} f\) onto the set K(h) is given by
with \(t_h := \sqrt{2\,h}\) for all \(h \in [0,1/8]\). Here, we used the norm \(\Vert v\Vert _{H_0^1}^2 = \int _\varOmega |\nabla v|^2\,{\mathrm {d}}x\). Then, \(\Vert y_h\Vert _{H_0^1(\varOmega )} = C \, h^{3/4}\) for some \(C > 0\). Since \(y_0 = 0\), the mapping \(h \mapsto y_h\) is not Lipschitz at \(h = 0\). By choosing a suitable \(\varPsi :V \rightarrow \mathbb {R}\) it can be checked that \(Q = K \circ \varPsi \) violates (2.1).
3 Moving-set QVIs as VIs
In this section, we utilize the moving-set structure of Q(y) to recast the QVI (1.1) as an equivalent variational inequality (VI). This is a classical approach, see also [1, Section 2], [2, Section 5.1]. We start by defining the new solution variable \(z := y - \varPhi (y) \in K\). In order to not lose any information, we require that the function \(I - \varPhi :V \rightarrow V\) is a bijection. Hence, \(y = (I - \varPhi )^{-1}(z)\). Now, it is easy to check that (1.1) is equivalent to
and \(y = (I - \varPhi )^{-1}(z)\). This means the following: Under the assumption that \(I-\varPhi \) is a bijection, y is a solution of (1.1) if and only if \(z = (I - \varPhi )(y)\) is a solution of (3.1). In what follows, we are going to use the VI (3.1) in order to obtain information about the QVI (1.1). In the case in which both A and \(\varPhi \) are linear, such a strategy was suggested in [1, Remark 7]. We shall see that this is also viable in the fully nonlinear case.
In order to analyze (3.1) we will frequently make use of the following assumption.
Assumption 3.1
The operator \(I - \varPhi :V \rightarrow V\) is a bijection and the operator \(B := A \circ (I-\varPhi )^{-1}\) is strongly monotone and Lipschitz continuous.
Using the equivalence of (1.1) and (3.1) as well as the existence result [11, Corollary III.1.8] for (3.1), we obtain the following existence and uniqueness result for (1.1).
Theorem 3.2
Under Assumption 3.1, the QVI (1.1) has a unique solution \(y \in V\) for any \(f \in V^\star \). Moreover, the mapping \(f \mapsto y\) is Lipschitz continuous.
In the remainder of this section, we give some conditions implying Assumption 3.1.
Lemma 3.3
We assume that A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz and that \(\varPhi \) is \(L_\varPhi \)-Lipschitz. We further assume that
where \(\gamma _A = L_A / \mu _A\). Then, the operator \(B := A \circ (I - \varPhi )^{-1}\) is \(\mu _B\)-strongly monotone and \(L_B\)-Lipschitz with
In particular, Assumption 3.1 is satisfied and (1.1) has a unique solution for every \(f \in V^\star \).
Proof
First we remark that we have \(L_\varPhi < \gamma _A^{-1} \le 1\). Thus, Banach’s fixed point theorem implies that \(I-\varPhi \) is a bijection. We claim that \((1-L_\varPhi )^{-1}\) is a Lipschitz constant of \((I-\varPhi )^{-1}\). Indeed, let \(y_1, y_2 \in V\) be arbitrary. We define \(x_i := (I-\varPhi )^{-1}(y_i)\) for \(i = 1,2\). Then, \(x_i - y_i = \varPhi (x_i)\), \(i = 1,2\) and this yields
This shows the claim concerning a Lipschitz constant of \((I-\varPhi )^{-1}\). Moreover, this directly shows that \(L_B\) is a Lipschitz constant of B.
For arbitrary \(y_1, y_2 \in V\) we again use \(x_i := (I-\varPhi )^{-1}(y_i)\) for \(i = 1,2\). Then,
The estimate
yields the assertion concerning the strong monotonicity of B. The final claim follows from Theorem 3.2. \(\square \)
We recall that the condition (3.2) was used in [14, Corollary 2] to obtain existence and uniqueness for solutions of (3.1). The above analysis shows, that this condition even implies Assumption 3.1.
Next, we show that the estimate (3.2) can be significantly relaxed if A is the derivative of a convex function. To this end, we need to recall an important inequality for convex functions. This inequality is well-known in the finite-dimensional case, see, e.g., [13, Theorem 2.1.12] or [6, Lemma 3.10], and the proof carries over to arbitrary Hilbert spaces. We are, however, not aware of a reference in the infinite-dimensional case.
Lemma 3.4
Let \(g:V \rightarrow \mathbb {R}\) be a Fréchet differentiable convex function such that the derivative \(g' :V \rightarrow V^\star \) is \(\mu _g\)-strongly monotone and \(L_g\)-Lipschitz. Then,
holds for all \(x_1, x_2 \in V\).
Proof
One can transfer the proofs of [13, Theorem 2.1.12] or [6, Lemma 3.10] to the infinite-dimensional case by using [4, Theorem 18.15]. \(\square \)
Lemma 3.5
We assume that A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz and that \(\varPhi \) is \(L_\varPhi \)-Lipschitz. We further assume that there exists a Fréchet differentiable convex function \(g:V \rightarrow \mathbb {R}\) such that \(A = g'\) and
where \(\gamma _A = L_A / \mu _A\). Then, the operator \(B := A \, (I - \varPhi )^{-1}\) is \(\mu _B\)-strongly monotone and \(L_B\)-Lipschitz with
In particular, Assumption 3.1 is satisfied and (1.1) has a unique solution for every \(f \in V^\star \).
Proof
By arguing as in the proof of Lemma 3.3, we obtain that \(I-\varPhi \) is invertible and the value of the Lipschitz constant \(L_B\) follows.
Now, let \(y_1, y_2 \in V\) be arbitrary and we set \(x_i := (I-\varPhi )^{-1}(y_i)\), \(i =1,2\). Then, we apply Lemma 3.4 to obtain
Next, we employ Young’s inequality
and get
From the proof of Lemma 3.3 we find \(\Vert y_1 - y_2 \Vert _V \le (1 + L_\varPhi ) \, \Vert x_1 - x_2\Vert _V\) and this yields the monotonicity. The final claim follows from Theorem 3.2. \(\square \)
Note that the inequality (3.3) is weaker than (3.2), unless \(\gamma _A = 1\). Lemma 3.5 is an improvement of the corresponding results in the literature, e.g., [14, Corollary 2], in the case that A is the derivative of a convex function. It is well known that A is a derivative of a convex function if and only if A is maximally cyclically monotone, see, e.g., [4, Theorem 22.14].
Finally, we demonstrate by the mean of two examples that the assumptions (3.2) and (3.3) are sharp, even in the case of linear operators. These examples are found by constructing operators for which the estimates in the proofs of Lemmas 3.3 and 3.5 are sharp. First, we validate the sharpness of (3.3).
Theorem 3.6
Let the constants \(0< \mu _A < L_A\) be given. We define \(L_\varPhi := \mu _A / L_A < 1\), i.e., (3.2) is violated. Then, there exist linear operators A and \(\varPhi \) on \(V = \mathbb {R}^2\) (equipped with the Euclidean inner product), such that A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz, \(\varPhi \) is \(L_\varPhi \)-Lipschitz and \(A \, (I-\varPhi )^{-1}\) is not coercive. Moreover, there exists a one-dimensional subspace \(K \subset \mathbb {R}^2\) such that (3.1) and (1.1) are not uniquely solvable for all \(f \in V^\star \).
Proof
We define the constant \(c_A := \sqrt{L_A^2 - \mu _A^2} > 0\) and the operators
where
Since A is the combination of a rotation and a scaling by \(L_A\), it is easy to check that \(z^\top A \, z = \mu _A \, \Vert z\Vert ^2\) and \(\Vert A \, z\Vert = L_A \, \Vert z\Vert \) hold for all \(z \in \mathbb {R}^2\). Moreover, the Lipschitz constant of \(\varPhi \) is \(L_\varPhi \). However,
Hence, \(A \, (I-\varPhi )^{-1}\) is not coercive. Moreover, if we set \(K = {\text {span}}\{z\}\) it is clear that (3.1) is not uniquely solvable for all \(f \in V^\star = \mathbb {R}^2\). Since \(I - \varPhi \) is a bijection, this implies that (1.1) is not uniquely solvable for all \(f \in V^\star = \mathbb {R}^2\). \(\square \)
The next result shows that (3.3) is sharp.
Theorem 3.7
Let \(0< \mu _A < L_A\) be given. We define \(L_\varPhi := 2 \, \sqrt{\mu _A \, L_A} / (\mu _A + L_A) < 1\), i.e., (3.3) is violated. Then, there exists a linear symmetric operator A on \(V = \mathbb {R}^2\) (equipped with the Euclidean inner product) and a linear operator \(\varPhi \) in \(\mathbb {R}^2\), such that A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz, \(\varPhi \) is \(L_\varPhi \)-Lipschitz and \(A \, (I-\varPhi )^{-1}\) is not coercive. Moreover, there exists a one-dimensional subspace \(K \subset \mathbb {R}^2\) such that (3.1) and (1.1) cannot be uniquely solvable for all \(f \in V^\star \).
Proof
We define
It is clear that the operator A is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz. We further set
It can be checked that \(\varPhi \) is \(L_\varPhi \)-Lipschitz and \(\Vert x\Vert = 1\). However,
where \(y = (I-\varPhi ) \, x \ne 0\). Hence, \(A \, (I-\varPhi )\) and \(A \, (I-\varPhi )^{-1}\) are not coercive. Moreover, if we set \(K = {\text {span}}\{y\}\) it is clear that (3.1) cannot be uniquely solvable for all \(f \in V^\star = \mathbb {R}^2\). Since \(I - \varPhi \) is a bijection, this implies that (1.1) is not uniquely solvable for all \(f \in V^\star = \mathbb {R}^2\). \(\square \)
We further mention that it is also possible to obtain Assumption 3.1 in situations in which \(\varPhi \) is “not small”, e.g., if \(\varPhi = \lambda \, I\) with some \(\lambda < 1\), Assumption 3.1 follows automatically if A is strongly monotone and Lipschitz since \((I-\varPhi )^{-1} = (1-\lambda )^{-1} \, I\) in this case. Moreover, it is possible to analyze the situation in which A is a small perturbation of the derivative of a convex function by combining the ideas of Lemmas 3.3 and 3.5.
The combination of Theorem 3.2 and 3.3 yields a well-known result: under the assumption (3.2), the QVI (1.1) has a unique solution. Such a result is typically shown via contraction-type arguments, see, e.g., [14] or [2, Section 3.1.1]. Thus, the approach of this section is able to reproduce this classical result. However, the combination of Theorem 3.2 and Lemma 3.5 yields a new result in case that A has a convex potential in which the condition (3.2) on the Lipschitz constant \(L_\varPhi \) of \(\varPhi \) is relaxed to (3.3).
4 Localization of the smallness assumption
We localize the assumptions concerning the Lipschitz constant of \(\varPhi \).
Assumption 4.1
We assume that \(A :V \rightarrow V^\star \) is (globally) \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz. Further, let \({\bar{f}} \in V^\star \) be given and let \({\bar{y}}\) be a solution of (1.1). We suppose that there is a closed, convex neighborhood \(Y \subset V\) of \({\bar{y}}\) such that \(\varPhi \) is \(L_\varPhi \)-Lipschitz continuous on Y. Finally,
- (i)
inequality (3.2) holds or
- (ii)
inequality (3.3) holds and A is the Fréchet derivative of a convex function.
Theorem 4.2
Suppose that Assumption 4.1 is satisfied. There is a neighborhood \(F \subset V^\star \) of f such that (1.1) has exactly one solution in Y for all \(f \in F\). Moreover, this solution depends Lipschitz-continuously on f.
Note that we do not claim that (1.1) is uniquely solvable for all \(f \in F\) and (1.1) might have further solutions in \(V {\setminus } Y\).
Proof
We define \({\tilde{\varPhi }} :V \rightarrow V\) via
Since projections are 1-Lipschitz, \({\tilde{\varPhi }}\) is \(L_\varPhi \)-Lipschitz. Now, we consider the modified QVI
with
From Assumption 4.1, Lemmas 3.3 and 3.5, and Theorem 3.2 it follows that (4.1) has a unique solution \(y = \tilde{S}(f)\) for every \(f \in F\) and the solution operator \(S :V^\star \rightarrow V\) is Lipschitz continuous. Hence, we can choose a neighborhood \(F \subset V^\star \) of \({\bar{f}}\), such that \(\tilde{S}(f) \in Y\) for all \(f \in F\).
Since \(Q(y) = {\tilde{Q}}(y)\) for all \(y \in Y\), it is clear that \(y \in Y\) is a solution of (1.1) if and only if \(y \in Y\) solves (4.1). Hence, (1.1) has a unique solution in Y for all \(f \in F\). \(\square \)
5 Differential stability
In this section, we consider the situation of Assumption 4.1. However, we do not need Assumption 4.1 directly, but the assertion of Theorem 4.2 is enough.
Assumption 5.1
We suppose that the following assumptions are satisfied.
- (i).
We assume the existence of sets \(F \subset V^\star \), \(Y \subset V\) such that for every \(f \in F\), (1.1) has a unique solution y in Y and the solution map \(S :F \rightarrow Y\), \(f \mapsto y\) is Lipschitz continuous. For fixed \({\bar{f}} \in F\), we set \({\bar{y}} := S({\bar{f}})\). The sets F, Y are assumed to be neighborhoods of \({\bar{f}}\), \({\bar{y}}\), respectively.
- (ii).
The operator \(\varPhi :V \rightarrow V\) is Lipschitz on Y, i.e., there exists \(L_\varPhi > 0\) with
$$\begin{aligned} \Vert \varPhi ( y_1 ) - \varPhi ( y_2 ) \Vert _V \le L_\varPhi \, \Vert y_1 - y_2 \Vert _V \qquad \forall y_1, y_2 \in Y. \end{aligned}$$We suppose that \(I - \varPhi :Y \rightarrow Z\) is bijective with a Lipschitz continuous inverse, where \(Z := (I - \varPhi )(Y)\). Further, \(\varPhi \) is Fréchet differentiable at \({\bar{y}}\) and the bounded linear operator \(I - \varPhi '({\bar{y}})\) is bijective.
- (iii).
The operator A is Fréchet differentiable at \({\bar{y}}\) and the bounded linear operator
$$\begin{aligned} A'({\bar{y}}) \, (I - \varPhi '(\bar{y}))^{-1} \end{aligned}$$(5.1)is assumed to be coercive.
- (iv).
The set K is polyhedric at \(({\bar{z}}, {\bar{f}} - A({\bar{y}}))\), where \({\bar{z}} = (I - \varPhi )({\bar{y}})\).
Due to (1.2), the last assumption is equivalent to the polyhedricity of Q(y) at \(({\bar{y}}, {\bar{f}} - A({\bar{y}}))\).
First, we show that Assumption 5.1 follows from Assumption 4.1 and from the differentiability of \(\varPhi \) and A.
Theorem 5.2
Suppose that Assumption 4.1 is satisfied. Then, Assumption 5.1 (i) holds. If \(\varPhi \) is Fréchet differentiable at \({\bar{y}}\), then Assumption 5.1 (ii) holds. If, additionally, A is Fréchet differentiable at \(\bar{y}\), then Assumption 5.1 (iii) is satisfied.
Proof
Assumption 5.1 (i) follows from Theorem 4.2.
Since \(L_\varPhi < 1\), Banach’s fixed point theorem implies that \(I - \varPhi \) is bijective with a Lipschitz continuous inverse. The invertibility of \(I - \varPhi '({\bar{y}})\) follows from the Neumann series since \(\Vert \varPhi '({\bar{y}})\Vert \le L_\varPhi < 1\).
If A is Fréchet differentiable at \({\bar{y}}\), Assumption 4.1 implies that \(A'({\bar{y}})\) is \(\mu _A\)-strongly monotone and \(L_A\)-Lipschitz. In case that (3.2) is satisfied, we can invoke Lemma 3.3 to obtain Assumption 5.1 (iii). Otherwise, A is the Fréchet derivative of a convex function. Hence, \(A'({\bar{y}})\) is symmetric since it is a second Fréchet derivative, see [7, Theorem 5.1.1]. Thus, \(A'({\bar{y}})\) is the derivative of the convex function \(v \mapsto \langle A'(\bar{y})\,v,v\rangle /2\). Therefore, we can invoke Lemma 3.5 to obtain Assumption 5.1 (iii). \(\square \)
Lemma 5.3
Let us assume that Assumption 5.1 (i)–(ii) is satisfied. Then, \((I - \varPhi )^{-1}\) is Fréchet differentiable at \({\bar{z}} := (I - \varPhi )({\bar{y}})\) and \(( (I-\varPhi )^{-1} )'({\bar{z}}) = (I-\varPhi '({\bar{y}}))^{-1}\).
Proof
For arbitrary \(h \in V\) we have
Using the Fréchet differentiability of \(\varPhi \) at \({\bar{y}}\) implies
as \(\Vert (I-\varPhi )^{-1}({\bar{z}} + h) - {\bar{y}}\Vert _V \rightarrow 0\). Finally, using the fact that \((I-\varPhi )^{-1}\) is Lipschitz implies
This shows the claim. \(\square \)
Lemma 5.4
Let us assume that Assumption 5.1 (i)–(iii) is satisfied. The operator \(B := A \circ (I - \varPhi )^{-1}\) is Fréchet differentiable at \({\bar{z}}\) and its Fréchet derivative is given by \(B'({\bar{z}})= A'({\bar{y}}) \, (I - \varPhi '(\bar{y}))^{-1}\).
Proof
Follows from Lemma 5.3 together with a chain rule. \(\square \)
Theorem 5.5
Let us assume that Assumption 5.1 is satisfied. Then, the solution map S is directionally differentiable at \({\bar{f}}\) and the directional derivative \(x := S'({\bar{f}}; h)\) in direction \(h \in V^\star \) is given by the unique solution of the QVI
where the set-valued mapping \(Q^{{\bar{y}}} :V \rightrightarrows V\) is given by
Note that we have
due to (1.2).
Proof
Let \(h \in V^\star \) be given. There exists \(T > 0\) such that \({\bar{f}} + t \, h \in F\) for all \(t \in [0,T)\). For \(t \in (0,T)\) we define
Since S is assumed to be Lipschitz continuous on F, the difference quotients \(\{ x_t \big |t \in (0,T) \}\) are bounded in V. The Lipschitz continuity of \(\varPhi \) implies the boundedness of \(\{ w_t \big |t \in (0,T) \}\) in V.
Since \(z_t\) solves the VI (3.1), i.e.,
with \(f := {\bar{f}} + t \, h\), we can apply [9, Theorem 2.13] to obtain the convergence of the difference quotients \(w_t\). Let us check that the assumptions of [9, Theorem 2.13] are satisfied. The standing assumption [9, Assumption 2.1] is satisfied in our Hilbert space setting with \(j = \delta _K\) being the indicator function (in the sense of convex analysis) of the set K. The validity of [9, Assumption 2.2] follows from the Taylor expansion
with , see Lemma 5.4. It remains to check that the assumption [9, Theorem 2.13 (ii)] holds:
Since K is assumed to be polyhedric at \({\bar{z}}\) w.r.t. \({\bar{f}} - A({\bar{y}})\), its indicator function \(\delta _K\) is twice epi-differentiable at \(({\bar{z}}, {\bar{f}} - A({\bar{y}}))\), see [9, Corollary 3.3]. Moreover, its second subderivative is the indicator function of the critical cone \(\mathcal {K}_K({\bar{z}}, {\bar{f}} - A({\bar{y}})) := \mathcal {T}_K({\bar{z}}) \cap ({\bar{f}} - A({\bar{y}}))^\perp \).
The weak convergence \(w_n \rightharpoonup w\) in V implies \(B'({\bar{z}}) \, w_n \rightharpoonup B'({\bar{z}}) \, w\) in \(V^\star \) and \(\liminf _{n \rightarrow \infty } \langle B'({\bar{z}}) w_n,w_n\rangle \ge \langle B'({\bar{z}})\,w,w\rangle \) follows from the coercivity of the linear operator \(B'({\bar{z}}) = A'({\bar{y}}) \, (I - \varPhi '({\bar{y}}))^{-1}\), see Lemma 2.1, Assumption 5.1 (iii) and Lemma 5.4.
Thus, the application of [9, Theorem 2.13] yields that all accumulation points w of \(w_t\) for \(t \searrow 0\) are solutions of the linearized VI
Since \(B'({\bar{z}})\) is coercive, this linearized VI has a unique solution. Hence, the last part of [9, Theorem 2.13] implies \(w_t \rightarrow w\) as \(t \searrow 0\).
It remains to prove the convergence of \(x_t\) towards the solution of (5.2). Using the differentiability of \((I - \varPhi )^{-1}\), we find
The change of variables \(w = (I - \varPhi '({\bar{y}})) \, x\) shows the equivalence of (5.2) and (5.3). Thus, x is the unique solution of (5.2). \(\square \)
Some remarks concerning Theorem 5.5 are in order.
Remark 5.6
-
(i)
The polyhedricity assumption Assumption 5.1 (iv) can be replaced by the strong twice epi-differentiability of the indicator function \(\delta _K\) in the sense of [9, Definition 2.9]. Under this generalized assumption, the second epi-derivative of \(\delta _K\) appears as a curvature term in the linearized inequalities (5.2) and (5.3). Note that the indicator function of the critical cone \(\mathcal {K}_K({\bar{z}}, {\bar{f}} - A({\bar{y}}))\), which appears implicitly in (5.2) and (5.3), is just the second epi-derivative of \(\delta _K\) in the case of K being polyhedric.
-
(ii)
We have derived the differentiability result under the assumption that \(\varPhi \) is Fréchet differentiable at \({\bar{y}}\). In the notation of [9], this translates to linearity of the operator \(A_x\). However, the inspection of the proof of [9, Theorem 2.13] entails that it is possible to replace the Fréchet differentiability of \(\varPhi \) by the following set of assumptions:
- (a)
\(\varPhi \) is Bouligand differentiable at \({\bar{y}}\), i.e., there exists \(\varPhi '({\bar{y}}; \cdot ) :V \rightarrow V\) such that
- (b)
For every sequence \(w_n \rightharpoonup w\) in V, we assume
$$\begin{aligned} (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w_n)&\rightharpoonup (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w) , \end{aligned}$$(5.4a)$$\begin{aligned} \liminf _{n \rightarrow \infty } \langle A'({\bar{y}}) \, (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w_n) , w_n \rangle&\ge \langle A'({\bar{y}}) \, (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w) , w \rangle . \end{aligned}$$(5.4b)
Note that (a) implies that \(\varPhi '({\bar{y}}; \cdot )\) is Lipschitz on V with constant \(L_\varPhi \). Hence, \((I - \varPhi '({\bar{y}};\cdot ))\) is invertible and Lemmas 3.3 and 3.5 can be used to obtain the strong monotonicity of \(A'({\bar{y}}) \, (I - \varPhi '({\bar{y}}; \cdot ))^{-1}\).
Property (5.4a) can be verified by assuming, e.g., weak continuity of \(\varPhi '({\bar{y}}; \cdot )\). Indeed, the sequence \(z_n := (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w_n)\) is bounded, hence, \(z_n \rightharpoonup z\) along a subsequence. Now, weak continuity implies \(w_n = z_n - \varPhi '({\bar{y}}; z_n) \rightharpoonup z - \varPhi '({\bar{y}}; z)\) and \(w_n \rightharpoonup w\) implies \(z = (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w)\), i.e. \(z_n \rightharpoonup (I - \varPhi '({\bar{y}}; \cdot ))^{-1}(w)\) along a subsequence. The uniqueness of the limit point implies the convergence of the entire sequence.
- (a)
In the next remark, we compare our differentiability result with [1, Theorem 1].
Remark 5.7
In [1, Theorem 1] a similar differentiability result is obtained in a more restrictive setting:
- (i)
Therein, the leading operator A has to be linear and T-monotone (w.r.t. a vector space order on V). Our approach also allows for non-linear operators and we do not need any order structure on V. Similarly, we do not need any monotonicity assumptions on \(\varPhi \).
- (ii)
They require the complete continuity of \(\varPhi '({\bar{y}})\), which is not needed in Theorem 5.5.
- (iii)
One of their most restrictive assumptions is the assumption (A5). Via [7, Theorem 3.1.2], this assumption is equivalent to \(\varPhi \) being \(L_\varPhi \)-Lipschitz with
$$\begin{aligned} L_\varPhi < \frac{1}{1 + \gamma _A}. \end{aligned}$$(5.5)This inequality is much stronger than (3.2). Thus, their assumption (A5) implies that the solutions to the QVI (1.1) are unique.
Moreover, they obtained the differentiability only for non-negative directions whereas our approach is applicable to arbitrary perturbations of the right-hand side.
One assumption in [1] is weaker: they only need Hadamard differentiability of \(\varPhi \). We need Fréchet differentiability (or Bouligand differentiability, see Remark 5.6).
6 Optimal control
In this section, we consider the optimal control problem
Here, \(f \in V^\star \) is fixed, U is a Hilbert space and the bounded, linear operator \(B :U \rightarrow V^\star \) is assumed to have a dense range. Moreover, the objective \(J :V \times U \rightarrow \mathbb {R}\) is Fréchet differentiable.
The main goal of this section is the derivation of stationarity conditions for local solutions of (6.1). Since the constraints of (6.1) contain a QVI, this is a delicate issue. Using the (local) solution map S of the QVI, one can consider the reduced problem
Under the assumptions of Theorem 5.5, this reduced objective function is directionally differentiable and we obtain the stationarity condition
In the literature, such an inequality is called B-stationarity. In some situations, it is possible to introduce dual variables to obtain a so-called system of strong stationarity which is equivalent to B-stationarity. The next theorem shows that this is indeed possible for (6.1).
Theorem 6.1
Suppose that \(({\bar{y}}, {\bar{u}})\) is locally optimal for (6.1). In addition to the assumptions on B and J, we assume that Assumption 5.1 is satisfied at \({\bar{f}} := B \, {\bar{u}} + f\). Then, there exist unique multipliers \(p \in V\), \(\mu \in V^\star \) such that the system
is satisfied. Here,
and \(J_y(\cdot ) \in V^\star \) and \(J_u(\cdot ) \in U^\star \) are the partial Fréchet derivatives of J at \(({\bar{u}}, \bar{y})\).
Proof
We use classical arguments dating back to [12, Proposition 4.1], see also [17, Theorem 5.3].
Due to Assumption 5.1 we can invoke Theorem 5.5 to obtain the directional differentiability of the control-to-state map. Combined with the local optimality of \(({\bar{y}}, {\bar{u}})\), this implies
Due to the Lipschitz estimate \(\Vert S'(B\,{\bar{u}} + f; B \, h)\Vert _{V} \le C \, \Vert B \, h\Vert _{V^\star }\), the above inequality implies
Hence, there is \(p \in V^{\star \star }\cong V\) (by defining it as in the next line on the dense subspace \({\text {image}}(B) \subset V^\star \) and extending it by continuity on the whole space \(V^\star \)) such that
This yields (6.2b) and
Using the density of \({\text {image}}(B)\) in \(V^\star \) we get
In what follows, we set \(\mathcal {K}:= \mathcal {K}_K({\bar{z}}, {\bar{\lambda }})\) for convenience. We recall that \(S'(B \, {\bar{u}} + f; h)\) is the unique solution of
where the set-valued mapping \(Q^{{\bar{y}}} :V \rightrightarrows V\) is given by
We choose \(h \in \mathcal {K}^\circ \) in (*). We check that (**) implies \(S'({\bar{u}} + f; h) = 0\). Indeed, \(0 \in Q^{{\bar{y}}}(0) = \mathcal {K}\) and
holds since \(h \in \mathcal {K}^\circ \). Thus, (*) implies
i.e., \(p \in -\mathcal {K}\), which shows (6.2c).
Now, we choose \(w \in (I - \varPhi '({\bar{y}}))^{-1} \mathcal {K}\) and set \(h = A'({\bar{y}}) \, w\). It can be checked that (**) implies \(S'(B \, {\bar{u}} + f ; h) = w\). Indeed, \(w \in Q^{{\bar{y}}}(w) = \mathcal {K}+ \varPhi '({\bar{y}})\,w\) and
due to the definition of h. With this choice, (*) implies
We define \(\mu := -(I - \varPhi '({\bar{y}}))^{-\star }(J_y(\cdot ) + A^\star \, p)\) and get (6.2a) and
Since \(I - \varPhi '({\bar{y}})\) is a bijection, this is equivalent to (6.2d).
The uniqueness of p and \(\mu \) follows from the injectivity of \(B^\star \) and the bijectivity of \((I - \varPhi '({\bar{y}}))^\star \). \(\square \)
The approach of [8, Section 6.1] can be used to provide strong stationarity systems under less restrictive assumptions on K, i.e., the polyhedricity assumption can be replaced by the twice epi-differentiability of the indicator function \(\delta _K\).
In the case that \(\varPhi \) is merely Bouligand differentiable, cf. Remark 5.6, conditions (6.2a) and (6.2d) could be rewritten as
see the proof of Theorem 6.1.
Finally, we show that the system of strong stationarity is of reasonable strength, i.e., it implies the B-stationarity (6.4).
Lemma 6.2
Let \(({\bar{y}}, {\bar{u}})\) be a feasible point of (6.1) such that Assumption 5.1 is satisfied at \({\bar{f}} := B \, {\bar{u}} + f\). Moreover, suppose that J is Fréchet differentiable. If there exist multipliers \(p \in V\), \(\mu \in V^\star \) satisfying (6.2), then (6.4) holds.
Proof
For an arbitrary \(h \in U\) we define \(x := S'(B \, {\bar{u}} + f; B \, h)\). Then
From the linearized QVI (5.2) and the strong stationarity system (6.2), we have
References
Alphonse, A., Hintermüller, M., Rautenberg, C.N.: Directional differentiability for elliptic quasi-variational inequalities of obstacle type. Calc. Var. Partial Differ. Equ. (2019). https://doi.org/10.1007/s00526-018-1473-0
Alphonse, A., Hintermüller, M., Rautenberg, C.N.: Recent trends and views on elliptic quasi-variational inequalities. In: CIM Series in Mathematical Sciences, pp. 1–31. Springer, Berlin (2019). https://doi.org/10.1007/978-3-030-33116-0_1
Barrett, J.W., Prigozhin, L.: Sandpiles and superconductors: nonconforming linear finite element approximations for mixed formulations of quasi-variational inequalities. IMA J. Numer. Anal. 35(1), 1–38 (2013). https://doi.org/10.1093/imanum/drt062
Bauschke, H.H., Combettes, P.L.: Convex Analysis and Monotone Operator Theory in Hilbert Spaces. Springer, Berlin (2011). https://doi.org/10.1007/978-1-4419-9467-7
Bensoussan, A., Lions, J.L.: Impulse Control and Quasi Variational Inequalities. Modern Applied Mathematics Series. Wiley, Berlin (1987)
Bubeck, S.: Convex optimization: algorithms and complexity. Found. Trends\(^{\textregistered }\) Mach. Learning 8(3–4), 231–357 (2015). https://doi.org/10.1561/2200000050
Cartan, H.: Calcul Différentiel. Hermann, Paris (1967)
Christof, C.: Sensitivity analysis of elliptic variational inequalities of the first and the second kind. Ph.D. Thesis (2018). https://doi.org/10.17877/de290r-19056
Christof, C., Wachsmuth, G.: Differential sensitivity analysis of variational inequalities with locally Lipschitz continuous solution operators. Appl. Math. Optim. 81, 23–62 (2020). https://doi.org/10.1007/s00245-018-09553-y
Dreves, A.: Uniqueness for quasi-variational inequalities. Set Valued Var. Anal. 24(2), 285–297 (2015). https://doi.org/10.1007/s11228-015-0339-2
Kinderlehrer, D., Stampacchia, G.: An Introduction to Variational Inequalities and Their Applications. Academic Press, New York (1980)
Mignot, F.: Contrôle dans les inéquations variationelles elliptiques. J. Funct. Anal. 22(2), 130–185 (1976). https://doi.org/10.1016/0022-1236(76)90017-3
Nesterov, Y.: Introductory Lectures on Convex Optimization. Springer, New York (2004). https://doi.org/10.1007/978-1-4419-8853-9
Nesterov, Y., Scrimali, L.: Solving strongly monotone variational and quasi-variational inequalities. Discrete Contin. Dyn. Syst. 31(4), 1383–1396 (2011). https://doi.org/10.3934/dcds.2011.31.1383
Noor, M.A., Oettli, W.: On general nonlinear complementarity problems and quasi-equilibria. Le Mat. 49(2), 313–331 (1994)
Prigozhin, L.: Variational model of sandpile growth. Eur. J. Appl. Math. (1996). https://doi.org/10.1017/s0956792500002321
Wachsmuth, G.: A guided tour of polyhedric sets. J. Convex Anal. 26(1), 153–188 (2019). http://www.heldermann.de/JCA/JCA26/JCA261/jca26010.htm
Acknowledgements
Open Access funding provided by Projekt DEAL. This work is supported by the DFG Grant Analysis and Solution Methods for Bilevel Optimal Control Problems (Grant No. WA3636/4-1) within the Priority Program SPP 1962 (Non-smooth and Complementarity-based Distributed Parameter Systems: Simulation and Hierarchical Optimization). The simple proof of Lemma 2.1 was pointed out by Felix Harder.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by L. Caffarelli.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wachsmuth, G. Elliptic quasi-variational inequalities under a smallness assumption: uniqueness, differential stability and optimal control. Calc. Var. 59, 82 (2020). https://doi.org/10.1007/s00526-020-01743-3
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00526-020-01743-3